Detail výsledku

LlamaPartialSpoof: An LLM-Driven Fake Speech Dataset Simulating Disinformation Generation

LUONG, H.; LI, H.; ZHANG, L.; LEE, K.; CHNG, E. LlamaPartialSpoof: An LLM-Driven Fake Speech Dataset Simulating Disinformation Generation. In Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing. Hyderabad, Indická republika: Institute of Electrical and Electronics Engineers Inc., 2025. p. 1-5. ISBN: 979-8-3503-6874-1.

Typ

článek ve sborníku konference

Jazyk

angličtina

Autoři

Luong Hieu Thi
Li Haoyang
Zhang Lin, Ph.D.
Lee Kong Aik
Chng Eng Siong

Abstrakt

Previous fake speech datasets were constructed from a defender's perspective to develop countermeasure (CM) systems without considering diverse motivations of attackers. To better align with real-life scenarios, we created LlamaPartialSpoof, a 130-hour dataset that contains both fully and partially fake speech, using a large language model (LLM) and voice cloning technologies to evaluate the robustness of CMs. By examining valuable information for both attackers and defenders, we identify several key vulnerabilities in current CM systems, which can be exploited to enhance attack success rates, including biases toward certain text-to-speech models or concatenation methods. Our experimental results indicate that the current fake speech detection system struggle to generalize to unseen scenarios, achieving a best performance of 24.49% equal error rate.

Klíčová slova

dataset | deepfake | fake speech detection | large language model | voice cloning

URL

https://ieeexplore.ieee.org/document/10888070

Rok

2025

Strany

Sborník

Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing

Konference

ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

ISBN

979-8-3503-6874-1

Vydavatel

Institute of Electrical and Electronics Engineers Inc.

Místo

Hyderabad, Indická republika

DOI

10.1109/ICASSP49660.2025.10888070

EID Scopus

2-s2.0-105009694746

BibTeX

@inproceedings{BUT199992,
  author="{} and  {} and Lin {Zhang} and  {} and  {}",
  title="LlamaPartialSpoof: An LLM-Driven Fake Speech Dataset Simulating Disinformation Generation",
  booktitle="Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing",
  year="2025",
  pages="5",
  publisher="Institute of Electrical and Electronics Engineers Inc.",
  address="Hyderabad, Indická republika",
  doi="10.1109/ICASSP49660.2025.10888070",
  isbn="979-8-3503-6874-1",
  url="https://ieeexplore.ieee.org/document/10888070"
}

Projekty

Soudobé metody zpracování, analýzy a zobrazování multimediálních a 3D dat, VUT, Vnitřní projekty VUT, FIT-S-23-8278, zahájení: 2023-03-01, ukončení: 2026-02-28, řešení

Výzkumné skupiny

Výzkumná skupina dolování dat z řeči BUT Speech@FIT (VZ SPEECH)

Pracoviště

Ústav počítačové grafiky a multimédií (UPGM)
Výzkumná skupina dolování dat z řeči BUT Speech@FIT (VZ SPEECH)