Publication Details

Evaluation Framework for Deepfake Speech Detection: A Comparative Study of State-of-the-art Deepfake Speech Detectors

FIRC Anton, MALINKA Kamil and HANÁČEK Petr. Evaluation Framework for Deepfake Speech Detection: A Comparative Study of State-of-the-art Deepfake Speech Detectors. Cybersecurity, vol. 8, no. 50, 2025, pp. 1-24. ISSN 2523-3246. Available from: https://cybersecurity.springeropen.com/articles/10.1186/s42400-024-00346-1

Czech title

Rámec pro hodnocení detekce deepfake řeči: Srovnávací studie nejmodernějších detektorů deepfake řeči

Type

journal article

Language

english

Authors

Firc Anton, Ing. (DITS FIT BUT)
Malinka Kamil, Mgr., Ph.D. (DITS FIT BUT)
Hanáček Petr, doc. Dr. Ing. (DITS FIT BUT)

URL

https://cybersecurity.springeropen.com/articles/10.1186/s42400-024-00346-1

Keywords

Deepfake speech, Detection, Robustness, Evaluation framework, Computer security

Abstract

The proliferation of deepfake speech poses a significant threat to cybersecurity, from manipulating political speeches and impersonating public figures to spoofing voice biometric systems. The increasing sophistication of adversaries increases the necessity of deploying adaptive detection methods. Moreover, real-world incidents such as fraudulent financial transactions highlight the severity of the problem. Although numerous detectors have been developed, their evaluation remains difficult due to different methodologies and benchmark datasets, making direct comparisons impossible. This study presents a general and detailed framework for evaluating and comparing deepfake speech detectors. We further demonstrate the use of this framework to evaluate 40 state-of-the-art deepfake speech detectors under various conditions and data samples. We objectively compare these methods and identify the key attributes influencing performance the most. We also stress the issue of generalisation, as current detectors struggle to detect previously unseen deepfake speech samples or samples that have been modified. Finally, to strengthen the defence against synthetic audio content, we provide recommendations for improving the robustness of future detectors.

Published

2025

Pages

1-24

Journal

Cybersecurity, vol. 8, no. 50, ISSN 2523-3246

Publisher

Springer Nature Switzerland AG

DOI

10.1186/s42400-024-00346-1

UT WoS

001541737700001

EID Scopus

2-s2.0-105012388167

BibTeX

@ARTICLE{FITPUB13100,
   author = "Anton Firc and Kamil Malinka and Petr Han\'{a}\v{c}ek",
   title = "Evaluation Framework for Deepfake Speech Detection: A Comparative Study of State-of-the-art Deepfake Speech Detectors",
   pages = "1--24",
   journal = "Cybersecurity",
   volume = 8,
   number = 50,
   year = 2025,
   ISSN = "2523-3246",
   doi = "10.1186/s42400-024-00346-1",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/13100"
}