Publication Details

Clustering Unsupervised Representations as Defense against Poisoning Attacks on Speech Commands Classification System

THEBAUD, T.; JOSHI, S.; LI, H.; ŠŮSTEK, M.; VILLALBA LOPEZ, J.; KHUDANPUR, S.; DEHAK, N. Clustering Unsupervised Representations as Defense against Poisoning Attacks on Speech Commands Classification System. Proceedings of IEEE Automatic Speech Recognition and Understanding Workshop (ASRU). Taipei: IEEE Signal Processing Society, 2023. p. 1-8. ISBN: 979-8-3503-0689-7.

Czech title

Shlukování reprezentací získaní pomocí učení bez učitele za účelem ochrany klasifikátoru řeči proti poisoning útokům

Type

conference paper

Language

English

Authors

THEBAUD, T.
JOSHI, S.
LI, H.
Šůstek Martin, Ing. (DCGM)
VILLALBA LOPEZ, J.
Khudanpur Sanjeev
Dehak Najim

URL

https://ieeexplore.ieee.org/document/10389650

Keywords

poisoning attack, unsupervised representa-
tions, clustering, Speech commands, defense against attacks
on speech systems

Abstract

Poisoning attacks entail attackers intentionally tampering with training
data. In this paper, we consider a dirty-label poisoning attack
scenario on a speech commands classification system. The threat model
assumes that certain utterances from one of the classes (source class)
are poisoned by superimposing a trigger on it, and its label is changed
to another class selected by the attacker (target class). We propose
a filtering defense against such an attack. First, we use DIstillation
with NO labels (DINO) to learn unsupervised representations for all the
training examples. Next, we use K-means and LDA to cluster these
representations. Finally, we keep the utterances with the most repeated
label in their cluster for training and discard the rest. For a 10%
poisoned source class, we demonstrate a drop in attack success rate from
99.75% to 0.25%. We test our defense against a variety of threat
models, including different target and source classes, as well as
trigger variations.

Published

2023

Pages

1–8

Proceedings

Proceedings of IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)

Conference

2023 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), Taipei, TW

ISBN

979-8-3503-0689-7

Publisher

IEEE Signal Processing Society

Place

Taipei

DOI

10.1109/ASRU57964.2023.10389650

BibTeX

@inproceedings{BUT187976,
  author="THEBAUD, T. and JOSHI, S. and LI, H. and ŠŮSTEK, M. and VILLALBA LOPEZ, J. and KHUDANPUR, S. and DEHAK, N.",
  title="Clustering Unsupervised Representations as Defense against Poisoning Attacks on Speech Commands Classification System",
  booktitle="Proceedings of IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)",
  year="2023",
  pages="1--8",
  publisher="IEEE Signal Processing Society",
  address="Taipei",
  doi="10.1109/ASRU57964.2023.10389650",
  isbn="979-8-3503-0689-7",
  url="https://ieeexplore.ieee.org/document/10389650"
}

Files

pdf Clustering_Unsupervised_Representations_as_Defense_Against_Poisoning_Attacks_on_Speech_Commands_Classification_System.pdf 610 kB