Skip to main content

Video Not Available

    Details
    Author(s)
    Display Name
    Po-Hao Huang
    Affiliation
    Affiliation
    National Tsing Hua University
    Display Name
    Yung-Yuan Lan
    Affiliation
    Affiliation
    National Tsing Hua University
    Display Name
    Wilbert Harriman
    Affiliation
    Affiliation
    National Tsing Hua University
    Affiliation
    Affiliation
    National Tsing Hua University
    Display Name
    Ting-Chi Wang
    Affiliation
    Affiliation
    National Tsing Hua University
    Abstract

    Neural networks have become an attractive choice for audio applications, but they are known to suffer from adversarial examples. In this work, we propose a method for detecting adversarial examples of a keyword spotting system. We design convolutional neural networks for metric learning to map the internal representation of each layer of an input audio to a low-dimensional feature space. We then extract the distance information from the feature space of each layer and feed it into an LSTM network to determine whether the input audio is clean or adversarial. Promising experimental results are shown to support our detector.