Papers
arxiv:1711.08058
Multiple-Instance, Cascaded Classification for Keyword Spotting in Narrow-Band Audio
Published on Nov 21, 2017
Authors:
Abstract
We propose using cascaded classifiers for a <PRE_TAG>keyword spotting (KWS)</POST_TAG> task on narrow-band (NB), 8kHz audio acquired in non-IID environments --- a more challenging task than most state-of-the-art KWS systems face. We present a model that incorporates Deep Neural Networks (DNNs), cascading, multiple-feature representations, and multiple-instance learning. The cascaded classifiers handle the task's class imbalance and reduce power consumption on computationally-constrained devices via early termination. The KWS system achieves a false negative rate of 6% at an hourly false positive rate of 0.75
Models citing this paper 0
No model linking this paper
Cite arxiv.org/abs/1711.08058 in a model README.md to link it from this page.
Datasets citing this paper 0
No dataset linking this paper
Cite arxiv.org/abs/1711.08058 in a dataset README.md to link it from this page.
Spaces citing this paper 0
No Space linking this paper
Cite arxiv.org/abs/1711.08058 in a Space README.md to link it from this page.
Collections including this paper 0
No Collection including this paper
Add this paper to a
collection
to link it from this page.