Abstract
In this paper we propose a novel approach for the audio-based detection of events. The approach adopts the bag of words paradigm, and has two main advantages over other techniques present in the literature: the ability to automatically adapt (through a learning phase) to both short, impulsive sounds and long, sustained ones, and the ability to work in noisy environments where the sounds of interest are superimposed to background sounds possibly having similar characteristics. The proposed method has been experimentally validated on a large database of sounds, including several kinds of background noise, which are superimposed to the sounds to be recognized. The obtained performance has been compared with the results of another audio event detection algorithm from the literature, showing a significant improvement.
Original language | English |
---|---|
Title of host publication | 2013 10th IEEE International Conference on Advanced Video and Signal Based Surveillance, AVSS 2013 |
Place of Publication | Piscataway, NJ |
Publisher | IEEE |
Pages | 81-86 |
Number of pages | 6 |
ISBN (Print) | 978-1-4799-0703-8 |
DOIs | |
Publication status | Published - 1 Jan 2013 |
Externally published | Yes |
Event | 2013 10th IEEE International Conference on Advanced Video and Signal Based Surveillance, AVSS 2013 - Krakow, Poland Duration: 27 Aug 2013 → 30 Aug 2013 Conference number: 10 |
Conference
Conference | 2013 10th IEEE International Conference on Advanced Video and Signal Based Surveillance, AVSS 2013 |
---|---|
Abbreviated title | AVSS 2013 |
Country/Territory | Poland |
City | Krakow |
Period | 27/08/13 → 30/08/13 |