Abstract
We propose a sound analysis system for the detection of audio events in surveillance applications. The method that we propose combines short-and long-Time analysis in order to increase the reliability of the detection. The basic idea is that a sound is composed of small, atomic audio units and some of them are distinctive of a particular class of sounds. Similarly to the words in a text, we count the occurrence of audio units for the construction of a feature vector that describes a given time interval. A classifier is then used to learn which audio units are distinctive for the different classes of sound. We compare the performance of different sets of short-Time features by carrying out experiments on the MIVIA audio event data set. We study the performance and the stability of the proposed system when it is employed in live scenarios, so as to characterize its expected behavior when used in real applications.
Original language | English |
---|---|
Title of host publication | 2016 13th IEEE International Conference on Advanced Video and Signal Based Surveillance, AVSS 2016 |
Place of Publication | Piscataway, NJ |
Publisher | IEEE |
Pages | 438-443 |
Number of pages | 6 |
ISBN (Electronic) | 978-1-5090-3811-4 |
ISBN (Print) | 978-1-5090-3812-1 |
DOIs | |
Publication status | Published - 7 Nov 2016 |
Externally published | Yes |
Event | 13th IEEE International Conference on Advanced Video and Signal Based Surveillance, AVSS 2016 - Colorado Springs, United States Duration: 23 Aug 2016 → 26 Aug 2016 Conference number: 13 |
Publication series
Name | 2016 13th IEEE International Conference on Advanced Video and Signal Based Surveillance, AVSS 2016 |
---|
Conference
Conference | 13th IEEE International Conference on Advanced Video and Signal Based Surveillance, AVSS 2016 |
---|---|
Abbreviated title | AVSS 2016 |
Country/Territory | United States |
City | Colorado Springs |
Period | 23/08/16 → 26/08/16 |