Detailed info

Scalable neural architectures for end-to-end environmental sound classification

Authors	Francesco Paissan, Alberto Ancilotto, Alessio Brutti, Elisabetta Farella
Title	Scalable neural architectures for end-to-end environmental sound classification
Abstract	Sound Event Detection is a complex task simulating human ability to recognize what is happening in the surrounding from auditory signals only. This technology is a crucial asset in many applications such as smart cities. Here, urban sounds can be detected and processed by embedded devices in an Internet of Things (IoT) to identify meaningful events for municipalities or law enforcement. However, while current deep learning techniques for SED are effective, they are also resource- and power-hungry, thus not appropriate for pervasive battery-powered devices. In this paper, we propose novel neural architectures based on PhiNets for real-time acoustic event detection on microcontroller units. The proposed models are easily scalable to fit the hardware requirements and can operate both on spectrograms and waveforms. In particular, our architectures achieve state-of-the-art performance on UrbanSound8K in spectrogram classification (around 77%) with extreme compression factors 99.8% with respect to current state-of-the-art architectures.
ISBN	Electronic ISSN: 2379-190X Print on Demand(PoD) ISSN: 1520-6149
Conference	IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP)
Date	22-27/05/2022
Location	virtual
Year of Publication, Publisher	2022
Url	https://zenodo.org/record/6351853
DOI	10.1109/ICASSP43922.2022.9746093

Key Facts

Project Coordinator: Dr. Sotiris Ioannidis
Institution: Foundation for Research and Technology Hellas (FORTH)
E-mail: marvel-info@marvel-project.eu
Start: 01.01.2021
Duration: 36 months
Participating Organisations: 17
Number of countries: 12

Get Connected

Funding

This project has received funding from the European Union’s Horizon 2020 Research and Innovation program under grant agreement No 957337. The website reflects only the view of the author(s) and the Commission is not responsible for any use that may be made of the information it contains.

Detailed info

Scalable neural architectures for end-to-end environmental sound classification

Key Facts

Get Connected

Menu

Funding