Detailed info

Masking Speech Contents by Random Splicing: is Emotional Expression Preserved?

Authors	Burkhardt Felix, Derington Anna, Kahlau Matthias, Scherer Klaus, Eyben Florian, Schuller, Bjorn
Title	Masking Speech Contents by Random Splicing: is Emotional Expression Preserved?
Abstract	We discuss the influence of random splicing on the perception of emotional expression in speech signals. Random splicing is the randomized reconstruction of short audio snippets with the aim to obfuscate the speech contents. A part of the German parliament recordings has been random spliced and both versions – the original and the scrambled ones – manually labeled with respect to the arousal, valence and dominance dimensions. Additionally, we run a state-of-the-art transformer-based pre-trained emotional model on the data. We find sufficiently high correlation for the annotations and predictions of emotional dimensions between both sample versions to be confident that machine learners can be trained with random spliced data.
Conference	ICASSP 2023 – 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Date	04-10 June 2023
Location	Rhodes Island, Greece
Year of Publication	2023
Publisher	IEEE
Url	https://doi.org/10.5281/zenodo.10664711
DOI	10.1109/ICASSP49357.2023.10097094

Key Facts

Project Coordinator: Dr. Sotiris Ioannidis
Institution: Foundation for Research and Technology Hellas (FORTH)
E-mail: marvel-info@marvel-project.eu
Start: 01.01.2021
Duration: 36 months
Participating Organisations: 17
Number of countries: 12

Get Connected

Funding

This project has received funding from the European Union’s Horizon 2020 Research and Innovation program under grant agreement No 957337. The website reflects only the view of the author(s) and the Commission is not responsible for any use that may be made of the information it contains.

Detailed info

Masking Speech Contents by Random Splicing: is Emotional Expression Preserved?

Key Facts

Get Connected

Menu

Funding