Assessment of Self-Attention on Learned Features For Sound Event Localization and Detection
Joint sound event localization and detection (SELD) is an emerging audio signal processing task adding spatial dimensions to acoustic scene analysis and sound event detection.