Publications Archives - Page 4 of 4

Within-layer Diversity Reduces Generalization Gap

Post published:December 16, 2021
Post category:Publications

Detailed info Within-layer Diversity Reduces Generalization Gap AuthorsFiras Laakom, Jenni Raitoharju, Alexandros Iosifidis and Moncef GabboujTitleWithin-layer Diversity Reduces Generalization GapAbstractNeural networks are composed of multiple layers arranged in a hierarchical…

WaveTransformer: An Architecture for Audio Captioning Based on Learning Temporal and Time-Frequency Information

Post published:November 18, 2021
Post category:Publications

Automated audio captioning (AAC) is a novel task, where a method takes as an input an audio sample and outputs a textual description (i.e. a caption) of its contents.

Continual Learning for Automated Audio Captioning Using The Learning Without Forgetting Approach

Post published:November 18, 2021
Post category:Publications

Automated audio captioning (AAC) is the task of automatically creating textual descriptions (i.e. captions) for the contents of a general audio signal. Most AAC methods are using existing datasets to optimize and/or evaluate upon.

Assessment of Self-Attention on Learned Features For Sound Event Localization and Detection

Post published:November 18, 2021
Post category:Publications

Joint sound event localization and detection (SELD) is an emerging audio signal processing task adding spatial dimensions to acoustic scene analysis and sound event detection.

Multi-Exit Vision Transformer for Dynamic Inference

Post published:October 29, 2021
Post category:Publications

Underspecification and fairness in machine learning (ML) applications have recently become two prominent issues in the ML community. Acoustic scene classification (ASC) applications have so far remained unaffected by this discussion, but are now becoming increasingly used in real-world systems where fairness and reliability are critical aspects.

Fairness and underspecification in acoustic scene classification: The case for disaggregated evaluations

Post published:October 5, 2021
Post category:Publications

Underspecification and fairness in machine learning (ML) applications have recently become two prominent issues in the ML community. Acoustic scene classification (ASC) applications have so far remained unaffected by this discussion, but are now becoming increasingly used in real-world systems where fairness and reliability are critical aspects.

MARVEL: Multimodal Extreme Scale Data Analytics for Smart Cities Environments

Post published:October 5, 2021
Post category:Publications

A Smart City based on data acquisition, handling and intelligent analysis requires efficient design and implementation of the respective AI technologies and the underlying infrastructure for seamlessly analyzing the large amounts of data in real-time.

Improving the Accuracy of Early Exits in Multi-Exit Architectures via Curriculum Learning

Post published:September 22, 2021
Post category:Publications

Deploying deep learning services for time-sensitive and resource-constrained settings such as IoT using edge computing systems is a challenging task that requires dynamic adjustment of inference time. Multi-exit architectures allow deep neural networks to terminate their execution early in order to adhere to tight deadlines at the cost of accuracy.

Automatic Analysis of the Emotional Content of Speech in Daylong Child-Centered Recordings from a Neonatal Intensive Care Unit

Post published:September 22, 2021
Post category:Publications

Researchers have recently started to study how the emotional speech heard by young infants can affect their developmental outcomes. As a part of this research, hundreds of hours of daylong recordings from preterm infants’ audio environments were collected from two hospitals in Finland and Estonia in the context of so-called APPLE study.

Enabling energy efficient machine learning on a Ultra-Low-Power vision sensor for IoT

Post published:March 24, 2021
Post category:Publications

he Internet of Things (IoT) and smart city paradigm includes ubiquitous technology to extract context information in order to return useful services to users and citizens. An essential role in this scenario is often played by computer vision applications, requiring the acquisition of images from specific devices.