Enriched Music Representations With Multiple Cross-Modal Contrastive Learning

Modeling various aspects that make a music piece unique is a challenging task, requiring the combination of multiple sources of information. Deep learning is commonly used to obtain representations using various sources of information, such as the audio, interactions between users and songs, or associated genre metadata.

Continue ReadingEnriched Music Representations With Multiple Cross-Modal Contrastive Learning