Audio Intelligence

39 publications
August 2023 | Interspeech

Lightweight and Efficient Spoken Language Identification of Long-form Audio

Winstead Zhu, Md Iftekhar Tanveer, Yang Janet Liu, Seye Ojumu, Rosie Jones

June 2023 | ICASSP

Contrastive Learning-based Audio to Lyrics Alignment for Multiple Languages

Simon Durand, Daniel Stoller, Sebastian Ewert

September 2022 | Interspeech

Unsupervised Speaker Diarization that is Agnostic to Language Overlap Aware and Free of Tuning

M Iftekhar Tanveer, Diego Casabuena, Jussi Karlgren, Rosie Jones

September 2022 | Interspeech

Exploring audio-based stylistic variation in podcasts

Katariina Martikainen, Jussi Karlgren, Khiet Truong

July 2022 | SIGIR

What Makes a Good Podcast Summary?

Rezvaneh Rezapour, Sravana Reddy, Ann Clifton, Rosie Jones

June 2022 | ICWSM

The Contribution of Lyrics and Acoustics to Collaborative Understanding of Mood

Shahrzad Nazeri, Sravana Reddy, Joana Correia, Jussi Karlgren, Rosie Jones

May 2022 | ICASSP

Few-shot musical source separation

Yu Wang, Daniel Stoller, Rachel M. Bittner, Juan Pablo Bello

May 2022 | ICASSP

A Lightweight Instrument-Agnostic Model for Polyphonic Note Transcription and Multipitch Estimation

Rachel Bittner, Juan José Bosch, David Rubinstein, Gabriel Meseguer-Brocal, Sebastian Ewert

May 2022 | ICASSP

Improving Lyrics Alignment through Joint Pitch Detection

Jiawen Huang, Emmanouil Benetos, Sebastian Ewert

October 2021 | IEEE Signal Processing Magazine

Audio-Based Musical Version Identification: Elements and challenges

Furkan Yesiler, Guillaume Doras, Rachel M. Bittner, Christopher J. Tralie, Joan Serrà

September 2020 | PLOS One

The skipping behavior of users of music streaming services and its relation to musical structure

Nicola Montecchio, Pierre Roy, François Pachet

August 2020 | ISMIR - International Society for Music Information Retrieval Conference

Data Cleansing with Contrastive Learning for Vocal Note Event Annotations

Gabriel Meseguer-Brocal, Rachel Bittner, Simon Durand, Brian Brost

July 2020 | IJCAI - International Joint Conference on Artificial Intelligence

Seq-U-Net: A One-Dimensional Causal U-Net for Efficient Sequence Modelling

Daniel Stoller, Mi Tian, Sebastian Ewert, and Simon Dixon

July 2020 | WCCI/IJCNN - IEEE World Congress on Computational Intelligence / International Joint Conference on Neural Networks

Using a Neural Network Codec Approximation Loss to Improve Source Separation Performance in Limited Capacity Networks

Ishwarya Ananthabhotla, Sebastian Ewert, Joseph A. Paradiso

November 2019 | ISMIR

mirdata: Software for Reproducible Usage of Datasets

Rachel M Bittner, Magdalena Fuentes, David Rubinstein, Andreas Jansson, Keunwoo Choi, Thor Kell

November 2019 | ISMIR

Generalized Metrics for Single-F0 Estimation Evaluation

Rachel M. Bittner, Juan J Bosch

November 2019 | ISMIR - International Society for Music Information Retrieval Conference

Generalized Metrics for Single-F0 Estimation Evaluation

Rachel M. Bittner, Juan J Bosch

October 2019 | ACM MM - ACM International Conference on Multimedia

Towards a Perceptual Loss: Using a Neural Network Codec Approximation as a Loss for Generative Audio Models

Ishwarya Ananthabhotla, Sebastian Ewert, Joseph A. Paradiso