Audio Intelligence

33 publications
September 2022 | Interspeech

Unsupervised Speaker Diarization that is Agnostic to Language Overlap Aware and Free of Tuning

M Iftekhar Tanveer, Diego Casabuena, Jussi Karlgren, Rosie Jones

September 2022 | Interspeech

Exploring audio-based stylistic variation in podcasts

Katariina Martikainen, Jussi Karlgren, Khiet Truong

July 2022 | SIGIR

What Makes a Good Podcast Summary?

Rezvaneh Rezapour, Sravana Reddy, Ann Clifton, Rosie Jones

June 2022 | ICWSM

The Contribution of Lyrics and Acoustics to Collaborative Understanding of Mood

Shahrzad Nazeri, Sravana Reddy, Joana Correia, Jussi Karlgren, Rosie Jones

May 2022 | ICASSP

Few-shot musical source separation

Yu Wang, Daniel Stoller, Rachel M. Bittner, Juan Pablo Bello

May 2022 | ICASSP

A Lightweight Instrument-Agnostic Model for Polyphonic Note Transcription and Multipitch Estimation

Rachel Bittner, Juan José Bosch, David Rubinstein, Gabriel Meseguer-Brocal, Sebastian Ewert

May 2022 | ICASSP

Improving Lyrics Alignment through Joint Pitch Detection

Jiawen Huang, Emmanouil Benetos, Sebastian Ewert

October 2021 | IEEE Signal Processing Magazine

Audio-Based Musical Version Identification: Elements and challenges

Furkan Yesiler, Guillaume Doras, Rachel M. Bittner, Christopher J. Tralie, Joan Serrà

August 2020 | ISMIR - International Society for Music Information Retrieval Conference

Data Cleansing with Contrastive Learning for Vocal Note Event Annotations

Gabriel Meseguer-Brocal, Rachel Bittner, Simon Durand, Brian Brost

July 2020 | IJCAI - International Joint Conference on Artificial Intelligence

Seq-U-Net: A One-Dimensional Causal U-Net for Efficient Sequence Modelling

Daniel Stoller, Mi Tian, Sebastian Ewert, and Simon Dixon

July 2020 | WCCI/IJCNN - IEEE World Congress on Computational Intelligence / International Joint Conference on Neural Networks

Using a Neural Network Codec Approximation Loss to Improve Source Separation Performance in Limited Capacity Networks

Ishwarya Ananthabhotla, Sebastian Ewert, Joseph A. Paradiso

November 2019 | ISMIR

mirdata: Software for Reproducible Usage of Datasets

Rachel M Bittner, Magdalena Fuentes, David Rubinstein, Andreas Jansson, Keunwoo Choi, Thor Kell

November 2019 | ISMIR

Generalized Metrics for Single-F0 Estimation Evaluation

Rachel M. Bittner, Juan J Bosch

November 2019 | ISMIR - International Society for Music Information Retrieval Conference

Generalized Metrics for Single-F0 Estimation Evaluation

Rachel M. Bittner, Juan J Bosch

October 2019 | ACM MM - ACM International Conference on Multimedia

Towards a Perceptual Loss: Using a Neural Network Codec Approximation as a Loss for Generative Audio Models

Ishwarya Ananthabhotla, Sebastian Ewert, Joseph A. Paradiso

September 2019 | EUSIPCO

Joint Singing Voice Separation and F0 Estimation with Deep U-Net Architectures

Andreas Jansson, Rachel M Bittner, Sebastian Ewert, Tillman Weyde

July 2019 | ICASSP

End-to-End Lyrics Alignment for Polyphonic Music Using An Audio-to-Character Recognition Model

Daniel Stoller, Simon Durand, Sebastian Ewert

May 2019 | ICASSP

Neural Music Synthesis for Flexible Timbre Control

Jong Wook Kim, Rachel Bittner, Aparna Kumar, Juan Pablo Bello

January 2019 | IEEE Signal Processing Magazine

An Introduction to Signal Processing for Singing-Voice Analysis: High Notes in the Effort to Automate the Understanding of Vocals in Music

Eric J. Humphrey, Sravana Reddy, Prem Seetharaman, Aparna Kumar, Rachel M. Bittner, Andrew Demetriou, Sankalp Gulati, Andreas Jansson, Tristan Jehan, Bernhard Lehner, Anna Krupse, Luwei Yang