Audio Intelligence

23 publications
July 2020 | IJCAI - International Joint Conference on Artificial Intelligence

Seq-U-Net: A One-Dimensional Causal U-Net for Efficient Sequence Modelling

Daniel Stoller, Mi Tian, Sebastian Ewert, and Simon Dixon

July 2020 | WCCI/IJCNN - IEEE World Congress on Computational Intelligence / International Joint Conference on Neural Networks

Using a Neural Network Codec Approximation Loss to Improve Source Separation Performance in Limited Capacity Networks

Ishwarya Ananthabhotla, Sebastian Ewert, Joseph A. Paradiso

November 2019 | ISMIR

mirdata: Software for Reproducible Usage of Datasets

Rachel M Bittner, Magdalena Fuentes, David Rubinstein, Andreas Jansson, Keunwoo Choi, Thor Kell

November 2019 | ISMIR

Generalized Metrics for Single-F0 Estimation Evaluation

Rachel M. Bittner, Juan J Bosch

October 2019 | ACM MM - ACM International Conference on Multimedia

Towards a Perceptual Loss: Using a Neural Network Codec Approximation as a Loss for Generative Audio Models

Ishwarya Ananthabhotla, Sebastian Ewert, Joseph A. Paradiso

September 2019 | EUSIPCO

Joint Singing Voice Separation and F0 Estimation with Deep U-Net Architectures

Andreas Jansson, Rachel M Bittner, Sebastian Ewert, Tillman Weyde

July 2019 | ICASSP

End-to-End Lyrics Alignment for Polyphonic Music Using An Audio-to-Character Recognition Model

Daniel Stoller, Simon Durand, Sebastian Ewert

May 2019 | ICASSP

Neural Music Synthesis for Flexible Timbre Control

Jong Wook Kim, Rachel Bittner, Aparna Kumar, Juan Pablo Bello

January 2019 | IEEE Signal Processing Magazine

Signal processing for singing voice analysis: Significance, Applications and Methods

Eric J. Humphrey, Sravana Reddy, Prem Seetharaman, Aparna Kumar, Rachel M. Bittner, Andrew Demetriou, Sankalp Gulati, Andreas Jansson, Tristan Jehan, Bernhard Lehner, Anna Krupse, Luwei Yang et al

January 2019 | IEEE Signal Processing Magazine

Open-Source Practices for Music Signal Processing Research: Recommendations for Transparent, Sustainable, and Reproducible Audio Research

Brian McFee, Jong Wook Kim, Mark Cartwright, Justin Salamon, Rachel M Bittner, Juan Pablo Bello

January 2019 | IEEE Signal Processing Magazine

Automatic Music Transcription – An Overview

Emmanouil Benetos, Simon Dixon, Zhiyao Duan, Sebastian Ewert

January 2019 | IEEE Signal Processing Magazine

Deep Learning for Audio-Based Music Classification and Tagging

Juhan Nam, Keunwoo Choi, Jongpil Lee, Szu-Yu Chou, Yu-Hsuan Yang

September 2018 | ISMIR

Wave-U-Net: A Multi-Scale Neural Network for End-to-End Audio Source Separation

Daniel Stoller, Sebastian Ewert, Simon Dixon.

September 2018 | ISMIR

Revisiting Singing Voice Detection: A Quantitative Review and the Future Outlook

Kyungyun Lee, Keunwoo Choi, Juhan Nam

September 2018 | ISMIR

OpenMIC-2018: an Open Dataset for Multiple Instrument Recognition

Eric J. Humphrey, Simon Durand, Brian McFee

July 2018 | ICASSP

Shift-Invariant Kernel Additive Modelling for Audio Source Separation

Delia Fano Yela, Sebastian Ewert, Ken O'Hanlon, Mark B. Sandler

July 2018 | ICASSP

Adversarial Semi-Supervised Audio Source Separation applied to Singing Voice Extraction

Daniel Stoller, Sebastian Ewert, Simon Dixon

July 2018 | LVA/ICA

Jointly Detecting and Separating Singing Voice: A Multi-Task Approach

Daniel Stoller, Sebastian Ewert, Simon Dixon

October 2017 | ISMIR

Mining Labeled Data from Web-Scale Collections for Vocal Activity Detection in Music

Eric J. Humphrey, Nicola Montecchio, Rachel Bittner, Andreas Jansson, Tristan Jehan