Spotify
  • Blog
  • Publications
  • Datasets
  • Research Areas
    • Algorithmic Responsibility
    • Audio Intelligence
    • Evaluation
    • Human-Computer Interaction
    • Language Technologies
    • Machine Learning
    • Music Creation
    • Search & Recommendations
    • User Modeling
  • Podcasts
  • Jobs

Datasets

Dive into datasets for everything from podcasts to music recommendation

The Million Playlist Dataset:

Learning from Music Playlists

Oct 05, 2020

Dataset for music recommendation and automatic music playlist continuation. Contains 1,000,000 playlists, including playlist- and track-level metadata.

Spotify Podcasts Dataset:

100,000 episodes with text and audio

Apr 15, 2020

Dataset for podcast research. Contains 100,000 episodes from thousands of different shows on Spotify, including audio files and speech transcriptions.

WSDM Cup:

The Music Streaming Sessions Dataset

Nov 15, 2018

Dataset for researching how to model user listening and interaction behavior in music streaming. Also includes data for music information retrieval and session-based sequential recommendations.

OpenMic:

Audio and Crowd-Sourced Instrument Labels

Sep 23, 2018

Dataset for researching multi-instrument recognition in polyphonic recordings, a fundamental problem in music information retrieval.

Spotify

Sign up for research updates

By clicking sign up you’ll receive occasional emails from Spotify. You always have the choice to adjust your interest settings or unsubscribe.

Spotify
  • Newsroom
  • Spotify Jobs
  • Spotify.com
  • Spotify R&D Engineering
  • Spotify R&D Design
  • Legal
  • Privacy
  • Cookies
  • About Ads
© 2023 Spotify AB