Author Archives for Spotify Engineering

LLark: A Multimodal Foundation Model for Music

LLark: A Multimodal Foundation Model for Music

October 13, 2023 10:49 am Published by Comments Off on LLark: A Multimodal Foundation Model for Music

Every day, music is enjoyed, created, and discovered by billions of people around the globe – and yet, existing AI systems largely struggle to model the nuances that make music different from other forms of audio.


Improving Retrievability in Search with Query Generation

Improving Retrievability in Search with Query Generation

May 4, 2023 9:53 am Published by Comments Off on Improving Retrievability in Search with Query Generation

Allowing users to discover new entities such as books, music, and movies is an important goal for online platforms. This can be achieved for example by recommending entities that the user has not yet interacted with. Another way users can find new entities is by exploring the catalog with the search system.


Exploring Goal-oriented Podcast Recommendations

Exploring Goal-oriented Podcast Recommendations

March 23, 2023 12:51 pm Published by Comments Off on Exploring Goal-oriented Podcast Recommendations

Recommender systems typically look to users' past consumption to predict what they may want next. In practice, this approach tends to work best when what the user wants is similar to what they have consumed recently, and when it is relatively easy for that person to evaluate new items.


Survival Analysis Meets Reinforcement Learning

Survival Analysis Meets Reinforcement Learning

November 25, 2022 10:08 am Published by Comments Off on Survival Analysis Meets Reinforcement Learning

TL;DR: Survival analysis provides a framework to reason about time-to-event data; at Spotify, for example, we use it to understand and predict the way users might engage with Spotify in the future. In this work, we bring temporal-difference learning, a central idea in reinforcement learning, to survival analysis. We develop a new algorithm that trains a survival model from sequential data by leveraging a temporal consistency condition, and show that it outperforms direct regression on observed outcomes......


Spotify’s Contributions to NeurIPS 2022

Spotify’s Contributions to NeurIPS 2022

November 24, 2022 11:08 am Published by Comments Off on Spotify’s Contributions to NeurIPS 2022

Cutting-edge research in Machine Learning, Language Technologies, User Modeling, Audio Intelligence, Search and Recommender Systems are some of the key areas we feel incredibly enthusiastic about at Spotify...


Can we correctly attribute changes among many possible causes when unobserved confounders are present?

Can we correctly attribute changes among many possible causes when unobserved confounders are present?

November 24, 2022 11:06 am Published by Comments Off on Can we correctly attribute changes among many possible causes when unobserved confounders are present?

There are many A/B tests we might like to run, but which are too technically challenging, risky in terms of user impact or even impossible to perform. For instance, in the classic example of whether smoking causes lung cancer, forcing a randomly selected group of people to smoke is unethical if we believe it might damage their health. In the context of technology companies, if we want to understand if app crashes cause users to churn, we would have to randomly select a subgroup of users and crash their apps on purpose – not something we would want to consider given we do not want to break their trust....


Scalable Dynamic Topic Modeling

Scalable Dynamic Topic Modeling

November 15, 2022 12:51 pm Published by Comments Off on Scalable Dynamic Topic Modeling

Dynamic topic modeling is a well established tool for capturing the temporal dynamics of the topics of a corpus....