sticky

Scaling Transformer-based Text-to-Speech with Knowledge Distillation

Transformer-based models have led to dramatic improvements in text-to-speech (TTS) quality...
sticky
Aug 7, 2025

ForTune: Running Offline Scenarios to Estimate Impact on Business Metrics

For product leaders at Spotify and other web-facing companies, making informed decisions about...
Evaluation
sticky
Aug 5, 2025

Modality-aware Multi-task Learning to Optimize Ad Targeting at Scale

Much of our on-platform listening happens while users are occupied with something else...
Search & RecommendationsArtificial IntelligenceUser Modeling
sticky
Jul 25, 2025

Optimizing Query Expansions via LLM Preference Alignment

One of the longstanding challenges in information retrieval is the vocabulary mismatch problem...
Search & Recommendations

Research Areas

How do we create more personalized experiences? What can we learn about listeners on how they use written language? How do we optimize testing methodologies? Explore all our research areas below.

Algorithmic ResponsibilityAlgorithmic Responsibility
Algorithmic Responsibility
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
Audio and Visual IntelligenceAudio and Visual Intelligence
Audio and Visual Intelligence
Causal InferenceCausal Inference
Causal Inference
EconomicsEconomics
Economics
EvaluationEvaluation
Evaluation
Search & RecommendationsSearch & Recommendations
Search & Recommendations
Speech and NLPSpeech and NLP
Speech and NLP
User ModelingUser Modeling
User Modeling

We are looking for pioneers to join us in all research areas

We're expanding knowledge of audio and video technology every day, sharing open source frameworks, tools, libraries, and models for everything from research exploration to large-scale production deployment.

Join Us