Scalable Personalized Recommendation System
Design a large-scale recommendation system for a music streaming platform to generate weekly personalized discovery playlists for 500 million users from a corpus of 100 million tracks. Your design should cover the end-to-end ML lifecycle: from multi-stage candidate retrieval and ranking to batch inference pipelines. Address specific challenges including cold-start for new tracks, popularity bias, and the use of audio-based vs. collaborative embeddings. Detail your approach to high-throughput offline serving, data consistency across weekly updates, and evaluation metrics that balance user satisfaction with discovery novelty.
Two-Tower DNNLightGBMWord2VecFAISSCNNApproximate Nearest NeighborSparkCassandraNegative SamplingMMR
00