The Question
ML DesignLarge-Scale Real-time Recommendation System
Design a high-scale, real-time recommendation system for a content platform with 100M+ items. Detail the multi-stage funnel architecture, handle real-time feature engineering, address position bias, and ensure the system meets sub-200ms P99 latency constraints for 100k+ QPS.
Two-Tower
DeepFM
HNSW
ANN
Kafka
Spark
Flink
Milvus
Triton
Feast
PyTorch
AUC
NDCG
March 20, 2026