The Question
ML DesignLarge-Scale Recommendation Ranking with Long User Sequences
Design a final-stage ranking system for a content discovery platform (like Pinterest) that effectively incorporates long-term user behavior history (1,000+ actions). Specifically, address the computational challenges of using attention mechanisms on long sequences within a low-latency production environment, discussing strategies for history retrieval, sequence compression, and online-offline architectural trade-offs.
SIM
DIN
Transformer
PinSage
Kafka
Flink
Spark
Redis
TensorRT
AUC
NDCG