Large-Scale Visual Similarity & Search System
Design a high-scale visual similarity system for a global e-commerce platform with a 100M+ item catalog. The system must support 'search-by-image' and 'similar product' recommendations with a P99 latency under 200ms. Your design should cover end-to-end data ingestion, embedding generation using deep vision models, efficient approximate nearest neighbor (ANN) retrieval at scale, and strategies for handling high QPS. Emphasize how you handle data freshness, model evaluation (online vs. offline), and the infrastructure required to serve and monitor these models in a production environment with constant catalog updates.
Vision TransformersSiamese NetworksHNSWTriplet LossTensorRTFAISSPyTorchSparkMilvusContrastive Learning
00