How would you implement Google Search?
- What overall architecture would you use?
- Start with a search engine that handles ~100k documents
- then expand via sharding to around 50M docs.
- What data structures would you use?
- What services/machines is your archicture composed of?
- What would a typical query latency be?
- What about failover/split brain issues?