Catalogue
System/Solution/Product Design
- Design a Twitter?
- Design a Uber?
- Design a short URL service.
Product -> functionalities/use cases -> Architecture
Topic
Storage
- Distributed file system
- Distributed database
Computation
- Batch Processing
- Streaming Processing
Web Application
Data Center
- Use file system interfaces to manage your data (files and directions)
- Data is distributed in many machies
- Examples: GFS(Google File System), HDFS, Ceph FS, GlusterFS, MapR FS…
- When to use a DFS?
- Durability
Hadoop Distributed File system
- Key features/assumptions
- Scale up to 100+ PB of storage and a single cluster of several thousand servers,
supporting close to a billion files and blocks
- Designed to run on commodity hardware
- some components of HDFS is always non-functional
Architecture