Secara internal, Kafka menyimpan data sebagai append-only log di disk (diorganisir dalam segments), menggunakan teknik I/O yang efisien, dan mengelola metadata cluster melalui ZooKeeper (secara historis) atau KRaft (saat ini). Memahami internal Kafka memperdalam pemahaman tentang perilaku dan performa Kafka.
The commit log storage
Each partition is an append-only LOG stored on disk, split into SEGMENTS (files):
→ new events are APPENDED to the end (sequential writes → fast)
→ events are immutable once written; identified by OFFSET
→ old segments are deleted (retention) or compacted
→ an INDEX maps offsets to file positions (fast lookups)
→ the append-only log is the core of Kafka's design (durable, sequential, efficient)
