在内部,Kafka 将数据作为仅追加日志存储在磁盘上(按分段组织),使用高效的 I/O 技术,并通过 ZooKeeper(历史上)或 KRaft(现在)管理集群元数据。理解内部原理可以深化对 Kafka 行为和性能的理解。
提交日志存储
Each partition is an append-only LOG stored on disk, split into SEGMENTS (files):
→ new events are APPENDED to the end (sequential writes → fast)
→ events are immutable once written; identified by OFFSET
→ old segments are deleted (retention) or compacted
→ an INDEX maps offsets to file positions (fast lookups)
→ the append-only log is the core of Kafka's design (durable, sequential, efficient)
