Apache Kafkaは分散イベントストリーミングプラットフォームです — 高スループット、耐久性のあるシステムで、イベント(レコード)のストリームの発行、保存、処理に使用されます。メッセージング、リアルタイムデータパイプライン、イベント駆動アーキテクチャ、および大規模ストリーム処理に使用されます。
Kafkaとは
Kafka = a distributed, durable, high-throughput EVENT STREAMING platform:
→ PUBLISH events (producers write) and SUBSCRIBE to them (consumers read)
→ STORE streams of events durably (a distributed, replicated commit LOG)
→ PROCESS streams in real time
→ think of it as a durable, scalable, append-only LOG of events that many systems can
write to and read from
Kafkaの用途
✓ MESSAGING / event streaming → decoupled communication between systems (pub/sub at scale)
✓ DATA PIPELINES → move/stream data between systems reliably (ingestion, ETL)
✓ EVENT-DRIVEN architecture → services react to events; event sourcing
✓ REAL-TIME processing → analytics, monitoring, stream processing (Kafka Streams)
✓ LOG aggregation, metrics, activity tracking, change data capture (CDC)
→ used by huge companies for high-volume, real-time data
