Partitioner er centrale for Kafkas design — de muliggør parallelisme og skalerbarhed (distribuering af data og forbrug) samtidig med at de giver sorteringsgarantier inden for hver partition. Forståelse af partitioner er nøglen til at forstå, hvordan Kafka skaleres og ordner events.
Partitioner muliggør parallelisme og skala
A topic is split into multiple PARTITIONS, distributed across brokers:
→ data is spread across partitions → distribute storage and load
→ CONSUMER PARALLELISM → each partition consumed by one consumer in a group →
more partitions = more parallel consumers = higher throughput
→ scale a topic by adding partitions (and consumers)
→ partitions are the unit of PARALLELISM and horizontal scaling in Kafka
