Company
Date Published
Nov. 25, 2024
Author
Confluent Staff
Word count
1573
Language
English
Hacker News points
None

Summary

In this final installment of the Kafka Producer and Consumer Internals Series, the focus shifts to consumer fetch requests, mirroring the producer request process up until I/O threads access data on disk. The storage fundamentals of Kafka are refreshed, including topics as commit logs divided into segments with .log and .index files. When an I/O thread parses a fetch request, it checks for enough data to meet fetch.min.bytes requirements, calculates the data's location, and proceeds with fetching it. If there's not enough data, the unfulfilled request is queued in "Purgatory" until either fetch.min.bytes or fetch.max.wait.ms is reached. The response is then sent back to the consumer, which caches records and continues polling from its cache until empty. To manage this, consumers can limit records per poll with max.poll.records and set a time limit for processing with max.poll.interval.ms.