KAFKA-19735: Add automatic commit offset caching in subscribe mode #20669

xijiu · 2025-10-09T09:10:26Z

Add a cache for the automatic offset commit operation. If the offsets to
be committed are identical between two consecutive commits, the cache
will be hit and a success response will be returned quickly. Note: This
only applies to automatic offset commit operations in subscribe mode.

chia7712 · 2025-10-09T09:28:16Z

@xijiu could you share the benchmark of your scenario with us?

xijiu · 2025-10-09T11:21:10Z

@xijiu could you share the benchmark of your scenario with us?

Sure.

I ran the test with the configuration auto.commit.interval.ms = 100, under two scenarios: cache enabled and cache disabled. The test ran continuously for one minute, after which I observed the LEO of _consumer_offset and the number of OFFSET_COMMIT RPCs.

xijiu · 2025-10-09T11:29:36Z

I think the cache should ideally only take effect in subscribe mode. This is because, in assign mode, apart from the current consumer being able to modify the offset of the corresponding TopicPartition, the Admin can also make modifications. Let’s assume such a scenario: the offset cached by the consumer is 10, and all subsequent requests to commit offset 10 will return successfully quickly without being sent to the broker. At this point, if the Admin is used to set the offset to 11, the consumer will not be aware of this change. As a result, the consumer caches an invalid offset, which is inconsistent with expectations.

Additionally, although we could try to check if the cache is hit in manual commit mode, I feel that manual commit is an active user action, and it's better to send the request to the broker. Alternatively, we can consider adding cache support for manual commits later once this PR stabilizes.

TaiJuWu · 2025-10-09T11:29:56Z

Hi @xijiu , is there normal case (consumer can poll and get data from broker) comparison?
If there is not any obvious performance downgrade, I think it is a great improvement.

xijiu · 2025-10-09T11:36:33Z

Hi @xijiu , is there normal case (consumer can poll and get data from broker) comparison? If there is not any obvious performance downgrade, I think it is a great improvement.

Thanks for reply, and that’s a great suggestion. I will create a comparison chart for the benchmark test results and share it later. But I don’t think it should have any impact on performance.

xijiu · 2025-10-09T13:19:13Z

@TaiJuWu I conducted a simple benchmark test. First, I launched a cluster consisting of 3 brokers, then created a topic with 12 partitions named topic12 using the following command:

sh kafka-topics.sh --bootstrap-server 10.255.225.107:9092 --create --topic topic12 --partitions 12 --replication-factor 2

Next, I sent a sufficient amount of data to topic12—each message was 1MB, with a total size of approximately 100GB. After that, I performed consumer stress tests using the trunk branch and the 19735 branch respectively, using the command:

sh kafka-consumer-perf-test.sh --bootstrap-server 10.255.225.107:9092 --topic topic12 --command-property auto.offset.reset=earliest --num-records 1000000 --group groupXX

The aggregated consumer throughput results are as follows:

The performance of the two is nearly identical.

TaiJuWu · 2025-10-09T13:26:09Z

@TaiJuWu I conducted a simple benchmark test. First, I launched a cluster consisting of 3 brokers, then created a topic with 12 partitions named topic12 using the following command:
sh kafka-topics.sh --bootstrap-server 10.255.225.107:9092 --create --topic topic12 --partitions 12 --replication-factor 2
Next, I sent a sufficient amount of data to topic12—each message was 1MB, with a total size of approximately 100GB. After that, I performed consumer stress tests using the trunk branch and the 19735 branch respectively, using the command:
sh kafka-consumer-perf-test.sh --bootstrap-server 10.255.225.107:9092 --topic topic12 --command-property auto.offset.reset=earliest --num-records 1000000 --group groupXX
The aggregated consumer throughput results are as follows:
The performance of the two is nearly identical.

The result LGTM. Thanks for your sharing and hard work.

github-actions · 2025-10-17T03:21:17Z

A label of 'needs-attention' was automatically added to this PR in order to raise the
attention of the committers. Once this issue has been triaged, the triage label
should be removed to prevent this automation from happening again.

xijiu added 3 commits October 9, 2025 16:41

finish coding

73b777a

Merge branch 'trunk' of github.com:xijiu/kafka into 19735

ed9f49a

code optimization

431c1c7

github-actions bot added triage PRs from the community consumer clients labels Oct 9, 2025

TaiJuWu added the ci-approved label Oct 9, 2025

add some tests

9eeea4f

Merge branch 'trunk' of github.com:xijiu/kafka into 19735

051ce60

github-actions bot added the needs-attention label Oct 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

KAFKA-19735: Add automatic commit offset caching in subscribe mode #20669

KAFKA-19735: Add automatic commit offset caching in subscribe mode #20669

xijiu commented Oct 9, 2025 •

edited by github-actions bot

Loading

Uh oh!

chia7712 commented Oct 9, 2025

Uh oh!

xijiu commented Oct 9, 2025

Uh oh!

xijiu commented Oct 9, 2025

Uh oh!

TaiJuWu commented Oct 9, 2025

Uh oh!

xijiu commented Oct 9, 2025

Uh oh!

xijiu commented Oct 9, 2025

Uh oh!

TaiJuWu commented Oct 9, 2025

Uh oh!

github-actions bot commented Oct 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

KAFKA-19735: Add automatic commit offset caching in subscribe mode #20669

Are you sure you want to change the base?

KAFKA-19735: Add automatic commit offset caching in subscribe mode #20669

Conversation

xijiu commented Oct 9, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chia7712 commented Oct 9, 2025

Uh oh!

xijiu commented Oct 9, 2025

Uh oh!

xijiu commented Oct 9, 2025

Uh oh!

TaiJuWu commented Oct 9, 2025

Uh oh!

xijiu commented Oct 9, 2025

Uh oh!

xijiu commented Oct 9, 2025

Uh oh!

TaiJuWu commented Oct 9, 2025

Uh oh!

github-actions bot commented Oct 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

xijiu commented Oct 9, 2025 •

edited by github-actions bot

Loading