KAFKA-9693: Kafka latency spikes caused by log segment flush on roll by novosibman · Pull Request #13782 · apache/kafka

novosibman · 2023-05-30T18:43:43Z

Trunk version of initial change: #13768 in branch "3.4"

Key difference with branched change:
Passed and used existing scheduler which already is being used for flushing large segment logs and indices.

In all cases snapshot's fileChannel is kept opened when passed to other threads for flushing and closing (so removing try-with-resource in this change).

Related issue https://issues.apache.org/jira/browse/KAFKA-9693

The issue with repeating latency spikes during Kafka log segments rolling still reproduced on the latest versions including kafka_2.13-3.4.0.

It was found that flushing Kafka snapshot file during segments rolling blocks producer request handling thread for some time. Reproduced latency improvement in the kafka_2.13-3.6.0-snapshot by offloading flush operation. Used available on my side single node test configuration:
kafka_2.13-3.6.0-snapshot - trunk version
kafka_2.13-3.6.0-snapshot-fix - trunk version with provided change

Test time increased to 1hr.

partitions=10 # rolling at each ~52 seconds

partitions=100 # rolling events about each 8.5 minute:

…y log segment flush on roll - trunk version

Hangleton

Many thanks for the patch and the collected data! Really interesting to see the impact of this change. A few questions:

What storage device and file system are used in the test?
Would you have a real-life workload where the impact of this change can be quantified? The workload generated by the producer-perf-test.sh exhibits the problem the most because the segments of all replicas on the brokers start rolling at the same time. Which is why it is also interesting to assess the impact using topic-partitions which have different ingress rate and/or use segments of different sizes.

novosibman · 2023-05-31T07:37:00Z

Many thanks for the patch and the collected data! Really interesting to see the impact of this change. A few questions:
* What storage device and file system are used in the test?

In AWS config used: i3en.2xlarge with 2 x 2500 NVMe SSDs
In local lab config: 2 x Samsung_SSD_860_EVO_1TB
FS type: xfs
two dirs setup: log.dirs=/ssd1/kafka/data,/ssd2/kafka/data

The FS format had huge impact on results. Initially we used ext4 in our lab for regular testing:
some of ext4 example results (using regular Kafka release kafka_2.13-3.4.0):

after switched to xfs:

ext4 was much worse before and during Kafka logs rolling

* Would you have a real-life workload where the impact of this change can be quantified? The workload generated by the producer-perf-test.sh exhibits the problem the most because the segments of all replicas on the brokers start rolling at the same time. Which is why it is also interesting to assess the impact using topic-partitions which have different ingress rate and/or use segments of different sizes.

We have no any real-life workload scenarios available for Kafka perf testing. Alternative workload https://github.com/AzulSystems/kafka-benchmark has slightly different rolling behavior compared to OMB:

OMB results example on released kafka_2.13-3.4.0 version (using xfs):

Kafka Tussle benchmark:

^^^ same params used: acks=1 batchSize=1048510 consumers=4 lingerMs=1 mlen=1024 partitions=100 producers=4 rf=1 targetRate=200k time=30m topics=1

divijvaidya · 2023-05-31T10:57:48Z

    // we manually override the state offset here prior to taking the snapshot.
    producerStateManager.updateMapEndOffset(newSegment.baseOffset)
-    producerStateManager.takeSnapshot()
+    producerStateManager.takeSnapshot(scheduler)


I am wondering if we need to increase the default size of background threads since we are adding more responsibility to it. Thoughts?

Hangleton · 2023-06-01T08:36:05Z

@novosibman

Thanks for the reply. In the tests we conducted in KAFKA-9693, an nvme SSD (one log directory) and ext4 were used along with jbd2, which likely penalized performance.

Are all the graphs shared for OMB and Kafka Tussle generated for Kafka with the fix in this PR?

novosibman · 2023-06-01T13:10:07Z

Are all the graphs shared for OMB and Kafka Tussle generated for Kafka with the fix in this PR?

Graphs with the fix noted in first description comment - marked with kafka_2.13-3.6.0-snapshot-fix label.

Other graphs in latter comment are examples of how rolling affects results on different configurations and benchmarks using regular Kafka release.

… updated change according to feedbak

novosibman · 2023-06-01T19:01:01Z

Provided updated change:
returned original try-with-resource on writing, added utility method for flushing:

        try (FileChannel fileChannel = FileChannel.open(file.toPath(), StandardOpenOption.CREATE, StandardOpenOption.WRITE)) {
            fileChannel.write(buffer);
        }
        if (scheduler != null) {
            scheduler.scheduleOnce("flush-producer-snapshot", () -> Utils.flushFileQuietly(file.toPath(), "producer-snapshot"));
        } else {
            Utils.flushFileQuietly(file.toPath(), "producer-snapshot");
        }

divijvaidya

Thank you for making the changes. The changes look good and I left some minor comments.

Since we are adding new responsibility to the scheduler threads, I think that we should probably advise the users to reconfigure the number of background thread [2]. Could you please add a note to the upgrade section [1] about considering an increase in background threads.

[1]

kafka/docs/upgrade.html

Line 24 in 6678f1b

    
           <h5><a id="upgrade_360_notable" href="#upgrade_360_notable">Notable changes in 3.6.0</a></h5>

[2] https://kafka.apache.org/documentation.html#brokerconfigs_background.threads

showuon

Thanks for the PR. LGTM, left a minor comment.

showuon · 2023-06-06T07:03:11Z

Also, there are compiling error, please fix it. Thanks.

… style check error corrected, open/close operations reduced for scheduler == null case

novosibman · 2023-06-08T17:52:58Z

Open/close changes provided.
Also corrected style check issue (in task ':storage:checkstyleMain').

divijvaidya · 2023-06-14T14:06:49Z

Hey @novosibman could you please respond to rest of the comments at #13782 (review) and #13782 (comment)

showuon

LGTM! Thanks for the patch!

Some minor comments:

The PR title please start with KAFKA-9693, ex: KAFKA-9693: Kafka latency spikes caused by log segment flush on roll
Notable change for v3.6 please remember to update. If you have any problem please let us know. Of course it can be a follow-up PR if you want. You can refer to this PR change
@divijvaidya , do you think we should backport this patch to 3.5 branch? This should have been existed for a long time, and should not be a regression bug. Maybe no? Thoughts?

showuon

Sorry, found an issue.

divijvaidya · 2023-06-15T11:26:49Z

@showuon - with this change we don't have consistent data on different flushed files on disk (since earlier they were flushed together but now it's done async). I want to ensure that this inconsistent is ok and recovery will not get hampered by it. Please wait for my review before merging this.

github-actions · 2023-11-27T03:33:34Z

This PR is being marked as stale since it has not had any activity in 90 days. If you would like to keep this PR alive, please ask a committer for review. If the PR has merge conflicts, please update it with the latest from trunk (or appropriate release branch)

If this PR is no longer valid or desired, please feel free to close it. If no activity occurs in the next 30 days, it will be automatically closed.

divijvaidya · 2024-02-02T12:07:15Z

@novosibman @ocadaruma do we still need this change after https://github.com/apache/kafka/pull/14242/files? Asking because with the latter PR merged in, we are not blocking request handler thread while flushing producer snapshot. This is same was what this PR is trying to achieve. Hence, I think this could be closed.

ocadaruma · 2024-02-02T12:31:20Z

@divijvaidya Yeah, my understanding is the same

divijvaidya · 2024-02-02T14:03:02Z

Thanks for checking @ocadaruma . I am going to close this PR, please feel free to re-open if you think that this is still not fixed.

novosibman added 2 commits May 30, 2023 23:51

Suggest for performance fix: KAFKA-9693 Kafka latency spikes caused b…

d6402cd

…y log segment flush on roll - trunk version

Merge branch 'apache:trunk' into trunk

e8a4960

novosibman mentioned this pull request May 30, 2023

Suggest for performance fix: KAFKA-9693 Kafka latency spikes caused by log segment flush on roll #13768

Closed

Hangleton reviewed May 30, 2023

View reviewed changes

divijvaidya reviewed May 31, 2023

View reviewed changes

divijvaidya added core Kafka Broker performance labels May 31, 2023

novosibman added 3 commits June 1, 2023 20:21

Merge branch 'apache:trunk' into trunk

b1e10b8

Merge branch 'apache:trunk' into trunk

8aa8bf6

KAFKA-9693 Kafka latency spikes caused by log segment flush on roll -…

f81efe0

… updated change according to feedbak

divijvaidya reviewed Jun 4, 2023

View reviewed changes

Comment thread clients/src/main/java/org/apache/kafka/common/utils/Utils.java Outdated

Comment thread storage/src/main/java/org/apache/kafka/storage/internals/log/ProducerStateManager.java

showuon reviewed Jun 6, 2023

View reviewed changes

Comment thread storage/src/main/java/org/apache/kafka/storage/internals/log/ProducerStateManager.java Outdated

novosibman added 2 commits June 8, 2023 23:51

Merge branch 'apache:trunk' into trunk

24d5f48

KAFKA-9693 Kafka latency spikes caused by log segment flush on roll -…

41fb5a4

… style check error corrected, open/close operations reduced for scheduler == null case

novosibman added 2 commits June 15, 2023 01:07

Merge branch 'apache:trunk' into trunk

fe22678

Corrected warn: skipping the third parameter

eee2ea3

showuon approved these changes Jun 15, 2023

View reviewed changes

showuon requested changes Jun 15, 2023

View reviewed changes

Comment thread clients/src/main/java/org/apache/kafka/common/utils/Utils.java Outdated

novosibman added 2 commits June 15, 2023 17:19

Merge branch 'apache:trunk' into trunk

f5a4c97

Including in the 3rd parameter

7518b32

novosibman changed the title ~~Suggest for performance fix: KAFKA-9693 Kafka latency spikes caused by log segment flush on roll - trunk version~~ KAFKA-9693: Kafka latency spikes caused by log segment flush on roll Jun 15, 2023

showuon approved these changes Jun 15, 2023

View reviewed changes

ocadaruma mentioned this pull request Aug 18, 2023

KAFKA-15046: Get rid of unnecessary fsyncs inside UnifiedLog.lock to stabilize performance #14242

Merged

3 tasks

ijuma requested a review from junrao August 28, 2023 13:26

ocadaruma mentioned this pull request Oct 13, 2023

KAFKA-15572: Race condition between log dir roll and log rename dir #14543

Closed

3 tasks

github-actions bot added the stale Stale PRs label Nov 27, 2023

divijvaidya closed this Feb 2, 2024

Conversation

novosibman commented May 30, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Hangleton left a comment

Choose a reason for hiding this comment

Uh oh!

novosibman commented May 31, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

divijvaidya May 31, 2023

Choose a reason for hiding this comment

Uh oh!

Hangleton commented Jun 1, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

novosibman commented Jun 1, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

novosibman commented Jun 1, 2023

Uh oh!

divijvaidya left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

showuon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

showuon commented Jun 6, 2023

Uh oh!

novosibman commented Jun 8, 2023

Uh oh!

divijvaidya commented Jun 14, 2023

Uh oh!

showuon left a comment

Choose a reason for hiding this comment

Uh oh!

showuon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

divijvaidya commented Jun 15, 2023

Uh oh!

github-actions bot commented Nov 27, 2023

Uh oh!

divijvaidya commented Feb 2, 2024

Uh oh!

ocadaruma commented Feb 2, 2024

Uh oh!

divijvaidya commented Feb 2, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

novosibman commented May 30, 2023 •

edited

Loading

novosibman commented May 31, 2023 •

edited

Loading

Hangleton commented Jun 1, 2023 •

edited

Loading

novosibman commented Jun 1, 2023 •

edited

Loading