-
Notifications
You must be signed in to change notification settings - Fork 2.2k
Add concurrent batching to the file sink #20394
Copy link
Copy link
Open
Labels
domain: performanceAnything related to Vector's performanceAnything related to Vector's performancesink: fileAnything `file` sink relatedAnything `file` sink relatedtype: enhancementA value-adding code change that enhances its existing functionality.A value-adding code change that enhances its existing functionality.
Metadata
Metadata
Assignees
Labels
domain: performanceAnything related to Vector's performanceAnything related to Vector's performancesink: fileAnything `file` sink relatedAnything `file` sink relatedtype: enhancementA value-adding code change that enhances its existing functionality.A value-adding code change that enhances its existing functionality.
Type
Fields
Give feedbackNo fields configured for issues without a type.
A note for the community
Problem
I can see 100% utilization on file sink which then applies backpressure and slows-down whole pipeline. I am using tmpfs so disk is not a bottleneck but high cardinal partitioning could be. It seems that file sink is not batching concurrently and therefore applying backpressure quickly (especially with gzip compression).
Configuration
Version
0.37.0
Debug Output
No response
Example Data
No response
Additional Context
No response
References
No response