[core] Support delete stats in result of scan plan.#4506
Merged
JingsongLi merged 1 commit intoNov 18, 2024
Merged
Conversation
a0d5f66 to
1c5dbff
Compare
3bc538e to
4e6a4f0
Compare
Contributor
Author
|
@JingsongLi Hi,Please CC, Thx. |
JingsongLi
requested changes
Nov 18, 2024
| valueStatsCols); | ||
| } | ||
|
|
||
| public DataFileMeta withoutStats() { |
| rowCount, | ||
| minKey, | ||
| maxKey, | ||
| EMPTY_STATS, |
Contributor
There was a problem hiding this comment.
It is better to keep key stats?
| deleteRowCount, | ||
| embeddedIndex, | ||
| fileSource, | ||
| valueStatsCols); |
| file.embeddedIndex()); | ||
| } | ||
|
|
||
| public ManifestEntry withoutStats() { |
| } | ||
|
|
||
| @Override | ||
| public FileStoreScan withoutStatsInPlan() { |
|
|
||
| FileStoreScan withMetrics(ScanMetrics metrics); | ||
|
|
||
| FileStoreScan withoutStatsInPlan(); |
| } | ||
|
|
||
| @Override | ||
| public AbstractDataTableScan withoutStatsInPlan() { |
| return this; | ||
| } | ||
|
|
||
| default InnerTableScan withoutStatsInPlan() { |
| ReadBuilder withShard(int indexOfThisSubtask, int numberOfParallelSubtasks); | ||
|
|
||
| /** Delete stats in scan plan result. */ | ||
| ReadBuilder withoutStatsInPlan(); |
4e6a4f0 to
1783216
Compare
1783216 to
2f53687
Compare
Contributor
Author
|
@JingsongLi |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Purpose
In my company's production environment, when use Flink session cluster for OLAP scan Paimon, we found the JobManager's memory is always heavy.
So, we will optimize this by two ways:
(1) Delete stats in DataSplit.
(2) When dataSkipping, cut unused stats in ManifestEntry.
This pr is for (1)
Linked issue: close #xxx
Tests
API and Format
Documentation