Skip to content

Add support for providing output-spec-id during rewrite - spark 3.5#9803

Merged
RussellSpitzer merged 11 commits into
apache:mainfrom
himadripal:main
Mar 7, 2024
Merged

Add support for providing output-spec-id during rewrite - spark 3.5#9803
RussellSpitzer merged 11 commits into
apache:mainfrom
himadripal:main

Conversation

@himadripal
Copy link
Copy Markdown
Contributor

#7557 Support Rewrite Datafiles into a custom Partition Spec

Comment thread api/src/main/java/org/apache/iceberg/actions/RewriteDataFiles.java Outdated
Comment thread api/src/main/java/org/apache/iceberg/actions/RewriteDataFiles.java Outdated
Comment thread core/src/main/java/org/apache/iceberg/actions/SizeBasedFileRewriter.java Outdated
Comment thread core/src/main/java/org/apache/iceberg/actions/SizeBasedFileRewriter.java Outdated
Copy link
Copy Markdown
Member

@RussellSpitzer RussellSpitzer left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Generally looking very good. I have a few nits and comments but I think this is close to merge.

@himadripal
Copy link
Copy Markdown
Contributor Author

Thanks for reviewing @RussellSpitzer, fixed all the review comments.

Comment thread api/src/main/java/org/apache/iceberg/actions/RewriteDataFiles.java Outdated
Copy link
Copy Markdown
Member

@RussellSpitzer RussellSpitzer left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This all seems pretty good to me. I have a bunch of nits on comments (mostly to remove them). But once those are cleaned up I think we are good to merge

@RussellSpitzer RussellSpitzer merged commit 99b1d0e into apache:main Mar 7, 2024
@RussellSpitzer
Copy link
Copy Markdown
Member

Thanks @himadripal for this great improvement, Our general policy is once one Spark Version is in you open another PR to all of the other currently supported Branches with the same change set. The review on that PR will be much faster thanks again 🙇

RussellSpitzer pushed a commit that referenced this pull request Mar 9, 2024
…Backport #9803)

Co-authored-by: hpal <hpal@apple.com>
zachdisc pushed a commit to zachdisc/iceberg that referenced this pull request Dec 23, 2024
anuragmantri added a commit to anuragmantri/iceberg that referenced this pull request Aug 5, 2025
@mukund-thakur
Copy link
Copy Markdown
Contributor

Can someone please give an real world use case for this feature. I am trying to understand more on this. Thanks so much.

@mukund-thakur
Copy link
Copy Markdown
Contributor

I can think of one use case
initial partition was day and the table moved to hour but for some historical files we want to rewrite by old partition spec which is day.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants