Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
237 commits
Select commit Hold shift + click to select a range
b53595b
Build: Bump software.amazon.awssdk:bom from 2.27.2 to 2.27.7 (#10961)
dependabot[bot] Aug 19, 2024
9ddde50
Docs: Update MRAP endpoint and add notebook link (#9362)
singhpk234 Aug 19, 2024
ed07fd1
Flink: put everything together for range distribution in Flink sink (…
stevenzwu Aug 19, 2024
43bbf08
Flink: FLIP-27 IcebergSource builder missed a couple of configs compa…
stevenzwu Aug 19, 2024
3028552
Prevent implicit default locale/charset usage (#10969)
findepi Aug 20, 2024
2f2c367
Core: Add ManifestWrite benchmark (#8637)
Fokko Aug 20, 2024
b76d81a
S3OutputStream: Don't complete multipart upload on finalize (#10874)
jasonf20 Aug 20, 2024
ce33890
Enable UnusedMethod error-prone check (#10968)
findepi Aug 20, 2024
24afc1f
Build: Upgrade to Gradle 8.10 (#10976)
jbonofre Aug 20, 2024
40d5204
Core: Support case-insensitivity for column names in PartitionSpec (#…
sl255051 Aug 20, 2024
f17c225
Core: Remove unused throws declarations (#10974)
findepi Aug 21, 2024
85cf79d
Flink: deprecate ReaderFunction with a new Converter interface to sim…
stevenzwu Aug 21, 2024
bcb3281
Drop ParallelIterable's queue low water mark (#10978)
findepi Aug 21, 2024
f107649
Check for minimal queue size in ParallelIterable (#10977)
findepi Aug 21, 2024
2f6e7e6
API, Spark 3.5: Action to compute table stats (#10288)
karuppayya Aug 22, 2024
cbd71eb
Core,AWS: Fix NPE in ResolvingFileIO when HadoopConf is not set (#10872)
munendrasn Aug 22, 2024
0446178
Spark 3.5: Fix incorrect catalog loaded in TestCreateActions (#10952)
manuzhang Aug 22, 2024
b2cd6f3
Flink: Maintenance - TriggerManager (#10484)
pvary Aug 22, 2024
7fec19f
Flink: backport PR #10956 for converter interface that deprecates Rea…
stevenzwu Aug 22, 2024
bf459ee
Flink: backport PR #10777 from 1.19 to 1.18 for sink test refactoring…
stevenzwu Aug 22, 2024
ac0d206
Spec: Minor modifications for v3 (#10948)
rdblue Aug 22, 2024
ce772a6
Flink: backport PR #10859 for range distribution (#10990)
stevenzwu Aug 22, 2024
f2d6275
Flink: Port #10484 to v1.20 (#10989)
pvary Aug 23, 2024
aa1ecc8
Flink: Maintenance - TableChange refactor (#10992)
pvary Aug 23, 2024
2424e2c
Flink: Port #10992 to v1.19 (#10994)
pvary Aug 23, 2024
e0596fb
OpenAPI: Add endpoint field to CatalogConfig (#10928)
nastra Aug 23, 2024
5864850
Spark 3.5: Add utility to load table state reliably (#10984)
aokolnychyi Aug 24, 2024
b9a6645
AWS: Include http-auth-aws-crt module into iceberg-aws-bundle (#10972)
Aug 25, 2024
5958065
Build: Bump software.amazon.awssdk:bom from 2.27.7 to 2.27.12 (#11006)
dependabot[bot] Aug 25, 2024
9aa354d
Build: Bump org.apache.commons:commons-compress from 1.27.0 to 1.27.1…
dependabot[bot] Aug 25, 2024
4af2b9e
Build: Bump jetty from 11.0.22 to 11.0.23 (#11003)
dependabot[bot] Aug 26, 2024
244eb1e
Build: Bump mkdocs-material from 9.5.31 to 9.5.33 (#11002)
dependabot[bot] Aug 26, 2024
524fbb8
Docs: `_commit_snapshot_id` instead of `_change_snapshot_id` (#11000)
ebyhr Aug 26, 2024
99e4ab7
Docs: Rename Clickhouse to ClickHouse (#10998)
ebyhr Aug 26, 2024
2ed61a1
Flink: infer source parallelism for FLIP-27 source in batch execution…
stevenzwu Aug 26, 2024
a7398ab
Flink: Fix duplicate data with upsert writer in case of aborted check…
zhongqishang Aug 26, 2024
bea364c
Introduces the new IcebergSink based on the new V2 Flink Sink Abstrac…
rodmeneses Aug 26, 2024
e6f8ab9
Flink: Backport PR #10179 to Flink 1.20 for v2 sink (#11011)
rodmeneses Aug 26, 2024
1898e62
Core: Project data file stats only if there are equality deletes (#11…
aokolnychyi Aug 27, 2024
f1764c6
Build: Bump org.xerial:sqlite-jdbc from 3.46.0.1 to 3.46.1.0 (#11007)
dependabot[bot] Aug 27, 2024
64b3699
Docs: Add Druid docs url to sidebar (#10997)
techdocsmith Aug 27, 2024
bf00d51
Flink: backport PR #10832 of inferring parallelism in FLIP-27 source …
stevenzwu Aug 27, 2024
c95bf58
Add REST Compatibility Kit (#10908)
danielcweeks Aug 27, 2024
f88f128
Core: Generate realistic bounds in benchmarks (#11022)
aokolnychyi Aug 27, 2024
8e2eb9a
OpenAPI, Build: Apply spotless to testFixtures source code (#11024)
danielcweeks Aug 27, 2024
877f63b
Docs: bump latest version to 1.6.1 (#11036)
cwsteinbach Aug 28, 2024
cd32ec7
Spec: Add RemovePartitionSpecsUpdate REST update type (#10846)
amogh-jahagirdar Aug 28, 2024
3c01833
Build: Ignore benchmark output folders across all modules (#11030)
aokolnychyi Aug 28, 2024
6c79640
Core: Add benchmark for appending files (#11029)
aokolnychyi Aug 28, 2024
9c344f9
Spark 3.5: Use FileGenerationUtil in PlanningBenchmark (#11027)
aokolnychyi Aug 29, 2024
4b71d40
Flink: add unit tests for range distribution on bucket partition colu…
stevenzwu Aug 29, 2024
a07f862
Kafka Connect: Disable publish tasks in runtime project (#11032)
bryanck Aug 30, 2024
e8c6148
Flink: Backport PR #10526 to v1.18 and v1.20 (#11018)
zhongqishang Aug 30, 2024
fa8fbb3
Build: Bump mkdocs-material from 9.5.33 to 9.5.34 (#11062)
dependabot[bot] Sep 1, 2024
d128a2a
Build: Bump com.google.errorprone:error_prone_annotations (#11055)
dependabot[bot] Sep 1, 2024
113c6e7
API: implement types timestamp_ns and timestamptz_ns (#9008)
jacobmarble Sep 3, 2024
5319767
Core: Refactor ZOrderByteUtils (#10624)
ajantha-bhat Sep 3, 2024
a446164
Docs: Initial committer guidelines and requirements for merging (#10780)
emkornfield Sep 3, 2024
896dcd5
Flink: Fix compile warning (#11072)
ajantha-bhat Sep 3, 2024
7830a3b
Docs: Fix Flink 1.20 support versions (#11065)
manuzhang Sep 3, 2024
4f37041
Build: Enable more error-prone checks (#11078)
findepi Sep 5, 2024
2391bdd
open-api: Fix compile warnings for testFixtures (#11071)
ajantha-bhat Sep 5, 2024
f508a7e
Spark 3.3, 3.4: Parallelize reading files in migrate procedures (#11043)
manuzhang Sep 5, 2024
f7c6d57
Spark 3.5: Mandate identifier fields when create_changelog_view for t…
dramaticlly Sep 5, 2024
6e05ae0
Core: Fix setting hasNewDataFile flag in MergingSnapshotProducer (#11…
aokolnychyi Sep 7, 2024
ab2c6f8
Docs: Document accessing instance variables (#11087)
aokolnychyi Sep 7, 2024
44eca04
Build: Bump jetty from 11.0.23 to 11.0.24 (#11096)
dependabot[bot] Sep 9, 2024
cefb1bb
Spec: Fix rendering of partition stats file section (#11068)
ajantha-bhat Sep 9, 2024
3fe4f42
Build: Bump software.amazon.awssdk:bom from 2.27.12 to 2.27.21 (#11098)
dependabot[bot] Sep 9, 2024
ed73ec4
Build: Bump io.netty:netty-buffer from 4.1.112.Final to 4.1.113.Final…
dependabot[bot] Sep 9, 2024
153b070
Spark 3.3, 3.4: Fix incorrect catalog loaded in TestCreateActions (#1…
manuzhang Sep 9, 2024
41d00ae
Core: Prevent incremental file cleanup when expiring specified snapsh…
hantangwangd Sep 9, 2024
4873b4b
Core, Kafka, Spark: Use AssertJ instead of JUnit assertions (#11102)
nastra Sep 10, 2024
026166f
Docs: Add blogs written by rmoff (#11069)
rmoff Sep 10, 2024
e40fe4c
Build: Upgrade to Gradle 8.10.1 (#11104)
jbonofre Sep 10, 2024
a5c8f9c
Build: Remove unused variables, fields and parameters (#11101)
findepi Sep 10, 2024
8d97d54
OpenAPI: Add Scan Planning Endpoints to REST spec (#9695)
rahil-c Sep 10, 2024
5439cbd
Kafka Connect: Docs on configuring the sink (#10746)
bryanck Sep 10, 2024
0747b60
Kafka Connect: Terminate commits on coordinator stop (#10814)
bryanck Sep 10, 2024
34cd01b
Build: Upgrade google-java-format to 1.22.0 (#11050)
manuzhang Sep 11, 2024
6ff7a6e
Flink: Maintenance - Lock remover (#11010)
pvary Sep 12, 2024
8b4b2c1
Docs: Update Project links to includ contributing and REST spec (#11114)
danielcweeks Sep 12, 2024
ab0594b
Flink: Port #10484 to v1.19 (#11010) (#11117)
pvary Sep 12, 2024
e3d3f88
OpenAPI: Fix YAML example and value json formatting (#11119)
danielcweeks Sep 12, 2024
d2087a0
Core: Parallelize manifest writing for many new files (#11086)
aokolnychyi Sep 12, 2024
7991206
API, Core: Add manifestLocation API to ContentFile (#11044)
amogh-jahagirdar Sep 13, 2024
a2b8008
Core: Allow servers to express supported endpoints via endpoint field…
nastra Sep 13, 2024
e449d34
Hive: Add View support for HIVE catalog (#9852)
nk1506 Sep 13, 2024
5582b0c
Spark 3.4: Action to compute table stats (#11106)
karuppayya Sep 13, 2024
2e4d5b5
Docs: Fix missing options for remove_orphan_files procedure (#11080)
manuzhang Sep 14, 2024
5ce7c30
Flink: Increase the number of checkpoints from 4 to 6 to fix flakines…
stevenzwu Sep 16, 2024
d5b21d8
Spark 3.4: Add utility to load table state reliably (#11115)
dramaticlly Sep 16, 2024
06ed235
Build: switch to slf4j-simple 2.x for test implementation dependency …
stevenzwu Sep 17, 2024
f71c7df
Core: Update metadata location without updating lastUpdatedMillis (#1…
nastra Sep 18, 2024
40ffcb9
Kafka Connect: separate CI workflow (#11075)
bryanck Sep 18, 2024
bbeadea
Core: Move internal struct projection to SupportsIndexProjection (#11…
rdblue Sep 18, 2024
e3088bc
Core: Add explicit JSON parser for LoadTableResponse (#11148)
nastra Sep 19, 2024
ffa13b1
Build: Add .java-version to gitignore (#11167)
zhongyujiang Sep 19, 2024
e5d9a15
Docs: Clarify Partition Transform (#8337)
jasonmfehr Sep 19, 2024
82cedbb
API, Core: Enable removing rewritten delete files in RowDelta (#11166)
aokolnychyi Sep 19, 2024
79fd977
Docs: `field_id` in name serialisation spec should read `field-id` (#…
jonaswk Sep 19, 2024
60f61c3
AWS: Bump AWS SDK to version 2.28.5 (#11170)
sullis Sep 20, 2024
d4af40c
Build: Bump org.xerial.snappy:snappy-java from 1.1.10.6 to 1.1.10.7 (…
dependabot[bot] Sep 20, 2024
b2b65df
Spark 3.3, 3.4, 3.5: Supplement test case for `RollbackToTimestampPro…
hantangwangd Sep 20, 2024
4482565
Docs: Uppercase SQL keywords in branching docs (#11172)
ebyhr Sep 20, 2024
b92ed13
Build: Bump org.apache.httpcomponents.client5:httpclient5 (#11186)
dependabot[bot] Sep 23, 2024
5ed9307
Build: Bump tez010 from 0.10.3 to 0.10.4 (#11183)
dependabot[bot] Sep 23, 2024
fc6271c
Build: Bump nessie from 0.95.0 to 0.97.1 (#11184)
dependabot[bot] Sep 23, 2024
ddfe503
Build: Bump com.google.cloud:libraries-bom from 26.44.0 to 26.47.0 (#…
dependabot[bot] Sep 23, 2024
5a2c1c9
Build: Bump org.roaringbitmap:RoaringBitmap from 1.2.1 to 1.3.0 (#11187)
dependabot[bot] Sep 23, 2024
257bad2
API: Deprecate ContentFile#path API and add location API which return…
amogh-jahagirdar Sep 23, 2024
72fd9ab
Docs: Document AWS Redshift and Amazon Data Firehose support (#11192)
singhpk234 Sep 24, 2024
c0d73f4
API, AWS: Retry S3InputStream reads (#10433)
amogh-jahagirdar Sep 24, 2024
983ede3
AWS: Fix AWS doc URL (#11198)
laurentgo Sep 25, 2024
c07de6f
Build: Bump mkdocs-macros-plugin from 1.0.5 to 1.2.0 (#11189)
dependabot[bot] Sep 25, 2024
2fa8c7d
Core: Add rewritten delete files to write results (#11203)
aokolnychyi Sep 25, 2024
474a770
Core: Support iterating over positions in PositionDeleteIndex (#11202)
aokolnychyi Sep 25, 2024
f3c784e
Spark: Added merge schema as spark configuration (#9640)
aleenamg21-1 Sep 25, 2024
1e5dcb1
Core: Support merging in PositionDeleteIndex (#11208)
aokolnychyi Sep 25, 2024
b1d38b3
Core: Remove unused code for streaming position deletes (#11175)
wypoon Sep 26, 2024
26648ae
Build: Upgrade to Gradle 8.10.2 (#11212)
jbonofre Sep 26, 2024
7bd13a3
Core: Add a util to compute partition stats (#11146)
ajantha-bhat Sep 26, 2024
2d9c344
Parquet: update PruneColumns to inherit from TypeWithSchemaVisitor to…
aihuaxu Sep 26, 2024
95497ab
Core: Replace use of CharSequenceMap in DeleteFileIndex with String (…
amogh-jahagirdar Sep 26, 2024
dddb5f4
[Core] Fix TestFastAppend.testAddManyFiles() (#11218)
anuragmantri Sep 27, 2024
09370dd
Spark: Deprecate SparkAppenderFactory (#11076)
ajantha-bhat Sep 27, 2024
9601784
Build: Bump guava from 33.3.0-jre to 33.3.1-jre (#11230)
dependabot[bot] Sep 30, 2024
570af25
Build: Bump software.amazon.awssdk:bom from 2.28.5 to 2.28.11 (#11229)
dependabot[bot] Sep 30, 2024
152f02d
Build: Bump io.delta:delta-standalone_2.12 from 3.2.0 to 3.2.1 (#11228)
dependabot[bot] Sep 30, 2024
d00c493
Build: Bump junit-platform from 1.10.3 to 1.11.1 (#11227)
dependabot[bot] Sep 30, 2024
3ce09bc
Build: Bump io.delta:delta-spark_2.12 from 3.2.0 to 3.2.1 (#11225)
dependabot[bot] Sep 30, 2024
9454927
Core: Improve error handling when parsing view representations (#11236)
nastra Sep 30, 2024
97c9c53
Core: Update REST CatalogHandlers to handle page sizes exceeding numb…
rcjverhoef Sep 30, 2024
c8fe01e
Core: Support combining position deletes during writes (#11222)
aokolnychyi Oct 1, 2024
e4bc593
Core: Add DataFileSet / DeleteFileSet (#11195)
nastra Oct 1, 2024
e8a11cb
Build: Bump nessie from 0.97.1 to 0.99.0 (#11224)
dependabot[bot] Oct 1, 2024
168a983
ThreadPools introduce newExitingWorkerPool and newFixedThreadPool for…
fengjiajie Oct 1, 2024
8520b5b
Core: Deprecate legacy ways for loading position deletes (#11242)
aokolnychyi Oct 1, 2024
09c7376
AWS: Add configuration and set defaults for S3 retry behaviour (#11052)
ookumuso Oct 1, 2024
4099a67
Puffin: Document stats `ndv` value representation (#10793)
findepi Oct 3, 2024
fd8cb71
Build: Bump mkdocs-material from 9.5.34 to 9.5.38 (#11233)
dependabot[bot] Oct 3, 2024
2b38e09
Build: Bump org.eclipse.microprofile.openapi:microprofile-openapi-api…
dependabot[bot] Oct 3, 2024
b7b0a46
Build: Bump org.xerial:sqlite-jdbc from 3.46.1.0 to 3.46.1.3 (#11231)
dependabot[bot] Oct 3, 2024
746e719
Build: Update baseline-java 5.69.0 (#11252)
findepi Oct 4, 2024
f6cdf94
Build: Forbid implicit case fall-through without a comment and enable…
findepi Oct 4, 2024
8190ce7
API, Core: Add default value APIs and Avro implementation (#9502)
wmoustafa Oct 4, 2024
745e819
AWS: Make sure overridden configurations are applied (#11274)
hsiang-c Oct 7, 2024
5dde680
Build: Bump com.google.cloud:libraries-bom from 26.47.0 to 26.48.0 (#…
dependabot[bot] Oct 7, 2024
3220fad
Core: Fix UnicodeUtil#truncateStringMax returns malformed string. (#1…
zhongyujiang Oct 7, 2024
f0e4fd2
Core: Add internal Avro reader (#11108)
rdblue Oct 7, 2024
208ab20
Arrow: Remove unused readers (#11276)
wypoon Oct 8, 2024
67dc9e5
Spec: Add v3 types and type promotion (#10955)
rdblue Oct 9, 2024
d7f668a
Build: Bump mkdocs-material from 9.5.38 to 9.5.39 (#11272)
dependabot[bot] Oct 12, 2024
5fa3bbe
Build: Bump com.google.errorprone:error_prone_annotations (#11270)
dependabot[bot] Oct 12, 2024
410477f
Build: Bump junit-platform from 1.11.1 to 1.11.2 (#11266)
dependabot[bot] Oct 12, 2024
28265cd
Build: Bump org.testcontainers:testcontainers from 1.20.1 to 1.20.2 (…
dependabot[bot] Oct 12, 2024
d93677a
Build: Bump io.netty:netty-buffer from 4.1.113.Final to 4.1.114.Final…
dependabot[bot] Oct 12, 2024
7e5caf1
Build: Bump software.amazon.awssdk:bom from 2.28.11 to 2.28.16 (#11268)
dependabot[bot] Oct 12, 2024
337d05b
Build: Bump jackson-bom from 2.14.2 to 2.18.0 (#11226)
dependabot[bot] Oct 12, 2024
d3e0158
Build: Bump datamodel-code-generator from 0.25.9 to 0.26.1 (#11234)
dependabot[bot] Oct 12, 2024
0a1a666
Build: Bump software.amazon.s3.accessgrants:aws-s3-accessgrants-java-…
dependabot[bot] Oct 12, 2024
7a7d150
Build: Bump net.snowflake:snowflake-jdbc from 3.18.0 to 3.19.0 (#11057)
dependabot[bot] Oct 12, 2024
7912035
Build: Bump org.apache.hadoop.thirdparty:hadoop-shaded-guava (#11061)
dependabot[bot] Oct 12, 2024
12ff959
Arrow: Deprecate unused fixed width binary reader classes (#11292)
wypoon Oct 14, 2024
ca8a3a4
API, Spark: Make StrictMetricsEvaluator not fail on nested column pre…
zhongyujiang Oct 14, 2024
5832a7a
Build: Use the active shadow plugin (#11315)
ajantha-bhat Oct 14, 2024
c55a078
Build: Bump software.amazon.awssdk:bom from 2.28.16 to 2.28.21 (#11311)
dependabot[bot] Oct 14, 2024
919387f
Build: Bump org.apache.datasketches:datasketches-java (#11307)
dependabot[bot] Oct 14, 2024
6a5ae1a
Core: Switch usage to DataFileSet / DeleteFileSet (#11158)
nastra Oct 14, 2024
3d9fc1d
[AWS] S3FileIO - Add Cross-Region Bucket Access (#11259)
munendrasn Oct 14, 2024
6fdc69a
Core: Deprecate ContentCache.invalidateAll (#10494)
findepi Oct 14, 2024
6376b44
Spark 3.3, 3.4, 3.5: Remove unnecessary copying of FileScanTask (#11319)
huaxingao Oct 15, 2024
33b33f3
Core: Rename DeleteFileHolder to PendingDeleteFile / Optimize duplica…
nastra Oct 15, 2024
32b1ab6
Core: Fix version number in deprecation note for invalidateAll (#11325)
findepi Oct 16, 2024
5e279c8
Build, Spark, Flink: Bump junit from 5.10.1 to 5.11.1 (#11262)
tomtongue Oct 16, 2024
11a8a78
Core, Azure: Support wasb[s] paths in ADLSFileIO (#11294)
mrcnc Oct 16, 2024
22a6b19
Make connect compatable with kafka plugin.discovery (#10536)
joswlv Oct 16, 2024
17f1c4d
Spark 3.5: Spark Scan should ignore statistics not of type Apache Dat…
jeesou Oct 16, 2024
3c6c626
Kafka Connect: Add regex for property file match (#11303)
ryanjclark Oct 16, 2024
9d58865
OpenAPI: Remove repeated 'for' (#11338)
ebyhr Oct 17, 2024
bbbfd1e
AWS: Fix S3InputStream retry policy (#11335)
edgarRd Oct 17, 2024
3def1f4
API: (Test Only) Small fix to TestSerializableTypes.java (#11342)
aihuaxu Oct 17, 2024
f4ffe13
Core: lazily load default Hadoop Configuration to avoid NPE with Hado…
stevenzwu Oct 17, 2024
fd06438
Revert "Core, Azure: Support wasb[s] paths in ADLSFileIO (#11294)" (#…
RussellSpitzer Oct 18, 2024
2ac5c43
Flink: make FLIP-27 default in SQL and mark the old FlinkSource as de…
stevenzwu Oct 18, 2024
8a931e8
Flink: disable the flaky range distribution bucketing tests for now (…
stevenzwu Oct 18, 2024
44233fa
OpenAPI: Standardize credentials in loadTable/loadView responses (#10…
nastra Oct 18, 2024
8dc9eac
Core: Add credentials to loadTable / loadView responses (#11173)
nastra Oct 18, 2024
d61a98d
API: Add RewriteTablePath action interface (#10920)
laithalzyoud Oct 18, 2024
ea5da17
AWS: Switch to base2 entropy in ObjectStoreLocationProvider for optim…
ookumuso Oct 18, 2024
d4f0d7e
Flink: Add IcebergSinkBuilder interface allowed unification of most o…
arkadius Oct 21, 2024
b8c2b20
Build: Bump parquet from 1.13.1 to 1.14.3 (#11264)
dependabot[bot] Oct 21, 2024
de48a74
Build: Bump com.palantir.baseline:gradle-baseline-java (#11362)
dependabot[bot] Oct 21, 2024
1814764
Build: Bump com.google.errorprone:error_prone_annotations (#11360)
dependabot[bot] Oct 21, 2024
0640a38
Build: Bump software.amazon.awssdk:bom from 2.28.21 to 2.28.26 (#11359)
dependabot[bot] Oct 21, 2024
ce75f52
Build: Bump com.google.cloud:libraries-bom from 26.48.0 to 26.49.0 (#…
dependabot[bot] Oct 21, 2024
c16cefa
Core: Move deleteRemovedMetadataFiles(..) to CatalogUtil (#11352)
leesf Oct 21, 2024
d0a7ff9
Arrow: Fix indexing in Parquet dictionary encoded values readers (#11…
wypoon Oct 21, 2024
a198966
Spark 3.5: Update Spark to use planned Avro reads (#11299)
rdblue Oct 22, 2024
a8ec43d
Core, Spark 3.5: Remove dangling deletes as part of RewriteDataFilesA…
dramaticlly Oct 22, 2024
9794608
Spark: Randomize view/function names in testing (#11381)
nastra Oct 23, 2024
60181f9
Spark 3.4: Randomize view/function names in testing (#11382)
nastra Oct 23, 2024
9c0a806
Spark 3.4: Action to remove dangling deletes (#11377)
dramaticlly Oct 23, 2024
043757c
Spark 3.5: Reset Spark Conf for each test in TestCompressionSettings …
huaxingao Oct 23, 2024
02a988b
Spec: Adds Row Lineage (#11130)
RussellSpitzer Oct 23, 2024
4850b62
AWS: Support S3 directory bucket listing (#11021)
stubz151 Oct 24, 2024
1cb88a6
Core: Add LoadCredentialsResponse class/parser (#11339)
nastra Oct 24, 2024
35a02d0
OpenAPI: Add endpoint for refreshing vended credentials (#11281)
nastra Oct 24, 2024
6ba1a1f
AWS: Use testcontainers-minio instead of S3Mock (#11349)
sullis Oct 24, 2024
12ac3ee
Kafka Connect: Include third party licenses and notices in distributi…
bryanck Oct 24, 2024
fdc2c22
Deprecate iceberg-pig (#11379)
jbonofre Oct 24, 2024
6b04a6d
Core: Track data files by spec id instead of full PartitionSpec (#11323)
nastra Oct 25, 2024
32e9f40
Spark 3.5: Don't change table distribution when only altering local o…
manuzhang Oct 25, 2024
7738e1d
Spec: Fix table of content generation (#11067)
ajantha-bhat Oct 25, 2024
9ecd97b
[KafkaConnect] Fix RecordConverter for UUID and Fixed Types (#11346)
singhpk234 Oct 25, 2024
7ad11b2
Core: Snapshot `summary` map must have `operation` key (#11354)
kevinjqliu Oct 25, 2024
2b55fef
Core: Update TableMetadataParser to ensure all streams closed (#11220)
erik-grepr Oct 26, 2024
e3bbcac
Build: Bump Spark 3.4 to 3.4.4 (#11366)
manuzhang Oct 28, 2024
b4d178f
Build: Bump junit from 5.11.1 to 5.11.3 (#11401)
dependabot[bot] Oct 28, 2024
9565b9c
Build: Bump software.amazon.awssdk:bom from 2.28.26 to 2.29.1 (#11400)
dependabot[bot] Oct 28, 2024
681b09d
Core: Move Javadoc about commit retries to SnapshotProducer (#10995)
Oct 28, 2024
9fc9c05
Build: Bump junit-platform from 1.11.2 to 1.11.3 (#11402)
dependabot[bot] Oct 28, 2024
68a7102
Build: Bump org.xerial:sqlite-jdbc from 3.46.1.3 to 3.47.0.0 (#11407)
dependabot[bot] Oct 28, 2024
48acaad
Build: Bump software.amazon.s3.accessgrants:aws-s3-accessgrants-java-…
dependabot[bot] Oct 28, 2024
c3191ee
Build: Bump net.snowflake:snowflake-jdbc from 3.19.0 to 3.19.1 (#11406)
dependabot[bot] Oct 28, 2024
6e911e0
Build: Bump testcontainers from 1.20.2 to 1.20.3 (#11404)
dependabot[bot] Oct 28, 2024
d10e67d
Build: Bump com.google.errorprone:error_prone_annotations (#11403)
dependabot[bot] Oct 28, 2024
b4b0bdc
Build: Bump mkdocs-macros-plugin from 1.2.0 to 1.3.7 (#11399)
dependabot[bot] Oct 28, 2024
6962043
Build: Bump mkdocs-material from 9.5.39 to 9.5.42 (#11398)
dependabot[bot] Oct 28, 2024
86a8560
Flink: Fix disabling flaky range distribution bucketing tests (#11410)
manuzhang Oct 28, 2024
a6503f5
Bump Azurite to the latest version (#11411)
Fokko Oct 28, 2024
a28ebf7
Build: Bump datamodel-code-generator from 0.26.1 to 0.26.2 (#11356)
dependabot[bot] Oct 28, 2024
6c58f5b
Revert "Core: Snapshot `summary` map must have `operation` key (#1135…
kevinjqliu Oct 28, 2024
e013c67
Aliyun: Remove spring-boot dependency (#11291)
jbonofre Oct 28, 2024
7ac617a
Build: Bump com.azure:azure-sdk-bom from 1.2.25 to 1.2.28 (#11267)
dependabot[bot] Oct 28, 2024
47eac52
Spark: Flaky test due temp directory (#10811)
manuzhang Oct 28, 2024
1e3ee1e
Core: Add portable Roaring bitmap for row positions (#11372)
aokolnychyi Oct 28, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
5 changes: 5 additions & 0 deletions .baseline/checkstyle/checkstyle.xml
Original file line number Diff line number Diff line change
Expand Up @@ -427,6 +427,11 @@
<property name="illegalPkgs" value="org.hamcrest"/>
<message key="import.illegal" value="Prefer using org.assertj.core.api.Assertions instead."/>
</module>
<module name="IllegalImport">
<property name="id" value="BanJUnit5Assertions"/>
<property name="illegalPkgs" value="org.junit.jupiter.api.Assertions"/>
<message key="import.illegal" value="Prefer using org.assertj.core.api.Assertions instead."/>
</module>
<module name="RegexpSinglelineJava">
<property name="ignoreComments" value="true"/>
<property name="format" value="@Json(S|Des)erialize"/>
Expand Down
3 changes: 2 additions & 1 deletion .github/ISSUE_TEMPLATE/iceberg_bug_report.yml
Original file line number Diff line number Diff line change
Expand Up @@ -28,7 +28,8 @@ body:
description: What Apache Iceberg version are you using?
multiple: false
options:
- "1.6.0 (latest release)"
- "1.6.1 (latest release)"
- "1.6.0"
- "1.5.2"
- "1.5.1"
- "1.5.0"
Expand Down
6 changes: 4 additions & 2 deletions .github/workflows/delta-conversion-ci.yml
Original file line number Diff line number Diff line change
Expand Up @@ -35,6 +35,7 @@ on:
- '.github/workflows/hive-ci.yml'
- '.github/workflows/java-ci.yml'
- '.github/workflows/jmh-benchmarks-ci.yml'
- '.github/workflows/kafka-connect-ci.yml'
- '.github/workflows/labeler.yml'
- '.github/workflows/licence-check.yml'
- '.github/workflows/open-api.yml'
Expand All @@ -51,6 +52,7 @@ on:
- 'hive3-orc-bundle/**'
- 'hive-runtime/**'
- 'flink/**'
- 'kafka-connect/**'
- 'pig/**'
- 'docs/**'
- 'site/**'
Expand Down Expand Up @@ -88,7 +90,7 @@ jobs:
key: ${{ runner.os }}-gradle-${{ hashFiles('**/*.gradle*', '**/gradle-wrapper.properties') }}
restore-keys: ${{ runner.os }}-gradle-
- run: echo -e "$(ip addr show eth0 | grep "inet\b" | awk '{print $2}' | cut -d/ -f1)\t$(hostname -f) $(hostname -s)" | sudo tee -a /etc/hosts
- run: ./gradlew -DsparkVersions=3.5 -DscalaVersion=2.12 -DhiveVersions= -DflinkVersions= :iceberg-delta-lake:check -Pquick=true -x javadoc
- run: ./gradlew -DsparkVersions=3.5 -DscalaVersion=2.12 -DhiveVersions= -DkafkaVersions= -DflinkVersions= :iceberg-delta-lake:check -Pquick=true -x javadoc
- uses: actions/upload-artifact@v4
if: failure()
with:
Expand Down Expand Up @@ -117,7 +119,7 @@ jobs:
key: ${{ runner.os }}-gradle-${{ hashFiles('**/*.gradle*', '**/gradle-wrapper.properties') }}
restore-keys: ${{ runner.os }}-gradle-
- run: echo -e "$(ip addr show eth0 | grep "inet\b" | awk '{print $2}' | cut -d/ -f1)\t$(hostname -f) $(hostname -s)" | sudo tee -a /etc/hosts
- run: ./gradlew -DsparkVersions=3.5 -DscalaVersion=2.13 -DhiveVersions= -DflinkVersions= :iceberg-delta-lake:check -Pquick=true -x javadoc
- run: ./gradlew -DsparkVersions=3.5 -DscalaVersion=2.13 -DhiveVersions= -DkafkaVersions= -DflinkVersions= :iceberg-delta-lake:check -Pquick=true -x javadoc
- uses: actions/upload-artifact@v4
if: failure()
with:
Expand Down
4 changes: 3 additions & 1 deletion .github/workflows/flink-ci.yml
Original file line number Diff line number Diff line change
Expand Up @@ -35,6 +35,7 @@ on:
- '.github/workflows/hive-ci.yml'
- '.github/workflows/java-ci.yml'
- '.github/workflows/jmh-benchmarks-ci.yml'
- '.github/workflows/kafka-connect-ci.yml'
- '.github/workflows/labeler.yml'
- '.github/workflows/licence-check.yml'
- '.github/workflows/open-api.yml'
Expand All @@ -50,6 +51,7 @@ on:
- 'hive3/**'
- 'hive3-orc-bundle/**'
- 'hive-runtime/**'
- 'kafka-connect/**'
- 'spark/**'
- 'pig/**'
- 'docs/**'
Expand Down Expand Up @@ -91,7 +93,7 @@ jobs:
key: ${{ runner.os }}-gradle-${{ hashFiles('**/*.gradle*', '**/gradle-wrapper.properties') }}
restore-keys: ${{ runner.os }}-gradle-
- run: echo -e "$(ip addr show eth0 | grep "inet\b" | awk '{print $2}' | cut -d/ -f1)\t$(hostname -f) $(hostname -s)" | sudo tee -a /etc/hosts
- run: ./gradlew -DsparkVersions= -DhiveVersions= -DflinkVersions=${{ matrix.flink }} :iceberg-flink:iceberg-flink-${{ matrix.flink }}:check :iceberg-flink:iceberg-flink-runtime-${{ matrix.flink }}:check -Pquick=true -x javadoc
- run: ./gradlew -DsparkVersions= -DhiveVersions= -DkafkaVersions= -DflinkVersions=${{ matrix.flink }} :iceberg-flink:iceberg-flink-${{ matrix.flink }}:check :iceberg-flink:iceberg-flink-runtime-${{ matrix.flink }}:check -Pquick=true -x javadoc
- uses: actions/upload-artifact@v4
if: failure()
with:
Expand Down
6 changes: 4 additions & 2 deletions .github/workflows/hive-ci.yml
Original file line number Diff line number Diff line change
Expand Up @@ -35,6 +35,7 @@ on:
- '.github/workflows/flink-ci.yml'
- '.github/workflows/java-ci.yml'
- '.github/workflows/jmh-benchmarks-ci.yml'
- '.github/workflows/kafka-connect-ci.yml'
- '.github/workflows/labeler.yml'
- '.github/workflows/licence-check.yml'
- '.github/workflows/open-api.yml'
Expand All @@ -49,6 +50,7 @@ on:
- 'arrow/**'
- 'spark/**'
- 'flink/**'
- 'kafka-connect/**'
- 'pig/**'
- 'docs/**'
- 'site/**'
Expand Down Expand Up @@ -86,7 +88,7 @@ jobs:
key: ${{ runner.os }}-gradle-${{ hashFiles('**/*.gradle*', '**/gradle-wrapper.properties') }}
restore-keys: ${{ runner.os }}-gradle-
- run: echo -e "$(ip addr show eth0 | grep "inet\b" | awk '{print $2}' | cut -d/ -f1)\t$(hostname -f) $(hostname -s)" | sudo tee -a /etc/hosts
- run: ./gradlew -DsparkVersions= -DhiveVersions=2 -DflinkVersions= -Pquick=true :iceberg-mr:check :iceberg-hive-runtime:check -x javadoc
- run: ./gradlew -DsparkVersions= -DhiveVersions=2 -DflinkVersions= -DkafkaVersions= -Pquick=true :iceberg-mr:check :iceberg-hive-runtime:check -x javadoc
- uses: actions/upload-artifact@v4
if: failure()
with:
Expand Down Expand Up @@ -115,7 +117,7 @@ jobs:
key: ${{ runner.os }}-gradle-${{ hashFiles('**/*.gradle*', '**/gradle-wrapper.properties') }}
restore-keys: ${{ runner.os }}-gradle-
- run: echo -e "$(ip addr show eth0 | grep "inet\b" | awk '{print $2}' | cut -d/ -f1)\t$(hostname -f) $(hostname -s)" | sudo tee -a /etc/hosts
- run: ./gradlew -DsparkVersions= -DhiveVersions=3 -DflinkVersions= -Pquick=true :iceberg-hive3-orc-bundle:check :iceberg-hive3:check :iceberg-hive-runtime:check -x javadoc
- run: ./gradlew -DsparkVersions= -DhiveVersions=3 -DflinkVersions= -DkafkaVersions= -Pquick=true :iceberg-hive3-orc-bundle:check :iceberg-hive3:check :iceberg-hive-runtime:check -x javadoc
- uses: actions/upload-artifact@v4
if: failure()
with:
Expand Down
3 changes: 2 additions & 1 deletion .github/workflows/java-ci.yml
Original file line number Diff line number Diff line change
Expand Up @@ -35,6 +35,7 @@ on:
- '.github/workflows/flink-ci.yml'
- '.github/workflows/hive-ci.yml'
- '.github/workflows/jmh-benchmarks-ci.yml'
- '.github/workflows/kafka-connect-ci.yml'
- '.github/workflows/labeler.yml'
- '.github/workflows/licence-check.yml'
- '.github/workflows/open-api.yml'
Expand Down Expand Up @@ -82,7 +83,7 @@ jobs:
key: ${{ runner.os }}-gradle-${{ hashFiles('**/*.gradle*', '**/gradle-wrapper.properties') }}
restore-keys: ${{ runner.os }}-gradle-
- run: echo -e "$(ip addr show eth0 | grep "inet\b" | awk '{print $2}' | cut -d/ -f1)\t$(hostname -f) $(hostname -s)" | sudo tee -a /etc/hosts
- run: ./gradlew check -DsparkVersions= -DhiveVersions= -DflinkVersions= -Pquick=true -x javadoc
- run: ./gradlew check -DsparkVersions= -DhiveVersions= -DflinkVersions= -DkafkaVersions= -Pquick=true -x javadoc
- uses: actions/upload-artifact@v4
if: failure()
with:
Expand Down
105 changes: 105 additions & 0 deletions .github/workflows/kafka-connect-ci.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,105 @@
#
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing,
# software distributed under the License is distributed on an
# "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
# KIND, either express or implied. See the License for the
# specific language governing permissions and limitations
# under the License.
#

name: "Kafka Connect CI"
on:
push:
branches:
- 'main'
- '0.*'
- '1.*'
- '2.*'
tags:
- 'apache-iceberg-**'
pull_request:
paths-ignore:
- '.github/ISSUE_TEMPLATE/**'
- '.github/workflows/api-binary-compatibility.yml'
- '.github/workflows/delta-conversion-ci.yml'
- '.github/workflows/flink-ci.yml'
- '.github/workflows/hive-ci.yml'
- '.github/workflows/java-ci.yml'
- '.github/workflows/jmh-benchmarks-ci.yml'
- '.github/workflows/labeler.yml'
- '.github/workflows/licence-check.yml'
- '.github/workflows/open-api.yml'
- '.github/workflows/publish-snapshot.yml'
- '.github/workflows/recurring-jmh-benchmarks.yml'
- '.github/workflows/site-ci.yml'
- '.github/workflows/spark-ci.yml'
- '.github/workflows/stale.yml'
- '.gitignore'
- '.asf.yml'
- 'dev/**'
- 'mr/**'
- 'flink/**'
- 'hive3/**'
- 'hive3-orc-bundle/**'
- 'hive-runtime/**'
- 'spark/**'
- 'pig/**'
- 'docs/**'
- 'site/**'
- 'open-api/**'
- 'format/**'
- '.gitattributes'
- 'README.md'
- 'CONTRIBUTING.md'
- 'LICENSE'
- 'NOTICE'

concurrency:
group: ${{ github.workflow }}-${{ github.ref }}
cancel-in-progress: ${{ github.event_name == 'pull_request' }}

jobs:

kafka-connect-tests:
runs-on: ubuntu-22.04
strategy:
matrix:
jvm: [11, 17, 21]
env:
SPARK_LOCAL_IP: localhost
steps:
- uses: actions/checkout@v4
- uses: actions/setup-java@v4
with:
distribution: zulu
java-version: ${{ matrix.jvm }}
- uses: actions/cache@v4
with:
path: |
~/.gradle/caches
~/.gradle/wrapper
key: ${{ runner.os }}-gradle-${{ hashFiles('**/*.gradle*', '**/gradle-wrapper.properties') }}
restore-keys: ${{ runner.os }}-gradle-
- run: echo -e "$(ip addr show eth0 | grep "inet\b" | awk '{print $2}' | cut -d/ -f1)\t$(hostname -f) $(hostname -s)" | sudo tee -a /etc/hosts
- run: |
./gradlew -DsparkVersions= -DhiveVersions= -DflinkVersions= -DkafkaVersions=3 \
:iceberg-kafka-connect:iceberg-kafka-connect-events:check \
:iceberg-kafka-connect:iceberg-kafka-connect:check \
:iceberg-kafka-connect:iceberg-kafka-connect-runtime:check \
-Pquick=true -x javadoc
- uses: actions/upload-artifact@v4
if: failure()
with:
name: test logs
path: |
**/build/testlogs
2 changes: 1 addition & 1 deletion .github/workflows/publish-snapshot.yml
Original file line number Diff line number Diff line change
Expand Up @@ -41,4 +41,4 @@ jobs:
- run: |
./gradlew printVersion
./gradlew -DallModules publishApachePublicationToMavenRepository -PmavenUser=${{ secrets.NEXUS_USER }} -PmavenPassword=${{ secrets.NEXUS_PW }}
./gradlew -DflinkVersions= -DsparkVersions=3.3,3.4,3.5 -DscalaVersion=2.13 -DhiveVersions= publishApachePublicationToMavenRepository -PmavenUser=${{ secrets.NEXUS_USER }} -PmavenPassword=${{ secrets.NEXUS_PW }}
./gradlew -DflinkVersions= -DsparkVersions=3.3,3.4,3.5 -DscalaVersion=2.13 -DkafkaVersions=3 -DhiveVersions= publishApachePublicationToMavenRepository -PmavenUser=${{ secrets.NEXUS_USER }} -PmavenPassword=${{ secrets.NEXUS_PW }}
4 changes: 3 additions & 1 deletion .github/workflows/spark-ci.yml
Original file line number Diff line number Diff line change
Expand Up @@ -36,6 +36,7 @@ on:
- '.github/workflows/hive-ci.yml'
- '.github/workflows/java-ci.yml'
- '.github/workflows/jmh-benchmarks-ci.yml'
- '.github/workflows/kafka-connect-ci.yml'
- '.github/workflows/labeler.yml'
- '.github/workflows/licence-check.yml'
- '.github/workflows/open-api.yml'
Expand All @@ -52,6 +53,7 @@ on:
- 'hive3-orc-bundle/**'
- 'hive-runtime/**'
- 'flink/**'
- 'kafka-connect/**'
- 'pig/**'
- 'docs/**'
- 'open-api/**'
Expand Down Expand Up @@ -101,7 +103,7 @@ jobs:
tool-cache: false
- run: echo -e "$(ip addr show eth0 | grep "inet\b" | awk '{print $2}' | cut -d/ -f1)\t$(hostname -f) $(hostname -s)" | sudo tee -a /etc/hosts
- run: |
./gradlew -DsparkVersions=${{ matrix.spark }} -DscalaVersion=${{ matrix.scala }} -DhiveVersions= -DflinkVersions= \
./gradlew -DsparkVersions=${{ matrix.spark }} -DscalaVersion=${{ matrix.scala }} -DhiveVersions= -DflinkVersions= -DkafkaVersions= \
:iceberg-spark:iceberg-spark-${{ matrix.spark }}_${{ matrix.scala }}:check \
:iceberg-spark:iceberg-spark-extensions-${{ matrix.spark }}_${{ matrix.scala }}:check \
:iceberg-spark:iceberg-spark-runtime-${{ matrix.spark }}_${{ matrix.scala }}:check \
Expand Down
5 changes: 4 additions & 1 deletion .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -37,7 +37,7 @@ spark/v3.4/spark/benchmark/*
spark/v3.4/spark-extensions/benchmark/*
spark/v3.5/spark/benchmark/*
spark/v3.5/spark-extensions/benchmark/*
data/benchmark/*
*/benchmark/*

__pycache__/
*.py[cod]
Expand Down Expand Up @@ -66,3 +66,6 @@ metastore_db/
# Spark/metastore files
spark-warehouse/
derby.log

# jenv
.java-version
9 changes: 9 additions & 0 deletions .palantir/revapi.yml
Original file line number Diff line number Diff line change
Expand Up @@ -873,6 +873,11 @@ acceptedBreaks:
new: "method void org.apache.iceberg.encryption.Ciphers::<init>()"
justification: "Static utility class - should not have public constructor"
"1.4.0":
org.apache.iceberg:iceberg-api:
- code: "java.class.defaultSerializationChanged"
old: "class org.apache.iceberg.types.Types.NestedField"
new: "class org.apache.iceberg.types.Types.NestedField"
justification: "Add default value APIs."
org.apache.iceberg:iceberg-core:
- code: "java.class.defaultSerializationChanged"
old: "class org.apache.iceberg.PartitionData"
Expand Down Expand Up @@ -1088,6 +1093,10 @@ acceptedBreaks:
old: "class org.apache.iceberg.GenericManifestFile"
new: "class org.apache.iceberg.GenericManifestFile"
justification: "Serialization across versions is not supported"
- code: "java.class.defaultSerializationChanged"
old: "class org.apache.iceberg.io.WriteResult"
new: "class org.apache.iceberg.io.WriteResult"
justification: "Serialization across versions is not supported"
- code: "java.class.removed"
old: "enum org.apache.iceberg.BaseMetastoreTableOperations.CommitStatus"
justification: "Removing deprecated code"
Expand Down
1 change: 1 addition & 0 deletions LICENSE
Original file line number Diff line number Diff line change
Expand Up @@ -298,6 +298,7 @@ License: https://www.apache.org/licenses/LICENSE-2.0
This product includes code from Delta Lake.

* AssignmentAlignmentSupport is an independent development but UpdateExpressionsSupport in Delta was used as a reference.
* RoaringPositionBitmap is a Java implementation of RoaringBitmapArray in Delta.

Copyright: 2020 The Delta Lake Project Authors.
Home page: https://delta.io/
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -65,7 +65,7 @@ public static AliyunOSSExtension initialize() {
} else {
LOG.info(
"Initializing AliyunOSSExtension implementation with default AliyunOSSMockExtension");
extension = AliyunOSSMockExtension.builder().silent().build();
extension = AliyunOSSMockExtension.builder().build();
}

return extension;
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -70,7 +70,7 @@ public void testWrite() throws IOException {
reset(ossMock);

// Write large file.
writeAndVerify(ossMock, uri, randomData(32 * 1024 * 1024), arrayWrite);
writeAndVerify(ossMock, uri, randomData(32 * 1024), arrayWrite);
verify(ossMock, times(1)).putObject(any());
reset(ossMock);
}
Expand Down
Loading