Skip to content

[SPARK-15719][SQL] Disables writing Parquet summary files by default#13455

Closed
liancheng wants to merge 3 commits into
apache:masterfrom
liancheng:spark-15719-disable-parquet-summary-files
Closed

[SPARK-15719][SQL] Disables writing Parquet summary files by default#13455
liancheng wants to merge 3 commits into
apache:masterfrom
liancheng:spark-15719-disable-parquet-summary-files

Conversation

@liancheng

Copy link
Copy Markdown
Contributor

What changes were proposed in this pull request?

This PR disables writing Parquet summary files by default (i.e., when Hadoop configuration "parquet.enable.summary-metadata" is not set).

Please refer to SPARK-15719 for more details.

How was this patch tested?

New test case added in ParquetQuerySuite to check no summary files are written by default.

@yhuai

yhuai commented Jun 1, 2016

Copy link
Copy Markdown
Contributor

lgtm pending jenkins.

@SparkQA

SparkQA commented Jun 2, 2016

Copy link
Copy Markdown

Test build #59783 has finished for PR 13455 at commit bb93f5b.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA

SparkQA commented Jun 2, 2016

Copy link
Copy Markdown

Test build #59816 has finished for PR 13455 at commit 4a17892.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA

SparkQA commented Jun 2, 2016

Copy link
Copy Markdown

Test build #59877 has finished for PR 13455 at commit 81ebfcd.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@liancheng

Copy link
Copy Markdown
Contributor Author

Merging to master and branch-2.0.

asfgit pushed a commit that referenced this pull request Jun 2, 2016
## What changes were proposed in this pull request?

This PR disables writing Parquet summary files by default (i.e., when Hadoop configuration "parquet.enable.summary-metadata" is not set).

Please refer to [SPARK-15719][1] for more details.

## How was this patch tested?

New test case added in `ParquetQuerySuite` to check no summary files are written by default.

[1]: https://issues.apache.org/jira/browse/SPARK-15719

Author: Cheng Lian <lian@databricks.com>

Closes #13455 from liancheng/spark-15719-disable-parquet-summary-files.

(cherry picked from commit 4315427)
Signed-off-by: Cheng Lian <lian@databricks.com>
@asfgit asfgit closed this in 4315427 Jun 2, 2016
@liancheng liancheng deleted the spark-15719-disable-parquet-summary-files branch June 2, 2016 23:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants