Skip to content

[SPARK-14344][SQL] Not creating meta files when summary-metadata property is false in parquet#12355

Closed
kasjain wants to merge 3 commits into
apache:branch-1.3from
kasjain:branch-1.3
Closed

[SPARK-14344][SQL] Not creating meta files when summary-metadata property is false in parquet#12355
kasjain wants to merge 3 commits into
apache:branch-1.3from
kasjain:branch-1.3

Conversation

@kasjain

@kasjain kasjain commented Apr 13, 2016

Copy link
Copy Markdown
Contributor

What changes were proposed in this pull request?

This change fixes a bug related to the property "parquet.enable.summary-metadata". Currently, the property's value is not considered before writing the metaData (_metadata and _common-metadata files) in the "saveAsParquetFile" code-flow. Hence making this property false still created the meta files.

How was this patch tested?

Integration tests and manual tests

@kasjain

kasjain commented Apr 13, 2016

Copy link
Copy Markdown
Contributor Author

Can one of the admins verify this patch?

1 similar comment
@AmplabJenkins

Copy link
Copy Markdown

Can one of the admins verify this patch?

@HyukjinKwon

Copy link
Copy Markdown
Member

It seems this is fixed in #13455? Could you confirm this please @kasjain ?

@HyukjinKwon

HyukjinKwon commented Oct 5, 2016

Copy link
Copy Markdown
Member

It seems working fine now. Therefore, it seems not a problem.

test("SPARK-14344 - write metadata")
  withSQLConf(ParquetOutputFormat.ENABLE_JOB_SUMMARY -> "true") {
    withTempPath { dir =>
      val path = s"${dir.getCanonicalPath}/part-r-0.parquet"
      spark.range(10).write.parquet(path)
      val files = new File(path).listFiles()
      assert(files.exists(_.getName.endsWith("_common_metadata")))
    }
  }

  withSQLConf(ParquetOutputFormat.ENABLE_JOB_SUMMARY -> "false") {
    withTempPath { dir =>
      val path = s"${dir.getCanonicalPath}/part-r-0.parquet"
      spark.range(10).write.parquet(path)
      val files = new File(path).listFiles()
      assert(!files.exists(_.getName.endsWith("_common_metadata")))
    }
  }
}

srowen added a commit to srowen/spark that referenced this pull request Oct 12, 2016
@asfgit asfgit closed this in eb69335 Oct 12, 2016
zifeif2 pushed a commit to zifeif2/spark that referenced this pull request Nov 22, 2025
Closes apache#15303
Closes apache#15078
Closes apache#15080
Closes apache#15135
Closes apache#14565
Closes apache#12355
Closes apache#15404

Author: Sean Owen <sowen@cloudera.com>

Closes apache#15451 from srowen/CloseStalePRs.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants