[SPARK-18107][SQL][FOLLOW-UP] Insert overwrite statement runs much slower in spark-sql than it does in hive-client by viirya · Pull Request #15726 · apache/spark

viirya · 2016-11-02T02:30:08Z

What changes were proposed in this pull request?

As reported on the jira, insert overwrite statement runs much slower in Spark, compared with hive-client.

We have addressed this issue for static partition at #15667. This is a follow-up pr for #15667 to address dynamic partition.

How was this patch tested?

Jenkins tests.

There are existing tests using insert overwrite statement. Those tests should be passed. I added a new test to specially test insert overwrite into dynamic partition.

For performance issue, as I don't have Hive 2.0 environment, this needs the reporter to verify it. Please refer to the jira.

Please review https://cwiki.apache.org/confluence/display/SPARK/Contributing+to+Spark before opening a pull request.

viirya · 2016-11-02T02:31:15Z

cc @snodawn Would you like to test this patch for dynamic partition? Thanks.

SparkQA · 2016-11-02T04:42:16Z

Test build #67945 has finished for PR 15726 at commit eae8f1a.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

rxin · 2016-11-02T07:06:38Z

cc @ericl

Do we need to do this for data source tables?

ericl · 2016-11-02T07:18:05Z

In datasource tables we already delete the partition beforehand, so this
should not be needed (we also don't follow the hive insert path so don't
know if the perf regression exists).

On Wed, Nov 2, 2016, 12:07 AM Reynold Xin notifications@github.com wrote:

cc @ericl https://github.com/ericl

Do we need to do this for data source tables?

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
#15726 (comment), or mute
the thread
https://github.com/notifications/unsubscribe-auth/AAA6SlK6-sNgly91PcU9d1v8qnwvNpdaks5q6Da5gaJpZM4Kmy8E
.

snodawn · 2016-11-02T07:24:41Z

@viirya Ok, I will try it soon.

snodawn · 2016-11-02T11:47:08Z

I have tested the new patch for dynamic partition. It still costs a long time in running overwrite statement as the same with hive 1.2.1. The execution logs show that when running in dynamic partition it move each file of the partition to .Trash instead of the whole partition, which may cost a lot of time in this way.

viirya · 2016-11-02T13:47:17Z

@snodawn Thanks for reporting this.

One thing I want to make sure is how you test that? Are you insert the partition first and then overwrite to the existing partition? Or you just use insert overwrite to write to a new partition, i.e., actually it is not overwriting?

viirya · 2016-11-02T15:28:12Z

@snodawn OK. I got the reason why dynamic partition is still much slower than Hive 2.0.

There is another patch to optimize dynamic partition, apache/hive@d297b51.

Basically it optimizes the sequential dynamic partition insertion as many asynchronous tasks with an executor pool.

We can also do it in InsertIntoHiveTable for dynamic partition. Don't know if it is worth? @ericl @rxin What do you think?

ericl · 2016-11-02T20:18:24Z

IIUC, you would have to call loadPartition in parallel for each new partition created instead of loadDynamicPartitions once? There might be an issue there since each Hive client operation in Spark currently holds a global lock. So it would all be serialized anyways.

viirya · 2016-11-03T02:39:59Z

@ericl Thanks. Looks like we have more than one level lock (at least two in HiveExternalCatalog, HiveClientImpl). This might hard to tackle.

Although it is still possibly to have a workaround by having customized methods to wrap those loadPartition as one task which obtains the locks, I think the work may not be worth.

@ericl @rxin Do you agree that?

snodawn · 2016-11-03T02:47:10Z

@viirya I test both inserting overwrite a new partition and a existing partition. Of cause, inserting overwrite a new partition runs faster.

ericl · 2016-11-03T18:39:51Z

Yeah, this sounds like more complexity than it's worth. We should probably fix the hive client locking issue first.

…rtoverwrite-followup

viirya · 2016-11-04T01:42:07Z

@ericl About the hive client locking issue, any thing you can suggest?

SparkQA · 2016-11-04T04:12:48Z

Test build #68102 has finished for PR 15726 at commit 4624f1a.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

ericl · 2016-11-04T04:52:02Z

Hm, iirc the issue is not super hard to fix, but basically since the hive
thrift client is not thread safe, only one client can use it at a time. We
would need some sort of hive client pool to solve the issue (search for
retryLocked in the hive client management code to see the global lock.)

On Thu, Nov 3, 2016, 9:14 PM UCB AMPLab notifications@github.com wrote:

Merged build finished. Test PASSed.

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
#15726 (comment), or mute
the thread
https://github.com/notifications/unsubscribe-auth/AAA6Slm6mZjAdO3bnW30wira4IoDnSDMks5q6rE2gaJpZM4Kmy8E
.

viirya · 2016-11-04T08:27:22Z

@ericl yeah, I have checked the codes with retryLocked in HiveClientImpl. Do you mean we can create multiple hive client in the pool and serve concurrently?

ericl · 2016-11-04T19:57:22Z

@viirya I think there are a few options there, either HiveClientImpl can create multiple internal thrift clients (see private def client there), or the external catalog could create multiple clients.

viirya · 2016-11-05T01:48:01Z

@ericl Currently I prefer the first one, let HiveClientImpl create multiple internal thrift clients, since I don't like to change external catalog for this.

ericl · 2016-11-05T02:15:34Z

@viirya that makes sense to me

viirya · 2016-11-21T23:31:06Z

@ericl I am thinking this recently. What I am not very sure is this multiple hive client approach is safe to use under multiple thread environment. E.g., for now, because we synchronize on the single hive client, we run hive operations in sequence. Once we have multiple hive clients, would the concurrent hive operations conflict each other?

My first thought is because the hive operations use metastore, these operations would need to acquire some locks on the items (e.g., tables) in metastore before running. Is my guess correct or not?

ericl · 2016-11-22T01:07:38Z

@yhuai do you know if it would be safe to have multiple concurrent Hive operations in HiveClientImpl. From a cursory audit of the code it seems that only thread-local state is mutated for withHiveState so maybe it's no different from having multiple Spark clusters connect to the same metastore.

viirya · 2017-01-05T03:05:52Z

I would close this for now and may be reopen this when we get correct answer from @yhuai.

yuananf · 2017-04-06T10:21:36Z

@viirya Is it possible to upgrade the built-in hive-exec to resolve this problem? We are facing the same problem, insert overwrite dynamic partition is extremely slow, the data written is over in 5 minutes, but the following action takes more than 1 hour.

I believe the built-in hive-exec is this https://github.com/JoshRosen/hive/tree/release-1.2.1-spark2
Can we just upgrade this to release-2.1.1-spark2 or something else?

grantnicholas · 2018-01-18T16:36:22Z

@viirya @yuananf a quick check of recent spark releases shows this fix is not in. Any suggested workarounds in the meantime for dynamic partition insert overwrites?

It sounds like if the user does the logic of deleting the necessary partitions before running the dynamic insert overwrite query then hive will go down the "happy" performant path. This will require calculating the dynamic partitions before running the insert query, but if you can do that then this workaround will work right?

kaiseu · 2019-05-16T13:32:03Z

Do we have any solutions so far to resolve or workaround this issue? Spark2.4.3 also encountered this problem.

viirya · 2019-05-16T13:59:17Z

Hmm, since Spark community is working on upgrading Hive version in Spark, I think once it is done, this shouldn't be an issue after that.

viirya added 2 commits November 1, 2016 14:03

Address dynamic partition.

a0060ab

Add comments.

eae8f1a

Merge remote-tracking branch 'upstream/master' into improve-hive-inse…

4624f1a

…rtoverwrite-followup

viirya closed this Jan 5, 2017

yaooqinn mentioned this pull request May 12, 2020

[SPARK-31684][SQL] Overwrite partition failed with 'WRONG FS' when the target partition is not belong to the filesystem as same as the table #28511

Closed

viirya deleted the improve-hive-insertoverwrite-followup branch December 27, 2023 18:34

Uh oh!

Conversation

viirya commented Nov 2, 2016

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

viirya commented Nov 2, 2016

Uh oh!

SparkQA commented Nov 2, 2016

Uh oh!

rxin commented Nov 2, 2016

Uh oh!

ericl commented Nov 2, 2016

Uh oh!

snodawn commented Nov 2, 2016

Uh oh!

snodawn commented Nov 2, 2016

Uh oh!

viirya commented Nov 2, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

viirya commented Nov 2, 2016

Uh oh!

ericl commented Nov 2, 2016

Uh oh!

viirya commented Nov 3, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

snodawn commented Nov 3, 2016

Uh oh!

ericl commented Nov 3, 2016

Uh oh!

viirya commented Nov 4, 2016

Uh oh!

SparkQA commented Nov 4, 2016

Uh oh!

ericl commented Nov 4, 2016

Uh oh!

viirya commented Nov 4, 2016

Uh oh!

ericl commented Nov 4, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

viirya commented Nov 5, 2016

Uh oh!

ericl commented Nov 5, 2016

Uh oh!

viirya commented Nov 21, 2016

Uh oh!

ericl commented Nov 22, 2016

Uh oh!

viirya commented Jan 5, 2017

Uh oh!

yuananf commented Apr 6, 2017

Uh oh!

grantnicholas commented Jan 18, 2018

Uh oh!

kaiseu commented May 16, 2019

Uh oh!

viirya commented May 16, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

viirya commented Nov 2, 2016 •

edited

Loading

viirya commented Nov 3, 2016 •

edited

Loading

ericl commented Nov 4, 2016 •

edited

Loading