Skip to content

[SPARK-26078][SQL] Dedup self-join attributes on IN subqueries#23057

Closed
mgaido91 wants to merge 10 commits into
apache:masterfrom
mgaido91:SPARK-26078
Closed

[SPARK-26078][SQL] Dedup self-join attributes on IN subqueries#23057
mgaido91 wants to merge 10 commits into
apache:masterfrom
mgaido91:SPARK-26078

Conversation

@mgaido91

Copy link
Copy Markdown
Contributor

What changes were proposed in this pull request?

When there is a self-join as result of a IN subquery, the join condition may be invalid, resulting in trivially true predicates and return wrong results.

The PR deduplicates the subquery output in order to avoid the issue.

How was this patch tested?

added UT

Loading
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants