[SPARK-12532] [SQL] Join-key Pushdown via Predicate Transitivity #10490

gatorsmile · 2015-12-27T19:03:17Z

More predicates in join conditions/filters can be inferred and pushed down via predicate transitivity. More predicate pushdown could greatly improve the join performance.

For example, we can infer the extra predicate upperCaseData.N = 3 in the following query:

"SELECT * 
 FROM upperCaseData JOIN lowerCaseData 
 WHERE lowerCaseData.n = upperCaseData.N AND lowerCaseData.n = 3"

Before the improvement, the optimized logical plan is

== Optimized Logical Plan ==
Project [N#16,L#17,n#18,l#19]
+- Join Inner, Some((n#18 = N#16))
   :- LogicalRDD [N#16,L#17], MapPartitionsRDD[17] at beforeAll at BeforeAndAfterAll.scala:187
   +- Filter (n#18 = 3)
      +- LogicalRDD [n#18,l#19], MapPartitionsRDD[19] at beforeAll at BeforeAndAfterAll.scala:187

After the improvement, the optimized logical plan should be like

== Optimized Logical Plan ==
Project [N#16,L#17,n#18,l#19]
+- Join Inner, Some((n#18 = N#16))
   :- Filter (N#16 = 3)
   :  +- LogicalRDD [N#16,L#17], MapPartitionsRDD[17] at beforeAll at BeforeAndAfterAll.scala:187
   +- Filter (n#18 = 3)
      +- LogicalRDD [n#18,l#19], MapPartitionsRDD[19] at beforeAll at BeforeAndAfterAll.scala:187

SparkQA · 2015-12-27T20:48:08Z

Test build #48355 has finished for PR 10490 at commit 918ea2c.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

gatorsmile · 2016-01-12T02:02:23Z

Thanks, close it now.

gatorsmile added 2 commits December 27, 2015 10:42

Infer and push down join/filter conditions

fb84dba

Merge remote-tracking branch 'upstream/master' into conditionInfer

918ea2c

gatorsmile closed this Jan 12, 2016

gatorsmile mentioned this pull request Mar 11, 2016

[SPARK-13789] Infer additional constraints from attribute equality #11618

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-12532] [SQL] Join-key Pushdown via Predicate Transitivity #10490

[SPARK-12532] [SQL] Join-key Pushdown via Predicate Transitivity #10490

Uh oh!

gatorsmile commented Dec 27, 2015

Uh oh!

SparkQA commented Dec 27, 2015

Uh oh!

gatorsmile commented Jan 12, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[SPARK-12532] [SQL] Join-key Pushdown via Predicate Transitivity #10490

[SPARK-12532] [SQL] Join-key Pushdown via Predicate Transitivity #10490

Uh oh!

Conversation

gatorsmile commented Dec 27, 2015

Uh oh!

SparkQA commented Dec 27, 2015

Uh oh!

gatorsmile commented Jan 12, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants