Watchdog support for Spark-3.2 #1591

cfmcgrady · 2021-12-20T07:35:12Z

Why are the changes needed?

move spark-3.1 ForcedMaxOutputRowsRule to spark-common and rename to ForcedMaxOutputRowsBase
handle WithCTE logical plan in spark-3.2
move spark-3.1 MaxPartitionStrategy to spark-common
add netsted cte unit test for ForcedMaxOutputRowsRule

How was this patch tested?

Add some test cases that check the changes thoroughly including negative and positive cases if possible
Add screenshots for manual tests if appropriate
Run test locally before make a pull request

codecov-commenter · 2021-12-20T08:37:53Z

Codecov Report

Merging #1591 (5399a3f) into master (df1d9f3) will decrease coverage by 0.95%.
The diff coverage is 0.00%.

@@             Coverage Diff              @@
##             master    #1591      +/-   ##
============================================
- Coverage     59.02%   58.06%   -0.96%     
+ Complexity      196      140      -56     
============================================
  Files           256      256              
  Lines         12708    12683      -25     
  Branches       1601     1596       -5     
============================================
- Hits           7501     7365     -136     
- Misses         4570     4695     +125     
+ Partials        637      623      -14

Impacted Files	Coverage Δ
.../kyuubi/sql/watchdog/ForcedMaxOutputRowsBase.scala	`0.00% <0.00%> (ø)`
.../kyuubi/sql/watchdog/KyuubiWatchDogException.scala	`0.00% <ø> (ø)`
...che/kyuubi/sql/watchdog/MaxPartitionStrategy.scala	`0.00% <ø> (ø)`
...che/spark/sql/PruneFileSourcePartitionHelper.scala	`0.00% <ø> (ø)`
...he/kyuubi/engine/spark/repl/KyuubiSparkILoop.scala	`90.16% <0.00%> (-3.28%)`	⬇️
.../kyuubi/sql/watchdog/ForcedMaxOutputRowsRule.scala

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update df1d9f3...5399a3f. Read the comment docs.

cfmcgrady · 2021-12-20T08:43:12Z

cc @ulysses-you and watchdog original author @zhouyifan279 @i7xh

cfmcgrady · 2021-12-20T08:55:07Z

dev/kyuubi-extension-spark-common/src/test/scala/org/apache/spark/sql/WatchDogSuiteBase.scala

+                 |SELECT * FROM t2
+                 |$sort
+                 |LIMIT $limit
+                 |""".stripMargin).queryExecution.optimizedPlan.maxRows.contains(expected))


As WithCTE.maxRows is None, we should check optimizedPlan here.

cfmcgrady · 2021-12-20T08:57:49Z

dev/kyuubi-extension-spark-common/src/test/scala/org/apache/spark/sql/WatchDogSuiteBase.scala

+                 |$having
+                 |$sort
+                 |LIMIT $limit
+                 |""".stripMargin).queryExecution.optimizedPlan.maxRows.contains(expected))


The same with https://github.com/apache/incubator-kyuubi/pull/1591/files#r772181897

ulysses-you · 2021-12-20T09:06:08Z

...ion-spark-common/src/main/scala/org/apache/kyuubi/sql/watchdog/ForcedMaxOutputRowsBase.scala

+
+}
+
+trait MarkAggregateOrderBase extends Rule[LogicalPlan] {


For Spark 3.2, we don't need MarkAggregateOrderBase about aggregate since apache/spark#32470. We only need CTE.

ulysses-you · 2021-12-21T04:17:05Z

thanks, merging to master

cfmcgrady added 2 commits December 20, 2021 15:38

watchdog support for spark-3.2

4847dbf

fix style

44726de

cfmcgrady force-pushed the watchdog-spark32 branch from e0ab111 to 44726de Compare December 20, 2021 07:38

add license header

364fc26

cfmcgrady commented Dec 20, 2021

View reviewed changes

ulysses-you reviewed Dec 20, 2021

View reviewed changes

cfmcgrady added 2 commits December 21, 2021 10:10

remove MarkAggregateOrderRule

0ce83ba

fix style

5399a3f

ulysses-you approved these changes Dec 21, 2021

View reviewed changes

ulysses-you closed this in 2b8304d Dec 21, 2021

ulysses-you assigned cfmcgrady Dec 21, 2021

ulysses-you added the enhance label Dec 21, 2021

ulysses-you added this to the v1.5.0 milestone Dec 21, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Watchdog support for Spark-3.2 #1591

Watchdog support for Spark-3.2 #1591

Uh oh!

cfmcgrady commented Dec 20, 2021 •

edited

Loading

Uh oh!

codecov-commenter commented Dec 20, 2021 •

edited

Loading

Uh oh!

cfmcgrady commented Dec 20, 2021

Uh oh!

cfmcgrady Dec 20, 2021

Uh oh!

cfmcgrady Dec 20, 2021

Uh oh!

ulysses-you Dec 20, 2021

Uh oh!

ulysses-you commented Dec 21, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Watchdog support for Spark-3.2 #1591

Watchdog support for Spark-3.2 #1591

Uh oh!

Conversation

cfmcgrady commented Dec 20, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why are the changes needed?

How was this patch tested?

Uh oh!

codecov-commenter commented Dec 20, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

cfmcgrady commented Dec 20, 2021

Uh oh!

cfmcgrady Dec 20, 2021

Choose a reason for hiding this comment

Uh oh!

cfmcgrady Dec 20, 2021

Choose a reason for hiding this comment

Uh oh!

ulysses-you Dec 20, 2021

Choose a reason for hiding this comment

Uh oh!

ulysses-you commented Dec 21, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

cfmcgrady commented Dec 20, 2021 •

edited

Loading

codecov-commenter commented Dec 20, 2021 •

edited

Loading