[SPARK-21980][SQL]References in grouping functions should be indexed with semanticEquals #19202

DonnyZone · 2017-09-12T09:36:36Z

What changes were proposed in this pull request?

https://issues.apache.org/jira/browse/SPARK-21980

This PR fixes the issue in ResolveGroupingAnalytics rule, which indexes the column references in grouping functions without considering case sensitive configurations.

The problem can be reproduced by:

val df = spark.createDataFrame(Seq((1, 1), (2, 1), (2, 2))).toDF("a", "b") df.cube("a").agg(grouping("A")).show()

How was this patch tested?

unit tests

DonnyZone · 2017-09-12T09:41:00Z

ping @cloud-fan @gatorsmile

gatorsmile · 2017-09-12T17:00:07Z

sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala

          }
        case e @ Grouping(col: Expression) =>
-          val idx = groupByExprs.indexOf(col)
+          val idx = groupByExprs.indexWhere(x => resolver(x.toString, col.toString))


indexWhere(_.semanticEquals(col))

gatorsmile · 2017-09-12T17:04:37Z

ok to test

SparkQA · 2017-09-12T19:42:15Z

Test build #81688 has finished for PR 19202 at commit b08fd93.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2017-09-13T04:31:28Z

Test build #81699 has finished for PR 19202 at commit e24fdb8.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2017-09-13T04:49:37Z

Test build #81700 has finished for PR 19202 at commit 09efc4d.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

gatorsmile · 2017-09-13T17:06:51Z

LGTM

gatorsmile · 2017-09-13T17:07:03Z

Thanks! Merged to master.

… with semanticEquals ## What changes were proposed in this pull request? https://issues.apache.org/jira/browse/SPARK-21980 This PR fixes the issue in ResolveGroupingAnalytics rule, which indexes the column references in grouping functions without considering case sensitive configurations. The problem can be reproduced by: `val df = spark.createDataFrame(Seq((1, 1), (2, 1), (2, 2))).toDF("a", "b") df.cube("a").agg(grouping("A")).show()` ## How was this patch tested? unit tests Author: donnyzone <[email protected]> Closes #19202 from DonnyZone/ResolveGroupingAnalytics. (cherry picked from commit 21c4450) Signed-off-by: gatorsmile <[email protected]>

… with semanticEquals ## What changes were proposed in this pull request? https://issues.apache.org/jira/browse/SPARK-21980 This PR fixes the issue in ResolveGroupingAnalytics rule, which indexes the column references in grouping functions without considering case sensitive configurations. The problem can be reproduced by: `val df = spark.createDataFrame(Seq((1, 1), (2, 1), (2, 2))).toDF("a", "b") df.cube("a").agg(grouping("A")).show()` ## How was this patch tested? unit tests Author: donnyzone <[email protected]> Closes apache#19202 from DonnyZone/ResolveGroupingAnalytics. (cherry picked from commit 21c4450) Signed-off-by: gatorsmile <[email protected]>

DonnyZone added 2 commits September 12, 2017 17:28

SPARK-21980

ac61a66

rename

b08fd93

gatorsmile reviewed Sep 12, 2017

View reviewed changes

use semanticEquals

e24fdb8

DonnyZone changed the title ~~[SPARK-21980][SQL]References in grouping functions should be indexed with resolver~~ [SPARK-21980][SQL]References in grouping functions should be indexed with semanticEquals Sep 13, 2017

test name

09efc4d

asfgit closed this in 21c4450 Sep 13, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-21980][SQL]References in grouping functions should be indexed with semanticEquals #19202

[SPARK-21980][SQL]References in grouping functions should be indexed with semanticEquals #19202

Uh oh!

DonnyZone commented Sep 12, 2017 •

edited

Loading

Uh oh!

DonnyZone commented Sep 12, 2017

Uh oh!

gatorsmile Sep 12, 2017

Uh oh!

gatorsmile commented Sep 12, 2017

Uh oh!

SparkQA commented Sep 12, 2017

Uh oh!

SparkQA commented Sep 13, 2017

Uh oh!

SparkQA commented Sep 13, 2017

Uh oh!

gatorsmile commented Sep 13, 2017

Uh oh!

gatorsmile commented Sep 13, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[SPARK-21980][SQL]References in grouping functions should be indexed with semanticEquals #19202

[SPARK-21980][SQL]References in grouping functions should be indexed with semanticEquals #19202

Uh oh!

Conversation

DonnyZone commented Sep 12, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

DonnyZone commented Sep 12, 2017

Uh oh!

gatorsmile Sep 12, 2017

Choose a reason for hiding this comment

Uh oh!

gatorsmile commented Sep 12, 2017

Uh oh!

SparkQA commented Sep 12, 2017

Uh oh!

SparkQA commented Sep 13, 2017

Uh oh!

SparkQA commented Sep 13, 2017

Uh oh!

gatorsmile commented Sep 13, 2017

Uh oh!

gatorsmile commented Sep 13, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

DonnyZone commented Sep 12, 2017 •

edited

Loading