Skip to content

Conversation

@AngersZhuuuu
Copy link
Contributor

What changes were proposed in this pull request?

Ideally, in SQL query, nested columns should result to GetStructField with non-None name. But there are places that can create GetStructField with None name, such as UnresolvedStar.expand, Dataset encoder stuff, etc.
the current nestedFieldToAlias cannot catch it up and will cause job failed.

Why are the changes needed?

Fix bug

Does this PR introduce any user-facing change?

No

How was this patch tested?

Added UT,

@AngersZhuuuu
Copy link
Contributor Author

FYI @viirya

@github-actions github-actions bot added the SQL label Jul 6, 2021
@viirya viirya changed the title [SPARK-35972][SQL][3.0] When replace ExtractValue in NestedColumnAliasing we should use semanticEquals [SPARK-35972][SQL][3.1] When replace ExtractValue in NestedColumnAliasing we should use semanticEquals Jul 6, 2021
@SparkQA
Copy link

SparkQA commented Jul 6, 2021

Kubernetes integration test starting
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45203/

@SparkQA
Copy link

SparkQA commented Jul 6, 2021

Kubernetes integration test status failure
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45203/

@SparkQA
Copy link

SparkQA commented Jul 6, 2021

Test build #140692 has finished for PR 33227 at commit bb5694c.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@AngersZhuuuu
Copy link
Contributor Author

@viirya gen_alias can't have a stable value. just remove the ut?

Comment on lines 707 to 709
$"_extract_search_params".as("value"),
$"_extract_search_params.col1".as("col1"),
$"_extract_search_params.col2".as("col2")).analyze
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You can use collectGeneratedAliases.

@SparkQA
Copy link

SparkQA commented Jul 16, 2021

Kubernetes integration test starting
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45635/

@SparkQA
Copy link

SparkQA commented Jul 16, 2021

Kubernetes integration test status failure
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/45635/

@SparkQA
Copy link

SparkQA commented Jul 16, 2021

Test build #141122 has finished for PR 33227 at commit d8270ce.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@viirya
Copy link
Member

viirya commented Jul 16, 2021

Thanks. Merging to 3.1.

viirya pushed a commit that referenced this pull request Jul 16, 2021
…sing we should use semanticEquals

### What changes were proposed in this pull request?
Ideally, in SQL query, nested columns should result to GetStructField with non-None name. But there are places that can create GetStructField with None name, such as UnresolvedStar.expand, Dataset encoder stuff, etc.
the current `nestedFieldToAlias` cannot catch it up and will cause job failed.

### Why are the changes needed?
Fix bug

### Does this PR introduce _any_ user-facing change?
No

### How was this patch tested?
Added UT,

Closes #33227 from AngersZhuuuu/SPARK-35972-3.0.

Authored-by: Angerszhuuuu <[email protected]>
Signed-off-by: Liang-Chi Hsieh <[email protected]>
@viirya viirya closed this Jul 16, 2021
flyrain pushed a commit to flyrain/spark that referenced this pull request Sep 21, 2021
…sing we should use semanticEquals

### What changes were proposed in this pull request?
Ideally, in SQL query, nested columns should result to GetStructField with non-None name. But there are places that can create GetStructField with None name, such as UnresolvedStar.expand, Dataset encoder stuff, etc.
the current `nestedFieldToAlias` cannot catch it up and will cause job failed.

### Why are the changes needed?
Fix bug

### Does this PR introduce _any_ user-facing change?
No

### How was this patch tested?
Added UT,

Closes apache#33227 from AngersZhuuuu/SPARK-35972-3.0.

Authored-by: Angerszhuuuu <[email protected]>
Signed-off-by: Liang-Chi Hsieh <[email protected]>
fishcus pushed a commit to fishcus/spark that referenced this pull request Jan 12, 2022
…sing we should use semanticEquals

### What changes were proposed in this pull request?
Ideally, in SQL query, nested columns should result to GetStructField with non-None name. But there are places that can create GetStructField with None name, such as UnresolvedStar.expand, Dataset encoder stuff, etc.
the current `nestedFieldToAlias` cannot catch it up and will cause job failed.

### Why are the changes needed?
Fix bug

### Does this PR introduce _any_ user-facing change?
No

### How was this patch tested?
Added UT,

Closes apache#33227 from AngersZhuuuu/SPARK-35972-3.0.

Authored-by: Angerszhuuuu <[email protected]>
Signed-off-by: Liang-Chi Hsieh <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants