[SPARK-27858][SQL] Fix for avro deserialization on union types with multiple non-null types #562

vinooganesh · 2019-05-28T20:17:53Z

What changes were proposed in this pull request?

This PR aims to fix an issue on a union avro type with more than one non-null value (for instance ["string", "null", "int"]) whose the deserialization to a DataFrame would throw a java.lang.ArrayIndexOutOfBoundsException. The issue was that the fieldWriter relied on the index from the avro schema before nulls were filtered out.

How was this patch tested?

A test for the case of multiple non-null values was added and the tests were run using sbt by running testOnly org.apache.spark.sql.avro.AvroSuite

…ultiple non-null types ## What changes were proposed in this pull request? This PR aims to fix an issue on a union avro type with more than one non-null value (for instance `["string", "null", "int"]`) whose the deserialization to a DataFrame would throw a `java.lang.ArrayIndexOutOfBoundsException`. The issue was that the `fieldWriter` relied on the index from the avro schema before nulls were filtered out. ## How was this patch tested? A test for the case of multiple non-null values was added and the tests were run using sbt by running `testOnly org.apache.spark.sql.avro.AvroSuite` Closes apache#24722 from gcmerz/master. Authored-by: Gabbi Merz <[email protected]> Signed-off-by: Dongjoon Hyun <[email protected]>

vinooganesh · 2019-05-28T20:19:32Z

@gcmerz @schlosna @briandorne @gatesn

gatesn · 2019-05-28T20:56:56Z

👍

vinooganesh requested review from mccheah and yifeih May 28, 2019 20:33

mccheah approved these changes May 28, 2019

View reviewed changes

bulldozer-bot bot merged commit a2904ad into master May 28, 2019

bulldozer-bot bot deleted the vg/SPARK-27858 branch May 28, 2019 21:29

schlosna mentioned this pull request May 29, 2019

Error in AvroDeserialization code causes failure to deserialize union types ["null", "a", "b" ....] #560

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-27858][SQL] Fix for avro deserialization on union types with multiple non-null types #562

[SPARK-27858][SQL] Fix for avro deserialization on union types with multiple non-null types #562

Uh oh!

vinooganesh commented May 28, 2019

Uh oh!

vinooganesh commented May 28, 2019 •

edited

Loading

Uh oh!

gatesn commented May 28, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[SPARK-27858][SQL] Fix for avro deserialization on union types with multiple non-null types #562

[SPARK-27858][SQL] Fix for avro deserialization on union types with multiple non-null types #562

Uh oh!

Conversation

vinooganesh commented May 28, 2019

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

vinooganesh commented May 28, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gatesn commented May 28, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

vinooganesh commented May 28, 2019 •

edited

Loading