Spark 3.3: Remove unnecessary metadata columns reading when merge using Iceberg table #7985

ConeyLiu · 2023-07-04T11:57:11Z

There are many unnecessary metadata columns reading when merging using the Iceberg table. The problem should be caused by Spark 3.3 AddMetadataColumns rule which has been fixed in Spark 3.4.
In this PR, we add a rule to remove the unnecessary metadata column reading to fix the problem in Spark 3.3.

Before this PR:

After this PR:

ConeyLiu · 2023-07-04T11:59:50Z

Hi @rdblue @szehon-ho @aokolnychyi @RussellSpitzer @Fokko, could you please help to review this when you are free? Thanks a lot.

RussellSpitzer · 2023-07-04T13:45:08Z

@huaxingao I believe you did the Spark fix for this?

huaxingao · 2023-07-04T14:35:13Z

I think the problem has already been fixed in Spark 3.3 by this PR

ConeyLiu · 2023-07-04T15:43:07Z

Thanks @RussellSpitzer @huaxingao, I see, that's because the Spark 3.3.3 is not released.

ConeyLiu · 2023-07-05T03:00:49Z

Close this since Spark 3.3 has been fixed.

Remove unnecessary metadata columns reading

94086cf

github-actions bot added the spark label Jul 4, 2023

ConeyLiu closed this Jul 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Spark 3.3: Remove unnecessary metadata columns reading when merge using Iceberg table #7985

Spark 3.3: Remove unnecessary metadata columns reading when merge using Iceberg table #7985

Uh oh!

ConeyLiu commented Jul 4, 2023 •

edited

Loading

Uh oh!

ConeyLiu commented Jul 4, 2023

Uh oh!

RussellSpitzer commented Jul 4, 2023

Uh oh!

huaxingao commented Jul 4, 2023

Uh oh!

ConeyLiu commented Jul 4, 2023

Uh oh!

ConeyLiu commented Jul 5, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Spark 3.3: Remove unnecessary metadata columns reading when merge using Iceberg table #7985

Spark 3.3: Remove unnecessary metadata columns reading when merge using Iceberg table #7985

Uh oh!

Conversation

ConeyLiu commented Jul 4, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ConeyLiu commented Jul 4, 2023

Uh oh!

RussellSpitzer commented Jul 4, 2023

Uh oh!

huaxingao commented Jul 4, 2023

Uh oh!

ConeyLiu commented Jul 4, 2023

Uh oh!

ConeyLiu commented Jul 5, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ConeyLiu commented Jul 4, 2023 •

edited

Loading