[SPARK-48602][SQL] Make csv generator support different output style with spark.sql.binaryOutputStyle #46956

yaooqinn · 2024-06-12T12:16:33Z

What changes were proposed in this pull request?

In SPARK-47911, we introduced a universal BinaryFormatter to make binary output consistent
across all clients, such as beeline, spark-sql, and spark-shell, for both primitive and nested binaries.

But unfortunately, to_csv and csv writer have interceptors for binary output which is hard-coded to use SparkStringUtils.getHexString. In this PR we make it also configurable.

Why are the changes needed?

feature parity

Does this PR introduce any user-facing change?

Yes, we have make spark.sql.binaryOutputStyle work for csv but the AS-IS behavior is kept.

How was this patch tested?

new tests

Was this patch authored or co-authored using generative AI tooling?

no

…with spark.sql.binaryOutputStyle

yaooqinn · 2024-06-13T02:45:15Z

cc @dongjoon-hyun @HyukjinKwon @cloud-fan @LuciferYang @ulysses-you thanks

yaooqinn · 2024-06-13T03:52:15Z

Merged to master, thank you @ulysses-you

HyukjinKwon

LGTM2

[SPARK-48602][SQL] Make csv generator support different output style …

e886603

…with spark.sql.binaryOutputStyle

github-actions bot added the SQL label Jun 12, 2024

ulysses-you approved these changes Jun 13, 2024

View reviewed changes

yaooqinn closed this in ea2bca7 Jun 13, 2024

yaooqinn deleted the SPARK-48602 branch June 13, 2024 03:51

HyukjinKwon reviewed Jun 17, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-48602][SQL] Make csv generator support different output style with spark.sql.binaryOutputStyle #46956

[SPARK-48602][SQL] Make csv generator support different output style with spark.sql.binaryOutputStyle #46956

Uh oh!

yaooqinn commented Jun 12, 2024

Uh oh!

yaooqinn commented Jun 13, 2024

Uh oh!

yaooqinn commented Jun 13, 2024

Uh oh!

HyukjinKwon left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[SPARK-48602][SQL] Make csv generator support different output style with spark.sql.binaryOutputStyle #46956

[SPARK-48602][SQL] Make csv generator support different output style with spark.sql.binaryOutputStyle #46956

Uh oh!

Conversation

yaooqinn commented Jun 12, 2024

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

yaooqinn commented Jun 13, 2024

Uh oh!

yaooqinn commented Jun 13, 2024

Uh oh!

HyukjinKwon left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants