Skip to content

Conversation

@HeartSaVioR
Copy link
Contributor

What changes were proposed in this pull request?

This PR proposes to support Spark Connect on transformWithState in PySpark. The code is mostly reused between Pandas version and Row version.

We rely on PythonEvanType to determine the user facing type of API, hence no proto change.

Why are the changes needed?

The new API needs to be supported with Spark Connect.

Does this PR introduce any user-facing change?

Yes, we will expose a new API to be available in Spark Connect.

How was this patch tested?

New test suites.

Was this patch authored or co-authored using generative AI tooling?

No.

@HeartSaVioR
Copy link
Contributor Author

@hvanhovell @HyukjinKwon @jingz-db Please take a look, thanks!

@HeartSaVioR
Copy link
Contributor Author

Thanks! Merging to master.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Try to fix: #50734

@yaooqinn
Copy link
Member

Sorry I missed your comment @LuciferYang

This is reverted via 4c9c41e due to CI failures

@HeartSaVioR
Copy link
Contributor Author

HeartSaVioR commented Apr 27, 2025

Argh, sorry. CI made me fun of me. This did not seem to go and pick the result (or really just merge conflicts, don't quite remember). I'll make a forward fix.

@HeartSaVioR HeartSaVioR force-pushed the WIP-transform-with-state-python-in-spark-connect branch from 5d1213f to a091fcc Compare April 27, 2025 12:31
@HeartSaVioR
Copy link
Contributor Author

HeartSaVioR commented Apr 27, 2025

a091fcc

This is the fix I added on top of merged PR. I'll merge if the CI passes since it was approved without 2 lines of changes, and I could have just +1 to @LuciferYang and merged.

Copy link
Contributor

@LuciferYang LuciferYang left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1, LGTM (pending tests)

@HeartSaVioR
Copy link
Contributor Author

@HeartSaVioR
Copy link
Contributor Author

Thanks! Merging to master (again).

yhuang-db pushed a commit to yhuang-db/spark that referenced this pull request Jun 9, 2025
…e in PySpark

### What changes were proposed in this pull request?

This PR proposes to support Spark Connect on transformWithState in PySpark. The code is mostly reused between Pandas version and Row version.

We rely on PythonEvanType to determine the user facing type of API, hence no proto change.

### Why are the changes needed?

The new API needs to be supported with Spark Connect.

### Does this PR introduce _any_ user-facing change?

Yes, we will expose a new API to be available in Spark Connect.

### How was this patch tested?

New test suites.

### Was this patch authored or co-authored using generative AI tooling?

No.

Closes apache#50704 from HeartSaVioR/WIP-transform-with-state-python-in-spark-connect.

Authored-by: Jungtaek Lim <[email protected]>
Signed-off-by: Jungtaek Lim <[email protected]>
yhuang-db pushed a commit to yhuang-db/spark that referenced this pull request Jun 9, 2025
…e in PySpark

### What changes were proposed in this pull request?

This PR proposes to support Spark Connect on transformWithState in PySpark. The code is mostly reused between Pandas version and Row version.

We rely on PythonEvanType to determine the user facing type of API, hence no proto change.

### Why are the changes needed?

The new API needs to be supported with Spark Connect.

### Does this PR introduce _any_ user-facing change?

Yes, we will expose a new API to be available in Spark Connect.

### How was this patch tested?

New test suites.

### Was this patch authored or co-authored using generative AI tooling?

No.

Closes apache#50704 from HeartSaVioR/WIP-transform-with-state-python-in-spark-connect.

Authored-by: Jungtaek Lim <[email protected]>
Signed-off-by: Jungtaek Lim <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants