-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-16613] [CORE] RDD.pipe returns values for empty partitions #14260
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Test build #62521 has finished for PR 14260 at commit
|
|
Is it possible that the underlying command always return something even for 0 rows? e.g. if it is counting the number of elements? |
|
Yeah that's the 'problem' -- consider |
|
Test build #62595 has finished for PR 14260 at commit
|
|
Jenkins retest this please |
|
Test build #62601 has finished for PR 14260 at commit
|
|
LGTM |
|
Merging in master/2.0. |
## What changes were proposed in this pull request? Document RDD.pipe semantics; don't execute process for empty input partitions. Note this includes the fix in #14256 because it's necessary to even test this. One or the other will merge the fix. ## How was this patch tested? Jenkins tests including new test. Author: Sean Owen <[email protected]> Closes #14260 from srowen/SPARK-16613. (cherry picked from commit 4b079dc) Signed-off-by: Reynold Xin <[email protected]>
What changes were proposed in this pull request?
Document RDD.pipe semantics; don't execute process for empty input partitions.
Note this includes the fix in #14256 because it's necessary to even test this. One or the other will merge the fix.
How was this patch tested?
Jenkins tests including new test.