Commit db538b2
Marcelo Vanzin
[SPARK-24552][CORE][SQL][BRANCH-2.3] Use unique id instead of attempt number for writes .
This passes a unique attempt id instead of attempt number to v2
data sources and hadoop APIs, because attempt number is reused
when stages are retried. When attempt numbers are reused, sources
that track data by partition id and attempt number may incorrectly
clean up data because the same attempt number can be both committed
and aborted.
Author: Marcelo Vanzin <vanzin@cloudera.com>
Closes #21615 from vanzin/SPARK-24552-2.3.1 parent a1e9640 commit db538b2
File tree
2 files changed
+9
-2
lines changed- core/src/main/scala/org/apache/spark/internal/io
- sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2
2 files changed
+9
-2
lines changedLines changed: 5 additions & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
76 | 76 | | |
77 | 77 | | |
78 | 78 | | |
| 79 | + | |
| 80 | + | |
| 81 | + | |
| 82 | + | |
79 | 83 | | |
80 | 84 | | |
81 | 85 | | |
82 | 86 | | |
83 | 87 | | |
84 | 88 | | |
85 | | - | |
| 89 | + | |
86 | 90 | | |
87 | 91 | | |
88 | 92 | | |
| |||
Lines changed: 4 additions & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
123 | 123 | | |
124 | 124 | | |
125 | 125 | | |
126 | | - | |
| 126 | + | |
| 127 | + | |
| 128 | + | |
| 129 | + | |
127 | 130 | | |
128 | 131 | | |
129 | 132 | | |
| |||
0 commit comments