-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-22428][DOC] Add spark application garbage collector configurat… #19826
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Test build #3994 has finished for PR 19826 at commit
|
docs/configuration.md
Outdated
| <td> | ||
| Controls whether the cleaning thread should block on cleanup tasks (other than shuffle, which is controlled by | ||
| spark.cleaner.referenceTracking.blocking.shuffle Spark property).<br><br> | ||
| It is true as a workaround to <a href="https://issues.apache.org/jira/browse/SPARK-3015"> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think you can remove comments like this, or leave them to the source code. This doesn't directly help the reader understand when or if to change it. Or, add context here about why one would set this.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Removed.
docs/configuration.md
Outdated
| <td>true</td> | ||
| <td> | ||
| Controls whether the cleaning thread should block on cleanup tasks (other than shuffle, which is controlled by | ||
| spark.cleaner.referenceTracking.blocking.shuffle Spark property).<br><br> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You might surround prop names in <code> for extra clarity
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Surrounded.
docs/configuration.md
Outdated
| <td><code>spark.cleaner.periodicGC.interval</code></td> | ||
| <td>30min</td> | ||
| <td> | ||
| Controls how often to trigger a garbage collection. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You might give at least a sentence of explanation about what this has to do with the context cleaner.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Given more detailed description.
docs/configuration.md
Outdated
| <td><code>spark.cleaner.referenceTracking</code></td> | ||
| <td>true</td> | ||
| <td> | ||
| Controls whether a ContextCleaner should be created when a SparkContext initializes. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'd just say it controls whether context cleaning is enabled at all. This sort of suggests it controls whether it's created at a certain time as opposed to another time.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Rephrased.
|
Test build #3995 has finished for PR 19826 at commit
|
docs/configuration.md
Outdated
| </tr> | ||
| </table> | ||
|
|
||
| ### Spark Application Garbage Collector |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Pardon, I just noticed one more thing. This is really about the Context Cleaner, right? not garbage collection per se. I'd probably put these properties under Memory Management instead of a new section.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Moved as asked.
|
retest this please |
| </tr> | ||
| </table> | ||
|
|
||
|
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe I am insane but let's revert unneeded change
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reverted.
|
Test build #84343 has finished for PR 19826 at commit
|
|
retest this please |
|
Test build #84358 has finished for PR 19826 at commit
|
|
Merged to master |
What changes were proposed in this pull request?
The spark properties for configuring the ContextCleaner are not documented in the official documentation at https://spark.apache.org/docs/latest/configuration.html#available-properties.
This PR adds the doc.
How was this patch tested?
Manual.