[SPARK-30362][CORE] Update InputMetrics in DataSourceRDD #680
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Incase of DS v2 InputMetrics are not updated
Before Fix

After Fix we can see that

Input Size / Recordsis updated in the UIInputMetrics like bytesread and recordread should be updated
Authored-by: sandeep katta [email protected]
Signed-off-by: Wenchen Fan [email protected]
Cherry-pick conflicts:
only import statements, as upstream reorganized packages
The test in original PR sets a SQLConf to empty string that doesnt exist yet.
Will need to be reintroduced after pulling from master.
Upstream SPARK-XXXXX ticket and PR link (if not applicable, explain)
https://issues.apache.org/jira/browse/SPARK-30362
apache#27021
What changes were proposed in this pull request?
DataSource RDDs now report records read (and bytes read based on filesystem stats). Previously those metrics were not present.
How was this patch tested?
unit tests from upstream PR