-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-31159][SQL] Rebase date/timestamp from/to Julian calendar in parquet #27915
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Closed
Closed
Changes from 1 commit
Commits
Show all changes
41 commits
Select commit
Hold shift + click to select a range
ae83fdc
Add rebaseGregorianToJulianMicros()
MaxGekk 4c35ddb
Add rebaseJulianToGregorianMicros()
MaxGekk 74774fc
Add a comment to rebaseJulianToGregorianMicros()
MaxGekk 8d74214
Add comment about calendar
MaxGekk 56ca744
Add round trip test for micros
MaxGekk 0cfeed5
Rename to JULIAN_CUTOVER_MICROS
MaxGekk 78cbd6c
Add rebaseJulianToGregorianDays and rebaseGregorianToJulianDays
MaxGekk 13aad60
Minor fix code style
MaxGekk 96573a9
Add comments for rebaseGregorianToJulianDays()
MaxGekk 36c0400
Add the SQL config spark.sql.legacy.parquet.rebaseDateTime.enabled
MaxGekk f0a2df6
Perform rebase in write
MaxGekk 9e3c201
Perform rebase dates in write
MaxGekk 053861c
Perform rebase dates/timestamps in read
MaxGekk 1624756
Rewrite days rebasing using Java 7 API
MaxGekk e3bbcb5
Rewrite micros rebasing using Java 7 API
MaxGekk d1e6d84
Extract common code
MaxGekk acd33f1
Revert "Extract common code"
MaxGekk 41fc33f
Revert "Rewrite micros rebasing using Java 7 API"
MaxGekk fe9f130
Revert "Rewrite days rebasing using Java 7 API"
MaxGekk c2c53b8
Remove branching by cutover days in rebase functions
MaxGekk 8e94359
Rebasing via system time zone
MaxGekk d6f7e6b
Rebase dates via UTC local date
MaxGekk 81d342a
Check more time zones in days rebasing
MaxGekk 63428ab
More dates/timestamps for testing
MaxGekk a34a9ce
Rename utcCal to cal
MaxGekk e590d36
Test multiple time zones in rebasing timestamps
MaxGekk 262f744
Test reading parquet files written by Spark 2.4
MaxGekk 8947298
Remove .asInstanceOf[DateType#InternalType]
MaxGekk 276d159
Change SQL config description
MaxGekk 167b463
Rebase timestamp INT96
MaxGekk bbc4a1a
Support rebasing in VectorizedColumnReader
MaxGekk a1b34cb
Bug fix in write
MaxGekk d7debb4
Add test for write
MaxGekk 6bebf3b
Read SQL config in place
MaxGekk 67cec02
Remove a gap
MaxGekk 8fa19a6
Remove config settings from ParquetWriteBuilder
MaxGekk a061870
Initialize rebaseDateTime in default constructor in ParquetWriteSupport
MaxGekk ae49cc4
Check INT96 rebasing regardless of SQL config settings
MaxGekk a96392c
Add JIRA id
MaxGekk 5b52735
Test INT96 w/ and w/o vectorized reader
MaxGekk 184fcd8
Merge remote-tracking branch 'remotes/origin/master' into rebase-parq…
MaxGekk File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Rebase dates via UTC local date
- Loading branch information
commit d6f7e6b4f37b47d6ae7a0c2d5de60189ae851774
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We can use particular time zone here because the conversion of "logical" days is independent from time zone, actually.
UTCis selected to avoid the problem of rounding micros to/from days because zone offset in UTC is 0.