-
Notifications
You must be signed in to change notification settings - Fork 3.4k
Use timestamp statistics #1999
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Use timestamp statistics #1999
Conversation
|
@ptkool thanks! i still need to check whether min/max is always in microseconds. |
|
I think you will have to inspect the logical type to determine whether its milliseconds or microseconds (think limited range nano may be supported as well?) on a long. See the PR comments here unfortunately I rebased/forced push out the timestamp changes but the discussion there is probably applicable. Two things:
|
|
Looks like the code has changed around slightly since I looked at this last but you'll have to pass along the |
3d34313 to
370cd61
Compare
| ParquetIntegerStatistics parquetIntegerStatistics = | ||
| statistics.type().getOriginalType() == TIMESTAMP_MICROS | ||
| ? new ParquetIntegerStatistics(MICROSECONDS.toMillis(min), MICROSECONDS.toMillis(max)) | ||
| : new ParquetIntegerStatistics(min, max); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
For TIMESTAMP_MILLIS I think do an explicit check here on original type because it looks like timestamp nanosecond support was added in apache/parquet-java#519 and it's using the new LogicalTypeAnnotation which will replace OriginalType in the next release of parquet-mr (see apache/parquet-java#463). So to be forward compatible if the check is explicit here it won't accidentally go into this path if a newer file is using nanoseconds.
|
are you still working on this @ptkool ? |
|
👋 @ptkool - this PR has become inactive. If you're still interested in working on it, please let us know, and we can try to get reviewers to help with that. |
|
@bitsondatadev yes I believe so |
|
Closing as confirmed duplicate. |
No description provided.