-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-11802][SQL] Kryo-based encoder for opaque types in Datasets #9789
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
| * (Scala-specific) Creates an encoder that serializes objects of type T using Kryo. | ||
| * This encoder maps T into a single byte array (binary) field. | ||
| */ | ||
| def kryo[T: ClassTag]: Encoder[T] = { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
How about putting this method into ExpressionEncoder? I think Encoders is only used at java side as the lack of implicit magic.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
then we can change the return type to ExpressionEncoder[T] and make the tests less verbose with asInstanseOf
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
what if we want to use kryo in scala?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
oh, this can't be done by implicit, I was wrong, nvm.
|
Test build #46146 has finished for PR 9789 at commit
|
|
Test build #46179 has finished for PR 9789 at commit
|
I also found a bug with self-joins returning incorrect results in the Dataset API. Two test cases attached and filed SPARK-11803. Author: Reynold Xin <rxin@databricks.com> Closes #9789 from rxin/SPARK-11802. (cherry picked from commit 5e2b444) Signed-off-by: Reynold Xin <rxin@databricks.com>
|
Test build #46175 has finished for PR 9789 at commit
|
I also found a bug with self-joins returning incorrect results in the Dataset API. Two test cases attached and filed SPARK-11803.