[SPARK-11727][SQL] Split ExpressionEncoder into FlatEncoder and ProductEncoder #9693

cloud-fan · 2015-11-13T15:10:43Z

also add more tests for encoders, and fix bugs that I found:

when convert array to catalyst array, we can only skip element conversion for native types(e.g. int, long, boolean), not AtomicType(String is AtomicType but we need to convert it)
we should also handle scala BigDecimal when convert from catalyst Decimal.
complex map type should be supported

other issues that still in investigation:

encode java BigDecimal and decode it back, seems we will loss precision info.
when encode case class that defined inside a object, ClassNotFound exception will be thrown.

I'll remove unused code in a follow-up PR.

cloud-fan · 2015-11-13T15:13:01Z

cc @marmbrus

SparkQA · 2015-11-13T18:32:50Z

Test build #45863 has finished for PR 9693 at commit 7e4f7fe.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

marmbrus · 2015-11-13T19:24:51Z

when encode case class that defined inside a object, ClassNotFound exception will be thrown.

I have a fix for this, but I'm still working out some repl issues.

…ctEncoder also add more tests for encoders, and fix bugs that I found: * when convert array to catalyst array, we can only skip element conversion for native types(e.g. int, long, boolean), not `AtomicType`(String is AtomicType but we need to convert it) * we should also handle scala `BigDecimal` when convert from catalyst `Decimal`. * complex map type should be supported other issues that still in investigation: * encode java `BigDecimal` and decode it back, seems we will loss precision info. * when encode case class that defined inside a object, `ClassNotFound` exception will be thrown. I'll remove unused code in a follow-up PR. Author: Wenchen Fan <[email protected]> Closes #9693 from cloud-fan/split. (cherry picked from commit d7b2b97) Signed-off-by: Michael Armbrust <[email protected]>

After some experiment, I found it's not convenient to have separate encoder builders: `FlatEncoder` and `ProductEncoder`. For example, when create encoders for `ScalaUDF`, we have no idea if the type `T` is flat or not. So I revert the splitting change in #9693, while still keeping the bug fixes and tests. Author: Wenchen Fan <[email protected]> Closes #9726 from cloud-fan/follow. (cherry picked from commit 47d1c23) Signed-off-by: Michael Armbrust <[email protected]>

After some experiment, I found it's not convenient to have separate encoder builders: `FlatEncoder` and `ProductEncoder`. For example, when create encoders for `ScalaUDF`, we have no idea if the type `T` is flat or not. So I revert the splitting change in #9693, while still keeping the bug fixes and tests. Author: Wenchen Fan <[email protected]> Closes #9726 from cloud-fan/follow.

After some experiment, I found it's not convenient to have separate encoder builders: `FlatEncoder` and `ProductEncoder`. For example, when create encoders for `ScalaUDF`, we have no idea if the type `T` is flat or not. So I revert the splitting change in apache/spark#9693, while still keeping the bug fixes and tests. Author: Wenchen Fan <[email protected]> Closes #9726 from cloud-fan/follow.

split ExpressionEncoder into FlatEncoder and ProductEncoder

7e4f7fe

cloud-fan force-pushed the split branch from a0164eb to 7e4f7fe Compare November 13, 2015 15:12

asfgit closed this in d7b2b97 Nov 13, 2015

cloud-fan deleted the split branch November 14, 2015 00:04

cloud-fan mentioned this pull request Nov 16, 2015

[SPARK-11750][SQL] revert SPARK-11727 and code clean up #9726

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-11727][SQL] Split ExpressionEncoder into FlatEncoder and ProductEncoder #9693

[SPARK-11727][SQL] Split ExpressionEncoder into FlatEncoder and ProductEncoder #9693

Uh oh!

cloud-fan commented Nov 13, 2015

Uh oh!

cloud-fan commented Nov 13, 2015

Uh oh!

SparkQA commented Nov 13, 2015

Uh oh!

marmbrus commented Nov 13, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[SPARK-11727][SQL] Split ExpressionEncoder into FlatEncoder and ProductEncoder #9693

[SPARK-11727][SQL] Split ExpressionEncoder into FlatEncoder and ProductEncoder #9693

Uh oh!

Conversation

cloud-fan commented Nov 13, 2015

Uh oh!

cloud-fan commented Nov 13, 2015

Uh oh!

SparkQA commented Nov 13, 2015

Uh oh!

marmbrus commented Nov 13, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants