Updated the Spark SQL Programming guide with Custom object encoding for Dataset and unsupported operation error handling #16997

HarshSharma8 · 2017-02-20T06:54:54Z

What changes were proposed in this pull request?

Made some updates to SQL programming guide to explain the Encoding operation with kryo.

How was this patch tested?

Just updated the docs.

Please review http://spark.apache.org/contributing.html before opening a pull request.

…tion

AmplabJenkins · 2017-02-20T06:57:14Z

Can one of the admins verify this patch?

srowen · 2017-02-20T12:35:25Z

docs/sql-programming-guide.md

@@ -297,6 +297,9 @@ reflection and become the names of the columns. Case classes can also be nested
 types such as `Seq`s or `Array`s. This RDD can be implicitly converted to a DataFrame and then be
 registered as a table. Tables can be used in subsequent SQL statements.

+Spark Encoders are used to convert a JVM object to Spark SQL representation. When we want to make a datase, Spark requires an encoder which takes the form Encoder[T] where T is the type we want to be encoded. When we try to create dataset with a custom type of object, then may result into <b>java.lang.UnsupportedOperationException: No Encoder found for Object-Name</b>.


It's minor, but there are enough problems with the text to call it out. Please match the voice of the other text and avoid 'we'. Typos: "datase", "spark sql" and "kryo" for example. Use back-ticks to consistently format code if you're going to. What is Object-Name?

Hello srowen,
I have updated the content to match the void of the content, you can have another look at it.

…Guide

HyukjinKwon · 2017-02-21T05:26:44Z

docs/sql-programming-guide.md

@@ -297,6 +297,9 @@ reflection and become the names of the columns. Case classes can also be nested
 types such as `Seq`s or `Array`s. This RDD can be implicitly converted to a DataFrame and then be
 registered as a table. Tables can be used in subsequent SQL statements.

+Spark Encoders are used to convert a JVM object to Spark SQL representation. To create dataset, spark requires an encoder which takes the form of <b>Encoder[T]</b> where <b>T</b> is the type which has to be encoded. Creation of a dataset with a custom type of object, may result into <b>java.lang.UnsupportedOperationException: No Encoder found for Object-Name</b>.


It is trivial.. but maybe spark -> Spark? I am not an expert in grammar but up to my knowledge, capitalizing a proper noun is correct.

Yes, @HarshSharma8 this still doesn't address the comments. Use back-ticks for code, not bold, too. What is Object-Name?

HyukjinKwon · 2017-02-21T05:27:34Z

BTW, could we maybe make the title complete (not opera…)?

HarshSharma8 · 2017-02-21T05:33:57Z

Hello Sean, I apologize for bold instead of back-ticks, and i'm updating the content for this. Thank You Best Regards | *Harsh Sharma* Sr. Software Consultant Facebook <https://www.facebook.com/harsh.sharma.161446> | Twitter <https://twitter.com/harsh_sharma5> | Linked In <https://www.linkedin.com/in/harsh-sharma-0a08a1b0?trk=hp-identity-name> harshs316@gmail.com Skype*: khandal60* *+91-8447307237*

…

On Tue, Feb 21, 2017 at 10:58 AM, Sean Owen ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In docs/sql-programming-guide.md <#16997 (comment)>: > @@ -297,6 +297,9 @@ reflection and become the names of the columns. Case classes can also be nested types such as `Seq`s or `Array`s. This RDD can be implicitly converted to a DataFrame and then be registered as a table. Tables can be used in subsequent SQL statements. +Spark Encoders are used to convert a JVM object to Spark SQL representation. To create dataset, spark requires an encoder which takes the form of Encoder[T] where T is the type which has to be encoded. Creation of a dataset with a custom type of object, may result into java.lang.UnsupportedOperationException: No Encoder found for Object-Name. Yes, @HarshSharma8 <https://github.com/HarshSharma8> this still doesn't address the comments. Use back-ticks for code, not bold, too. What is Object-Name? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#16997 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AKIiQM8Tsz96c1KHGszvbFmgJnnRD62Gks5renYPgaJpZM4MF0vf> .

…stead of bold tags

HarshSharma8 · 2017-02-21T05:43:19Z

Hello Sean, I have updated the content with back-ticks, Can you have a look at this ? And i am not getting which object-name you are asking about. Thank You Best Regards | *Harsh Sharma* Sr. Software Consultant Facebook <https://www.facebook.com/harsh.sharma.161446> | Twitter <https://twitter.com/harsh_sharma5> | Linked In <https://www.linkedin.com/in/harsh-sharma-0a08a1b0?trk=hp-identity-name> harshs316@gmail.com Skype*: khandal60* *+91-8447307237*

…

On Tue, Feb 21, 2017 at 11:03 AM, Harsh Sharma ***@***.***> wrote: Hello Sean, I apologize for bold instead of back-ticks, and i'm updating the content for this. Thank You Best Regards | *Harsh Sharma* Sr. Software Consultant Facebook <https://www.facebook.com/harsh.sharma.161446> | Twitter <https://twitter.com/harsh_sharma5> | Linked In <https://www.linkedin.com/in/harsh-sharma-0a08a1b0?trk=hp-identity-name> ***@***.*** Skype*: khandal60* *+91-8447307237* On Tue, Feb 21, 2017 at 10:58 AM, Sean Owen ***@***.***> wrote: > ***@***.**** commented on this pull request. > ------------------------------ > > In docs/sql-programming-guide.md > <#16997 (comment)>: > > > @@ -297,6 +297,9 @@ reflection and become the names of the columns. Case classes can also be nested > types such as `Seq`s or `Array`s. This RDD can be implicitly converted to a DataFrame and then be > registered as a table. Tables can be used in subsequent SQL statements. > > +Spark Encoders are used to convert a JVM object to Spark SQL representation. To create dataset, spark requires an encoder which takes the form of Encoder[T] where T is the type which has to be encoded. Creation of a dataset with a custom type of object, may result into java.lang.UnsupportedOperationException: No Encoder found for Object-Name. > > Yes, @HarshSharma8 <https://github.com/HarshSharma8> this still doesn't > address the comments. Use back-ticks for code, not bold, too. What is > Object-Name? > > — > You are receiving this because you were mentioned. > Reply to this email directly, view it on GitHub > <#16997 (comment)>, or mute > the thread > <https://github.com/notifications/unsubscribe-auth/AKIiQM8Tsz96c1KHGszvbFmgJnnRD62Gks5renYPgaJpZM4MF0vf> > . >

srowen · 2017-02-21T06:11:55Z

You are still bold-facing code elements, and now back-ticked a string, which isn't code. There are still typos like "create dataset" instead of "create a Dataset". Do you mean to write something to indicate a class name will be in the message? then write something like "[class name]". There is no object name here. Please review carefully before you ask for another review.

HarshSharma8 · 2017-02-21T09:46:46Z

I updated the content with a demo object. I would appreciate if anyone can have a look at this.

HyukjinKwon · 2017-02-21T09:56:29Z

Could you fix the PR title too while you are online maybe? It might be nice to have a good title for both a commit log and those who like to track down the history.

HarshSharma8 · 2017-02-21T10:03:35Z

Hello HyukjinKwon,
I have updated the title, i wish you like it, it shows what is there in the content. And commit has already been made.

HarshSharma8 · 2017-02-23T10:05:14Z

Did anyone get a chance to verify it or any changes required by me to make ?

srowen · 2017-03-05T16:43:25Z

This still has formatting and text problems. I'm sorry I don't think I can go around again for this when it's not an important change, and I'd like to close this.

HarshSharma8 · 2017-03-06T04:39:57Z

Sure, and thanks for kind attention to this pull request. Thank You Best Regards | *Harsh Sharma* Sr. Software Consultant Knoldus Software LLP FB <https://www.facebook.com/harsh.sharma.161446> | Twitter <https://twitter.com/harsh_sharma5> | LinkedIn <https://www.linkedin.com/in/harsh-sharma-0a08a1b0?trk=hp-identity-name> harshs316@gmail.com Skype*: khandal60* *+91-8447307237*

…

On Sun, Mar 5, 2017 at 10:13 PM, Sean Owen ***@***.***> wrote: This still has formatting and text problems. I'm sorry I don't think I can go around again for this when it's not an important change, and I'd like to close this. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#16997 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AKIiQARgsS9c8P7s7slP6T39bwCfW7ywks5riuZGgaJpZM4MF0vf> .

Closes apache#16819 Closes apache#13467 Closes apache#16083 Closes apache#17135 Closes apache#8785 Closes apache#16278 Closes apache#16997 Closes apache#17073 Closes apache#17220

Updated the SQL programming guide to explain about the Encoding opera…

103906f

…tion

srowen reviewed Feb 20, 2017

View reviewed changes

Updated the docs to match the voice of my updates in SQL Programming …

9c8f63c

…Guide

HyukjinKwon reviewed Feb 21, 2017

View reviewed changes

Modified the content and replaced the code block inside back-ticks in…

7a539a7

…stead of bold tags

HarshSharma8 added 2 commits February 21, 2017 15:14

Updated the content to provide object name

d49ae50

Updated the content to provide object name

c2fd0ad

HarshSharma8 changed the title ~~Updated the SQL programming guide to explain about the Encoding opera…~~ Updated the Spark SQL Programming guide with Encoder class specifications and possible error handling Feb 21, 2017

HarshSharma8 changed the title ~~Updated the Spark SQL Programming guide with Encoder class specifications and possible error handling~~ Updated the Spark SQL Programming guide with Custom object encoding for Dataset and unsupported operation error handling Feb 21, 2017

srowen added a commit to srowen/spark that referenced this pull request Mar 22, 2017

Close stale PRs.

d88bc61

Closes apache#16819 Closes apache#13467 Closes apache#16083 Closes apache#17135 Closes apache#8785 Closes apache#16278 Closes apache#16997 Closes apache#17073 Closes apache#17220

srowen mentioned this pull request Mar 22, 2017

[INFRA] Close stale PRs #17386

Closed

asfgit closed this in b70c03a Mar 23, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updated the Spark SQL Programming guide with Custom object encoding for Dataset and unsupported operation error handling #16997

Updated the Spark SQL Programming guide with Custom object encoding for Dataset and unsupported operation error handling #16997

HarshSharma8 commented Feb 20, 2017

AmplabJenkins commented Feb 20, 2017

srowen Feb 20, 2017

HarshSharma8 Feb 20, 2017

HyukjinKwon Feb 21, 2017

srowen Feb 21, 2017

HyukjinKwon commented Feb 21, 2017

HarshSharma8 commented Feb 21, 2017 via email

HarshSharma8 commented Feb 21, 2017 via email

srowen commented Feb 21, 2017

HarshSharma8 commented Feb 21, 2017

HyukjinKwon commented Feb 21, 2017

HarshSharma8 commented Feb 21, 2017

HarshSharma8 commented Feb 23, 2017

srowen commented Mar 5, 2017

HarshSharma8 commented Mar 6, 2017 via email

Updated the Spark SQL Programming guide with Custom object encoding for Dataset and unsupported operation error handling #16997

Updated the Spark SQL Programming guide with Custom object encoding for Dataset and unsupported operation error handling #16997

Conversation

HarshSharma8 commented Feb 20, 2017

What changes were proposed in this pull request?

How was this patch tested?

AmplabJenkins commented Feb 20, 2017

srowen Feb 20, 2017

Choose a reason for hiding this comment

HarshSharma8 Feb 20, 2017

Choose a reason for hiding this comment

HyukjinKwon Feb 21, 2017

Choose a reason for hiding this comment

srowen Feb 21, 2017

Choose a reason for hiding this comment

HyukjinKwon commented Feb 21, 2017

HarshSharma8 commented Feb 21, 2017 via email

HarshSharma8 commented Feb 21, 2017 via email

srowen commented Feb 21, 2017

HarshSharma8 commented Feb 21, 2017

HyukjinKwon commented Feb 21, 2017

HarshSharma8 commented Feb 21, 2017

HarshSharma8 commented Feb 23, 2017

srowen commented Mar 5, 2017

HarshSharma8 commented Mar 6, 2017 via email