ZEPPELIN-1197. Should print output directly without invoking function print in pyspark interpreter #1232

zjffdu · 2016-07-27T05:43:04Z

What is this PR for?

For now, user need to invoke print to make the output displayed on the notebook. This behavior is not natural and consistent with other notebooks. This PR is to make the pyspark interpreter in zeppelin behave the same as other notebook. 2 main changes

use single mode to compile the last statement, so that the evaluation result of the last statement will be printed to stdout, this is consistent with other notebooks (like jupyter)
Make SparkOutputStream extends LogOutputStream so that we can see the output of inner process (Python/R), it is helpful for diagnosing.

What type of PR is it?

[Bug Fix]

What is the Jira issue?

https://issues.apache.org/jira/browse/ZEPPELIN-1197

How should this be tested?

Tested it manually. Input the following text in pyspark paragraph,

1+1
sc.version

And get the following output

u'1.6.1'

Questions:

Does the licenses files need update? No
Is there breaking changes for older versions? User don't need to call print explicitly.
Does this needs documentation? Yes

… print in pyspark interpreter

Leemoonsoo · 2016-07-28T14:37:41Z

Thanks @zjffdu for great improvement.

I have tested bit and i could see some inconsistent behavior.

Is it expected result?

zjffdu · 2016-07-28T23:29:40Z

@Leemoonsoo Thanks for the careful checking. I compare it with jupyter, only the second case is different. Let me investigate how to fix it.

zjffdu · 2016-07-28T23:50:47Z

I also compare it with native python repl, the second case is consistent. So I think this behvior is fine, although it is different from jyputer.

Leemoonsoo · 2016-07-29T10:10:22Z

Thanks you for explanation. LGTM

Leemoonsoo · 2016-08-02T21:56:57Z

Merge into master if there're no more discussions.

Leemoonsoo · 2016-08-03T15:37:41Z

@zjffdu @bzz How about bring this change to python interpreter as well?

zjffdu · 2016-08-03T23:18:17Z

Sure, let me do it for python interpreter as well.

zjffdu · 2016-08-04T01:00:41Z

Just take a look at python interpreter, it uses a different way with pyspark interpreter. Might need to take more time to investigate that.

… print in pyspark interpreter ### What is this PR for? For now, user need to invoke print to make the output displayed on the notebook. This behavior is not natural and consistent with other notebooks. This PR is to make the pyspark interpreter in zeppelin behave the same as other notebook. 2 main changes * use single mode to compile the last statement, so that the evaluation result of the last statement will be printed to stdout, this is consistent with other notebooks (like jupyter) * Make SparkOutputStream extends LogOutputStream so that we can see the output of inner process (Python/R), it is helpful for diagnosing. ### What type of PR is it? [Bug Fix] ### What is the Jira issue? * https://issues.apache.org/jira/browse/ZEPPELIN-1197 ### How should this be tested? Tested it manually. Input the following text in pyspark paragraph, ``` 1+1 sc.version ``` And get the following output ``` u'1.6.1' ``` ### Questions: * Does the licenses files need update? No * Is there breaking changes for older versions? User don't need to call print explicitly. * Does this needs documentation? Yes Author: Jeff Zhang <zjffdu@apache.org> Closes apache#1232 from zjffdu/ZEPPELIN-1197 and squashes the following commits: 3771245 [Jeff Zhang] fix and add test 10182e6 [Jeff Zhang] ZEPPELIN-1197. Should print output directly without invoking function print in pyspark interpreter

… print in pyspark interpreter ### What is this PR for? For now, user need to invoke print to make the output displayed on the notebook. This behavior is not natural and consistent with other notebooks. This PR is to make the pyspark interpreter in zeppelin behave the same as other notebook. 2 main changes * use single mode to compile the last statement, so that the evaluation result of the last statement will be printed to stdout, this is consistent with other notebooks (like jupyter) * Make SparkOutputStream extends LogOutputStream so that we can see the output of inner process (Python/R), it is helpful for diagnosing. ### What type of PR is it? [Bug Fix] ### What is the Jira issue? * https://issues.apache.org/jira/browse/ZEPPELIN-1197 ### How should this be tested? Tested it manually. Input the following text in pyspark paragraph, ``` 1+1 sc.version ``` And get the following output ``` u'1.6.1' ``` ### Questions: * Does the licenses files need update? No * Is there breaking changes for older versions? User don't need to call print explicitly. * Does this needs documentation? Yes Author: Jeff Zhang <zjffdu@apache.org> Closes #1232 from zjffdu/ZEPPELIN-1197 and squashes the following commits: 3771245 [Jeff Zhang] fix and add test 10182e6 [Jeff Zhang] ZEPPELIN-1197. Should print output directly without invoking function print in pyspark interpreter (cherry picked from commit b885f43) Signed-off-by: Mina Lee <minalee@apache.org>

zjffdu added 2 commits July 27, 2016 13:42

ZEPPELIN-1197. Should print output directly without invoking function…

10182e6

… print in pyspark interpreter

fix and add test

3771245

asfgit closed this in b885f43 Aug 3, 2016

minahlee mentioned this pull request Aug 9, 2016

ZEPPELIN-1311. Typo in ZEPPELIN-1197 #1307

Closed

1 task

minahlee mentioned this pull request Aug 10, 2016

ZEPPELIN-1287. No need to call print to display output in PythonInterpreter #1278

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ZEPPELIN-1197. Should print output directly without invoking function print in pyspark interpreter #1232

ZEPPELIN-1197. Should print output directly without invoking function print in pyspark interpreter #1232

zjffdu commented Jul 27, 2016

Leemoonsoo commented Jul 28, 2016

zjffdu commented Jul 28, 2016

zjffdu commented Jul 28, 2016

Leemoonsoo commented Jul 29, 2016

Leemoonsoo commented Aug 2, 2016

Leemoonsoo commented Aug 3, 2016

zjffdu commented Aug 3, 2016

zjffdu commented Aug 4, 2016

ZEPPELIN-1197. Should print output directly without invoking function print in pyspark interpreter #1232

ZEPPELIN-1197. Should print output directly without invoking function print in pyspark interpreter #1232

Conversation

zjffdu commented Jul 27, 2016

What is this PR for?

What type of PR is it?

What is the Jira issue?

How should this be tested?

Questions:

Leemoonsoo commented Jul 28, 2016

zjffdu commented Jul 28, 2016

zjffdu commented Jul 28, 2016

Leemoonsoo commented Jul 29, 2016

Leemoonsoo commented Aug 2, 2016

Leemoonsoo commented Aug 3, 2016

zjffdu commented Aug 3, 2016

zjffdu commented Aug 4, 2016