[SYSTEMML-1451][Phase 2] Decouple Scripts and HDFS support #575

krishnakalyan3 · 2017-07-14T06:03:12Z

Please refer to https://issues.apache.org/jira/browse/SYSTEMML-1451 for more details.

krishnakalyan3 · 2017-07-14T06:14:30Z

@nakul02 could you please share your thoughts on this initial approach.

Also what do you think of the following points below:
a) Createing common utils files that can be shared by systemml-spark-submit.py and systemml-standalone.py. (Common functions like get_env, default_jars, find_script_file etc...)
b) Change systemml-standalone.py to point jars instead of classes.

I have tested systemml-spark-submit.py and it works on my local system with the commands below.

./systemml-spark-submit.py -f genRandData4Kmeans.dml -nvargs nr=10000 nf=1000 nc=50 dc=10.0 dr=1.0 fbf=100.0 cbf=100.0 X=data/X.data C=data/C.data Y=data/Y.data YbyC=data/YbyC.data fmt=csv
./systemml-spark-submit.py -f ~/open-source/matmul.dml

akchinSTC · 2017-07-14T09:30:13Z

Refer to this link for build results (access rights to CI server needed):
https://sparktc.ibmcloud.com/jenkins/job/SystemML-PullRequestBuilder/1757/

akchinSTC · 2017-07-15T01:45:14Z

Refer to this link for build results (access rights to CI server needed):
https://sparktc.ibmcloud.com/jenkins/job/SystemML-PullRequestBuilder/1762/

akchinSTC · 2017-07-15T09:10:50Z

Refer to this link for build results (access rights to CI server needed):
https://sparktc.ibmcloud.com/jenkins/job/SystemML-PullRequestBuilder/1763/

akchinSTC · 2017-07-17T08:28:44Z

Refer to this link for build results (access rights to CI server needed):
https://sparktc.ibmcloud.com/jenkins/job/SystemML-PullRequestBuilder/1765/

akchinSTC · 2017-07-19T01:06:06Z

Refer to this link for build results (access rights to CI server needed):
https://sparktc.ibmcloud.com/jenkins/job/SystemML-PullRequestBuilder/1774/

akchinSTC · 2017-07-19T08:25:18Z

Refer to this link for build results (access rights to CI server needed):
https://sparktc.ibmcloud.com/jenkins/job/SystemML-PullRequestBuilder/1777/

j143-zz · 2017-07-20T09:12:18Z

@akchinSTC can you test this once again. Thanks.

nakul02 · 2017-07-20T15:42:43Z

@j143 - @akchinSTC is a bot. It will run the CI tests again when the author of this PR pushes another commit.

These scripts don't add integration tests, so running the CI will not tell you more than running the tests on the master branch.

Any reason you wanted this tested again?

akchinSTC · 2017-07-20T16:11:41Z

Refer to this link for build results (access rights to CI server needed):
https://sparktc.ibmcloud.com/jenkins/job/SystemML-PullRequestBuilder/1782/

j143-zz · 2017-07-20T17:40:42Z

yes! @nakul02 There was a fix by another PR before this, which is otherwise responsible for previous build failure https://sparktc.ibmcloud.com/jenkins/job/SystemML-PullRequestBuilder/1777/ . Now, that test is passing in that particular test.

mboehm7 · 2017-07-20T18:04:38Z

FullReblockTest is one of our flaky tests which unfortunately fails once in a while.

deroneriksson · 2017-07-20T19:11:19Z

@j143 @nakul02 Note that it is possible to manually trigger a Jenkins PR test (if ever needed). See https://gist.github.com/deroneriksson/e0d6d0634f3388f0df5e#pull-request-magic for the command.

nakul02 · 2017-07-20T20:09:58Z

@deroneriksson - is there a way to disable it?

This PR introduces a set of python scripts which are never invoked directly or indirectly in anything that Jenkins does. Running the tests over and over again for each commit seems wasteful. It would be good to be able to disable the automatic running of the jenkins for every commit.

deroneriksson · 2017-07-20T20:13:59Z

I don't know if there's a way to disable it. There are situations where that would be very nice.

deroneriksson · 2017-07-20T20:56:40Z

I asked around and "skip ci" may do it.

skip ci

akchinSTC · 2017-07-21T05:40:42Z

Refer to this link for build results (access rights to CI server needed):
https://sparktc.ibmcloud.com/jenkins/job/SystemML-PullRequestBuilder/1789/

akchinSTC · 2017-07-22T03:26:43Z

Refer to this link for build results (access rights to CI server needed):
https://sparktc.ibmcloud.com/jenkins/job/SystemML-PullRequestBuilder/1793/

akchinSTC · 2017-07-22T13:14:24Z

Refer to this link for build results (access rights to CI server needed):
https://sparktc.ibmcloud.com/jenkins/job/SystemML-PullRequestBuilder/1795/

akchinSTC · 2017-07-23T21:47:52Z

Refer to this link for build results (access rights to CI server needed):
https://sparktc.ibmcloud.com/jenkins/job/SystemML-PullRequestBuilder/1799/

akchinSTC · 2017-07-25T20:57:58Z

Refer to this link for build results (access rights to CI server needed):
https://sparktc.ibmcloud.com/jenkins/job/SystemML-PullRequestBuilder/1807/

akchinSTC · 2017-07-28T16:24:51Z

Refer to this link for build results (access rights to CI server needed):
https://sparktc.ibmcloud.com/jenkins/job/SystemML-PullRequestBuilder/1822/

akchinSTC · 2017-07-28T17:41:41Z

Refer to this link for build results (access rights to CI server needed):
https://sparktc.ibmcloud.com/jenkins/job/SystemML-PullRequestBuilder/1823/

akchinSTC · 2017-07-28T20:10:53Z

Refer to this link for build results (access rights to CI server needed):
https://sparktc.ibmcloud.com/jenkins/job/SystemML-PullRequestBuilder/1825/

akchinSTC · 2017-07-29T05:23:16Z

Refer to this link for build results (access rights to CI server needed):
https://sparktc.ibmcloud.com/jenkins/job/SystemML-PullRequestBuilder/1828/

akchinSTC · 2017-07-29T07:08:53Z

Refer to this link for build results (access rights to CI server needed):
https://sparktc.ibmcloud.com/jenkins/job/SystemML-PullRequestBuilder/1829/

akchinSTC · 2017-07-29T18:41:57Z

Refer to this link for build results (access rights to CI server needed):
https://sparktc.ibmcloud.com/jenkins/job/SystemML-PullRequestBuilder/1831/

akchinSTC · 2017-07-29T23:02:29Z

Refer to this link for build results (access rights to CI server needed):
https://sparktc.ibmcloud.com/jenkins/job/SystemML-PullRequestBuilder/1833/

Failed Tests: 1

SystemML-PullRequestBuilder/org.apache.systemml:systemml: 1

org.apache.sysml.test.integration.functions.data.FullReblockTest.testBinaryBlockMultipleMSparseMR

akchinSTC · 2017-07-30T17:32:48Z

Refer to this link for build results (access rights to CI server needed):
https://sparktc.ibmcloud.com/jenkins/job/SystemML-PullRequestBuilder/1834/

nakul02 · 2017-07-30T21:57:47Z

bin/systemml-spark-submit.py

-    conf = default_conf
+def spark_submit_entry(master, driver_memory, num_executors, executor_memory,
+                       executor_cores, conf,
+                       nvargs, args, config, explain, debug, stats, gpu, f):



Could you please add a little comment about this function.

nakul02 · 2017-07-30T22:23:52Z

bin/utils.py

+
+
+def get_env():
+    """


Could you please give this function a more descriptive name. get_env() is too generic and doesn't convey that only SPARK_HOME and SYSTEMML_HOME variables are being returned.
I would suggest you break this function up into two functions - one to get the environment variable for SYSTEMML_HOME and one for SPARK_HOME. You can then name them get_env_systemml_home and get_env_spark_home or something better.

nakul02 · 2017-07-30T22:25:15Z

bin/systemml-spark-submit.py

+
+    cparser = argparse.ArgumentParser(description='System-ML Spark Submit Script')
+    # SPARK-SUBMIT Options
+    cparser.add_argument('--master', default='local[*]', help='local, yarn-client, yarn-cluster', metavar='')


Can you please also print out the defaults for each of the options in the help message?

nakul02 · 2017-07-30T22:26:28Z

bin/systemml-standalone.py

-if len(sys.argv) < 2:
-    print('Wrong usage')
-    print_usage_and_exit()
+def standalone_entry(nvargs, args, config, explain, debug, stats, gpu, f):



Please add documentation for this function.
Also, do you think standalone_execution_entry or standalone_mode_entry is a better name?

nakul02 · 2017-07-30T22:36:44Z

bin/utils.py

+
+def find_file(name, path):
+    """
+


Could you please complete this documentation?

nakul02 · 2017-07-30T22:50:56Z

scripts/perftest/python/utils_exec.py

+    return return_data
+
+
+def get_std_out(process):


This name is somewhat misleading, since you return both - stdout and stderr. Maybe you could rename it to something more appropriate?

nakul02 · 2017-07-30T22:52:28Z

scripts/perftest/python/utils_exec.py

+    return out_arr, error_arr
+
+
+def parse_dir(std_outs):


How about the function name parse_hdfs_paths instead of parse_dir?

nakul02 · 2017-07-30T22:53:46Z

scripts/perftest/python/utils_fs.py

+        os.makedirs(directory)
+
+
+def write_success(time, cwd):


I thought cwd is mostly to specify the current working directory.
How about naming this variable to something else, like directory or dir or something similar?

nakul02 · 2017-07-30T22:55:00Z

scripts/perftest/python/utils_fs.py

+                open(full_path, 'w').close()
+
+
+def get_existence(path):


Maybe the name could be check_SUCCESS_file_exists

nakul02 · 2017-07-30T22:56:37Z

scripts/perftest/python/utils_misc.py

+# This file contains all misc utility functions required by performance test module
+
+
+def sup_args(config_dict, spark_dict, exec_type):


Not sure what this name means. Could you please think of a more descriptive name?

krishnakalyan3 · 2017-07-30T23:02:35Z

Thanks for the review @nakul02.

akchinSTC · 2017-07-31T00:44:18Z

Refer to this link for build results (access rights to CI server needed):
https://sparktc.ibmcloud.com/jenkins/job/SystemML-PullRequestBuilder/1837/

akchinSTC · 2017-08-01T08:56:45Z

Refer to this link for build results (access rights to CI server needed):
https://sparktc.ibmcloud.com/jenkins/job/SystemML-PullRequestBuilder/1843/

nakul02 · 2017-08-01T20:42:51Z

LGTM, i shall merge.

Completed these tasks as part for Phase 2 for Google Summer of Code '17 - Decouple systemml-spark-submit.py - Decouple systemml-standalone.py - Refractor perf test suit to accept args like debug, stats, config etc... - Add HDFS support - Google Docs support - Compare SystemML with previous versions - Pylint, Comment - Extra arguments configuration Test - Windows Test - Doc update - systemml standalone comments - systemml spark submit comments Closes #575

Completed these tasks as part for Phase 2 for Google Summer of Code '17 - Decouple systemml-spark-submit.py - Decouple systemml-standalone.py - Refractor perf test suit to accept args like debug, stats, config etc... - Add HDFS support - Google Docs support - Compare SystemML with previous versions - Pylint, Comment - Extra arguments configuration Test - Windows Test - Doc update - systemml standalone comments - systemml spark submit comments Closes apache#575

[SYSTEMML-1451] phase 2 work

7367005

krishnakalyan3 force-pushed the SYSTEMML-1451-phase2 branch from fe378f6 to 7367005 Compare July 28, 2017 17:40

fix missing refs

1334d37

krishnakalyan3 added 2 commits July 28, 2017 23:37

add docstring

bfaa025

pylint and docstring

d3921cb

krishnakalyan3 added 2 commits July 29, 2017 12:23

fix error handling

377c0c7

update comments

e611851

krishnakalyan3 added 2 commits July 29, 2017 17:11

update doc

dd7fbc7

updates to docs

31900dc

update doc

fd68d73

krishnakalyan3 added 2 commits July 30, 2017 17:37

gspread

87ecbed

windows fix

b7022e5

nakul02 reviewed Jul 30, 2017

View reviewed changes

krishnakalyan3 added 3 commits August 1, 2017 01:30

update todos and fix comments

ac8178c

minor test changes

16ed00e

debug off

f668444

asfgit closed this in e94374a Aug 1, 2017

		# This file contains all misc utility functions required by performance test module


		def sup_args(config_dict, spark_dict, exec_type):

[SYSTEMML-1451][Phase 2] Decouple Scripts and HDFS support #575

[SYSTEMML-1451][Phase 2] Decouple Scripts and HDFS support #575

Conversation

krishnakalyan3 commented Jul 14, 2017 • edited

krishnakalyan3 commented Jul 14, 2017 • edited

akchinSTC commented Jul 14, 2017

akchinSTC commented Jul 15, 2017

akchinSTC commented Jul 15, 2017

akchinSTC commented Jul 17, 2017

akchinSTC commented Jul 19, 2017

akchinSTC commented Jul 19, 2017

j143-zz commented Jul 20, 2017

nakul02 commented Jul 20, 2017

akchinSTC commented Jul 20, 2017

j143-zz commented Jul 20, 2017

mboehm7 commented Jul 20, 2017

deroneriksson commented Jul 20, 2017

nakul02 commented Jul 20, 2017

deroneriksson commented Jul 20, 2017

deroneriksson commented Jul 20, 2017

akchinSTC commented Jul 21, 2017

akchinSTC commented Jul 22, 2017

akchinSTC commented Jul 22, 2017

akchinSTC commented Jul 23, 2017

akchinSTC commented Jul 25, 2017

akchinSTC commented Jul 28, 2017

akchinSTC commented Jul 28, 2017

akchinSTC commented Jul 28, 2017

akchinSTC commented Jul 29, 2017

akchinSTC commented Jul 29, 2017

akchinSTC commented Jul 29, 2017

akchinSTC commented Jul 29, 2017

Failed Tests: 1

SystemML-PullRequestBuilder/org.apache.systemml:systemml: 1

akchinSTC commented Jul 30, 2017

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

krishnakalyan3 commented Jul 30, 2017

akchinSTC commented Jul 31, 2017

akchinSTC commented Aug 1, 2017

nakul02 commented Aug 1, 2017

krishnakalyan3 commented Jul 14, 2017 •

edited

krishnakalyan3 commented Jul 14, 2017 •

edited