refactor generate_optional_value function & add Terasort testcase by gcz2022 · Pull Request #312 · Intel-bigdata/HiBench

gcz2022 · 2016-09-20T13:04:04Z

refactor generate_optional_value function to probe the JAVA_OPTS in a right way, gain a clearer hint & comments, wipe out support for CDH4 and MR1, and grant more readability

… right way, gain a clearer hint & comments, wipe out support for CDH4 and MR1, and grant more readability.

carsonwang · 2016-09-21T02:31:41Z

+        # CDH release
+        elif HibenchConf['hibench.hadoop.release'].startswith('cdh'):
+            HibenchConf["hibench.hadoop.examples.test.jar"] = OneAndOnlyOneFile(HibenchConf[
+                                                                                        'hibench.hadoop.home'] + "/share/hadoop/mapreduce2/hadoop-mapreduce-client-jobclient*-tests.jar")


Similar to the example jar, is there another path for CDH here?

carsonwang · 2016-09-21T02:34:30Z

    # set hibench.sleep.job.jar
    if not HibenchConf.get('hibench.sleep.job.jar', ''):
-        if HibenchConf['hibench.hadoop.release'] == 'apache' and HibenchConf["hibench.hadoop.version"] == "hadoop1":
+        if HibenchConf['hibench.hadoop.release'] == 'apache':


According to the original condition, this only applies to hadoop1. For hadoop2, the path is different. We should remove this.

carsonwang · 2016-09-21T02:37:10Z

+        # CDH release
+        elif HibenchConf["hibench.hadoop.release"].startswith("cdh"):
+            HibenchConf["hibench.hadoop.configure.dir"] = join(HibenchConf["hibench.hadoop.home"], "etc", "hadoop")
+            HibenchConfRef["hibench.hadoop.configure.dir"] = "Inferred by:  & 'hibench.hadoop.release'"


There is no difference for apache, hdp, cdh anymore? Can we combine these and removing the if else.

carsonwang · 2016-09-21T02:39:43Z

    # determine running mode according to spark master configuration
-    if not (HibenchConf.get("hibench.masters.hostnames", "") or HibenchConf.get("hibench.slaves.hostnames", "")): # no pre-defined hostnames, let's probe
+    if not (HibenchConf.get("hibench.masters.hostnames", "") or HibenchConf.get("hibench.slaves.hostnames",
+                                                                                "")):  # no pre-defined hostnames, let's probe


Is this the right style for python we are going to follow?

Probably not, the two rows have the same length while they follow different styles.
For the newest code of these two lines, there are many line feeds. It seems weird for new python developers such as me, but it accords with pep8

carsonwang · 2016-09-21T02:46:36Z

+    probe_java_opts()
+    #test_succeed()
+
+def test_succeed():


We need write some unit tests later to test this.

carsonwang · 2016-09-21T02:49:06Z

                log(spark_master, HibenchConf['hibench.masters.hostnames'])
                with closing(urllib.urlopen('http://%s:8080' % HibenchConf['hibench.masters.hostnames'])) as page:
-                    worker_hostnames=[re.findall("http:\/\/([a-zA-Z\-\._0-9]+):8081", x)[0] for x in page.readlines() if "8081" in x and "worker" in x]
+                    worker_hostnames = [re.findall("http:\/\/([a-zA-Z\-\._0-9]+):8081", x)[0] for x in page.readlines()


We need fix the hard coded port number later.

…e the codes and remove some unuseful codes

…andalone and Spark on yarn

carsonwang · 2016-09-23T01:39:29Z

+        bufsize=0,  # default value of 0 (unbuffered) is best
+        shell=True,
+        stdout=subprocess.PIPE,
+        stderr=subprocess.PIPE


It seems the style here is not consistent with others. A space is need before and after =? Is this caused by the auto formatter?

Yes, it's modified by autopep8, you can install it by pip install autopep8

carsonwang · 2016-09-23T02:59:27Z

+SIZE=`dir_size $INPUT_HDFS`
+START_TIME=`timestamp`
+run-hadoop-job ${HADOOP_EXAMPLES_JAR} terasort \
+    -D ${REDUCER_CONFIG_NAME}=${NUM_REDS} \


Can you update this to -D mapreduce.job.reduces=${NUM_REDS}? REDUCER_CONFIG_NAME will be removed later because mapreduce.job.reduces is the only value of it.

carsonwang · 2016-09-23T03:00:49Z

+START_TIME=`timestamp`
+run-hadoop-job ${HADOOP_EXAMPLES_JAR} teragen \
+    -D ${MAP_CONFIG_NAME}=${NUM_MAPS} \
+    -D ${REDUCER_CONFIG_NAME}=${NUM_REDS} \


Do not use MAP_CONFIG_NAME and REDUCER_CONFIG_NAME here as well

carsonwang · 2016-09-23T06:27:18Z

Thanks @gczsjdy for the work!

…tel-bigdata#312) * refactor generate_optional_value function to probe the JAVA_OPTS in a right way, gain a clearer hint & comments, wipe out support for CDH4 and MR1, and grant more readability. * Change probe_java_opts function to deal with any weird xml style, tune the codes and remove some unuseful codes * Use autopep8 to standardize the code * Add bin/conf for terasort, already finished test for Hadoop, Spark Standalone and Spark on yarn * Use new config name instead of the old

* refactor generate_optional_value function to probe the JAVA_OPTS in a right way, gain a clearer hint & comments, wipe out support for CDH4 and MR1, and grant more readability. * Change probe_java_opts function to deal with any weird xml style, tune the codes and remove some unuseful codes * Use autopep8 to standardize the code * Add bin/conf for terasort, already finished test for Hadoop, Spark Standalone and Spark on yarn * Use new config name instead of the old

refactor generate_optional_value function to probe the JAVA_OPTS in a…

8c40871

… right way, gain a clearer hint & comments, wipe out support for CDH4 and MR1, and grant more readability.

gcz2022 changed the title ~~refactor generate_optional_value function to probe the JAVA_OPTS in a…~~ refactor generate_optional_value function Sep 20, 2016

carsonwang reviewed Sep 21, 2016

View reviewed changes

GuoChenzhao added 3 commits September 21, 2016 16:31

Change probe_java_opts function to deal with any weird xml style, tun…

9ac1da9

…e the codes and remove some unuseful codes

Use autopep8 to standardize the code

56b11ab

Add bin/conf for terasort, already finished test for Hadoop, Spark St…

0e466a6

…andalone and Spark on yarn

gcz2022 changed the title ~~refactor generate_optional_value function~~ refactor generate_optional_value function & add Terasort bin/conf Sep 22, 2016

gcz2022 changed the title ~~refactor generate_optional_value function & add Terasort bin/conf~~ refactor generate_optional_value function & add Terasort testcase Sep 23, 2016

carsonwang reviewed Sep 23, 2016

View reviewed changes

Use new config name instead of the old

36b9d52

carsonwang merged commit d8bd401 into Intel-bigdata:6.0 Sep 23, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor generate_optional_value function & add Terasort testcase#312

refactor generate_optional_value function & add Terasort testcase#312
carsonwang merged 5 commits intoIntel-bigdata:6.0from
gcz2022:6.0

gcz2022 commented Sep 20, 2016

Uh oh!

carsonwang Sep 21, 2016

Uh oh!

carsonwang Sep 21, 2016

Uh oh!

carsonwang Sep 21, 2016

Uh oh!

carsonwang Sep 21, 2016 •

edited

Loading

Uh oh!

gcz2022 Sep 23, 2016

Uh oh!

carsonwang Sep 21, 2016

Uh oh!

carsonwang Sep 21, 2016

Uh oh!

carsonwang Sep 23, 2016 •

edited

Loading

Uh oh!

gcz2022 Sep 23, 2016

Uh oh!

carsonwang Sep 23, 2016

Uh oh!

carsonwang Sep 23, 2016

Uh oh!

carsonwang commented Sep 23, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

gcz2022 commented Sep 20, 2016

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

carsonwang Sep 21, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

carsonwang Sep 23, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

carsonwang commented Sep 23, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

carsonwang Sep 21, 2016 •

edited

Loading

carsonwang Sep 23, 2016 •

edited

Loading