Parallel Diff-ing and other Tweaks #66

Myoldmopar · 2021-05-11T17:44:20Z

This has some changes in place to get parallel diff calculations working. I haven't tested it exhaustively, but it does look like it is working, and all unit tests continue to pass. Need to verify CI will be OK with it though, by pointing a decent CI config file to this branch and making sure some diffs appear in that branch.

This will continue to run serially on Windows or in frozen apps or whatever the limitation was previously. Now that E+ is thread safe, we need to make sure that table-diff and math-diff are also thread safe, and then we can get all this running using threading rather than multiprocessing, and it will be parallel on all platforms.

mitchute · 2021-05-11T17:49:03Z

Parallel diff-ing!! 🤩

Myoldmopar · 2021-05-11T17:45:16Z

epregressions/diffs/ci_compare_script.py

+        runner.test_output_dir,
+        runner.thresh_dict_file,
+        ci_mode=True
+    )  # returns an updated entry


process_diffs_for_one_case is now a @staticmethod to make it easier to use with the process pool. This required some extra parameters to be passed into it.

Myoldmopar · 2021-05-11T17:45:46Z

epregressions/runtests.py

        self.id_like_to_stop_now = False
+        self.completed_structure = None


Completed structure instance is now a member variable on the suite runner class so that the process pool can access it when processes complete.

Myoldmopar · 2021-05-11T17:46:19Z

epregressions/runtests.py

@@ -372,7 +375,8 @@ def run_build(self, build_tree):
        # `apply_async` approach I am using.  Blech.  Once again, on Windows, this means it will partially not be
        # multithreaded.
        if self.number_of_threads == 1 or frozen and system() in ['Windows', 'Darwin']:  # pragma: no cover
-            self.my_print("Ignoring num_threads on frozen Windows/Mac instance, just running with one thread.")
+            if self.number_of_threads > 1:
+                self.my_print("Ignoring num_threads on frozen Windows/Mac instance, just running with one thread.")


Only print this warning message if you are actually trying to run multithreaded. Previously it would show up even if you intentionally ran a single thread.

Myoldmopar · 2021-05-11T17:47:18Z

epregressions/runtests.py

@@ -697,8 +704,7 @@ def process_diffs_for_one_case(self, this_entry, ci_mode=False):
                        EndErrSummary.STATUS_SUCCESS,
                        runtime_case2
                    ))
-                self.my_print("TestMathAndKill Fatal-ed as expected, continuing with no diff checking on it")
-                return this_entry
+                return this_entry, "TestMathAndKill Fatal-ed as expected, continuing with no diff checking on it"


self.my_print is no longer available from this static method, so this function simply returns a string message that can be printed once back in the process pool callback function.

Myoldmopar · 2021-05-11T17:47:40Z

epregressions/runtests.py

@@ -786,14 +786,14 @@ def process_diffs_for_one_case(self, this_entry, ci_mode=False):
                path_to_math_diff_log)), MathDifferences.SSZ)

        # Do sorta-math-diff JSON diff
-        if self.both_files_exist(case_result_dir_1, case_result_dir_2, 'eplusout_hourly.json'):
-            this_entry.add_math_differences(MathDifferences(self.diff_json_time_series(
+        if SuiteRunner.both_files_exist(case_result_dir_1, case_result_dir_2, 'eplusout_hourly.json'):


Several functions were already static methods but still referenced by self, so these were cleaned up.

Myoldmopar · 2021-05-11T17:48:05Z

epregressions/runtests.py

            self.build_tree_a['source_dir'], self.build_tree_a['build_dir'],
            self.build_tree_b['source_dir'], self.build_tree_b['build_dir'],
            os.path.join(self.build_tree_a['build_dir'], self.test_output_dir),
            os.path.join(self.build_tree_b['build_dir'], self.test_output_dir)
        )
+        diff_runs = []


New functionality to allow for diffs to run multithreaded.

Myoldmopar · 2021-05-11T17:48:47Z

epregressions/tests/test_runtests.py

+        if diff_results.entries_by_file[0].basename == 'my_file':
+            results_for_file = diff_results.entries_by_file[0]
+        elif diff_results.entries_by_file[1].basename == 'my_file':
+            results_for_file = diff_results.entries_by_file[1]


This was an interesting one, previously the two files would always come out in the serial order, but now sometimes they would come out backwards. I had to simply check which entry was the one I'm inspecting before calling assertions on it.

Myoldmopar · 2021-05-11T17:49:22Z

epregressions/tk_window.py

@@ -638,6 +643,24 @@ def idf_select_all(self):
            self.active_idf_listbox.insert(END, idf)
        self.idf_refresh_count_status()

+    def idf_select_all_except_long_runs(self):


Add in a new way to select all but the longest running files. This can take tons of time off the run just by skipping these few files.

Myoldmopar · 2021-05-11T17:49:59Z

Parallel diff-ing!!

Take a look at this branch at your leisure and let me know if you run into any issues.

Myoldmopar · 2021-05-11T18:04:45Z

I just ran this on the rmPlantHack branch and it worked flawlessly, showing all the diffs. The best part...I pulled the branch and it took just over 7 minutes to do a full rebuild including unit tests...then the full regression suite took just over 4 minutes including both sets of runs. Doing 723 files. Things are looking up!

mitchute · 2021-05-11T18:09:46Z

Is this run on DecentCI?

mitchute · 2021-05-11T22:04:49Z

Looks great to me! We're definitely parallel-processing the diffs now. Nice job, @Myoldmopar 🥇

Myoldmopar · 2021-05-11T22:42:35Z

Is this run on DecentCI?

It is run on Decent CI, but not in a way that will make a difference with this multi-processing. The call out to the regression script is made as part of a CTest execution. So if you run ctest -j N, each of those tests will independently be calling the regression script in different processes. So it's all good over there already.

Myoldmopar · 2021-05-12T01:14:05Z

I just pushed a commit based on the rmPlantHack branch that points to this branch in the regression repo. If it looks all clean I'll go ahead and merge this down.

Myoldmopar · 2021-05-23T02:55:39Z

OK, I think I'm going to merge this...if we notice anything later I'll revert or address it.

Myoldmopar added 4 commits March 24, 2021 09:06

A couple small tweaks to the UI

3c440e9

Diffs might run parallellelly now

97772d9

Fix parallel test in case they are completed out of order

b5fbac1

Tweak error handling for bad diffs

fc9c103

Myoldmopar requested a review from mitchute May 11, 2021 17:44

Myoldmopar commented May 11, 2021

View reviewed changes

mitchute approved these changes May 11, 2021

View reviewed changes

Coverage back up to 100%

5f2634d

Myoldmopar linked an issue May 12, 2021 that may be closed by this pull request

Make it more parallel #31

Closed

Myoldmopar merged commit 237b8a3 into master May 23, 2021

Myoldmopar deleted the SmallTweaks branch May 23, 2021 02:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parallel Diff-ing and other Tweaks #66

Parallel Diff-ing and other Tweaks #66

Myoldmopar commented May 11, 2021

mitchute commented May 11, 2021

Myoldmopar May 11, 2021

Myoldmopar May 11, 2021

Myoldmopar May 11, 2021

Myoldmopar May 11, 2021

Myoldmopar May 11, 2021

Myoldmopar May 11, 2021

Myoldmopar May 11, 2021

Myoldmopar May 11, 2021

Myoldmopar commented May 11, 2021

Myoldmopar commented May 11, 2021

mitchute commented May 11, 2021

mitchute commented May 11, 2021

Myoldmopar commented May 11, 2021

Myoldmopar commented May 12, 2021

Myoldmopar commented May 23, 2021

		self.id_like_to_stop_now = False
		self.completed_structure = None

Parallel Diff-ing and other Tweaks #66

Parallel Diff-ing and other Tweaks #66

Conversation

Myoldmopar commented May 11, 2021

mitchute commented May 11, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Myoldmopar commented May 11, 2021

Myoldmopar commented May 11, 2021

mitchute commented May 11, 2021

mitchute commented May 11, 2021

Myoldmopar commented May 11, 2021

Myoldmopar commented May 12, 2021

Myoldmopar commented May 23, 2021