feat (search): adding option to disable dumping the results to civ #325

wigging · 2025-05-15T20:16:47Z

Allow the user to disable writing search results to a CSV file. Collect search results using a SearchHistory class. Resolves issue #309 .

…pareto-efficient attribute

src/deephyper/hpo/_search.py

codecov · 2025-05-15T20:25:48Z

Codecov Report

Attention: Patch coverage is 85.87786% with 37 lines in your changes missing coverage. Please review.

Project coverage is 49.92%. Comparing base (c9c39f8) to head (ff2d967).
Report is 20 commits behind head on develop.

Files with missing lines	Patch %	Lines
src/deephyper/evaluator/callback.py	87.37%	4 Missing and 9 partials ⚠️
src/deephyper/hpo/_search.py	88.46%	6 Missing and 6 partials ⚠️
src/deephyper/hpo/_cbo.py	21.42%	10 Missing and 1 partial ⚠️
src/deephyper/skopt/optimizer/optimizer.py	83.33%	0 Missing and 1 partial ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop     #325      +/-   ##
===========================================
+ Coverage    43.69%   49.92%   +6.23%     
===========================================
  Files          124      108      -16     
  Lines         8377     7609     -768     
  Branches      1375     1219     -156     
===========================================
+ Hits          3660     3799     +139     
+ Misses        4387     3468     -919     
- Partials       330      342      +12

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

wigging · 2025-05-15T20:29:08Z

src/deephyper/hpo/_search.py

+        if len(resultsList) > 0:
+            started_dumping = self._csv_cursor > 0
+            file_mode = "a" if started_dumping else "w"
+
+            if not (started_dumping):
+                for result in resultsList:
+                    # Waiting to start receiving non-failed jobs before dumping results
+                    is_single_obj_and_has_success = (
+                        "objective" in result and type(result["objective"]) is not str
+                    )
+                    is_multi_obj_and_has_success = (
+                        "objective_0" in result and type(result["objective_0"]) is not str
+                    )
+                    if is_single_obj_and_has_success or is_multi_obj_and_has_success or flush:
+                        self._csv_columns = result.keys()
+                        break
+
+            if self._csv_columns is not None:
+                with open(os.path.join(path), file_mode) as fp:
+                    writer = csv.DictWriter(fp, self._csv_columns, extrasaction="ignore")
+                    if not (started_dumping):
+                        writer.writeheader()
+                    writer.writerows(resultsList)
+                    self._csv_cursor += len(resultsList)


Can this be avoided by converting the results list to a dataframe then use pandas to write the dataframe to csv file? This would eliminate importing the csv module.

This logic is to avoid redundant writing. It just appends new lines. The pandas logic would re-write everything again an again. Making the complexity squared instead of linear.

The self.jobs = [] is a list of dictionaries, is that right? Each dictionary represents the results of a completed job, is that correct?

The self.jobs is actually a list of HPOJob objects.

…ct in history

wigging · 2025-05-16T13:57:50Z

At the end of the Search.search method is this:

        self.dump_jobs_done_to_csv(flush=True)

        self.history.compute_pareto_efficiency()

        df_results = self.history.to_dataframe()

        return df_results

It looks like the job results are written to CSV then the pareto results are calculated. Does this update the CSV with the pareto results?

…gather callback for callback interface, cleaning up evaluator code, adapting code in search

Deathn0t · 2025-05-16T15:38:00Z

The PR is looking better but we still need to add a few unit tests.

Deathn0t · 2025-05-16T15:44:15Z

The Redis tests needs to be checked and updated if necessary.

wigging · 2025-05-19T19:08:18Z

I merged the latest develop branch into this branch and apparently it fixed the failing Redis tests.

wigging · 2025-05-19T20:00:18Z

For the Search class, is it necessary to name the input argument as checkpoint_history_to_csv? This seems very wordy, why not something shorter like csv_output?

wigging · 2025-05-20T01:23:57Z

The search method is currently defined as:

search(self, max_evals: int = -1, timeout: int = None, max_evals_strict: bool = False)

A better definition would be the following:

search(self, max_evals=-1, timeout=0, max_evals_strict=False) -> pd.DataFrame

When you have default values there is no need to explicitly give the type, it is inferred from the default value. I don't know why timeout would be an int or None; can it have a default value of 0? Providing a return type for this method helps type checkers like pyright otherwise it will think the return type is optional.

Deathn0t · 2025-05-26T07:31:32Z

Hi @wigging ,

adding the test for Pareto efficient is great.
adding the returned type to search(…): seems like a good idea, I will do it.
changing the type def/default value of timeout: let’s do it in an other PR, because it may have an impact.
I am voting for keeping the verbose name of checkpointing_history_to_csv because it is sort of explicit and with auto-completion in idea easy.

Deathn0t added 5 commits March 28, 2025 11:01

Merge branch 'release/0.10.0'

1d4f590

Merge branch 'develop'

df21038

Merge branch 'develop'

3f3e178

chore (search): refactoring management of results

c86dbcd

chore (search): search now using the SearchHisory class also for the …

eb5b22a

…pareto-efficient attribute

wigging commented May 15, 2025

View reviewed changes

src/deephyper/hpo/_search.py Outdated Show resolved Hide resolved

wigging commented May 15, 2025

View reviewed changes

Deathn0t added 5 commits May 16, 2025 11:19

chore (search): refactoring detection of rows with failures

162a76c

chore (tests): adding a test to run a search without dumping csv file

7c8acb5

chore (clean): removed unused import

e523b28

chore (clean): ruff format

819505f

chore (refactor): improved implementation of conversion to list of di…

562eefe

…ct in history

Deathn0t added 3 commits May 16, 2025 16:13

chore (refactor): adding csvlogger callback for evaluator, adding on_…

4e5cf47

…gather callback for callback interface, cleaning up evaluator code, adapting code in search

chore (search): adding checkpoint_history_to_csv for all search init

0060fea

chore (clean): ruff format and check

c9ca9a5

Deathn0t mentioned this pull request May 16, 2025

Remove CSV results file #319

Closed

Merge branch 'develop' into 309-results-output-v2

431b1b0

Remove the convert_for_csv method for Evaluator class

cdd3f45

wigging added 2 commits May 19, 2025 21:10

Add test for pareto efficiency

b897c5e

Fix linter warning

6015155

Fix test

ff2d967

chore (types): adding type hints

b6cb168

chore (tests): adding a check for pareto_efficient indicator

37d4c38

Deathn0t marked this pull request as ready for review May 26, 2025 07:50

Deathn0t changed the title ~~CSV output parameter and search history~~ feat (search): adding option to disable dumping the results to civ May 26, 2025

Deathn0t merged commit 0cc0256 into develop May 26, 2025
9 checks passed

Deathn0t deleted the 309-results-output-v2 branch May 26, 2025 08:05

Deathn0t mentioned this pull request May 26, 2025

[FEATURE] Provide an option to disable creation of results.csv #309

Closed

feat (search): adding option to disable dumping the results to civ #325

feat (search): adding option to disable dumping the results to civ #325

Uh oh!

Conversation

wigging commented May 15, 2025

Uh oh!

Uh oh!

codecov bot commented May 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

wigging May 15, 2025

Choose a reason for hiding this comment

Uh oh!

Deathn0t May 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wigging May 16, 2025

Choose a reason for hiding this comment

Uh oh!

wigging May 19, 2025

Choose a reason for hiding this comment

Uh oh!

wigging commented May 16, 2025

Uh oh!

Deathn0t commented May 16, 2025

Uh oh!

Deathn0t commented May 16, 2025

Uh oh!

wigging commented May 19, 2025

Uh oh!

wigging commented May 19, 2025

Uh oh!

wigging commented May 20, 2025

Uh oh!

Deathn0t commented May 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov bot commented May 15, 2025 •

edited

Loading

Deathn0t May 16, 2025 •

edited

Loading