Statistics tracking #22

N-Wouda · 2022-07-28T16:41:57Z

Closes #21.

N-Wouda · 2022-07-28T17:36:08Z

@LiekevdHeide you can use this branch to add the statistics tracking stuff!

N-Wouda · 2022-07-28T19:48:41Z

Statistics now collection population data (pop size, # feasible), and new incumbent best solutions (when and value). Example:

N-Wouda · 2022-07-28T19:50:39Z

hgs_vrptw/include/Statistics.h

+    using clock = std::chrono::system_clock;
+    using timedDatapoints = std::vector<std::pair<clock::time_point, double>>;
+
+    // TODO measure and store population diversity statistic?


@LiekevdHeide (and/or @leonlan): do you have any idea on how to measure population diversity?

Are you asking how it is currently measured (broken pairs distance) or are you asking whether we have other suggestions for measuring diversity?

The latter, the former I already know about. I'm looking for some alternatives that we can quickly compute for the whole population, rather than using the broken pairs distance which is currently only computed for the neighbourhood defined by nbGranular.

You could calculate broken pairs over the entire population if you ignore nbClose in Individual::avgBrokenPairDistanceClosest, no?

Prins (p. 36) also outlines three other measures for the giant tour chromosome: 1) Hamming, 2) Broken Pairs, and 3) Levenshtein distance. I think the Hamming distance might be interesting to try out. It's easy to compute, cheap (same time complexity as broken pairs, and for VRPTW does not have the drawback of "circular shifts" as stated in the slides.

I'll add something like this and then I think this PR is done.

hgs_vrptw/src/GeneticAlgorithm.cpp

Defaults to false, since we do not usually want to collect statistics. Also fix a rounding issue with the seconds/runtime calculation in the plots.

N-Wouda · 2022-07-29T08:56:56Z

With average diversity (avg broken pairs distance to nearest nbGranular neighbours) on a secondary y-axis:

N-Wouda · 2022-07-29T09:00:08Z

I think this is done. @LiekevdHeide @leonlan can you review this (hopefully today)?

leonlan · 2022-07-29T12:01:08Z

I think this is done. @LiekevdHeide @leonlan can you review this (hopefully today)?

I will review it today!

python/classes/Measures.py

leonlan · 2022-07-29T19:20:12Z

benchmark.py

@@ -8,6 +8,7 @@
 from tqdm.contrib.concurrent import process_map

 import tools
+from python.classes import Measures


 def parse_args():


Suggestion: add argument --collect_statistics to make it optional.

Question: how should we evaluate a benchmark without statistics?

If we want to benchmark solver on the average costs for the given competition time limits, then I don't see the need for collecting statistics. (Assuming that collecting statistics adds non-negligible computational overhead.)

It's small, as in, I don't see a difference in solution quality with stats on/off ATM.

Two issues:

It's a hassle to make this optional, and I want to sleep. :-)

I already use these stats to make decisions on how well ideas work, in addition to just the average cost. E.g. the # iterations tells me whether the local search went faster/slower than previously, and similarly the # improving moves is useful in checking the algorithm actually does something for each instance.

Since the performance hit is miniscule, I do not yet see the need to remove collection from the benchmark script.

If the performance hit is miniscule, then I don't see any issues here.

I didn't realize that measures/stats were coupled, so when reviewing I thought it would be an almost trivial addition. But I now understand your first point so I think it's bedtime for me :-)

benchmark.py

python/classes/Measures.py

hgs_vrptw/include/Statistics.h

hgs_vrptw/include/Result.h

hgs_vrptw/include/Statistics.h

N-Wouda added 2 commits July 28, 2022 18:41

Separate benchmark values by feas/infeas

8196ee0

Stats stubs

1c087a8

N-Wouda added 3 commits July 28, 2022 20:07

Merge branch 'main' into track-statistics

fdfa076

Merge branch 'main' into track-statistics

42646d3

Update license structure

c314b57

Add statistics collection, plot function in solver

4a97315

N-Wouda commented Jul 28, 2022

View reviewed changes

hgs_vrptw/src/GeneticAlgorithm.cpp Outdated Show resolved Hide resolved

Add comment why we skip the possibly infeasible best solution

d6ccf74

N-Wouda requested a review from LiekevdHeide July 28, 2022 19:54

N-Wouda self-assigned this Jul 28, 2022

N-Wouda added 4 commits July 28, 2022 21:55

Update ax title

80e28f5

Add nodiscard attribute to Statistics getters

f55afac

Add data collection config parameter

3be7e1d

Defaults to false, since we do not usually want to collect statistics. Also fix a rounding issue with the seconds/runtime calculation in the plots.

Add Measures class to support more KPIs in benchmark run

5f7a148

Add a population diversity measure

e79cf50

N-Wouda requested a review from leonlan July 29, 2022 09:00

leonlan reviewed Jul 29, 2022

View reviewed changes

leonlan mentioned this pull request Jul 29, 2022

Hamming distance as diversity measure #27

Closed

N-Wouda added 2 commits July 29, 2022 23:10

Process feedback

5a8de5c

Add run times

c42e2f2

N-Wouda merged commit a58ca4c into main Jul 30, 2022

N-Wouda deleted the track-statistics branch July 30, 2022 07:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Statistics tracking #22

Statistics tracking #22

N-Wouda commented Jul 28, 2022 •

edited

N-Wouda commented Jul 28, 2022

N-Wouda commented Jul 28, 2022 •

edited

N-Wouda Jul 28, 2022

leonlan Jul 29, 2022

N-Wouda Jul 29, 2022 •

edited

leonlan Jul 29, 2022 •

edited

N-Wouda Jul 29, 2022

N-Wouda commented Jul 29, 2022

N-Wouda commented Jul 29, 2022

leonlan commented Jul 29, 2022

leonlan Jul 29, 2022

N-Wouda Jul 29, 2022

leonlan Jul 29, 2022 •

edited

N-Wouda Jul 29, 2022

leonlan Jul 29, 2022 •

edited

Statistics tracking #22

Statistics tracking #22

Conversation

N-Wouda commented Jul 28, 2022 • edited

N-Wouda commented Jul 28, 2022

N-Wouda commented Jul 28, 2022 • edited

N-Wouda Jul 28, 2022

Choose a reason for hiding this comment

leonlan Jul 29, 2022

Choose a reason for hiding this comment

N-Wouda Jul 29, 2022 • edited

Choose a reason for hiding this comment

leonlan Jul 29, 2022 • edited

Choose a reason for hiding this comment

N-Wouda Jul 29, 2022

Choose a reason for hiding this comment

N-Wouda commented Jul 29, 2022

N-Wouda commented Jul 29, 2022

leonlan commented Jul 29, 2022

leonlan Jul 29, 2022

Choose a reason for hiding this comment

N-Wouda Jul 29, 2022

Choose a reason for hiding this comment

leonlan Jul 29, 2022 • edited

Choose a reason for hiding this comment

N-Wouda Jul 29, 2022

Choose a reason for hiding this comment

leonlan Jul 29, 2022 • edited

Choose a reason for hiding this comment

N-Wouda commented Jul 28, 2022 •

edited

N-Wouda commented Jul 28, 2022 •

edited

N-Wouda Jul 29, 2022 •

edited

leonlan Jul 29, 2022 •

edited

leonlan Jul 29, 2022 •

edited

leonlan Jul 29, 2022 •

edited