Rollout algorithm #90

leonlan · 2022-09-26T10:21:49Z

This PR

introduces the rollout algorithm (#93), which is a dispatching policy based on Monte Carlo simulations
introduces run_dispatch, which takes as input a dispatching strategy to solve the dynamic problem
adds an argument initial solutions for solve_static (not used yet)

This generalizes random, dqn, rollout so we don't have to write a separate "run_xxx" every time

dqn needs to load a network model before starting the epochs. This is currently not possible in run_dispatch

dynamic/run_dispatch.py

N-Wouda · 2022-09-26T11:25:24Z

There is a 75ms overhead for running simulations

Because of moving data between C++ and Python? I suspect we can shorten this a bit if rollout turns out to be promising.

leonlan · 2022-09-26T12:09:49Z

There is a 75ms overhead for running simulations

Because of moving data between C++ and Python? I suspect we can shorten this a bit if rollout turns out to be promising.

I checked again: its roughly 10-20 ms for the C++ and Python interaction due to hgspy, the other time is due to everything that happens in a simulation that isn't running the solver.

N-Wouda · 2022-09-26T12:40:59Z

In multiple places we now have time_limit - 1 to ensure we stay within the contest's time limits. But our overhead is not that big, and there's a 2 second grace period. So maybe we should do time_limit + 1 instead? That still gives us 1s to wrap up, and 2s more time for computations (per epoch of 60s; that's almost 3.5% more).

dynamic/rollout/simulate_instance.py

…ollout

leonlan · 2022-09-30T05:59:04Z

I benchmarked 78408e3. Performance is still the retained, i.e., roughly 10k improvement over greedy+.

N-Wouda · 2022-09-30T08:16:36Z

I'll continue with this today, mostly by profiling stuff and finding ways to make it faster.

N-Wouda · 2022-09-30T10:52:36Z

dynamic/rollout/simulate_instance.py

+    cust_idx = rng.integers(n_customers, size=n_samples) + 1
+    tw_idx = rng.integers(n_customers, size=n_samples) + 1
+    service_idx = rng.integers(n_customers, size=n_samples) + 1
+
+    # These are unnormalized time windows and release times, which are used to
+    # determine request feasibility. Will be clipped later.
+    sim_tw = tws[tw_idx]
+    sim_epochs = np.repeat(np.arange(1, max_lookahead + 1), EPOCH_N_REQUESTS)
+    sim_release = start_time + sim_epochs * EPOCH_DURATION
+    sim_service = static_inst["service_times"][service_idx]
+
+    # Earliest arrival is release time + drive time or earliest time window.
+    earliest_arrival = np.maximum(sim_release + dist[0, cust_idx],
+                                  sim_tw[:, 0])
+    earliest_return = earliest_arrival + sim_service + dist[cust_idx, 0]
+    feas = (earliest_arrival <= sim_tw[:, 1]) & (earliest_return <= tws[0, 1])


It happens fairly often that feas is less than 30% of the initial customers. That's unfortunate, of course, but I do not see a good way around this because we should replicate the way the controller generates customers (and they do this as well).

But it does mean we can postpone some work until after we know the (much smaller) subset of feasible customers.

N-Wouda · 2022-09-30T10:55:27Z

dynamic/rollout/simulate_instance.py

+    if n_new_customers == 0:  # this should not happen a lot
+        return simulate_instance(info, obs, rng, n_lookahead)


This will probably never happen, but it might, and then the code below it crashes. So to avoid that we should just try again.

N-Wouda · 2022-09-30T12:14:32Z

@leonlan @jaspervd96 I think this is basically done for a first implementation. Shall we merge this?

leonlan

LGTM. Two remaining (small) points are:

Use the environment to get constants EPOCH_N_REQUESTS and EPOCH_DURATION.
Prevent simulation exceeding time limit by too much.

Do you mind addressing these points as well?

N-Wouda · 2022-09-30T12:46:20Z

Sure!

dynamic/rollout/simulate_instance.py

N-Wouda · 2022-09-30T13:17:11Z

Use the environment to get constants EPOCH_N_REQUESTS and EPOCH_DURATION.

Turns out that removing these is pretty hard because the run_dispatch function is in-between. Let's keep this as-is, and refactor if and when we get rid of other dynamic strategies.

…n another

N-Wouda · 2022-09-30T13:19:25Z

@jaspervd96 can you approve if you're OK with this PR? I'll wait for the CI to complete as well.

… reason So we'll just use the (static) benchmark for now

dynamic/rollout/simulate_instance.py

jmhvandoorn · 2022-09-30T13:58:45Z

@jaspervd96 can you approve if you're OK with this PR? I'll wait for the CI to complete as well.

Will do. Only added 1 more small suggestion

dynamic/rollout/simulate_instance.py

Co-authored-by: jaspervd96 <jasper.vandoorn@hotmail.com>

leonlan added 15 commits September 24, 2022 16:01

Add simple rollout algorithm

21ef095

Change maxruntime to milliseconds; postpone if below threshold

82d1dc8

Only postpone threshold; add timing to rollout simulations

036a7dd

Add dynamic configs

2b02107

Introduce run_dispatch

be6e3e2

This generalizes random, dqn, rollout so we don't have to write a separate "run_xxx" every time

Add bindings for Population.add_invidual; revert max runtime hack

b940aaa

Change simulation solve using iters; change config

d1aac2e

Refactor rollout

41ab975

Introduce separate solver for simulations

0ed8816

Exclude depot from postponement

c240554

Refactor simulate_instance

da2d89b

Remove LS-hack on initial solutions

cad3405

Merge remote-tracking branch 'origin/main' into rollout

231019b

Introduce run_dqn again in solver and benchmark

c8bd664

dqn needs to load a network model before starting the epochs. This is currently not possible in run_dispatch

Re-introduce default time limit in solve_static

740b656

leonlan commented Sep 26, 2022

View reviewed changes

dynamic/run_dispatch.py Outdated Show resolved Hide resolved

Remove initial singleton solution

9f6d245

leonlan commented Sep 26, 2022

View reviewed changes

dynamic/run_dispatch.py Outdated Show resolved Hide resolved

leonlan added 2 commits September 26, 2022 12:33

Remove config and reintroduce nbVeh default in solve_static

8709b58

Add seed default

837a7a8

leonlan added 2 commits September 26, 2022 13:35

Merge remote-tracking branch 'origin/main' into rollout

bd328e6

Merge remote-tracking branch 'origin/main' into HEAD

bee22f6

leonlan commented Sep 26, 2022

View reviewed changes

dynamic/rollout/simulate_instance.py Outdated Show resolved Hide resolved

leonlan added 4 commits September 26, 2022 19:51

Parameter changes

e91bb60

Refactor simulate_instance using vectorization

aec97cb

Merge branch 'main' of github.com:N-Wouda/Euro-NeurIPS-2022 into rollout

a651aa0

Merge branch 'rollout' of github.com:N-Wouda/Euro-NeurIPS-2022 into r…

d7f9395

…ollout

leonlan and others added 5 commits September 29, 2022 10:29

Address reviewer comments

217287d

Change postpone threshold to dispatch threshold

a993976

Threshold on equality as well

78408e3

Merge branch 'main' into rollout

0bcf6f6

Make rollout default strategy

72a3bd5

Merge branch 'main' into rollout

7ed6303

Do clipping etc. after slicing feasible clients

c85d976

N-Wouda reviewed Sep 30, 2022

View reviewed changes

N-Wouda added 2 commits September 30, 2022 13:12

Add flake8

218b18a

Fix typo in CI

95e8278

leonlan commented Sep 30, 2022

View reviewed changes

jmhvandoorn reviewed Sep 30, 2022

View reviewed changes

dynamic/rollout/simulate_instance.py Outdated Show resolved Hide resolved

Track average simulation run-time, use that to determine if we can ru…

0a57443

…n another

N-Wouda approved these changes Sep 30, 2022

View reviewed changes

N-Wouda added 2 commits September 30, 2022 15:30

Rename sim_end to sim_duration, add dynamic instance to CI run

6ece977

Revert CI because running the controller there does not work for some…

e28c728

… reason So we'll just use the (static) benchmark for now

jmhvandoorn reviewed Sep 30, 2022

View reviewed changes

dynamic/rollout/simulate_instance.py Outdated Show resolved Hide resolved

jmhvandoorn reviewed Sep 30, 2022

View reviewed changes

dynamic/rollout/simulate_instance.py Outdated Show resolved Hide resolved

Index distance matrix differently

01d47bc

Co-authored-by: jaspervd96 <jasper.vandoorn@hotmail.com>

jmhvandoorn approved these changes Sep 30, 2022

View reviewed changes

N-Wouda merged commit 71500c0 into main Sep 30, 2022

N-Wouda deleted the rollout branch September 30, 2022 14:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rollout algorithm #90

Rollout algorithm #90

leonlan commented Sep 26, 2022 •

edited

N-Wouda commented Sep 26, 2022 •

edited

leonlan commented Sep 26, 2022

N-Wouda commented Sep 26, 2022

leonlan commented Sep 30, 2022 •

edited

N-Wouda commented Sep 30, 2022

N-Wouda Sep 30, 2022

N-Wouda Sep 30, 2022 •

edited

N-Wouda commented Sep 30, 2022

leonlan left a comment

N-Wouda commented Sep 30, 2022

N-Wouda commented Sep 30, 2022

N-Wouda commented Sep 30, 2022 •

edited

jmhvandoorn commented Sep 30, 2022

		if n_new_customers == 0: # this should not happen a lot
		return simulate_instance(info, obs, rng, n_lookahead)

Rollout algorithm #90

Rollout algorithm #90

Conversation

leonlan commented Sep 26, 2022 • edited

N-Wouda commented Sep 26, 2022 • edited

leonlan commented Sep 26, 2022

N-Wouda commented Sep 26, 2022

leonlan commented Sep 30, 2022 • edited

N-Wouda commented Sep 30, 2022

N-Wouda Sep 30, 2022

Choose a reason for hiding this comment

N-Wouda Sep 30, 2022 • edited

Choose a reason for hiding this comment

N-Wouda commented Sep 30, 2022

leonlan left a comment

Choose a reason for hiding this comment

N-Wouda commented Sep 30, 2022

N-Wouda commented Sep 30, 2022

N-Wouda commented Sep 30, 2022 • edited

jmhvandoorn commented Sep 30, 2022

leonlan commented Sep 26, 2022 •

edited

N-Wouda commented Sep 26, 2022 •

edited

leonlan commented Sep 30, 2022 •

edited

N-Wouda Sep 30, 2022 •

edited

N-Wouda commented Sep 30, 2022 •

edited