Feature/Population Based Training #285

DriesSmit · 2021-08-03T13:47:38Z

What?

Add the first example of population based training in Mava. This example uses the recurrent MAD4PG algorithm to train a population of 5 networks, using 5 trainers and 5 executors, on the debugging environment. The hyperparameters that are getting tuned are the discount factor, target update rate and the target update period. This PR will remain in draft form for now as it still needs to be tested in a more complicated environment for longer time periods.

Why?

Population based training allows for the joint optimisation of hyperparameters and network parameters in one training setting.

How?

Various hooks have been added inside the MADDPG system. A PBT wrapper has also been added. The PBT wrapper can now wrap an MADDG and MAD4PG system and overwrite the appropriate hooks to add PBT to the system.

Extra

…nt and working.

…form.

…-based-training

lgtm-com · 2021-08-10T07:53:58Z

This pull request introduces 6 alerts when merging c234d31 into 7e6fd83 - view on LGTM.com

new alerts:

2 for Unused local variable
2 for Unreachable code
1 for Module is imported with 'import' and 'import from'
1 for Variable defined multiple times

…ward.

lgtm-com · 2021-08-10T10:39:05Z

This pull request introduces 6 alerts when merging 3a59f3c into 7e6fd83 - view on LGTM.com

new alerts:

2 for Unused local variable
2 for Unreachable code
1 for Module is imported with 'import' and 'import from'
1 for Variable defined multiple times

lgtm-com · 2021-08-12T14:09:26Z

This pull request introduces 6 alerts when merging 24d6707 into 7e6fd83 - view on LGTM.com

new alerts:

2 for Unused local variable
2 for Unreachable code
1 for Module is imported with 'import' and 'import from'
1 for Variable defined multiple times

lgtm-com · 2021-08-12T16:16:56Z

This pull request introduces 6 alerts when merging b25af4a into 7e6fd83 - view on LGTM.com

new alerts:

2 for Unused local variable
2 for Unreachable code
1 for Module is imported with 'import' and 'import from'
1 for Variable defined multiple times

lgtm-com · 2021-08-13T15:41:43Z

This pull request introduces 6 alerts when merging 3fdfcba into 7e6fd83 - view on LGTM.com

new alerts:

2 for Unused local variable
2 for Unreachable code
1 for Module is imported with 'import' and 'import from'
1 for Variable defined multiple times

…ining

…he networks.

DriesSmit added 30 commits June 15, 2021 15:16

feature: Save work on seperate variable source.

f543f5f

fix: A general inter-node variable communicator module is now impleme…

5333e74

…nt and working.

fix: Cleanup variable_utils and some other files.

ab10d87

Merge remote-tracking branch 'origin' into feature/mava-scaling

a84946e

Merge remote-tracking branch 'origin/develop' into feature/mava-scaling

329f95f

Add scaled mad4pg example.

5d93108

Merge remote-tracking branch 'origin/develop' into feature/mava-scaling

58926f5

feature: Save latest code.

0a1119b

feature: Save latest code.

e259dd1

fix: A lot of bugfixes.

e9d6a55

fix: A lot of bugfixes.

9f2f2cd

fix: Last save for today.

6648245

fix: Fix code so that other algorithms can still run in their normal …

6cda7fc

…form.

Fix some bugs in the debugging 3 trainer example.

bc2a013

fix: Fix some more bugs.

acbfd4e

Fix makefile.

9aa7834

fix: First attempt running.

a678bf3

fix: Fix bug where trainers did not update variable source.

e5037f9

Merge remote-tracking branch 'origin/develop' into feature/mava-scaling

ad76e16

fix: Small fix.

ad3a184

fix: Resolve merge conflict.

8c159b9

fix: Fix environment_loop for when no variable_client is presented.

fb29faa

fix: Fix error where no count variable crashes the environment_loop.

0607dca

fix: Small fix.

efec9d4

fix: Small fix.

ce939e7

Fix: Small fix.

2bf2e46

Merge remote-tracking branch 'origin/develop' into feature/mava-scaling

28cb0e9

Merge branch 'bugfix/logging-running-stats' into feature/mava-scaling

8b24be2

Merge remote-tracking branch 'origin/develop' into feature/mava-scaling

e9fc43e

fix: Fix environment loop.

7a44219

Merge remote-tracking branch 'origin/develop' into feature/population…

c234d31

…-based-training

fix: PBT now uses the correct cumulative value instead of the last re…

3a59f3c

…ward.

DriesSmit added 5 commits August 11, 2021 15:42

Change pbt saving rate.

875a333

Fix slow training times.

1b70136

Add some more async methods in the trainer.

dc9348d

Add tf.function to wrapper steps.

00361fa

Merge mava-scaling.

24d6707

Change trainer gets back.

b25af4a

Add first best respones game_theory wrapper.

3fdfcba

DriesSmit added 9 commits August 17, 2021 16:00

Add single action state based architecture.

a136f71

fix: Fix the evaluator and executor loggers.

34b0925

Merge branch 'feature/mava-scaling' into feature/population-based-tra…

f1c41f4

…ining

fix: Small changes.

55e4f2b

Merge branch 'feature/mava-scaling' into feature/population-based-tra…

7c03a1c

…ining

Update environment_loop.

cb3a08e

Small changes.

5775a4d

Add code to make 1D network more general.

c09d304

Remove custom football network code from Mava.

1ab6b7c

DriesSmit marked this pull request as draft August 20, 2021 15:14

DriesSmit added 5 commits August 22, 2021 19:35

Update mava code to allow for multiple observation sets as input to t…

5e5f571

…he networks.

Update mava code to allow for multiple observation sets as input to t…

183d649

…he networks.

Update 2D convnet.

c874d5a

Small changes.

dc02035

Update MAD4PG networks.

75cda50

DriesSmit closed this Aug 25, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/Population Based Training #285

Feature/Population Based Training #285

DriesSmit commented Aug 3, 2021 •

edited

lgtm-com bot commented Aug 10, 2021

lgtm-com bot commented Aug 10, 2021

lgtm-com bot commented Aug 12, 2021

lgtm-com bot commented Aug 12, 2021

lgtm-com bot commented Aug 13, 2021

Feature/Population Based Training #285

Feature/Population Based Training #285

Conversation

DriesSmit commented Aug 3, 2021 • edited

What?

Why?

How?

Extra

lgtm-com bot commented Aug 10, 2021

lgtm-com bot commented Aug 10, 2021

lgtm-com bot commented Aug 12, 2021

lgtm-com bot commented Aug 12, 2021

lgtm-com bot commented Aug 13, 2021

DriesSmit commented Aug 3, 2021 •

edited