Develop generalization training #2232

sankalp04 · 2019-07-08T07:20:42Z

Reopening a new PR since I accidentally closed the other one; This PR consists of all the edits from the previous PR and omit the lesson_controller class.

…and min_lesson_length

…ecks

chriselion · 2019-07-08T17:01:54Z

ml-agents-envs/mlagents/envs/sampler_class.py

+    def __init__(self, intervals, **kwargs):
+        self.intervals = intervals
+        # Measure the length of the intervals
+        self.interval_lengths = list(map(lambda x: abs(x[1] - x[0]), self.intervals))


I find list comprehensions more readable than maps and reduces (and they were also removed from python3). I think this works:

self.interval_lengths = [abs(x[1] - x[0]) for x in self.intervals] self.cum_interval_length = sum(self.interval_lengths) self.interval_weights = [x / self.cum_interval_length for x in self.interval_lengths]

and I also think you can keep interval_lengths and cum_interval_length as local variables.

Converted them to local variables

chriselion · 2019-07-08T17:12:53Z

ml-agents/mlagents/trainers/trainer_controller.py

+        If self.samplers is empty, then bool of it returns false, indicating
+        there is no sampler manager.
+        """
+        return not bool(self.sampler_manager.samplers)


How about making this a method or property on SamplerManager instead?

+1 on making check_empty_sampler_manager part of SamplerManager

Made the check part of the SamplerManager class

chriselion · 2019-07-08T17:14:48Z

ml-agents/mlagents/trainers/trainer_controller.py

-                if changed:
-                    self.trainers[brain_name].reward_buffer.clear()
-
+        if ( ((self.meta_curriculum)  and any(lessons_incremented.values()))


This is way too complicated. If you're actually checking what the comment says, then split it up into something like

lessons_were_incremented = ... ready_for_reset = ... if lessons_were_incremented or ready_for_reset: # do stuff

(also watch the modulo by zero)

Added a check to ensure that global_step isn't 0 to ensure modulo safety

chriselion · 2019-07-08T17:15:47Z

ml-agents/mlagents/trainers/trainer_controller.py

@@ -37,6 +39,8 @@ def __init__(
        external_brains: Dict[str, BrainParameters],
        training_seed: int,
        fast_simulation: bool,
+        sampler_manager,


type annotation here: sampler_manager: SamplerManager,

Added type annotation

chriselion · 2019-07-08T17:17:06Z

ml-agents/mlagents/trainers/learn.py

@@ -73,6 +76,21 @@ def run_training(sub_id: int, run_seed: int, run_options, process_queue):
            docker_target_name=docker_target_name
        )

+    sampler = None


nit: I think sampler_config is a better name for this. sampler implies it's an instance of a Sampler

chriselion · 2019-07-08T17:18:34Z

ml-agents-envs/mlagents/envs/sampler_class.py

+
+    def sample_all(self):
+        res = {}
+        if self.samplers == {}:


I'm not sure this is what you want here; it's definitely more pythonic to do if not self.samplers: But even then, you don't need this if, since doing a for loop will iterate over 0 items.

chriselion · 2019-07-08T17:19:16Z

ml-agents-envs/mlagents/envs/sampler_class.py

+        if self.samplers == {}:
+            pass
+        else:
+            for param_name, param_sampler in list(self.samplers.items()):


Don't need the list() here (only if we were trying for some sort of python2+3 compatibility).

ml-agents/mlagents/trainers/trainer_controller.py

ervteng · 2019-07-08T17:43:45Z

ml-agents/mlagents/trainers/trainer_controller.py

@@ -203,20 +215,30 @@ def _create_model_path(model_path):
                "permissions are set correctly.".format(model_path)
            )

+    @staticmethod
+    def _check_reset_params(reset_params, new_config):


Just checking, where is this method called?

Code trailing from use of lesson_controller; not needed anymore so removed from next update.

ervteng · 2019-07-08T17:48:40Z

ml-agents-envs/mlagents/envs/sampler_class.py

+
+from .exception import SamplerException
+
+class SamplerException(Exception):


BTW this should inherit from UnityException as with all of the other exception classes

Instance of dead code I was using for testing; removed in the next update

vincentpierre · 2019-07-23T17:44:31Z