[refactor] Make PyTorch the default and TensorFlow optional #4517

ervteng · 2020-09-26T00:01:16Z

Proposed change(s)

This PR does a couple things:

Makes TensorFlow optional (i.e., code can be run without TF installed) and PyTorch required
Adds PyTorch to the setup.py, throwing an error in Windows telling the user to install PyTorch from the website (see:Pytorchvision 0.5.0 not published to pypi or Conda for Windows pytorch/vision#1774).
Reports PyTorch version in TrainingStatus and Timers. For now, TrainingStatus reports both TF and PyTorch versions, with -1 set if TF is not installed.
Switch TensorboardWriter to use torch.utils.tensorboard
Install documentation
Modify Yamato test to look for ONNX files only

Furthermore, we need to verify Barracuda compatibility and package versioning before merging

Types of change(s)

Checklist

Added tests that prove my fix is effective or that my feature works
Updated the changelog (if applicable)
Updated the documentation (if applicable)
Updated the migration guide (if applicable)

Other comments

Requires ML-Agents Cloud PR after merging

* Adding correst setup commands for verifying torch is installed * Editing the test_requirments to add tf and remove torch

vincentpierre · 2020-10-01T01:12:18Z

~~Leaving this here for posterity: This will not raise an error correctly on windows when installing via pip. That is because the cmdclass will NOT work when installing from whl~~

Edit : Removed cmdclass, the user will see an error when trying to use mlagents-learn without having torch installed. cannot directly install torch on windows via pip.

* Torch not imported error to raise at first usage * Torch not imported error to raise at first usage

* Convert stats writer to use PyTorch TB support * Use common function to print params * Update test * Bump tensorboard to 1.15 to fix the tests * putting tensorboard 1.15.0 as min version requirement Co-authored-by: vincentpierre <vincentpierre@unity3d.com>

) * Initial commit * Forgotten doc * Removing the `Installation-Anaconda-Windows.md` as it is deprecated * Readding the depreacted Installation-Anaconda-Windows.md but leaving it unchanged * more references to tensorflow removed * Update README.md Co-authored-by: Ervin T. <ervin@unity3d.com> * Change references to .nn to .onnx in docs (#4583) Co-authored-by: Ervin T. <ervin@unity3d.com>

* Add --tensorflow option * Switch framework to Pytorch default * Update changelog * Re-add --torch * Edit warning

…ml-agents into develop-torchdefault

docs/Unity-Inference-Engine.md

ml-agents/mlagents/trainers/ppo/trainer.py

ml-agents/mlagents/trainers/sac/trainer.py

chriselion · 2020-10-20T18:28:30Z

ml-agents/mlagents/trainers/trainer/rl_trainer.py

                    self.stats_reporter.add_stat(
-                        optimizer.reward_signals[name].stat_name,
+                        f"Policy/{optimizer.reward_signals[name].name.capitalize()} Reward",


Move this to a field/property on BaseRewardProvider?

I think that would be cleaner. @vincentpierre what do you think about this? (probably a separate PR).

dongruoping · 2020-10-20T18:28:40Z

ml-agents/mlagents/trainers/trainer/trainer_factory.py

@@ -27,6 +27,7 @@ def __init__(
        init_path: str = None,
        multi_gpu: bool = False,
        force_torch: bool = False,
+        force_tensorflow: bool = False,


are force_torch and force_tensorflow both needed?

We left force_torch in case users were still using --torch in places (discussion here: #4582)

It's behavior is pretty subtle, it will only take effect if the user specifies framework: tensorflow in the YAML and then adds --torch.

ml-agents/mlagents/trainers/training_status.py

chriselion · 2020-10-20T18:31:10Z

test_constraints_min_version.txt

@@ -5,3 +5,4 @@ Pillow==4.2.1
 protobuf==3.6
 tensorflow==1.14.0
 h5py==2.9.0
+tensorboard==1.15.0


Do we need this here? should be handled by setup.py.

If we do not add it here, tensorboard version 1.14.0 is installed (because tensorflow 1.14.0 is installed here) and we do not support tensorboard 1.14.0 with torch

dongruoping · 2020-10-20T18:32:33Z

docs/Installation.md

@@ -128,6 +123,7 @@ To install the `mlagents` Python package, activate your virtual environment and
 run from the command line:

 ```sh
+pip3 install torch -f https://download.pytorch.org/whl/torch_stable.html


should we mention this is only for windows?

It doesn't hurt on the other OS's, but I agree it's clearer if we separate windows. Updated the instructions - let me know what you think

dongruoping · 2020-10-20T18:37:23Z

Since PyTorch is default now, we should also update all .nn model files to .onnx files for the example scenes (or in separate PR?)

chriselion · 2020-10-20T22:31:48Z

Since PyTorch is default now, we should also update all .nn model files to .onnx files for the example scenes (or in separate PR?)

@dongruoping - I have that logged as a followup in https://jira.unity3d.com/browse/MLA-1435

chriselion · 2020-10-20T22:32:14Z

@ervteng is there a separate PR to update the CI configs to use --tensorflow?

Co-authored-by: Chris Elion <chris.elion@unity3d.com>

…ml-agents into develop-torchdefault

ervteng · 2020-10-21T18:01:00Z

@ervteng is there a separate PR to update the CI configs to use --tensorflow?

Yep - https://github.com/Unity-Technologies/ml-agents-cloud-internal/pull/223. It only affects the old CI, as the new CI doesn't use --torch but instead overrides the YAML/RunOptions and won't require changes.

Ervin Teng and others added 7 commits September 25, 2020 11:13

Torch setup.py

44df681

Set torch to default

9df44fd

Make torch default in setup.py

19a88ee

Remove indents

f5761f6

Remove other instances of TF being used

c0d9b81

Add tensorboard to setup.py

978c52f

Adding correst setup commands for verifying torch is installed (#4524)

c7303f0

* Adding correst setup commands for verifying torch is installed * Editing the test_requirments to add tf and remove torch

Develop torchdefault raise outside setup (#4530)

86faff2

* Torch not imported error to raise at first usage * Torch not imported error to raise at first usage

ervteng marked this pull request as ready for review October 19, 2020 18:25

Ervin Teng and others added 8 commits October 19, 2020 11:38

Merge branch 'master' into develop-torchdefault

7419374

[refactor] Add --tensorflow, enable Torch as default setting (#4582)

03f7e79

* Add --tensorflow option * Switch framework to Pytorch default * Update changelog * Re-add --torch * Edit warning

Modify Yamato tests (#4584)

ad958ca

Don't check for PB file in Yamato inference

78bd740

Merge branch 'develop-torchdefault' of github.com:Unity-Technologies/…

0e038ec

…ml-agents into develop-torchdefault

Only run inference on ONNX

5ab0ca0

ervteng requested review from vincentpierre, chriselion and dongruoping October 20, 2020 18:10