Documentation 0.5 Release Check List (Part 1) #1154

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

awjuliani merged 53 commits into release-v0.5 from develop-doc-check-0.5

Sep 5, 2018

Contributor

shihzy commented Aug 29, 2018

Only the checked items. There are some structure and flow issues to be resolved based on the directory changes. I will tie off with @dericp tomorrow.

Wanted to get these into review before I start the bigger changes.

Please see Github Documentation Pre Release Checklist for 0.5 on what has been addressed

unityjeffrey added 17 commits

August 29, 2018 11:23


          spell checker on in docs

043d6f7


          spell check on ml-agents folder

a2d9364


          updated capitlizations

36ff3cb


          capitalization fix for brain

fe32c27


          rigidbody capitalization fix

62dbe93


          academy capitalization fixes

301515d


          agents capitalization

bf47b49


          capitlization for learning env

f4fff1e


          more agents capitlization

0f3602a


          updated s3 links

875f503


          Fixes to readme on ml-agents interface and training

bb57a1e


          minor capitalization typo

0c995b9


          added toolkit

f2f49cd


          added ml-agents SDK readme

3f663eb


          typo

77e8ebd


          updated installation with the correct directory

9ce0e8e


          python / pip to python3 / pip3

5bab1f7

shihzy changed the base branch from master to release-v0.5

August 29, 2018 20:16


          Merge branch 'release-v0.5' into develop-doc-check-0.5

169f757

awjuliani reviewed

View reviewed changes

README.md Show resolved Hide resolved

unityjeffrey added 4 commits

August 29, 2018 13:26


          remove agents capitalization

8b75882


          Merge branch 'develop-doc-check-0.5' of https://github.com/Unity-Tech…

98895a1

…nologies/ml-agents into develop-doc-check-0.5


          test


          revert additional agents capitalization

c3fc9b5

vincentpierre reviewed

View reviewed changes

docs/Installation-Windows.md Outdated Show resolved Hide resolved


          conflict fix for agents capitalization

03dd749

pderichai reviewed

View reviewed changes

docs/FAQ.md Outdated Show resolved Hide resolved

docs/Learning-Environment-Examples.md Outdated Show resolved Hide resolved

unityjeffrey and others added 3 commits

August 29, 2018 13:36


          fix for pip3 in windows

37b397c


          typo

a787697


          rigidbody typo

22f3507


          Missed a line.

f36b3e8

awjuliani reviewed

View reviewed changes

docs/Learning-Environment-Create-New.md

    
              This tutorial walks through the process of creating a Unity Environment. A Unity

              Environment is an application built using the Unity Engine which can be used to

              train Reinforcement Learning agents.

              train Reinforcement Learning Agents.

Contributor

awjuliani Sep 4, 2018

should be lowercased.

awjuliani reviewed

View reviewed changes

docs/Learning-Environment-Create-New.md Outdated

    
              steps:

              1. Create an environment for your agents to live in. An environment can range

              1. Create an environment for your Agents to live in. An environment can range

Contributor

awjuliani Sep 4, 2018

Should be lowercased.

awjuliani reviewed

View reviewed changes

docs/Learning-Environment-Create-New.md

    
              The Agent sends the information we collect to the Brain, which uses it to make a

              decision. When you train the agent (or use a trained model), the data is fed

              into a neural network as a feature vector. For an agent to successfully learn a

              decision. When you train the Agent (or use a trained model), the data is fed

Contributor

awjuliani Sep 4, 2018

Should be lowercased.

awjuliani reviewed

View reviewed changes

docs/Learning-Environment-Create-New.md

    
              * Position of the agent itself within the confines of the floor. This data is

                collected as the agent's distance from each edge of the floor.

              * Position of the Agent itself within the confines of the floor. This data is

Contributor

awjuliani Sep 4, 2018

Should be lowercased.

awjuliani reviewed

View reviewed changes

docs/Learning-Environment-Create-New.md

    
              the task. For example, the RollerAgent reward system provides a small reward if

              the agent moves closer to the target in a step and a small negative reward at

              each step which encourages the agent to complete its task quickly.

              the Agent moves closer to the target in a step and a small negative reward at

Contributor

awjuliani Sep 4, 2018

Should be lowercased.

awjuliani reviewed

View reviewed changes

docs/Learning-Environment-Design-Brains.md Outdated

    
              Heuristics or Internal brains game sessions. You can then use this data to train

              an agent in a supervised context.

              Heuristics or Internal Brains game sessions. You can then use this data to train

              an Agent in a supervised context.

Contributor

awjuliani Sep 4, 2018

Should be lowercased.

awjuliani reviewed

View reviewed changes

docs/Learning-Environment-Design-External-Internal-Brains.md Outdated

    
              that you can use with the Internal Brain type.

              A __model__ is a mathematical relationship mapping an agent's observations to

              A __model__ is a mathematical relationship mapping an Agent's observations to

Contributor

awjuliani Sep 4, 2018

Should be lowercased.

awjuliani reviewed

View reviewed changes

docs/Learning-Environment-Design.md Outdated

    
              Reinforcement learning is an artificial intelligence technique that trains

              _agents_ to perform tasks by rewarding desirable behavior. During reinforcement

              learning, an agent explores its environment, observes the state of things, and,

              learning, an Agent explores its environment, observes the state of things, and,

Contributor

awjuliani Sep 4, 2018

Should be lowercased.

awjuliani reviewed

View reviewed changes

docs/Learning-Environment-Design.md Outdated

    
              state, the agent receives a positive reward. If it leads to a less desirable

              state, then the agent receives no reward or a negative reward (punishment). As

              the agent learns during training, it optimizes its decision making so that it

              state, the Agent receives a positive reward. If it leads to a less desirable

Contributor

awjuliani Sep 4, 2018

Should be lowercased.

awjuliani reviewed

View reviewed changes

docs/Learning-Environment-Design.md Outdated

    
              [Proximal Policy Optimization (PPO)](https://blog.openai.com/openai-baselines-ppo/).

              PPO uses a neural network to approximate the ideal function that maps an agent's

              observations to the best action an agent can take in a given state. The

              PPO uses a neural network to approximate the ideal function that maps an Agent's

Contributor

awjuliani Sep 4, 2018

Should be lowercased.

awjuliani reviewed

View reviewed changes

docs/Learning-Environment-Design.md Outdated

    
              **Note:** if you aren't studying machine and reinforcement learning as a subject

              and just want to train agents to accomplish tasks, you can treat PPO training as

              and just want to train Agents to accomplish tasks, you can treat PPO training as

Contributor

awjuliani Sep 4, 2018

Should be lowercased.

awjuliani reviewed

View reviewed changes

docs/Learning-Environment-Design.md Outdated

    
              a _black box_. There are a few training-related parameters to adjust inside

              Unity as well as on the Python training side, but you do not need in-depth

              knowledge of the algorithm itself to successfully create and train agents.

              knowledge of the algorithm itself to successfully create and train Agents.

Contributor

awjuliani Sep 4, 2018

Should be lowercased.

awjuliani reviewed

View reviewed changes

docs/Learning-Environment-Design.md

    
              class. The Academy works with Agent and Brain objects in the scene to step

              through the simulation. When either the Academy has reached its maximum number

              of steps or all agents in the scene are _done_, one training episode is

              of steps or all Agents in the scene are _done_, one training episode is

Contributor

awjuliani Sep 4, 2018

Should be lowercased.

awjuliani reviewed

View reviewed changes

docs/Learning-Environment-Design.md Outdated

    
              An _environment_ in the ML-Agents toolkit can be any scene built in Unity. The

              Unity scene provides the environment in which agents observe, act, and learn.

              Unity scene provides the environment in which Agents observe, act, and learn.

Contributor

awjuliani Sep 4, 2018

Should be lowercased.

awjuliani reviewed

View reviewed changes

docs/Learning-Environment-Executable.md Outdated

    
              * You can put your executable on a remote machine for faster training.

              * You can use `Headless` mode for faster training.

              * You can keep using the Unity Editor for other tasks while the agents are

              * You can keep using the Unity Editor for other tasks while the Agents are

Contributor

awjuliani Sep 4, 2018

Should be lowercased.

awjuliani reviewed

View reviewed changes

docs/ML-Agents-Overview.md Outdated

    
              - **Observations** - what the medic perceives about the environment.

                Observations can be numeric and/or visual. Numeric observations measure

                attributes of the environment from the point of view of the agent. For our

                attributes of the environment from the point of view of the Agent. For our

Contributor

awjuliani Sep 4, 2018

Should be lowercased.

awjuliani reviewed

View reviewed changes

docs/ML-Agents-Overview.md Outdated

    
              - Single-Agent. A single Agent linked to a single Brain, with its own reward

                signal. The traditional way of training an agent. An example is any

                signal. The traditional way of training an Agent. An example is any

Contributor

awjuliani Sep 4, 2018

Should be lowercased.

awjuliani reviewed

View reviewed changes

docs/ML-Agents-Overview.md

    
              - **Monitoring Agent’s Decision Making** - Since communication in ML-Agents is a

                two-way street, we provide an agent Monitor class in Unity which can display

                aspects of the trained agent, such as the agents perception on how well it is

                two-way street, we provide an Agent Monitor class in Unity which can display

Contributor

awjuliani Sep 4, 2018

Keep as it is. All others in this file should be lowercased.

awjuliani reviewed

View reviewed changes

docs/Migrating.md Outdated

    
                packages, `mlagents.env` and `mlagents.trainers`. `mlagents.env` can be used

                to interact directly with a Unity environment, while `mlagents.trainers`

                contains the classes for training agents.

                contains the classes for training Agents.

Contributor

awjuliani Sep 4, 2018

Should be lowercased.

awjuliani reviewed

View reviewed changes

docs/Training-ML-Agents.md Outdated

    
              The ML-Agents toolkit conducts training using an external Python training

              process. During training, this external process communicates with the Academy

              object in the Unity scene to generate a block of agent experiences. These

              object in the Unity scene to generate a block of Agent experiences. These

Contributor

awjuliani Sep 4, 2018

Should be lowercased.

Contributor

awjuliani Sep 4, 2018

All in this file should be lowercased.

awjuliani reviewed

View reviewed changes

docs/Training-PPO.md Outdated

    
              [Proximal Policy Optimization (PPO)](https://blog.openai.com/openai-baselines-ppo/).

              PPO uses a neural network to approximate the ideal function that maps an agent's

              observations to the best action an agent can take in a given state. The

              PPO uses a neural network to approximate the ideal function that maps an Agent's

Contributor

awjuliani Sep 4, 2018

Should be lowercased.

Deric Pang and others added 3 commits

September 4, 2018 13:50


          Merge branch 'release-v0.5' into develop-doc-check-0.5

84c9024


          Linting errors.

c1e3002


          round of agent capitalizations

0781d88

Contributor Author

shihzy commented Sep 5, 2018

@awjuliani hopefully last round of capitalizations :) . let me know if any last min changes needed.

Contributor

awjuliani commented Sep 5, 2018

Looks good @unityjeffrey! Thanks for making all these changes.

awjuliani merged commit e8393d5 into release-v0.5

shihzy deleted the develop-doc-check-0.5 branch

September 6, 2018 16:48

github-actions bot locked as resolved and limited conversation to collaborators

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet