Imitation Learning using ML-Agents (Under Development)

This project is based on the Imitation learning which uses the interaction between Teacher and Student rather than depending on reward/punishment mechanism accommodated by the Reinforcement learning.

I am using the Hover Racer assets developed by Unity for the application of Imitation Learning using ML-Agents toolkit.

Objective - To automate the vehicle's movement such that it imitates the behavior of the player.

This project is under development.

Work

The student agent needs to understand Observations and Actions in order to be trained properly.
The vehicle sends out a series of raycasts to detect the nearby objects around. The five raycasts are used around the agent which returns the value of the distance from the object it overlaps with.
The value of the distance is passed if it hits any object otherwise -1 value is passed if it hits nothing.

    Debug.DrawRay(transform.position, this.transform.forward * visibleDistance, Color.red);
    Debug.DrawRay(transform.position, this.transform.right * visibleDistance, Color.red);
    Debug.DrawRay(transform.position, -this.transform.right * visibleDistance, Color.red);
    Debug.DrawRay(transform.position, Quaternion.AngleAxis(45, Vector3.down) * this.transform.right * visibleDistance, Color.red);
    Debug.DrawRay(transform.position, Quaternion.AngleAxis(45, Vector3.up) * -this.transform.right * visibleDistance, Color.red);
    
    RaycastHit hit;

    //forward 
    if (Physics.Raycast(transform.position, this.transform.forward, out hit, visibleDistance))
    {
        fDist = hit.distance / visibleDistance;
    }
    else { fDist = -1f; }
    //right
    if (Physics.Raycast(transform.position, this.transform.right, out hit, visibleDistance))
    {
        rDist = hit.distance / visibleDistance;
    }
    else { rDist = -1f; }
    //left
    if (Physics.Raycast(transform.position, -this.transform.right, out hit, visibleDistance))
    {
        lDist = hit.distance / visibleDistance;
    }
    else { lDist = -1f; }

    //right 45
    if (Physics.Raycast(transform.position, Quaternion.AngleAxis(45, Vector3.down) * this.transform.right, out hit,visibleDistance))
    {
        r45Dist = hit.distance / visibleDistance;
    }
    else { r45Dist = -1f; }

    //left 45
    if (Physics.Raycast(transform.position, Quaternion.AngleAxis(45, Vector3.up) * -this.transform.right, out hit, visibleDistance))
    {
        l45Dist = hit.distance / visibleDistance;
    }
    else { l45Dist = -1f; }

The CollectObservations() method for the agent is responsible for adding information of the nearby surrounding objects to avoid it.

public override void CollectObservations()
  {
 
    if (fDist != -1f)
    {
       AddVectorObs(fDist);
       AddVectorObs(1f);
    }
    else
    {
       AddVectorObs(1f);
       AddVectorObs(0f);
    }
  
    Vector3 localVelocity = transform.InverseTransformVector(rigidBody.velocity);
    Vector3 localAngularVelocity = transform.InverseTransformVector(rigidBody.angularVelocity);
    AddVectorObs(localVelocity.x);
    AddVectorObs(localVelocity.y);
    AddVectorObs(localVelocity.z);
    AddVectorObs(localAngularVelocity.y);

    print(localVelocity.x);
 }

Further this information is sent to the Brain as it needs to know how many observations has been collected.

The AgentAction() method is responsible for performing actions during both training and testing mode.

public override void AgentAction(float[] act, string txt)
 {
   float propulsion = driveForce * input.thruster - drag * Mathf.Clamp(speed, 0f, terminalVelocity);
   rigidBody.AddForce(transform.forward * propulsion, ForceMode.Acceleration);
   input.rudder = Mathf.Clamp(act[0], -1, 1);
   AddReward(.1f);
}

The observations during testing mode is saved in a .demo file which is further used for the training of the agent.

Training Process -

Tools used -

Unity - Game Engine used
Visual Studio - IDE used for scripting
ML-Agents tookit - Machine Learning Applications

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.vs/UnitySDK/v15		.vs/UnitySDK/v15
Assets		Assets
Library		Library
ProjectSettings		ProjectSettings
UnityPackageManager		UnityPackageManager
MacBLAS.csproj		MacBLAS.csproj
README.md		README.md
UnitySDK.Editor.Plugins.csproj		UnitySDK.Editor.Plugins.csproj
UnitySDK.Editor.csproj		UnitySDK.Editor.csproj
UnitySDK.Plugins.csproj		UnitySDK.Plugins.csproj
UnitySDK.csproj		UnitySDK.csproj
UnitySDK.sln		UnitySDK.sln
iOSBLAS.csproj		iOSBLAS.csproj

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.vs/UnitySDK/v15

.vs/UnitySDK/v15

Assets

Assets

Library

Library

ProjectSettings

ProjectSettings

UnityPackageManager

UnityPackageManager

MacBLAS.csproj

MacBLAS.csproj

README.md

README.md

UnitySDK.Editor.Plugins.csproj

UnitySDK.Editor.Plugins.csproj

UnitySDK.Editor.csproj

UnitySDK.Editor.csproj

UnitySDK.Plugins.csproj

UnitySDK.Plugins.csproj

UnitySDK.csproj

UnitySDK.csproj

UnitySDK.sln

UnitySDK.sln

iOSBLAS.csproj

iOSBLAS.csproj

Repository files navigation

Imitation Learning using ML-Agents (Under Development)

Work

Tools used -

About

Releases

Packages

Languages

pkunjam/Imitation-Learning-using-mlagents

Folders and files

Latest commit

History

Repository files navigation

Imitation Learning using ML-Agents (Under Development)

Work

Tools used -

About

Resources

Stars

Watchers

Forks

Languages