Unity ML-Agents Training Project

A Unity project for training reinforcement learning agents using ML-Agents toolkit with custom network support.

Setup

Clone ML-Agents Ocramaru Fork

git clone https://github.com/Ocramaru/unity-ml-agents.git ml-agents

Install Dependencies with uv

Install uv:

curl -LsSf https://astral.sh/uv/install.sh | sh

Sync dependencies with GPU support (CUDA 12.8):

uv sync --extra cu128

Or sync with CPU-only PyTorch:

uv sync --extra cpu

Project Structure

.
├── Assets/
│   ├── DodgingAgent/             # Main project directory
│   │   ├── config/               # Training configurations
│   │   ├── Scripts/              # C# scripts
│   │   │   ├── Agents/           # ML-Agent implementations
│   │   │   ├── Core/             # Core game logic
│   │   │   ├── Sensors/          # Custom sensor implementations
│   │   │   ├── Weapons/          # Weapon systems
│   │   │   ├── Utilities/        # Helper utilities
│   │   │   └── Editor/           # Unity Editor extensions
│   │   ├── Prefabs/              # Unity prefabs
│   │   ├── Models/               # Trained ONNX models
│   │   ├── Resources/            # Unity resources
│   │   └── results/              # Training results and checkpoints
│   └── ThirdParty/               # Third-party assets
│
├── Custom/                       # Custom Python networks
│   ├── networks.py               # CustomActor, CustomActorCritic, CustomCritic
│   └── models.py                 # Model components (vae, cnn, etc.)
│
├── ml-agents/                    # ML-Agents toolkit fork
└── builds/                       # Unity executable builds for training

Makefile Commands

Training with executable builds

make train MODEL=<build_name> RUN=<run_id>

Optional parameters:

NUM_ENVS - Number of parallel environments (default: 64)
NUM_AREAS - Number of areas per environment (default: 16)
ARGS - Additional mlagents-learn arguments

Example:

make train MODEL=linux_drone RUN=experiment_1 NUM_ENVS=32

Training with custom networks

make custom_train MODEL=<build_name> RUN=<run_id>

This sets PYTHONPATH to include Custom/ for custom network injection.

TensorBoard Dashboard

make dashboard

Opens TensorBoard at http://localhost:6006

Features

Custom drone agent
Bezier curve movement system
Weapon orchestration system

Credits

This project uses third-party assets and code:

Assets

Katana Model: "katana" by Rubixboy101 - CC BY 4.0
Warehouse FBX Model: "Warehouse FBX Model Free" by Nicholas-3D - CC BY 4.0

Code Libraries

ml-drone-collection: by MBaske - MIT License
ZigguratGaussian: Adapted from Redzen by Colin D. Green - MIT License
IMU Noise Model: Based on Kalibr by ETH Zurich ASL

See Assets/ThirdParty/CREDITS.md for full attribution details.

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
Assets		Assets
Custom		Custom
Packages		Packages
ProjectSettings		ProjectSettings
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Unity ML-Agents Training Project

Setup

Clone ML-Agents Ocramaru Fork

Install Dependencies with uv

Project Structure

Makefile Commands

Training with executable builds

Training with custom networks

TensorBoard Dashboard

Features

Credits

Assets

Code Libraries

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Unity ML-Agents Training Project

Setup

Clone ML-Agents Ocramaru Fork

Install Dependencies with uv

Project Structure

Makefile Commands

Training with executable builds

Training with custom networks

TensorBoard Dashboard

Features

Credits

Assets

Code Libraries

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages