Unity ML-Agents Toolkit (Enhanced Fork)

Enhanced fork of Unity ML-Agents Toolkit with performance optimizations, security fixes, and Python 3.12 support.

Based on: Release 23 / Unity Package 4.0.0

Improvements

Performance

GPU Training:

TorchScript compilation (2.50x faster inference)
Automatic Mixed Precision (AMP) support
Fused optimizers
Verified 1.89% profiling overhead

Unity Inference:

Pre-allocated collections (512 capacity)
Array-based batch storage
Tensor reference caching
Object pooling

Profiling:

Profiler.BeginSample markers throughout pipeline
Benchmark tools included

New Performance Features (2026-01):

Shared Memory Manager (5-10x training speedup)
GPU Processing Utilities (2-3x with batching)
Model Quantization (4x size reduction, 2-4x inference speedup)
Decision Transformer (offline RL from logged data)
Async Batching (10x throughput, <10ms latency)

Python 3.12 Support

Updated deprecated APIs (pkg_resources, distutils.version)
Fixed pytest hooks for pytest-xdist
Black formatting applied

Supported: 3.10.1 - 3.12.8

Unity 6

All examples migrated to new Input System
Auto time scale controller (press 1-9 during training)
Fixed package manifest
Tested with Unity 6000.0.40f1

Security

Fixed MD5 hash usage (usedforsecurity=False)
Secured file permissions (0o700)
URL scheme validation
Safe PyTorch load (weights_only=True)
HuggingFace revision pinning
Bandit scan: 0 critical vulnerabilities

Code Quality

Fixed 47 silent failures
Removed debug statements
Extracted magic numbers
Thread-safe global state

Tools:

upgrade_config.py - config migration
optimizer_utils.py - shared optimizer helpers
Benchmark and doctor CLI tools

Tests:

120+ tests passing
75%+ coverage

Bug Fixes

CPUTensorData resource leak
Unused variable warnings
pytest hook compatibility
Thread safety in compilation stats

Quick Start

Prerequisites

Python: 3.10.1 to 3.12.8 (3.12 fully supported)
Unity: 6000.0 or later
OS: Windows, macOS, or Linux

Installation

git clone https://github.com/quanticsoul4772/ml-agents.git
cd ml-agents

# One-command setup
./setup-dev.sh  # or setup-dev.ps1 on Windows

# Or manual
python -m venv venv
source venv/bin/activate
pip install -e ./ml-agents-envs
pip install -e ./ml-agents

Training

# Standard training (now includes TorchScript + GPU processing automatically)
mlagents-learn config/ppo/3DBall.yaml --run-id=3DBall_01

# With built executable for 4 parallel environments (faster)
mlagents-learn config/ppo/3DBall.yaml --run-id=3DBall_Fast --num-envs=4 --env=builds/3DBall.exe

# With MaxGPU optimizations (AMP + fused optimizers)
mlagents-learn config/ppo/3DBall_MaxGPU.yaml --run-id=3DBall_GPU --num-envs=4 --env=builds/3DBall.exe

# Monitor
tensorboard --logdir=results

What's automatically enabled:

TorchScript compilation (2.5x speedup) - enabled in standard configs
GPU observation processing - auto-enabled on CUDA GPUs
Multi-environment parallelization - when using built executables

Using New Features

Model Quantization (after training):

# Compress model to 4x smaller, 2-4x faster inference
python -m mlagents.trainers.optimization.quantize \
    results/3DBall_01/3DBall.pt \
    --output results/3DBall_01/3DBall_quantized.pt \
    --type int8

Decision Transformer (offline RL):

# Train from recorded demonstrations or logged data
# See claudedocs/new-features-guide.md for full examples
from mlagents.trainers.dt import DecisionTransformerTrainer

Async Batching (production deployment):

# Deploy trained models with low-latency batched inference
from mlagents.trainers.inference import AsyncBatchInference

**Advanced Features

Quantize model (4x smaller, 2-4x faster)

python -m mlagents.trainers.optimization.quantize results/Walker/policy.pt --type int8

Use shared memory manager (5-10x faster) - enable in trainer config

Use GPU processing - enable in trainer config

Use async batching - for production deployment


See [claudedocs/new-features-guide.md](./claudedocs/new-features-guide.md) for detailed usage.

---

## Syncing with Upstream

This fork receives updates from Unity's upstream repository:

```bash
# Fetch upstream changes
git fetch upstream

# Merge into your branch
git checkout main
git merge upstream/develop

# Push to this fork only
git push origin main

Project Structure

ml-agents/
├── ml-agents/                    # Python training package
│   └── mlagents/
│       ├── trainers/             # Training algorithms (PPO, SAC, MA-POCA)
│       │   ├── optimizer/        # Optimizer utilities (new)
│       │   └── upgrade_config.py # Config migration tool (new)
│       └── torch_utils/          # PyTorch utilities with AMP support
├── ml-agents-envs/               # Python environment interface
│   └── mlagents_envs/
│       └── registry/             # Binary download utilities (security hardened)
├── com.unity.ml-agents/          # Unity C# package (optimized)
│   └── Runtime/
│       ├── Inference/            # Model inference pipeline (optimized)
│       └── Scripts/              # Agent behaviors and sensors
├── Project/                      # Unity example project (Unity 6 compatible)
│   └── Assets/ML-Agents/
│       └── Examples/             # 17+ example environments
├── config/                       # Training configurations
│   └── ppo/                      # PPO configs including MaxGPU variants
├── scripts/                      # CLI tools (doctor, benchmark)
├── docs/                         # Technical documentation (2200+ lines)
└── test_requirements.txt         # Test dependencies

Documentation

Development Guides

AGENTS.md - Comprehensive development guide with build, test, and training commands
PROJECT-NOTES.md - Improvement notes and roadmap
claudedocs/new-features-guide.md - User guide for new performance features (shared memory, GPU processing, quantization, Decision Transformer, async batching)

Technical Notes

All technical debt remediation is complete. Key achievements documented in this README include security hardening, performance optimizations, and comprehensive testing.

Official Documentation

Unity Package Docs - Official ML-Agents documentation

Running Tests

# Install test dependencies
pip install -r test_requirements.txt

# Run all tests (excluding slow integration tests)
pytest --cov=ml-agents --cov=ml-agents-envs -m "not slow"

# Run tests in parallel (8 workers)
pytest --cov=ml-agents --cov=ml-agents-envs -m "not slow" -n 8

# Run slow integration tests
pytest -m "slow"

# Run with coverage report
pytest --cov=ml-agents --cov=ml-agents-envs --cov-report=html -m "not slow"

# Unity C# tests (run from Unity Editor)
# Window → General → Test Runner → EditMode/PlayMode

Quick Verification

# Run a quick training test (30 seconds)
mlagents-learn config/ppo/3DBall.yaml --run-id=test --max-steps=1000

# Run Python test suite
pytest ml-agents-envs/tests/ -v -x --tb=short

# Check code quality
pre-commit run --all-files

Test Coverage

Phase 3 Tests: 120/120 passing (100%)
Overall Coverage: 75%+ (enforced in CI)
Zero Regressions: All improvements maintain backward compatibility

Key Differences from Upstream

Python 3.12 support
TorchScript optimization (2.50x speedup)
Security fixes (0 critical vulnerabilities)
Unity 6 Input System migration
Configuration migration tool
120+ tests, 75%+ coverage

Upstream Project

This fork is based on Unity ML-Agents Toolkit - an open-source project that enables games and simulations to serve as environments for training intelligent agents.

Upstream Features:

17+ example Unity environments
PPO, SAC, MA-POCA training algorithms
Imitation learning (BC and GAIL)
Curriculum learning
Multi-agent training
Gym and PettingZoo wrappers

For the full upstream documentation, see the Unity ML-Agents Documentation.

Community and Support

For help with ML-Agents (this fork or upstream):

Unity ML-Agents Discussions - Q&A and community support
Discord - Real-time chat
GitHub Issues - Bug reports (upstream)
ML-Agents Tutorials - Video tutorials by CodeMonkeyUnity
Hugging Face Course - Introduction to ML-Agents

Contributing

Contributions to this fork are welcome! Please:

Fork this repository
Create a feature branch (git checkout -b feature/amazing-feature)
Make your changes following the existing code style
Run tests: pytest --cov=ml-agents --cov=ml-agents-envs -m "not slow"
Run quality checks: pre-commit run --all-files
Commit your changes (git commit -m "Add amazing feature")
Push to the branch (git push origin feature/amazing-feature)
Submit a pull request

Development Setup

# Install development dependencies
pip install -r test_requirements.txt
pip install -r requirements-dev.txt

# Install pre-commit hooks
pre-commit install

# Run full test suite
pytest --cov=ml-agents --cov=ml-agents-envs

# Check code quality
pre-commit run --all-files

Code Quality Standards

Minimum test coverage: 60%
Black formatting (line length: 88)
Type hints for public APIs
Comprehensive documentation for new features
Security considerations documented

License

Apache License 2.0

Citation

If you use this fork or Unity ML-Agents in research, please cite:

@article{juliani2020,
  title={Unity: A general platform for intelligent agents},
  author={Juliani, Arthur and Berges, Vincent-Pierre and Teng, Ervin and Cohen, Andrew and Harper, Jonathan and Elion, Chris and Goy, Chris and Gao, Yuan and Henry, Hunter and Mattar, Marwan and Lange, Danny},
  journal={arXiv preprint arXiv:1809.02627},
  year={2020}
}

Acknowledgments

Unity Technologies - Original ML-Agents Toolkit
Community Contributors - Bug reports, feature requests, and improvements
OpenAI - PPO algorithm and training insights

Name		Name	Last commit message	Last commit date
Latest commit History 3,650 Commits
.devcontainer		.devcontainer
.factory/skills		.factory/skills
.github		.github
.yamato		.yamato
DevProject		DevProject
PerformanceProject		PerformanceProject
Project		Project
claudedocs		claudedocs
colab		colab
com.unity.ml-agents		com.unity.ml-agents
config		config
docs		docs
localized_docs		localized_docs
ml-agents-envs		ml-agents-envs
ml-agents-plugin-examples		ml-agents-plugin-examples
ml-agents-trainer-plugin		ml-agents-trainer-plugin
ml-agents		ml-agents
protobuf-definitions		protobuf-definitions
scripts		scripts
unity-volume		unity-volume
utils		utils
.editorconfig		.editorconfig
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
.pre-commit-search-and-replace.yaml		.pre-commit-search-and-replace.yaml
.secrets.baseline		.secrets.baseline
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
CODEOWNERS		CODEOWNERS
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
Dockerfile		Dockerfile
LICENSE.md		LICENSE.md
PROJECT-NOTES.md		PROJECT-NOTES.md
Readme.md		Readme.md
SURVEY.md		SURVEY.md
Third Party Notices.md		Third Party Notices.md
catalog-info.yaml		catalog-info.yaml
codecov.yml		codecov.yml
colab_requirements.txt		colab_requirements.txt
conftest.py		conftest.py
markdown-link-check.fast.json		markdown-link-check.fast.json
markdown-link-check.full.json		markdown-link-check.full.json
mjx_benchmark.py		mjx_benchmark.py
mkdocs.yml		mkdocs.yml
pytest.ini		pytest.ini
requirements-dev.txt		requirements-dev.txt
requirements-lock.txt		requirements-lock.txt
setup-dev.ps1		setup-dev.ps1
setup-dev.sh		setup-dev.sh
setup.cfg		setup.cfg
test_constraints_version.txt		test_constraints_version.txt
test_failures.txt		test_failures.txt
test_requirements.txt		test_requirements.txt
wsl-setup.sh		wsl-setup.sh

License

quanticsoul4772/ml-agents

Folders and files

Latest commit

History

Repository files navigation

Unity ML-Agents Toolkit (Enhanced Fork)

Improvements

Performance

Python 3.12 Support

Unity 6

Security

Code Quality

Bug Fixes

Quick Start

Prerequisites

Installation

Training

Using New Features

Quantize model (4x smaller, 2-4x faster)

Use shared memory manager (5-10x faster) - enable in trainer config

Use GPU processing - enable in trainer config

Use async batching - for production deployment

Project Structure

Documentation

Development Guides

Technical Notes

Official Documentation

Running Tests

Quick Verification

Test Coverage

Key Differences from Upstream

Upstream Project

Community and Support

Contributing

Development Setup

Code Quality Standards

License

Citation

Acknowledgments

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 160

Uh oh!

Languages

Packages