About

Development

Author: Dylan Zelkin
Employer: University of Colorado, Denver
Supervisor: Mazen Al Borno
Lab: http://cse.ucdenver.edu/~alborno/

Description

This is an imitation learning project which uses reinforcmenent learning to train deep neural networks to control biomechanical and torque driven models by minimizing the difference between a desired kinematic motion and the actual motion enacted by a network. In this implementation of imitation learning, the network takes, as input, the joint angles and velocities, and outputs the muscle activations or torque activations respectively; there is no kinematic information given to the network, and each network learns a single unique motion.

The DRL used here is StableBaselines, and when training is completed, agents are saved in the ./agents/ folder and contain the following: training logs, the config file used when the model was created, and a zip managed by StableBaselines. Currently the only supported model architecture is a shared LSTM backbone, split off into dense layered reward and action heads.

This project was created and tested on linux (specifically ubuntu), and while it might work on other systems, is not guarenteed.

Examples

Torque Driven Solution

Muscle Driven Solution

Setup

Miniconda Installation (if not done so already)

Download Miniconda

wget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh

Run the Installer
```
bash Miniconda3-latest-Linux-x86_64.sh
```

Repo Setup

Install Git (if not done so already)

sudo apt update && sudo apt install -y git

Clone Repo from Github and Open

git clone git clone https://github.com/Al-Borno-Lab/MouseArmImitationLearning.git
cd MouseArmImitationLearning

Create Python Environment and Activate

conda env create -f environment.yml
conda activate MouseArmImitationLearningEnv

(Optional) Install Tensorboard for Numerical Results Visualization
```
pip install tensorboard
```

Huggingface Installations

Download Huggingface Hub (if not done so already)
```
pip install -U huggingface_hub
```

Download Mujoco Model and Kinematic Data

hf download AlBornoLab/MouseArmModel --repo-type dataset --local-dir ./models
hf download AlBornoLab/MouseArmKinematics --repo-type dataset --local-dir ./data

(OPTIONAL) Download Pre-Trained Torque and Muscle Models

hf download AlBornoLab/MouseArmSampleMuscleAgent --repo-type model --local-dir ./agents/muscle_model
hf download AlBornoLab/MouseArmSampleTorqueAgent --repo-type model --local-dir ./agents/torque_model

How to Use

Configuration Parameters

This section details which parameters can be tuned from the imitation learning environment, policy, algorithm, and training and testing scripts.

General
- name: Name of the model (if there is no folder under ./agents/... with that name, then the train script will create one instead of continuing training; the test script will fail; if an existing model is used, all config data is pulled from it's relevant config file instead)
Environment
- model: Mujoco model file to use
- kinematics: Kinematic data file to use
- w_bone_diff: A weight on the average difference between tracked bone locations in the reward function
- w_elbow: A weight on the elbow in the bone average difference
- w_paw: A weight on the paw in the bone average difference
- w_effort: A weight on the effort used by all actuaturos in the reward function
- w_jitter: A weight on the difference between qvel on the joints in the reward function
- w_action: A weight on the difference between action outputs in the reward function
- control_dt: Total simulation time step size per environment step
- n_substeps: Simulation substeps per environment step (increasing improves simulation stability)
Policy
- lstm_hidden_size: Number of parameters in the lstm
- n_lstm_layers: Number of lstm layers
- net_arch_pi: A list of layers for the action head
- net_arch_vf: A list of layers for the reward head
Algorithm (There are more advanced terms in the config that are unlisted here, see the stablebaselines RecurrentPPO API for more info)
- learning_rate: Learning rate for training
- n_steps: Total number of steps per environment per iteration
- batch_size: Total number of steps per batch
- n_epochs: Training epochs per iteration
Training
- timesteps: total timesteps across all training
- num_envs: number of environments running in parallel
Testing
- slowmo: sleep time between frames (visually only), increase for greater slowmo effect

Running the Programs

Train a Model
```
python train.py
```

Visualize Training Results with Tensorboard

PORT=$(shuf -i 6006-9000 -n 1); tensorboard --logdir ./agents --port $PORT & sleep 2 && xdg-open http://localhost:$PORT

Test a Model's Performance in a Live Viewer
```
python test.py
```

References

Gilmer, Jesse I., Susan K. Coltman, Geraldine Cuenu, John R. Hutchinson, Daniel Huber, Abigail L. Person, and Mazen Al Borno. "A novel biomechanical model of the proximal mouse forelimb predicts muscle activity in optimal control simulations of reaching movements." Journal of neurophysiology 133, no. 4 (2025): 1266-1278.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
readme		readme
README.md		README.md
config.yml		config.yml
environment.yml		environment.yml
imitation_env.py		imitation_env.py
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Development

Description

Examples

Setup

Miniconda Installation (if not done so already)

Repo Setup

Huggingface Installations

How to Use

Configuration Parameters

Running the Programs

References

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

About

Development

Description

Examples

Setup

Miniconda Installation (if not done so already)

Repo Setup

Huggingface Installations

How to Use

Configuration Parameters

Running the Programs

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages