ML-EV3: Q-Learning Robot Navigation

⚠️ PROJECT STATUS: NO LONGER MAINTAINED
This project is provided as-is for educational and research purposes. It is free to use forever but will not receive updates, bug fixes, or feature additions. Feel free to fork and adapt it to your needs.

📖 Overview

ML-EV3 is an educational project demonstrating Q-Learning reinforcement learning applied to robot navigation. The project contains two complementary components:

Webots Simulation – Train and test a virtual robot using Q-Learning in a simulated environment
Real-World EV3 Control – Deploy trained policies to a physical LEGO Mindstorms EV3 robot

The robot learns to navigate toward a colored goal (red square) while avoiding obstacles, using sensor data (touch, distance, color) to make decisions.

🎯 Who Is This For?

Audience	Use Case
Students	Learn reinforcement learning fundamentals with hands-on robotics
Researchers	Study sim-to-real transfer and Q-Learning behavior
Hobbyists	Experiment with EV3 robots and machine learning
Educators	Teaching material for robotics and AI courses

Prerequisites:

Basic Python knowledge
Familiarity with reinforcement learning concepts (helpful but not required)
For real robot: LEGO Mindstorms EV3 with ev3dev OS

📂 Repository Structure

ML-EV3/
├── simulation/                    # Webots simulation environment
│   ├── ev3_bot.py                # Robot controller with Q-Learning
│   ├── world.wbt                 # Webots world definition
│   ├── QTable.json               # Pre-trained Q-table (simulation)
│   └── q_table.npy               # NumPy format Q-table backup
│
├── real_robot/                   # Physical EV3 robot control
│   ├── ev3control.py             # EV3 robot controller class
│   ├── run.py                    # Main execution script
│   ├── screen_utils.py           # EV3 display utilities
│   └── QTable.json               # Q-table for deployment
│
├── results/                      # Experimental results
│   ├── results.txt               # Test runs with obstacles
│   └── results_no_obstacles.txt  # Test runs without obstacles
│
├── README.md                     # This file
├── LICENSE                       # MIT License
└── .gitignore                    # Git ignore rules

🔬 Technical Details

State Space

The robot perceives its environment through three sensors:

Sensor	States	Description
Touch	2	Binary: pressed (1) or not pressed (0)
Distance	256	Discretized distance readings (0-255 cm)
Color	8	Color indices: None(0), Black(1), Blue(2), Green(3), Yellow(4), Red(5), White(6), Brown(7)

Total State Space: 2 × 256 × 8 = 4,096 states

Action Space

Action	Index	Description
Move Forward	0	Drive straight ahead
Turn Left	1	Rotate left
Turn Right	2	Rotate right

Q-Learning Parameters (Simulation)

Parameter	Value	Description
α (Learning Rate)	0.1	How much new information overrides old
γ (Discount Factor)	0.99	Importance of future rewards
ε (Exploration Rate)	1.0 → 0.01	Decays exponentially during training
Episodes	1000	Number of training episodes
Goal Reward	+100	Reaching the red target
Collision Penalty	-10	Hitting obstacles
Step Penalty	-1	Each movement step

🖥️ Simulation Setup (Webots)

Requirements

Webots R2023b or later (Download)
Python 3.x
Python packages:
```
numpy
pandas
opencv-python
colorthief
```

Installation

Install Webots from cyberbotics.com

Install Python dependencies:

pip install numpy pandas opencv-python colorthief

Clone this repository:

git clone https://github.com/OzSho/ML-EV3.git
cd ML-EV3/simulation

Running the Simulation

Open simulation/world.wbt in Webots
The simulation will automatically run ev3_bot.py as the robot controller
The robot will train using Q-Learning and output results to console

Simulation Environment

The Webots world contains:

A 2x2 meter arena with walls
Colored floor squares (goal is red)
Wooden box obstacles
An EV3-like differential drive robot with:
- Touch sensor
- Distance sensor (ultrasonic)
- Color camera

🤖 Real Robot Setup (EV3)

Requirements

LEGO Mindstorms EV3 brick
ev3dev OS installed (ev3dev.org)
Python 3.x on ev3dev
Hardware configuration:
- Left motor: Port A
- Right motor: Port B
- Touch sensor: Port 1
- Ultrasonic sensor: Port 2
- Color sensor: Port 4

Installation

Connect to your EV3 via SSH

Copy the real_robot/ folder to your EV3:

scp -r real_robot/ robot@ev3dev.local:~/

Install ev3dev Python library (usually pre-installed):
```
pip3 install python-ev3dev2
```

Running on Real Robot

cd ~/real_robot
python3 run.py

The robot will:

Load the trained Q-table
Execute 30 test runs
Record completion times to results.txt
Each run has a 2-minute timeout

Physical Setup Requirements

Create a colored floor with a red target zone
Place colored squares matching the simulation (black, blue, green, yellow, white, brown)
Optionally add obstacles for navigation challenges

📊 Results Interpretation

The results/ folder contains experimental data:

results.txt (With Obstacles)

Test num: 1, run time: 00:00:23
Test num: 2, run time: 00:00:03
...

results_no_obstacles.txt (Without Obstacles)

Test num: 1, run time: 00:00:30
Test num: 2, run time: 00:00:20
...

Key Metrics:

Run times under 2 minutes indicate successful goal completion
02:00 indicates timeout (goal not reached)
Compare with/without obstacles to evaluate policy robustness

📚 Educational Value

This project demonstrates several key concepts:

Reinforcement Learning

Q-Learning algorithm implementation
Epsilon-greedy exploration strategy
Reward shaping for desired behavior
State discretization for continuous environments

Robotics

Sensor fusion (touch + distance + color)
Differential drive kinematics
Sim-to-real transfer challenges
ev3dev programming

Software Engineering

Modular code design
Configuration management (JSON Q-tables)
Cross-platform development (simulation ↔ real robot)

🔧 Customization

Modifying Q-Learning Parameters

In simulation/ev3_bot.py, adjust the main() function:

# Q-learning parameters
EPSILOn = 1        # Initial exploration rate
ALPHa = 0.1        # Learning rate
GAMMa = 0.99       # Discount factor
NUM_EPISODEs = 1000  # Training episodes
collision_reward = -10
goal_reward = 100

Adding New Colors

Modify get_color_index() in ev3_bot.py:

def get_color_index(b, g, r):
    # Add new color thresholds here
    if r > 200 and g < 100 and b > 200:  # Purple
        return 8  # Remember to update NUM_COLOR_STATES
    # ... existing colors

Changing Robot Hardware Ports

In real_robot/ev3control.py, update the port assignments:

self.left_motor = LargeMotor(OUTPUT_A)   # Change port here
self.right_motor = LargeMotor(OUTPUT_B)  # Change port here
self.touch_sensor = TouchSensor(INPUT_1) # Change port here

⚠️ Known Limitations

Sim-to-Real Gap: Policies trained in simulation may not transfer perfectly to real hardware due to:
- Sensor noise differences
- Motor response variations
- Lighting conditions affecting color detection
Fixed State Discretization: The color and distance discretization may not be optimal for all environments
No Continuous Learning: The real robot runs inference only; it doesn't update the Q-table online
Limited Error Handling: The code assumes proper hardware configuration

🙋 FAQ

Q: Can I use a different robot simulator?
A: The Q-Learning logic is portable. You'll need to adapt the sensor/motor interfaces for your simulator.

Q: Why does the robot sometimes spin in circles?
A: This can happen with insufficient training or when the robot encounters unseen states. Try increasing training episodes.

Q: How do I retrain from scratch?
A: Delete QTable.json and q_table.npy, then run the simulation. The Q-table will be reinitialized.

Q: Can I use this with EV3-G or other LEGO software?
A: No, this requires ev3dev OS and Python. EV3-G doesn't support custom Python scripts.

📜 License

This project is licensed under the MIT License - see the LICENSE file for details.

You are free to:

✅ Use commercially
✅ Modify
✅ Distribute
✅ Use privately

🤝 Contributing & Forking

This project is no longer actively maintained, but you're welcome to:

Fork this repository for your own experiments
Adapt the code for your specific use case
Share your improvements with the community

If you create something interesting, feel free to tag @OzSho on GitHub!

📖 Citation

If you use this project in academic work, please cite:

@software{ml_ev3,
  author = {OzSho},
  title = {ML-EV3: Q-Learning Robot Navigation},
  year = {2024},
  url = {https://github.com/OzSho/ML-EV3},
  note = {Educational reinforcement learning project for EV3 robots}
}

🔗 Resources

Made with ❤️ for robotics education

This project is provided free of charge, forever.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
real_robot		real_robot
results		results
simulation		simulation
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md

License

OzSho/ML-EV3

Folders and files

Latest commit

History

Repository files navigation

ML-EV3: Q-Learning Robot Navigation

📖 Overview

🎯 Who Is This For?

📂 Repository Structure

🔬 Technical Details

State Space

Action Space

Q-Learning Parameters (Simulation)

🖥️ Simulation Setup (Webots)

Requirements

Installation

Running the Simulation

Simulation Environment

🤖 Real Robot Setup (EV3)

Requirements

Installation

Running on Real Robot

Physical Setup Requirements

📊 Results Interpretation

results.txt (With Obstacles)

results_no_obstacles.txt (Without Obstacles)

📚 Educational Value

Reinforcement Learning

Robotics

Software Engineering

🔧 Customization

Modifying Q-Learning Parameters

Adding New Colors

Changing Robot Hardware Ports

⚠️ Known Limitations

🙋 FAQ

📜 License

🤝 Contributing & Forking

📖 Citation

🔗 Resources

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages