Skip to content

A bachelor thesis project about autonomous car maneuver around roundabout using RL-DQN

License

Notifications You must be signed in to change notification settings

roychanmeliaz/carla-rl-dqn-thesis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

54 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

# Carla RL-DQN Thesis Project

A bachelor thesis project about autonomous car maneuver around roundabout using reinforcement learning with deep q network.

Conference paper published at: https://ieeexplore.ieee.org/document/10221077

Environment Setup

Two kinds of roundabouts are used in this research.

  • Roundabout with Intersection

roundabout

  • Roundabout without Intersection

roundabout2

Sensor

The camera sensor was obtained, and then we did the semantic segmentation using CARLA's built-in function.
rgb -> segmentasi

RGB -> Semantic Segmentation

Two kinds of camera setup are used in this research.

  • Segmented and Grayscaled

grayscale

  • Segmented and retouched

segmentasi_hitam_putih

Action

There are three actions that the agent can do.

  • Forward Throttle = 1; Steer = 0;
  • Forward Left Throttle = 1; Steer = -1;
  • Forward Right Throttle = 1; Steer = 1;

Reward Function

The reward functions are different for each kinds of roundabout.

Roundabout with intersection

image

  1. Angle Deviation. The value is Reward1 = 1/alpha. Alpha is the difference in angle from the target line and the agent angle.

image

  1. Distance Deviation. The value is Reward2 = 1/distance_deviation*10. Distance deviation is the distance from agent to the target line in m.

image

  1. Collision. The value is Reward3 = -1. Collision event triggered when the agent touched another object.

  2. Agent too far. The value is Reward2 = -0.5. Agent too far event triggered when the distance between the center of roundabout and the agent is more than 30 meters.

  3. Total Reward:. The total reward is Reward = Reward1 + Reward+2 + Reward3 + Reward4

Roundabout without intersection

image

  1. Angle Deviation. The value is Reward1 = 1/alpha. Alpha is the angle difference between the agent's direction and the direction from the agent to the nearest waypoint + 5 from the agent.

image

  1. Collision The value is Reward2 = -1. Collision event triggered when the agent touched another object.

  2. Total Reward: The total reward is Reward = Reward1 + Reward+2

Training

Using reinforcement learning with deep q network.

methodology

About

A bachelor thesis project about autonomous car maneuver around roundabout using RL-DQN

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages