deep_learning

The agents were trained based on the provided dqn exercises and achieved an average score of 13 or higher. I ran the code in the workspace provided by Udacity. The installation steps for the review requirements were skipped.

Enviroment

Environment: State: 37 discrete spaces[37]. Action: Move forward, move backward, turn right, turn left [4]. Reward: 1. +1 if you get a yellow banana, -1 if you get a blue banana

Solutions

Using the Bellman operator, we obtain an action value function from historical data. The approximation function of the action value function is a neural network. The input is the state and the output is the action. The value is obtained by reducing the error between the approximation function and the target function. Double DQN is not applied.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Navigation (2).ipynb		Navigation (2).ipynb
README.md		README.md
Report.pdf		Report.pdf
checkpoint.pth		checkpoint.pth
dqn_agent.py		dqn_agent.py
model.py		model.py
unity-environment.log		unity-environment.log

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

deep_learning

Enviroment

Solutions

About

Releases

Packages

Languages

throbotaith/deep_learning

Folders and files

Latest commit

History

Repository files navigation

deep_learning

Enviroment

Solutions

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages