Path-Finder

Implementation of a deep reinforcement learning model trained to solve pathfinder puzzles.

See results commit on 4-22-17. Observation: convergence and convergence rate are relatively unphased by maximum number of actions except when the number is insufficient.

5/28 todo

implement scheduler:
1. learning rate scheduler (1.5. MNA scheduler)
2. state presentation scheduler
implement deuling architectures (easy modification)

Name		Name	Last commit message	Last commit date
Latest commit History 110 Commits
10-23		10-23
10-5		10-5
10-6		10-6
10-8		10-8
Experiments		Experiments
data_files/components		data_files/components
openai_etc		openai_etc
trial-exp-1		trial-exp-1
README.md		README.md
analysis.py		analysis.py
cleaner.py		cleaner.py
environment.py		environment.py
environment3.py		environment3.py
experiment.py		experiment.py
network.py		network.py
out-9-17.py		out-9-17.py
reinforcement_batch.py		reinforcement_batch.py
save_as_plot.py		save_as_plot.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Path-Finder

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Path-Finder

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages