useless-thing

Useless try to create neural network.

The idea

Instead of using gradient descent for learning my neural network I've tried simplier method. I tried to do the following:
layer[j]=layer[j]+activity[i]*sign(layer[j])*reward*lr
where layer[j] is weight of ith input of each neuron. activity[i] is latest value of that input. i is number of input. So it increase the absolute value of weight if action was rewarded and decrease it if not. But unfortanetely it doesn't work.

Testing it

This network was tested on LunarLanderContinous-v2 from gym library. On average it do 1 landing over 1000 iterations.

Running

python rl1.py
NOTICE: gym and Box2D libraries are required to run. Python version>=3.6

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
rl1.py		rl1.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

useless-thing

The idea

Testing it

Running

About

Releases

Packages

Languages

License

0xf104a/useless-thing

Folders and files

Latest commit

History

Repository files navigation

useless-thing

The idea

Testing it

Running

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages