Deep Q Network

An implementation of q algorithm of Reinforcement Learning.

Installation Dependencies:

Python 3
TensorFlow 1.0.1
pygame
gym

How to Run?

git clone https://github.com/lufficc/dqn.git
cd dqn
python run.py

Tricks for flappybird

Remove background image:

clip useless part:

resize and using binary image:

decayed ε-greedy exploration, and when exploration, 0.95 probability to do nothing(because in flappy bird, most time wo do nothing). This is very important. It makes model converge in less than 2 hours.

def egreedy_action(self, state):
    #Exploration
    if random.random() <= self.epsilon:
        if random.random() < 0.95:
            action_index = 0
        else:
            action_index = 1
        # action_index = random.randint(0, self.num_actions - 1)
    else:
        #Exploitation
        action_index = self.action(state)
    if self.epsilon > self.final_epsilon:
        self.epsilon *= self.decay_factor
    return action_index

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.vscode		.vscode
assets		assets
core		core
game		game
.gitignore		.gitignore
README.md		README.md
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Q Network

Installation Dependencies:

How to Run?

Tricks for flappybird

Thanks

About

Releases

Packages

Languages

lufficc/dqn

Folders and files

Latest commit

History

Repository files navigation

Deep Q Network

Installation Dependencies:

How to Run?

Tricks for flappybird

Thanks

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages