Skip to content

Navigation Menu

Explore
For
- Enterprise
- Teams
- Startups
- Education
By Solution
Resources
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

dennybritz / reinforcement-learning Public

Notifications
Fork 6k
Star 20.1k

Code
Issues 94
Pull requests 20
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Pull requests: dennybritz/reinforcement-learning

Labels 7 Milestones 0

Labels 7 Milestones 0

New pull request New

20 Open 74 Closed

20 Open 74 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Update README.md

#248 opened Mar 17, 2023 by pajjaecat

Loading…

Modify "v (list) : state value function" to "V"

#242 opened Oct 29, 2021 by hslyu

Loading…

Hello

#241 opened Oct 21, 2021 by simplephi

Loading…

Update README.md

#240 opened Oct 1, 2021 by hardlyhuman

Loading…

Error 'show() takes 1 positional argument but 2 were given' fixed in plotting.py

#237 opened Feb 1, 2021 by Dolores2333

Loading…

Minor fixes

#234 opened Dec 22, 2020 by rafardenas

Loading…

update slides

#233 opened Oct 10, 2020 by harsh306

Loading…

1

added: Double DQN Proportional Prioritized Experience Replay Solution

#226 opened Apr 30, 2020 by makaveli10

Loading…

2

Exercise notebooks with no outputs.

#207 opened Aug 3, 2019 by avullo

Loading…

Add Links to Deepnote

#206 opened Aug 1, 2019 by jirkalhotka

Loading…

Test the policy in "Value Iteration" exercise

#205 opened Jun 23, 2019 by link2xt

Loading…

1

Proposal of Expected SARSA algorithm

#197 opened Mar 25, 2019 by AntonioSerrano

Loading…

1

Adding k-bandit implementation

#178 opened Oct 1, 2018 by rae83

Loading…

Create MDP_David_class_first_example.py

#169 opened Jul 11, 2018 by olmerg

Loading…

Modify Policy Evaluation Solution.ipynb according to David Silver's slides.

#166 opened Jul 5, 2018 by QikeLi

Loading…

1

Update dqn.py

#165 opened Jun 7, 2018 by zmonoid

Loading…

updated DQN model for tf 1.0

#115 opened Oct 18, 2017 by Airconaaron

Loading…

Workaround for environment max step limit of 200.

#107 opened Sep 12, 2017 by sedand

Loading…

6

update from upstream & make the implement more robust and meaningful in DP/Policy Evaluation Solution

#105 opened Aug 24, 2017 by liu-jc

Loading…

1

fix the probabilities for each action bug

#86 opened May 26, 2017 by fstonezst

Loading…

ProTip! What’s not been updated in a month: updated:<2024-04-04.

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.