policy-evaluation

Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay

reinforcement-learning linear-programming thompson-sampling epsilon-greedy ucb policy-evaluation mdps multi-armed-bandits policy-iteration randomised-algorithms reinforcement-learning-excercises kl-divergence markovian-epidemic-processes reinforcement-learning-analysis multiarm-bandit ucb1 howards-pi batch-switching randomized-policy-iteration

Updated May 21, 2018
Python

dyth / causalEntropicForces

Star

Emergent unsupervised policy generation from thermodynamics

entropy simulation tic-tac-toe unsupervised-learning policy-evaluation

Updated Dec 1, 2018
Python

nima-siboni / narrow-corridor-ai

Star

A reinforcement learning project for crowd-dynamics in a very narrow corridor

dynamic-programming policy-evaluation policy-iteration world-models multi-agent-reinforcement-learning

Updated Apr 21, 2020
Python

QasimWani / off-policy-evaluation

Star

Approaching OPE as a regression problem using meta-learning.

reinforcement-learning policy-evaluation meta-learning

Updated Feb 11, 2021
Python

vishal-keshav / xcelerator

Star

Exploring RL ideas for deep neural network hyper-parameter search

deep-neural-networks optimization pruning policy-evaluation meta-learning deep-learning-architectures

Updated Dec 25, 2018
Python

koulanurag / opcc

Star

Benchmark for "Offline Policy Comparison with Confidence"

reinforcement-learning policy-evaluation uncertainty-estimation confidence-estimation offline-reinforcement-learning offline-policy-comparison

Updated Oct 25, 2023
Python

Prakhar-FF13 / Reinforcement-Learning-With-Python

Star

Reinforcement Learning Notebooks

machine-learning reinforcement-learning deep-learning monte-carlo deep-reinforcement-learning policy-gradient policy-evaluation markov-decision-processes policy-iteration value-iteration actor-critic deep-q-learning temporal-differencing-learning cross-entropy-method

Updated Mar 31, 2019
Python

willyfh / grid-world-reinforcement-learning-2

Star

Implementation of td policy evaluation and q-learning on a grid world.

reinforcement-learning q-learning artificial-intelligence policy-evaluation temporal-difference

Updated Feb 17, 2024
Python

sparshgarg23 / Basic-Reinforcement-Learning

Star

This includes sample reinfrocement learning algorithms .Currently working on an approach to use RL for more comlex navigation issues

policy dynamic-programming policy-evaluation policy-iteration

Updated Aug 10, 2018
Python

agoryuno / robust_control

Star

A PyTorch implementation of the "robust" synthetic control model

python pytorch econometrics policy-evaluation causal-inference synthetic-control causal-models program-evaluation pytorch-implementation synthetic-control-method

Updated Jun 11, 2023
Python

sarthakmittal92 / mdp-and-cricket

Star

Repository for the course project done as part of CS-747 (Foundations of Intelligent & Learning Agents) course at IIT Bombay in Autumn 2022.

python mdp policy-evaluation policy-iteration cricket-game

Updated Oct 14, 2022
Python

wothmag07 / ReinforceMe

Star

A Python-based repository with implementations of RL algorithms, featuring visualization tools and benchmarks

policy-evaluation value-iteration actor-critic sarsa-lambda sarsa-learning qlearning-on-gridworld

Updated Apr 23, 2024
Python

heath3rq / Python_Opioid-Policy-Evaluation

Star

The primary objective of the project is to assess the effectiveness of opioid drug regulations in three U.S. states.

big-data policy-evaluation social-sciences-data data-analysis-python difference-in-differences

Updated Jan 11, 2023
Python

Animesh-Chourey / Frozen-Lake

Star

Various reinforcement learning algorithms implemented on the frozen lake grid world.

reinforcement-learning q-learning sarsa policy-evaluation policy-iteration value-iteration model-based-reinforcement-learning policy-improvement frozen-lake

Updated Aug 29, 2022
Python

ChukwumaChukwuma / enyimba_ai

Star

Applying AlphaZero Self-Play Tactics to LLaMA for Enhanced Chatbot Interaction

machine-learning natural-language-processing reinforcement-learning ai chatbot artificial-intelligence strategy policy-evaluation alphazero muzero prompt-engineering llms generative-ai rlhf llama2

Updated Jan 5, 2024
Python

farkoo / DP-for-FMDP

Star

Dynamic Programming for Finite Markov Decision Processes

reinforcement-learning dynamic-programming policy-evaluation markov-decision-processes policy-iteration value-iteration

Updated Apr 19, 2023
Python

Improve this page

Add a description, image, and links to the policy-evaluation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the policy-evaluation topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

policy-evaluation

Here are 22 public repositories matching this topic...

benedekrozemberczki / awesome-monte-carlo-tree-search-papers

OscarEngelbrektson / SyntheticControlMethods

linesd / tabular-methods

antoine-hochart / bandit_algo_evaluation

akshaykhadse / reinforcement-learning

dyth / causalEntropicForces

nima-siboni / narrow-corridor-ai

QasimWani / off-policy-evaluation

vishal-keshav / xcelerator

koulanurag / opcc

Prakhar-FF13 / Reinforcement-Learning-With-Python

willyfh / grid-world-reinforcement-learning-2

sparshgarg23 / Basic-Reinforcement-Learning

agoryuno / robust_control

sarthakmittal92 / mdp-and-cricket

wothmag07 / ReinforceMe

heath3rq / Python_Opioid-Policy-Evaluation

Animesh-Chourey / Frozen-Lake

ChukwumaChukwuma / enyimba_ai

farkoo / DP-for-FMDP

Improve this page

Add this topic to your repo