#

policy-evaluation

Here are 22 public repositories matching this topic...

sarthakmittal92 / mdp-and-cricket

Repository for the course project done as part of CS-747 (Foundations of Intelligent & Learning Agents) course at IIT Bombay in Autumn 2022.

python mdp policy-evaluation policy-iteration cricket-game

Updated Oct 14, 2022
Python

willyfh / grid-world-reinforcement-learning-2

Implementation of td policy evaluation and q-learning on a grid world.

reinforcement-learning q-learning artificial-intelligence policy-evaluation temporal-difference

Updated Feb 17, 2024
Python

wothmag07 / ReinforceMe

A Python-based repository with implementations of RL algorithms, featuring visualization tools and benchmarks

policy-evaluation value-iteration actor-critic sarsa-lambda sarsa-learning qlearning-on-gridworld

Updated Apr 23, 2024
Python

narrow-corridor-ai

nima-siboni / narrow-corridor-ai

A reinforcement learning project for crowd-dynamics in a very narrow corridor

dynamic-programming policy-evaluation policy-iteration world-models multi-agent-reinforcement-learning

Updated Apr 21, 2020
Python

heath3rq / Python_Opioid-Policy-Evaluation

The primary objective of the project is to assess the effectiveness of opioid drug regulations in three U.S. states.

big-data policy-evaluation social-sciences-data data-analysis-python difference-in-differences

Updated Jan 11, 2023
Python

Animesh-Chourey / Frozen-Lake

Various reinforcement learning algorithms implemented on the frozen lake grid world.

reinforcement-learning q-learning sarsa policy-evaluation policy-iteration value-iteration model-based-reinforcement-learning policy-improvement frozen-lake

Updated Aug 29, 2022
Python

ChukwumaChukwuma / enyimba_ai

Applying AlphaZero Self-Play Tactics to LLaMA for Enhanced Chatbot Interaction

machine-learning natural-language-processing reinforcement-learning ai chatbot artificial-intelligence strategy policy-evaluation alphazero muzero prompt-engineering llms generative-ai rlhf llama2

Updated Jan 5, 2024
Python

sparshgarg23 / Basic-Reinforcement-Learning

This includes sample reinfrocement learning algorithms .Currently working on an approach to use RL for more comlex navigation issues

policy dynamic-programming policy-evaluation policy-iteration

Updated Aug 10, 2018
Python

farkoo / DP-for-FMDP

Dynamic Programming for Finite Markov Decision Processes

reinforcement-learning dynamic-programming policy-evaluation markov-decision-processes policy-iteration value-iteration

Updated Apr 19, 2023
Python

koulanurag / opcc

Benchmark for "Offline Policy Comparison with Confidence"

reinforcement-learning policy-evaluation uncertainty-estimation confidence-estimation offline-reinforcement-learning offline-policy-comparison

Updated Oct 25, 2023
Python

glee1228 / RL_basic

rl policy-evaluation jacks-car-rental small-grid-world

Updated Mar 23, 2021
Python

agoryuno / robust_control

A PyTorch implementation of the "robust" synthetic control model

python pytorch econometrics policy-evaluation causal-inference synthetic-control causal-models program-evaluation pytorch-implementation synthetic-control-method

Updated Jun 11, 2023
Python

vishal-keshav / xcelerator

Exploring RL ideas for deep neural network hyper-parameter search

deep-neural-networks optimization pruning policy-evaluation meta-learning deep-learning-architectures

Updated Dec 25, 2018
Python

QasimWani / off-policy-evaluation

Approaching OPE as a regression problem using meta-learning.

reinforcement-learning policy-evaluation meta-learning

Updated Feb 11, 2021
Python

jeremylhour / CIC-asymptotics

Codes for Change-in-change Asymptotics project

econometrics policy-evaluation causal-inference

Updated Jan 14, 2024
Python

Prakhar-FF13 / Reinforcement-Learning-With-Python

Reinforcement Learning Notebooks

machine-learning reinforcement-learning deep-learning monte-carlo deep-reinforcement-learning policy-gradient policy-evaluation markov-decision-processes policy-iteration value-iteration actor-critic deep-q-learning temporal-differencing-learning cross-entropy-method

Updated Mar 31, 2019
Python

antoine-hochart / bandit_algo_evaluation

Offline evaluation of multi-armed bandit algorithms

thompson-sampling epsilon-greedy policy-evaluation multi-armed-bandit upper-confidence-bound

Updated Dec 1, 2020
Python

dyth / causalEntropicForces

Emergent unsupervised policy generation from thermodynamics

entropy simulation tic-tac-toe unsupervised-learning policy-evaluation

Updated Dec 1, 2018
Python

akshaykhadse / reinforcement-learning

Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay

reinforcement-learning linear-programming thompson-sampling epsilon-greedy ucb policy-evaluation mdps multi-armed-bandits policy-iteration randomised-algorithms reinforcement-learning-excercises kl-divergence markovian-epidemic-processes reinforcement-learning-analysis multiarm-bandit ucb1 howards-pi batch-switching randomized-policy-iteration

Updated May 21, 2018
Python

linesd / tabular-methods

Tabular methods for reinforcement learning

algorithm reinforcement-learning q-learning reinforcement-learning-algorithms sarsa gridworld policy-evaluation policy-iteration value-iteration reinforcement-learning-agent tabular-q-learning gridworld-environment sarsa-learning q-learning-vs-sarsa cliffwalking gridworld-cliff tabular-environments sarsa-algorithm tabular-methods q-learning-algorithm

Updated Jul 3, 2020
Python

Improve this page

Add a description, image, and links to the policy-evaluation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the policy-evaluation topic, visit your repo's landing page and select "manage topics."