A curated list of Monte Carlo tree search papers with implementations.
-
Updated
Mar 16, 2024 - Python
A curated list of Monte Carlo tree search papers with implementations.
A Python package for causal inference using Synthetic Controls
Tabular methods for reinforcement learning
Offline evaluation of multi-armed bandit algorithms
Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay
Emergent unsupervised policy generation from thermodynamics
A reinforcement learning project for crowd-dynamics in a very narrow corridor
Approaching OPE as a regression problem using meta-learning.
Exploring RL ideas for deep neural network hyper-parameter search
Benchmark for "Offline Policy Comparison with Confidence"
Reinforcement Learning Notebooks
Implementation of td policy evaluation and q-learning on a grid world.
This includes sample reinfrocement learning algorithms .Currently working on an approach to use RL for more comlex navigation issues
A PyTorch implementation of the "robust" synthetic control model
Repository for the course project done as part of CS-747 (Foundations of Intelligent & Learning Agents) course at IIT Bombay in Autumn 2022.
A Python-based repository with implementations of RL algorithms, featuring visualization tools and benchmarks
The primary objective of the project is to assess the effectiveness of opioid drug regulations in three U.S. states.
Various reinforcement learning algorithms implemented on the frozen lake grid world.
Applying AlphaZero Self-Play Tactics to LLaMA for Enhanced Chatbot Interaction
Dynamic Programming for Finite Markov Decision Processes
Add a description, image, and links to the policy-evaluation topic page so that developers can more easily learn about it.
To associate your repository with the policy-evaluation topic, visit your repo's landing page and select "manage topics."