Repository for the course project done as part of CS-747 (Foundations of Intelligent & Learning Agents) course at IIT Bombay in Autumn 2022.
-
Updated
Oct 14, 2022 - Python
Repository for the course project done as part of CS-747 (Foundations of Intelligent & Learning Agents) course at IIT Bombay in Autumn 2022.
Implementation of td policy evaluation and q-learning on a grid world.
A Python-based repository with implementations of RL algorithms, featuring visualization tools and benchmarks
A reinforcement learning project for crowd-dynamics in a very narrow corridor
The primary objective of the project is to assess the effectiveness of opioid drug regulations in three U.S. states.
Various reinforcement learning algorithms implemented on the frozen lake grid world.
Applying AlphaZero Self-Play Tactics to LLaMA for Enhanced Chatbot Interaction
This includes sample reinfrocement learning algorithms .Currently working on an approach to use RL for more comlex navigation issues
Dynamic Programming for Finite Markov Decision Processes
Benchmark for "Offline Policy Comparison with Confidence"
A PyTorch implementation of the "robust" synthetic control model
Exploring RL ideas for deep neural network hyper-parameter search
Approaching OPE as a regression problem using meta-learning.
Codes for Change-in-change Asymptotics project
Reinforcement Learning Notebooks
Offline evaluation of multi-armed bandit algorithms
Emergent unsupervised policy generation from thermodynamics
Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay
Tabular methods for reinforcement learning
Add a description, image, and links to the policy-evaluation topic page so that developers can more easily learn about it.
To associate your repository with the policy-evaluation topic, visit your repo's landing page and select "manage topics."