Pinned Loading
-
google-research-datasets/Education-Dialogue-Dataset
google-research-datasets/Education-Dialogue-Dataset Public archiveDataset of conversations, generated by prompting Gemini Ultra. These are conversations between a teacher and a student, where the teacher is prompted with specific topic to teach the student, and t…
-
Policy-Iteration-with-Adaptive-Planning-Horizon
Policy-Iteration-with-Adaptive-Planning-Horizon PublicAn implementation of Policy Iteration with adaptive planning horizons on a grid world environment.
Python
-
StableBaselines3-Added-Features
StableBaselines3-Added-Features PublicAdding to StableBaselines3 DQN: n-step TD error and an auxiliary task of predicting the next state.
Python
-
Factored-MDP-with-Unknown-Structure
Factored-MDP-with-Unknown-Structure PublicImplementation of the experiments for the paper "Oracle-Efficient Regret Minimization in Factored MDPs with Unknown Structure" by Aviv Rosenberg and Yishay Mansour (NeurIPS 2021).
Python 1
-
If the problem persists, check the GitHub status page or contact support.
