avivros007

Aviv Rosenberg avivros007

Achievements

google-research-datasets/Education-Dialogue-Dataset google-research-datasets/Education-Dialogue-Dataset Public archive

Dataset of conversations, generated by prompting Gemini Ultra. These are conversations between a teacher and a student, where the teacher is prompted with specific topic to teach the student, and t…

37 9
Policy-Iteration-with-Adaptive-Planning-Horizon Policy-Iteration-with-Adaptive-Planning-Horizon Public

An implementation of Policy Iteration with adaptive planning horizons on a grid world environment.

Python
StableBaselines3-Added-Features StableBaselines3-Added-Features Public

Adding to StableBaselines3 DQN: n-step TD error and an auxiliary task of predicting the next state.

Python
Factored-MDP-with-Unknown-Structure Factored-MDP-with-Unknown-Structure Public

Implementation of the experiments for the paper "Oracle-Efficient Regret Minimization in Factored MDPs with Unknown Structure" by Aviv Rosenberg and Yishay Mansour (NeurIPS 2021).

Python 1
SummarizationNEWSROOM SummarizationNEWSROOM Public

Python 1