Skip to content
View avivros007's full-sized avatar

Block or report avivros007

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. google-research-datasets/Education-Dialogue-Dataset google-research-datasets/Education-Dialogue-Dataset Public archive

    Dataset of conversations, generated by prompting Gemini Ultra. These are conversations between a teacher and a student, where the teacher is prompted with specific topic to teach the student, and t…

    37 9

  2. Policy-Iteration-with-Adaptive-Planning-Horizon Policy-Iteration-with-Adaptive-Planning-Horizon Public

    An implementation of Policy Iteration with adaptive planning horizons on a grid world environment.

    Python

  3. StableBaselines3-Added-Features StableBaselines3-Added-Features Public

    Adding to StableBaselines3 DQN: n-step TD error and an auxiliary task of predicting the next state.

    Python

  4. Factored-MDP-with-Unknown-Structure Factored-MDP-with-Unknown-Structure Public

    Implementation of the experiments for the paper "Oracle-Efficient Regret Minimization in Factored MDPs with Unknown Structure" by Aviv Rosenberg and Yishay Mansour (NeurIPS 2021).

    Python 1

  5. SummarizationNEWSROOM SummarizationNEWSROOM Public

    Python 1