Research Scientist, Instadeep | Deep RL
- Paris, France
An easy-to-use reinforcement learning library for research and education.
🔬Research Framework for Single and Multi-Players 🎰Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorithms for single-player (UCB, KL-UCB, Thompson...) and multi-play…
AGAC: Adversarially Guided Actor-Critic
AVEC: Actor with Variance Estimated Critic