Code for our AJCAI 2020 paper: "Online Semi-Supervised Learning in Contextual Bandits with Episodic Reward".
reinforcement-learning
paper
semi-supervised-learning
bandits
bandit
contextual-bandits
contextual-bandit
self-supervised-learning
nonstationary-environments
-
Updated
Sep 21, 2020 - MATLAB