Skip to content

Files

Latest commit

 

History

History

bandits_statlog_vw_customEnv

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Contextual Bandits with Amazon SageMaker RL

This Notebook demonstrates how you can manage your own contextual multi-armed bandit workflow on SageMaker using the built-in Vowpal Wabbit (VW) container to train and deploy contextual bandit models. We show how to train these models that interact with a live environment (using a simulated client application) and continuously update the model with efficient exploration.

Contents

  • bandits_statlog_vw_custom.ipynb: Notebook used for running the contextual bandit notebook.
  • config.yaml: The configuration parameters used for the bandit example.
  • sim_app: Simulated client application that pings SageMaker for recommended action given a state. Also computes the rewards for each interaction.
  • common: Code that manages the different AWS components required for training workflow.
  • src:
    • train-vw.py: Script for training with Vowpal Wabbit library.
    • eval-cfa-vw.py: Script for evaluation with Vowpal Wabbit library.