Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add step-by-step and pipeline tutorials for reinforcement learning with Vertex AI. #19

Merged
merged 5 commits into from
Aug 6, 2021
Merged
Show file tree
Hide file tree
Changes from 1 commit
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
25 changes: 25 additions & 0 deletions tutorials/community/reinforcement_learning/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,25 @@
# Reinforcement Learning Protytopes on Vertex AI
KathyFeiyang marked this conversation as resolved.
Show resolved Hide resolved

This directory demos the use of TF-Agents, Kubeflow Pipelines (KFP) and Vertex
AI in building reinforcement learning applications. The RL applications we build
are movie recommendation systems based on the MovieLens 100K dataset.

There are 2 demos:

1. Step-by-step demo: showcase how to use custom training, custom
hyperparameter tuning, custom prediction and endpoint deployment of Vertex
AI to build a RL movie recommendation system

2. End-to-end pipeline demo: showcase how to build a RL-specific MLOps pipeline
using KFP and Vertex Pipelines, as well as additional Vertex AI and GCP
services such as BigQuery, Cloud Functions, Cloud Scheduler, Pub/Sub.
Highlights of this end-to-end pipeline are:

- RL-specific implementation for training and prediction
- Simulation for initial training data, prediction requests and
re-training
- Closing of the feedback loop from prediction results back to training
- Customizable and reproducible KFP components

Each demo contains a notebook that carries out the full workflow and user
instructions, and a `src/` directory for Python modules and unit tests.
Loading