Add step-by-step and pipeline tutorials for reinforcement learning with Vertex AI. #19

KathyFeiyang · 2021-08-03T19:19:11Z

Add step-by-step and pipeline tutorials for reinforcement learning with Vertex AI.

Add two reinforcement learning on Vertex AI prototypes. The prototypes use TF-Agents, Kubeflow Pipelines (KFP) and Vertex AI in building a reinforcement learning application: movie recommendation system based on the MovieLens 100K dataset.

Step-by-step demo: showcase how to use custom training, custom hyperparameter tuning, custom prediction and endpoint deployment of Vertex AI to build a RL movie recommendation system
End-to-end pipeline demo: showcase how to build a RL-specific MLOps pipeline using KFP and Vertex Pipelines, as well as additional Vertex AI and GCP services such as BigQuery, Cloud Functions, Cloud Scheduler, Pub/Sub.

Each demo contains a notebook that carries out the full workflow and user instructions, and a src/ directory for Python modules and unit tests.

Before submitting a Jupyter notebook, follow this mandatory checklist:

Use the notebook template as a starting point.
Follow the style and grammar rules outlined in the above notebook template.
Verify the notebook runs successfully in Colab since the automated tests cannot guarantee this even when it passes.
[N/A] Passes all the required automated checks
[N/A] You have consulted with a tech writer to see if tech writer review is necessary. If so, the notebook has been reviewed by a tech writer, and they have approved it.
This notebook has been added to the CODEOWNERS file, pointing to the author or the author's team. If the CODEOWNERS file doesn't exist, create one in the nearest folder that makes sense.
The Jupyter notebook cleans up any artifacts it has created (datasets, ML models, endpoints, etc) so as not to eat up unnecessary resources.

…with Vertex AI.

review-notebook-app · 2021-08-03T19:19:15Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

yinghsienwu · 2021-08-03T21:53:09Z

LGTM! Thanks for the PR.

tutorials/community/reinforcement_learning/README.md

ivanmkc

LGTM because community tutorials don't need review.

Just add the CODEOWNERS file

...orcement_learning/pipeline_reinforcement_learning_vertex_ai/src/trainer/trainer_component.py

...ng/pipeline_reinforcement_learning_vertex_ai/pipeline_reinforcement_learning_vertex_ai.ipynb

Ark-kun · 2021-08-04T22:48:31Z

Got this error when running at the ModelUpload stage: google.api_core.exceptions.FailedPrecondition: 400 The Cloud Storage bucket of gs://avolkov/tmp/artifactsis in locationus. It must be in the same regional location as the service location us-central1.

/cc @sasha-gitg
/cc @chensun

This is one of the reasons when I was proposing to add the InputUri placeholders, I've proposed it to have the supportedSchemas property and possibility for expansion. All URIs are not equal - the component authors need to be able to tell the system which URIs the component supports.

KathyFeiyang · 2021-08-04T22:54:46Z

Got this error when running at the ModelUpload stage: google.api_core.exceptions.FailedPrecondition: 400 The Cloud Storage bucket of gs://avolkov/tmp/artifactsis in locationus. It must be in the same regional location as the service location us-central1.

/cc @sasha-gitg

Thanks for pointing this out. I encountered the same issue with CustomContainerTrainingJob, and also logged this in a friction log (which I will send you offline).

For the two notebooks, there are instructions in the GCS bucket creation section about avoiding multi-regional buckets.

morgandu · 2021-08-05T01:51:37Z

@KathyFeiyang please add the prototype to the CODEOWNERS

sasha-gitg · 2021-08-05T14:03:56Z

In Vertex, buckets must be in a single region and must match the region of the Vertex service. We have an open ticket to add verification before we call the API(b/183494969) in the Vertex SDK. In many scenarios the service exception is informative enough (like the example above) and is generally raised immediately.

For components, I don't think this is feasible because the storage uri could be passed in as PipelineParam and we will not know the identity of the uri until the task is executed.

Are you thinking of a different solution?

…ypes.

…Directly load dataset from remote bucket.

morgandu

LGTM, thank you @KathyFeiyang for your contribution to our community content!

KathyFeiyang · 2021-08-06T19:30:24Z

@Ark-kun @sasha-gitg Thank you for the discussion above around the potential improvement to using GCS buckets with Vertex. For the scope of this PR and adding the two RL prototypes, the feature change with Vertex is not immediately necessary, because users are instructed in the notebooks to match bucket region with Vertex region, and platform changes can't be implemented in the context of this PR. Therefore, I'll prepare to wrap up the PR. Meanwhile, I do think the discussion is valuable to continue.

Added step-by-step and pipeline tutorials for reinforcement learning …

92fc053

…with Vertex AI.

ivanmkc reviewed Aug 3, 2021

View reviewed changes

tutorials/community/reinforcement_learning/README.md Outdated Show resolved Hide resolved

ivanmkc approved these changes Aug 3, 2021

View reviewed changes

Ark-kun reviewed Aug 4, 2021

View reviewed changes

...orcement_learning/pipeline_reinforcement_learning_vertex_ai/src/trainer/trainer_component.py Outdated Show resolved Hide resolved

Ark-kun reviewed Aug 4, 2021

View reviewed changes

...ng/pipeline_reinforcement_learning_vertex_ai/pipeline_reinforcement_learning_vertex_ai.ipynb Outdated Show resolved Hide resolved

KathyFeiyang mentioned this pull request Aug 4, 2021

Made the Reinforcement Learning components shareable KathyFeiyang/vertex-ai-samples#1

Merged

Merge remote-tracking branch 'upstream/master' into vertex-growth-rl

88d0cfb

KathyFeiyang added 3 commits August 5, 2021 19:37

Add READMEs for the step-by-step and end-to-end MLOps pipeline protot…

3e89aef

…ypes.

Make dir and file names more descriptive, and adjust embedded links. …

7ceccce

…Directly load dataset from remote bucket.

Add TF-Agents bandits movie RecSys demo owner.

0229aa8

KathyFeiyang requested review from morgandu and a team as code owners August 5, 2021 19:49

morgandu approved these changes Aug 6, 2021

View reviewed changes

morgandu merged commit ee6dd35 into GoogleCloudPlatform:master Aug 6, 2021

Ark-kun mentioned this pull request Aug 20, 2021

Made the Reinforcement Learning components shareable in the tf_agents_bandits_movie_recommendation_with_kfp_and_vertex_sdk sample #34

Merged

iampawanpoojary mentioned this pull request Nov 29, 2021

mlops_pipeline_tf_agents_bandits_movie_recommendation.ipynb #162

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add step-by-step and pipeline tutorials for reinforcement learning with Vertex AI. #19

Add step-by-step and pipeline tutorials for reinforcement learning with Vertex AI. #19

KathyFeiyang commented Aug 3, 2021 •

edited

Loading

review-notebook-app bot commented Aug 3, 2021

yinghsienwu commented Aug 3, 2021

ivanmkc left a comment

Ark-kun commented Aug 4, 2021 •

edited

Loading

KathyFeiyang commented Aug 4, 2021

morgandu commented Aug 5, 2021

sasha-gitg commented Aug 5, 2021

morgandu left a comment

KathyFeiyang commented Aug 6, 2021

Add step-by-step and pipeline tutorials for reinforcement learning with Vertex AI. #19

Add step-by-step and pipeline tutorials for reinforcement learning with Vertex AI. #19

Conversation

KathyFeiyang commented Aug 3, 2021 • edited Loading