MLOps on GCP Vertex AI: a gradual tutorial

Please read MLOps: Continuous delivery and automation pipelines in machine learning before beginning this tutorial.

To summarize the MLOps levels:

MLOps level 0: Manual process
MLOps level 1: ML pipeline automation
MLOps level 2: CI/CD pipeline automation

The goal of MLOps level 2 is to achieve the same velocity and quality of the DevOps teams: new web applications and features are created, tested, deployed, and destroyed every day, if not multiple times a day, all with zero impact to a user. For example, Google constantly adds or updates products across its entire portfolio, which means there are hundreds to thousands of new deployments on any given day. Even if your company is not as big as Google, you and your AI/ML team can and should aspire to the same velocity as the application development teams. This means adopting the practices and principles of DevOps.

MLOps level 0: Manual process

Topics

Jupyter Notebooks

Data Scientist workflow

I load the data in Jupyter Notebook
I iterate on model
I run training notebook to output model

MLOps level 1: ML pipeline automation

Topics

Non-Jupyter IDE: Code is written in .py files to accomodate containers, not .ipynb
Docker Containers: Runs custom code repeatedly
Pipelines

Data Scientist workflow

I may iterate on training container
1. Manually build Docker image and push to Artifact Repository
I may modify Vertex Pipeline
1. Manually recompile pipeline.yaml
I may need new bucket
1. Manually create a new bucket in Console

MLOps level 2: CI/CD pipeline automation

Topics

Unit testing
Production environments: Production environments should not be manually touched, only vetted code can deploy
Terraform: Infrastructure as Code e.g. create new buckets using code instead of console
Cloud Build: Build service e.g. docker build and push
Git automation: When source code changes, changes are automatically tested and applied

Data Scientist workflow

I iterate on training container locally
I git push the changes to dev branch in repo of choice e.g. Gitlab, Github
Cloud Build detects that change, then runs the steps in cloudbuild.yaml (*), which may include:
1. Running unit tests
2. Running Docker build and pushing to Artifact Registry
3. Running terraform apply
4. Running functional tests
Once all code passes is dev, new changes may automatically pushed to Production depending on your DevOps
The above checks and builds take place in Production and your new model is launched

(*) In this tutorial we will not connect Cloud Build to a repo, instead we will run Cloud Build manually to mimic the trigger would execute.

Best practices

Use .gitignore in same directory as cloudbuild yaml to ignore temp files e.g. terraform
- https://cloud.google.com/sdk/gcloud/reference/builds/submit

Name		Name	Last commit message	Last commit date
Latest commit History 66 Commits
level0		level0
level1		level1
level2		level2
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MLOps on GCP Vertex AI: a gradual tutorial

MLOps level 0: Manual process

Topics

Data Scientist workflow

MLOps level 1: ML pipeline automation

Topics

Data Scientist workflow

MLOps level 2: CI/CD pipeline automation

Topics

Data Scientist workflow

Best practices

Resources

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

jakechen/gcp-mlops-tutorial

Folders and files

Latest commit

History

Repository files navigation

MLOps on GCP Vertex AI: a gradual tutorial

MLOps level 0: Manual process

Topics

Data Scientist workflow

MLOps level 1: ML pipeline automation

Topics

Data Scientist workflow

MLOps level 2: CI/CD pipeline automation

Topics

Data Scientist workflow

Best practices

Resources

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages