feat(jobs): improve MLOps example #82

redanrd · 2024-04-17T07:58:19Z

Summary

Improve MLOps example:

Make MLOps pipeline automatic after first deployment using CRON schedules to: fetch data from source then train model regularly, and deploy the latest model version from model registry. The schema is useful when the data provided by source grows, which would in general improve inference server performance.
Code formatting.

Checklist

I have reviewed this myself.
I have attached a README to my example. You can use this template as reference.
I have updated the project README to link my example.

redanrd · 2024-05-07T13:58:30Z

jobs/ml-ops/terraform/container.tf

  memory_limit   = 2048
  min_scale      = 1
-  max_scale      = 5
+  max_scale      = 1


We need to preserve the container state as we load the model in memory. Having many instances would possibly lead to ones with no model loaded in memory so they would fail when requested for inference.

jobs/ml-ops/terraform/container.tf

jobs/ml-ops/terraform/images.tf

jobs/ml-ops/terraform/variables.tf

jobs/ml-ops/README.md

redanrd added 11 commits April 17, 2024 09:54

feat: separate loader file

9cc3e65

feat: version model and other training artifacts

c5dfce0

feat: use cron for jobs and container

77c3703

chore: scaleway provider version for terraform

e2a9ba5

feat: model versioning

8e956b6

feat: info get endpoint and trigger post endpoint

4a17d10

docs: add TF_VAR_data_fetch_cron_schedule

e9888ca

refactor: formatting

2bf3f3f

docs: rewording

08325f2

feat: increase scheduling frequency

da2f143

feat: schedule fetching data from source

6234310

redanrd marked this pull request as ready for review May 7, 2024 13:52

redanrd commented May 7, 2024

View reviewed changes

redanrd requested review from Shillaker and cyclimse May 7, 2024 14:01

Shillaker suggested changes May 10, 2024

View reviewed changes

jobs/ml-ops/terraform/images.tf Outdated Show resolved Hide resolved

jobs/ml-ops/terraform/variables.tf Outdated Show resolved Hide resolved

jobs/ml-ops/README.md Outdated Show resolved Hide resolved

jobs/ml-ops/README.md Outdated Show resolved Hide resolved

redanrd added 3 commits May 10, 2024 15:13

fix: remove no cache directive

530670d

fix: cron schedules

5e1cfb0

docs: optional tf variables rewording

edb4c9c

Shillaker approved these changes May 16, 2024

View reviewed changes

cyclimse approved these changes May 24, 2024

View reviewed changes

refactor: inference response message

9aed2fe

redanrd merged commit 4eecda3 into main Jun 28, 2024

redanrd deleted the mlops-example branch June 28, 2024 08:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(jobs): improve MLOps example #82

feat(jobs): improve MLOps example #82

Uh oh!

redanrd commented Apr 17, 2024 •

edited

Loading

Uh oh!

redanrd May 7, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

feat(jobs): improve MLOps example #82

feat(jobs): improve MLOps example #82

Uh oh!

Conversation

redanrd commented Apr 17, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Checklist

Uh oh!

redanrd May 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

redanrd commented Apr 17, 2024 •

edited

Loading

redanrd May 7, 2024 •

edited

Loading