Feat support continue training #668

guenthermi · 2023-01-26T12:10:49Z

Add support for continue the training

This PR enables to continue the training form an existing artifact of a fine-tuned model

train_data = 'path/to/another/data.csv'

run2 = finetuner.fit(
    model='efficientnet_b0',
    train_data=train_data,
    model_artifact=run.artifact_id,
)
print(f'Run name: {run2.name}')
print(f'Run status: {run2.status()}')

Since we need the model name to detect the task in order to construct the training dataset correctly, users still need to add the model property to the fit function.

This PR references an open issue
I have added a line about this change to CHANGELOG

guenthermi · 2023-01-26T12:36:13Z

Tests are failing because I did not commit any requirements change for commons and stubs yet.

guenthermi · 2023-01-27T08:24:39Z

I added a pre-release to see if the tests are running. Everything seems to work as expected. One test is failing because loss-optimizers are not added yet. This should work when #664 is merged.

bwanglzu

LGTM! while why CI is failing?

bwanglzu · 2023-01-27T09:33:26Z

docs/walkthrough/run-job.md

+train_data = 'path/to/another/data.csv'
+
+run2 = finetuner.fit(
+    model='efficientnet_b0',


do we still need the model?

okay now i see the hint, do i understand correly that the model type must be identical to artifact model type?

We need the model name to detect the task, so the task they are used for must be equal, so basically yes. If you use a docarray dataset, it probably does not matter

LMMilliken

Once the pr for the new pooling and loss options is merged, everything should pass.
LGTM!

LMMilliken · 2023-01-27T09:53:51Z

docs/walkthrough/run-job.md

+:class: hint
+When you want to continue training, you still need to provide the `model` parameter
+beside the `model_artifact` parameter for Finetuner to correctly configure the new run.


Suggested change

:class: hint

When you want to continue training, you still need to provide the `model` parameter

beside the `model_artifact` parameter for Finetuner to correctly configure the new run.

:class: hint

When you want to continue training, you still need to provide the `model` parameter

as well as the `model_artifact` parameter for Finetuner to correctly configure the new run.

LMMilliken · 2023-01-27T09:56:25Z

finetuner/experiment.py

-            name=model,
+            name=model if not kwargs.get(MODEL_ARTIFACT) else None,
+            artifact=kwargs.get(MODEL_ARTIFACT),


Just to make sure I understand, this means that if there is a MODEL_ARTIFACT, then name is set to None?

yes, because George implemented the jsonschema in a way that both arguments are mutually exclusive, which makes somehow sense, because otherwise it would be a little bit hard to see what will happen, e.g., if a new model is constructed or the artifact is used. So in this way, the final config to be send to the API is more clear on what a run should do.

guenthermi · 2023-01-27T12:54:37Z

LGTM! while why CI is failing?

as said in the comment above, the stubs version includes already the ArcFace changes and specifically, the loss-optimizers etc. The tests therefore failing but it will be addressed by Louis PR. So I need to merge main into this branch here after Louis finished its PR then it should not fail anymore

github-actions · 2023-01-30T10:38:46Z

📝 Docs are deployed on https://ft-feat-support-continue-training--jina-docs.netlify.app 🎉

guenthermi added 3 commits January 25, 2023 10:24

feat: add model_artifact parameter

9bec119

docs: add documentation for continue training

a01ce07

docs: add hint to provide model name

63253b5

guenthermi linked an issue Jan 26, 2023 that may be closed by this pull request

continuous training using finetuner #627

Closed

github-actions bot added size/s area/core area/docs area/entrypoint labels Jan 26, 2023

chore: add line to changelog

00763d2

guenthermi added 2 commits January 26, 2023 16:23

refactor: make name and artifact mutual exclusive

23709d0

chore: add pre-release to requirements

6d06459

guenthermi marked this pull request as ready for review January 26, 2023 16:56

github-actions bot added the area/setup label Jan 26, 2023

test: add artifact key to config in test

780b916

github-actions bot added the area/testing This issue/PR affects testing label Jan 27, 2023

guenthermi requested review from gmastrapas, LMMilliken and bwanglzu and removed request for gmastrapas and LMMilliken January 27, 2023 08:26

bwanglzu reviewed Jan 27, 2023

View reviewed changes

LMMilliken approved these changes Jan 27, 2023

View reviewed changes

guenthermi added 2 commits January 30, 2023 10:37

chore: solve merge conflict

802dd7f

docs: implement review notes

9ffca6b

gmastrapas approved these changes Jan 30, 2023

View reviewed changes

guenthermi merged commit fea1263 into main Jan 30, 2023

guenthermi deleted the feat-support-continue-training branch January 30, 2023 14:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat support continue training #668

Feat support continue training #668

guenthermi commented Jan 26, 2023 •

edited

Loading

guenthermi commented Jan 26, 2023

guenthermi commented Jan 27, 2023

bwanglzu left a comment

bwanglzu Jan 27, 2023

bwanglzu Jan 27, 2023

guenthermi Jan 27, 2023

LMMilliken left a comment

LMMilliken Jan 27, 2023

LMMilliken Jan 27, 2023 •

edited

Loading

guenthermi Jan 27, 2023

guenthermi commented Jan 27, 2023

github-actions bot commented Jan 30, 2023

Feat support continue training #668

Feat support continue training #668

Conversation

guenthermi commented Jan 26, 2023 • edited Loading

Add support for continue the training

guenthermi commented Jan 26, 2023

guenthermi commented Jan 27, 2023

bwanglzu left a comment

Choose a reason for hiding this comment

bwanglzu Jan 27, 2023

Choose a reason for hiding this comment

bwanglzu Jan 27, 2023

Choose a reason for hiding this comment

guenthermi Jan 27, 2023

Choose a reason for hiding this comment

LMMilliken left a comment

Choose a reason for hiding this comment

LMMilliken Jan 27, 2023

Choose a reason for hiding this comment

LMMilliken Jan 27, 2023 • edited Loading

Choose a reason for hiding this comment

guenthermi Jan 27, 2023

Choose a reason for hiding this comment

guenthermi commented Jan 27, 2023

github-actions bot commented Jan 30, 2023

guenthermi commented Jan 26, 2023 •

edited

Loading

LMMilliken Jan 27, 2023 •

edited

Loading