Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: support multiple models on autotune service #107

Merged
merged 74 commits into from
Jul 7, 2021

Conversation

shjwudp
Copy link
Member

@shjwudp shjwudp commented Jul 6, 2021

No description provided.

@@ -37,6 +37,7 @@ class BaguaHyperparameter(BaseModel):
"""

buckets: List[List[TensorDeclaration]] = []
bucket_size: int = 0
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this hyperparameter class only used in service?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, I put it here because this is the simplest and most intuitive way to achieve

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I mean the BaguaHyperparameter class. If it is only used in service, we can move the whole class to service module

Copy link
Member Author

@shjwudp shjwudp Jul 7, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Got it, but not exactly, new distributed hyperparameters should also be configured here, such as is_hierarchical_reduce

@@ -1,17 +1,30 @@
import unittest
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

let's include the autotune test in CI?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ok i think so too

@shjwudp shjwudp requested a review from NOBLES5E July 7, 2021 06:16
@NOBLES5E
Copy link
Contributor

NOBLES5E commented Jul 7, 2021

one more thing, we also need test case for different datatypes (fp16 fp32 mixed precision case)

@NOBLES5E NOBLES5E changed the title feat: autotune support multi models feat: support multiple models on autotune service Jul 7, 2021
@NOBLES5E NOBLES5E merged commit 0aec789 into master Jul 7, 2021
@NOBLES5E NOBLES5E deleted the autotune_support_multi_models branch July 7, 2021 09:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants