Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Submission checker version 4.0 #1560

Merged
merged 3 commits into from
Jan 17, 2024

Conversation

pgmpablo157321
Copy link
Contributor

@pgmpablo157321 pgmpablo157321 commented Jan 16, 2024

  • Add Llamav2 and stable diffusion models to submission checker
  • Check for new accuracy, performance metrics and compliance tests
  • Check that image files for stable diffusion are there
  • Update seeds

This PR contains the necessary changes for a v4.0 submissions with the exception of some pending changes for Llama2

Copy link

github-actions bot commented Jan 16, 2024

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

@@ -1309,6 +1509,7 @@ def __init__(
self.seeds = self.base["seeds"]
self.test05_seeds = self.base["test05_seeds"]
self.accuracy_target = self.base["accuracy-target"]
self.accuracy_upper_limit = self.base["accuracy-upper-limit"]
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we should use self.base.get("accuracy-upper-limit") as it is not there for all the benchmarks.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is done here: https://github.com/pgmpablo157321/inference/blob/f7c755a0e156f51c1684f984889b309b8afaffc2/tools/submission/submission_checker.py#L1618

The script passes the dictionary to the config and calls get for each model

@nvzhihanj
Copy link
Contributor

@pgmpablo157321 can you also change the loadgen version similar to:
https://github.com/mlcommons/inference/pull/1452/files
https://github.com/mlcommons/inference/pull/1440/files?
Thanks!

"3d-unet-99.9",
"gptj-99",
"gptj-99.9",
"llama-v2-70b-99",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I believe the model name should be llama2-70b according to the official website: https://huggingface.co/docs/transformers/main/model_doc/llama2

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same for all occurrences

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we should change all to llama2. The llama-v2 name will cause confusion.

@pgmpablo157321 pgmpablo157321 force-pushed the submission_checker_4.0 branch 2 times, most recently from f7c755a to 14efc69 Compare January 16, 2024 22:44
Copy link
Contributor

@arjunsuresh arjunsuresh left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you Pablo for the changes

@pgmpablo157321 pgmpablo157321 marked this pull request as ready for review January 16, 2024 23:29
@pgmpablo157321 pgmpablo157321 requested a review from a team as a code owner January 16, 2024 23:29
@nvzhihanj
Copy link
Contributor

Thank you Pablo! Can we merge this ASAP so logs can be collected 🙏

@mrmhodak mrmhodak merged commit 8e36925 into mlcommons:master Jan 17, 2024
10 checks passed
@github-actions github-actions bot locked and limited conversation to collaborators Jan 17, 2024
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants