-
Notifications
You must be signed in to change notification settings - Fork 216
Export example with dynamo=True and onnxscript fusion #1766
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
|
type: OnnxScriptFusion | ||
log_severity_level: 1 | ||
output_dir: llama3_output_model | ||
cache_dir: cache |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Remove cache?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yes. There is a default location of cache. Cache is useful while working on a model. For example, it'd help if the user is rerunning the flow by just changing parameters of fusion pass then Olive will skip the conversion step and pick it up from the cache.
Do you see GQA fusion happening here? |
I didn't see it. I am checking with @shubhambhokare1 to see what's going on |
Describe your changes
Checklist before requesting a review
lintrunner -a
(Optional) Issue link