New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Introduce prescription for deepsparse inference engine from Neural Magic #18515
Introduce prescription for deepsparse inference engine from Neural Magic #18515
Conversation
a4122b9
to
5d9b392
Compare
ecdbd3d
to
8199b6c
Compare
8199b6c
to
697e36b
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm unsure about the approach taken here. If the processor will be AVX512 enabled but not AVX2 enabled, the DeepSparseAVX2Sieve
will sieve all deepsparse<=0.7.0
and vice versa. It might be a good idea to merge these prescriptions for the desired functionality.
Perfectly make sense, I will merge them! |
697e36b
to
d40cf7d
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lgtm, thanks 👍🏻
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: fridex The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
/hold |
I need to add AMD family and models https://en.wikichip.org/wiki/amd/cpuid, Zen 2 and Zen 3 |
4d6c4f0
to
0021dbd
Compare
0021dbd
to
2f36fcd
Compare
Signed-off-by: Francesco Murdaca <fmurdaca@redhat.com>
2f36fcd
to
d375a89
Compare
/unhold |
Signed-off-by: Francesco Murdaca fmurdaca@redhat.com
What type of PR is this?
/kind feature
Related issues or additional information of the supplied change
Related-To: neuralmagic/deepsparse#186
Description
Deepsparse engine to this version 0.7.0 only supports
avx2
andavx512
(andavx512
with optionalvnni
instructions): https://github.com/neuralmagic/deepsparse#hardware-supportThoth resolution engine will fail if a user is asking for a recommendation for a different CPU family/model or the build to deploy a model is happening in a cluster without correct cpu family/model.
cc @bnellnm