Introduce prescription for deepsparse inference engine from Neural Magic #18515

pacospace · 2021-10-07T09:41:02Z

Signed-off-by: Francesco Murdaca fmurdaca@redhat.com

What type of PR is this?

/kind feature

Related issues or additional information of the supplied change

Description

Deepsparse engine to this version 0.7.0 only supports avx2 and avx512 (and avx512 with optional vnni instructions): https://github.com/neuralmagic/deepsparse#hardware-support

Thoth resolution engine will fail if a user is asking for a recommendation for a different CPU family/model or the build to deploy a model is happening in a cluster without correct cpu family/model.

cc @bnellnm

fridex

I'm unsure about the approach taken here. If the processor will be AVX512 enabled but not AVX2 enabled, the DeepSparseAVX2Sieve will sieve all deepsparse<=0.7.0 and vice versa. It might be a good idea to merge these prescriptions for the desired functionality.

prescriptions/de_/deepsparse/deepsparse_avx2.yaml

pacospace · 2021-10-07T12:44:53Z

I'm unsure about the approach taken here. If the processor will be AVX512 enabled but not AVX2 enabled, the DeepSparseAVX2Sieve will sieve all deepsparse<=0.7.0 and vice versa. It might be a good idea to merge these prescriptions for the desired functionality.

Perfectly make sense, I will merge them!

fridex

lgtm, thanks 👍🏻

sesheta · 2021-10-07T13:03:22Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: fridex

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [fridex]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

pacospace · 2021-10-07T13:05:30Z

/hold

pacospace · 2021-10-07T13:08:36Z

/hold

I need to add AMD family and models https://en.wikichip.org/wiki/amd/cpuid, Zen 2 and Zen 3

Signed-off-by: Francesco Murdaca <fmurdaca@redhat.com>

pacospace · 2021-10-08T14:32:57Z

/unhold

sesheta added the kind/feature Categorizes issue or PR as related to a new feature. label Oct 7, 2021

sesheta requested review from harshad16 and KPostOffice October 7, 2021 09:41

sesheta added the size/M Denotes a PR that changes 30-99 lines, ignoring generated files. label Oct 7, 2021

pacospace requested a review from fridex October 7, 2021 09:41

pacospace force-pushed the add-deepsparse-prescription branch 2 times, most recently from a4122b9 to 5d9b392 Compare October 7, 2021 09:43

pacospace changed the title ~~Introduce prescription for deepsparse inference engine from Neural Magic~~ WIP: Introduce prescription for deepsparse inference engine from Neural Magic Oct 7, 2021

sesheta added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Oct 7, 2021

pacospace force-pushed the add-deepsparse-prescription branch 3 times, most recently from ecdbd3d to 8199b6c Compare October 7, 2021 09:55

pacospace changed the title ~~WIP: Introduce prescription for deepsparse inference engine from Neural Magic~~ Introduce prescription for deepsparse inference engine from Neural Magic Oct 7, 2021

sesheta removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Oct 7, 2021

pacospace force-pushed the add-deepsparse-prescription branch from 8199b6c to 697e36b Compare October 7, 2021 10:33

fridex reviewed Oct 7, 2021

View reviewed changes

prescriptions/de_/deepsparse/deepsparse_avx2.yaml Outdated Show resolved Hide resolved

prescriptions/de_/deepsparse/deepsparse_avx2.yaml Outdated Show resolved Hide resolved

prescriptions/de_/deepsparse/deepsparse_avx2.yaml Outdated Show resolved Hide resolved

pacospace force-pushed the add-deepsparse-prescription branch from 697e36b to d40cf7d Compare October 7, 2021 12:51

fridex approved these changes Oct 7, 2021

View reviewed changes

sesheta added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Oct 7, 2021

sesheta added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Oct 7, 2021

pacospace mentioned this pull request Oct 7, 2021

Extend Elyra tutorial with Neural Magic version. AICoE/elyra-aidevsecops-tutorial#297

Closed

7 tasks

pacospace force-pushed the add-deepsparse-prescription branch 2 times, most recently from 4d6c4f0 to 0021dbd Compare October 7, 2021 15:34

sesheta added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed size/M Denotes a PR that changes 30-99 lines, ignoring generated files. labels Oct 7, 2021

pacospace force-pushed the add-deepsparse-prescription branch from 0021dbd to 2f36fcd Compare October 7, 2021 15:35

pacospace requested a review from fridex October 7, 2021 15:35

Introduce prescription for deepsparse inference engine from Neural Magic

d375a89

Signed-off-by: Francesco Murdaca <fmurdaca@redhat.com>

pacospace force-pushed the add-deepsparse-prescription branch from 2f36fcd to d375a89 Compare October 7, 2021 15:36

fridex mentioned this pull request Oct 7, 2021

Use higher level abstraction for CPU features thoth-station/adviser#2114

Closed

sesheta removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Oct 8, 2021

sesheta merged commit 78e7c5e into thoth-station:master Oct 8, 2021

fridex mentioned this pull request Oct 12, 2021

sprint production release for 2021.10.25 thoth-station/thoth-application#2033

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce prescription for deepsparse inference engine from Neural Magic #18515

Introduce prescription for deepsparse inference engine from Neural Magic #18515

pacospace commented Oct 7, 2021 •

edited

fridex left a comment

pacospace commented Oct 7, 2021

fridex left a comment

sesheta commented Oct 7, 2021

pacospace commented Oct 7, 2021

pacospace commented Oct 7, 2021

pacospace commented Oct 8, 2021

Introduce prescription for deepsparse inference engine from Neural Magic #18515

Introduce prescription for deepsparse inference engine from Neural Magic #18515

Conversation

pacospace commented Oct 7, 2021 • edited

What type of PR is this?

Related issues or additional information of the supplied change

Description

fridex left a comment

Choose a reason for hiding this comment

pacospace commented Oct 7, 2021

fridex left a comment

Choose a reason for hiding this comment

sesheta commented Oct 7, 2021

pacospace commented Oct 7, 2021

pacospace commented Oct 7, 2021

pacospace commented Oct 8, 2021

pacospace commented Oct 7, 2021 •

edited