Add Ascend backend support #11477

wangxiyuan · 2022-05-10T03:44:46Z

Is your feature request related to a problem? Please describe.

Base on the processor, Huawei build a series AI related hardwares which shown in blue rectangles. They’re called Atlas. Here I’d like to say more abort Atlas 300. It’s a kind of PCI card and used widely on data/ai process servers. Our develop and test work is base on it as well.

Then, base on the hardware, Ascend ecosystem also provides a software layer called CANN. It’s the yellow rectangles in the picture. CANN provides APIs to help developers quickly build AI applications and services based on the Ascend platform. It’s similar with CUDA in Nvidia ecosystem.

In ONNX case, users need convert it to Ascend model first using a transport tool called ATC . It's a little complex. And sometimes, the performance may be poor or accuracy maybe drops.

It's good that if onnxruntime can support Ascend processor as a backend. If so, users can uses onnxruntime on Ascend processor directly.

For software, CANN is the main point that both developer and AI framework should know. Let’s focus on CANN. This is the CANN Technical Stack view in Ascend ecosystem. Last year, my colleague, zhipeng had shared the CANN stack already in the onnx meetup. Well, it was based on CANN 3.0 version which is out of date. The picture here shows the newest version called CANN 5.0. As you can see, there are multi layer in CANN. It contains service layer, compilation layer ,execution layer and the base layer. For example. service layer provide operator library, optimization engine and framework adapter.

In general, developers do not need to know them. You need only focus on Ascend Computing Language, ie ACL. It’s the APIs part to help you control Ascend hardware via CANN.

System information

ONNX Runtime version (you are using):
ONNX Runtime master branch

Describe the solution you'd like

Currently, if a user want to run onnx model on Ascend hardware, he should first use the model translation tool provided by CANN to translate the model from onnx to ascend . The flow is a little complex. And the translated model may lost some precision, and the performance may poor. Even in some case, the model may can’t work correctly.

To solve the problem, a better way is find a way that onnx model can work on Ascend directly. So In onnxruntime, we’d like to add CANN as a new execution provider. Once it’s done, users can use onnx model on Ascend hardware via onnxruntime. Of cause, we’ll add the related CI as well. For example, we can donate VM resoucres which contains Ascend hardware to the community.

The line below the our roadmap. First we’ll push the basic code to upstream. The end to end flow will be done in it. And the ResNet model should work correctly on CANN EP. At the end of year, we’ll finish all the onnx operator support and make sure all the models in onnx model zoo works well on Ascend.

In the next year, we’ll focus on optimizing work. Like performance improvement and so on

Basing the Execution provider mechanism in onnxruntime. It's easy to integrate Ascend processor as a new EP in onnxruntime.

The new EP can be named as CANN. CANN is the AI-oriented heterogeneous compute architecture in Ascend ecosystem. It provides hierarchical APIs to help users quickly build AI applications. Frankly speaking, it's similar with CUDA in GPU ecosystem.

Additionally, we'll add the CI supports as well. We can donate the VM which supports Ascend processor to onnxruntime CI system. Then the community can keep testing the new EP CANN easily.

We hope that the community can accept this feature request. Wish to get your feedback.

Thanks.

Describe alternatives you've considered
Use the library provied by Ascend without using onnxruntime

Additional context
Ascend official website
CANN

The text was updated successfully, but these errors were encountered:

KnightYao · 2022-05-10T11:12:31Z

you think too much

wangxiyuan · 2022-05-11T01:00:06Z

you think too much

what do you mean?

wangxiyuan · 2022-05-13T02:42:42Z

This basic PR is ready for review now.

There are about 10 operators are added in the PR. With this change, the ResNet-v1.12 can runs well on Ascend backend with onnxruntime.

Any committer can take a look? Thanks

I'd like to know what should I do to push it forward.

About the test environment, We can donate ascend based VM to community as well.

**Description**: This PR adds Ascend CANN execution provider support. **Motivation and Context** - Why is this change required? What problem does it solve? As the info shown in the issue. CANN is the API layer for Ascend processor. Add CANN EP can allow user run onnx model on Ascend hardware via onnxruntime The detail change: 1. Added CANN EP framework. 2. Added the basic operators to support ResNet and VGG model. 3. Added C/C++、Python API support - If it fixes an open issue, please link to the issue here. #11477 Author: lijiawei <lijiawei19@huawei.com> wangxiyuan <wangxiyuan1007@gmail.com> Co-authored-by: FFrog <ljw1101.vip@gmail.com>

johnnynunez · 2023-11-28T11:42:31Z

how is going on?

FFFrog · 2023-11-29T03:56:41Z

how is going on?

Hey! Refer to the doc first if you have any questions, please. And CI releated is here

ashbhandare added the feature request request for unsupported feature or enhancement label May 10, 2022

wangxiyuan mentioned this issue Aug 2, 2022

Add CANN EP #12416

Merged

github-actions bot added ep:ACL issues related to ACL execution provider ep:CUDA issues related to the CUDA execution provider labels Jun 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Ascend backend support #11477

Add Ascend backend support #11477

wangxiyuan commented May 10, 2022 •

edited

KnightYao commented May 10, 2022

wangxiyuan commented May 11, 2022

wangxiyuan commented May 13, 2022 •

edited

johnnynunez commented Nov 28, 2023

FFFrog commented Nov 29, 2023 •

edited

Add Ascend backend support #11477

Add Ascend backend support #11477

Comments

wangxiyuan commented May 10, 2022 • edited

KnightYao commented May 10, 2022

wangxiyuan commented May 11, 2022

wangxiyuan commented May 13, 2022 • edited

johnnynunez commented Nov 28, 2023

FFFrog commented Nov 29, 2023 • edited

wangxiyuan commented May 10, 2022 •

edited

wangxiyuan commented May 13, 2022 •

edited

FFFrog commented Nov 29, 2023 •

edited