feat: Support WasmEdge and its ggml plugin on ARM NPU #3411

alabulei1 · 2024-05-17T08:54:34Z

Summary

One of the advantages of using WasmEdge as the LLM inference runtime is that WasmEdge is portable across different CPUs and GPUs. So it's important to support more chips for WasmEdge.

ARM NPU chip is a popular AI processor that WasmEdge should support.

Details

Support running LLM inference with WasmEdge on ARM NPU

Appendix

No response

Wck-iipi · 2024-05-18T13:54:15Z

I would like to work on this issue. If you can guide me with references I would be grateful thanks.

hangedfish · 2024-05-20T04:18:39Z

I think we can start with Rockchip RK3588 SOC, which is a popular chip recently. It supports 32GB of memory, which is enough for LLM. There are also a large number of SBC (Single-board computers) products that can be tested, such as Radxa ROCK 5B/5C and Orange Pi 5 Plus.

https://github.com/airockchip/rknn-toolkit2

Good luck

alabulei1 added enhancement New feature or request help wanted Extra attention is needed c-WASI-NN labels May 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Support WasmEdge and its ggml plugin on ARM NPU #3411

feat: Support WasmEdge and its ggml plugin on ARM NPU #3411

alabulei1 commented May 17, 2024

Wck-iipi commented May 18, 2024

hangedfish commented May 20, 2024

feat: Support WasmEdge and its ggml plugin on ARM NPU #3411

feat: Support WasmEdge and its ggml plugin on ARM NPU #3411

Comments

alabulei1 commented May 17, 2024

Summary

Details

Appendix

Wck-iipi commented May 18, 2024

hangedfish commented May 20, 2024