Implement optimizations as in `ane_transformers` #8

dimitry12 · 2023-02-12T21:44:31Z

ane_transformers (https://github.com/apple/ml-ane-transformers and https://machinelearning.apple.com/research/neural-engine-transformers) suggest weight-compatible changes to transformers allowing better mapping of the ops to ANE and thus resulting in significant performance improvement.

@hollance do you think these optimizations "belong" in 🤗 Exporters? If yes, how do you envision their implementation: within CoreMLConfig abstraction or somewhere else?

The text was updated successfully, but these errors were encountered:

hollance · 2023-02-16T11:08:43Z

Good question! Hugging Face has a library, Optimum, that can do optimizations such as pruning and quantization. It seems to me that these kinds of optimizations that require "model surgery" really belong into Optimum, but I'm not aware of any plans to add these particular optimizations. But it's definitely something worth considering (and it could be prototyped in Exporters).

dimitry12 · 2023-02-17T15:44:45Z

these kinds of optimizations that require "model surgery" really belong into Optimum, but I'm not aware of any plans to add these particular optimizations. But it's definitely something worth considering (and it could be prototyped in Exporters)

Makes sense! I can't commit to a particular timeline but I definitely plan to work in this direction. If that's ok, I will keep this issue open to post updates.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement optimizations as in `ane_transformers` #8

Implement optimizations as in `ane_transformers` #8

dimitry12 commented Feb 12, 2023 •

edited

hollance commented Feb 16, 2023

dimitry12 commented Feb 17, 2023

Implement optimizations as in ane_transformers #8

Implement optimizations as in ane_transformers #8

Comments

dimitry12 commented Feb 12, 2023 • edited

hollance commented Feb 16, 2023

dimitry12 commented Feb 17, 2023

Implement optimizations as in `ane_transformers` #8

Implement optimizations as in `ane_transformers` #8

dimitry12 commented Feb 12, 2023 •

edited