Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

llama model is not fully lowered to ANE (coreml backend) #4091

Open
cccclai opened this issue Jun 29, 2024 · 0 comments
Open

llama model is not fully lowered to ANE (coreml backend) #4091

cccclai opened this issue Jun 29, 2024 · 0 comments
Labels
module: coreml Issues related to Apple's Core ML delegation

Comments

@cccclai
Copy link
Contributor

cccclai commented Jun 29, 2024

As title, the model definition is here: https://github.com/pytorch/executorch/blob/main/examples/models/llama2/llama_transformer.py

There are two parts not lowered to ANE, including embedding and kv cache update

@cccclai cccclai added the module: coreml Issues related to Apple's Core ML delegation label Jun 30, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
module: coreml Issues related to Apple's Core ML delegation
Projects
None yet
Development

No branches or pull requests

1 participant