AttributeError: 'LlamaModel' object has no attribute '_use_flash_attention_2' #10

raghavgarg97 · 2024-05-20T09:14:17Z

I was running speedup.sh with Llama model but got this issue trace.

The error follows from the file Consistency_LLM/cllm/cllm_llama_modeling.py

Consistency_LLM/cllm/cllm_llama_modeling.py

Line 154 in b2a7283

if self.model._use_flash_attention_2:

the code needs to be updated to
if self.model.config._attn_implementation=='flash_attention_2':
Do i need to change model config to check speed of base model with jacobi iteration?
base model="meta-llama/Meta-Llama-3-8B-Instruct"

The text was updated successfully, but these errors were encountered:

snyhlxde1 · 2024-05-21T07:05:21Z

Did you use package versions we provided in requirements.txt? If not, what are the pytorch and transformers versions you are using?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AttributeError: 'LlamaModel' object has no attribute '_use_flash_attention_2' #10

AttributeError: 'LlamaModel' object has no attribute '_use_flash_attention_2' #10

raghavgarg97 commented May 20, 2024 •

edited

snyhlxde1 commented May 21, 2024

AttributeError: 'LlamaModel' object has no attribute '_use_flash_attention_2' #10

AttributeError: 'LlamaModel' object has no attribute '_use_flash_attention_2' #10

Comments

raghavgarg97 commented May 20, 2024 • edited

snyhlxde1 commented May 21, 2024

raghavgarg97 commented May 20, 2024 •

edited