Encapsulate FSDPA in GaudiLlamaAttention #129

dudilester · 2024-03-21T09:43:52Z

Done to allow quantization using HQT
Added use_flash_attention and flash_attention_recompute to run_lm_eval

optimum/habana/transformers/models/llama/modeling_llama.py

* Done to allow quantization using HQT * Added use_flash_attention and flash_attention_recompute to run_lm_eval

issues were addressed.

* Done to allow quantization using HQT * Added use_flash_attention and flash_attention_recompute to run_lm_eval

astachowiczhabana · 2024-06-12T10:52:46Z

huggingface#972

dudilester · 2024-06-13T11:40:19Z

upstream URL
huggingface#976

dudilester requested review from mandy-li, libinta and dvarshney-habana as code owners March 21, 2024 09:43

dudilester requested review from MrGeva, HolyFalafel, bgoldberg-habana, ulivne and Yantom1 and removed request for mandy-li, libinta and dvarshney-habana March 21, 2024 09:44

ulivne previously requested changes Mar 21, 2024

View reviewed changes

optimum/habana/transformers/models/llama/modeling_llama.py Outdated Show resolved Hide resolved

optimum/habana/transformers/models/llama/modeling_llama.py Outdated Show resolved Hide resolved

Encapsulate FSDPA in GaudiLlamaAttention

d584181

* Done to allow quantization using HQT * Added use_flash_attention and flash_attention_recompute to run_lm_eval

dudilester force-pushed the dev/dlester/moduleFSDPA branch from 0a09511 to d584181 Compare March 21, 2024 13:12

MrGeva approved these changes Mar 24, 2024

View reviewed changes

MrGeva merged commit b7e74c1 into habana-main Mar 24, 2024

dudilester added a commit that referenced this pull request Mar 31, 2024

Encapsulate FSDPA in GaudiLlamaAttention (#129)

bae47af

* Done to allow quantization using HQT * Added use_flash_attention and flash_attention_recompute to run_lm_eval

astachowiczhabana pushed a commit that referenced this pull request Apr 5, 2024

Encapsulate FSDPA in GaudiLlamaAttention (#129)

9cf4dff

* Done to allow quantization using HQT * Added use_flash_attention and flash_attention_recompute to run_lm_eval

astachowiczhabana pushed a commit that referenced this pull request Apr 5, 2024

Encapsulate FSDPA in GaudiLlamaAttention (#129)

f5736fa

* Done to allow quantization using HQT * Added use_flash_attention and flash_attention_recompute to run_lm_eval

astachowiczhabana pushed a commit that referenced this pull request Apr 19, 2024

Encapsulate FSDPA in GaudiLlamaAttention (#129)

b79036e

* Done to allow quantization using HQT * Added use_flash_attention and flash_attention_recompute to run_lm_eval

astachowiczhabana pushed a commit that referenced this pull request Apr 22, 2024

Encapsulate FSDPA in GaudiLlamaAttention (#129)

21948d8

* Done to allow quantization using HQT * Added use_flash_attention and flash_attention_recompute to run_lm_eval

astachowiczhabana pushed a commit that referenced this pull request Apr 24, 2024

Encapsulate FSDPA in GaudiLlamaAttention (#129)

1fcf130

* Done to allow quantization using HQT * Added use_flash_attention and flash_attention_recompute to run_lm_eval

astachowiczhabana pushed a commit that referenced this pull request Apr 24, 2024

Encapsulate FSDPA in GaudiLlamaAttention (#129)

08012fc

* Done to allow quantization using HQT * Added use_flash_attention and flash_attention_recompute to run_lm_eval

dudilester added a commit that referenced this pull request May 7, 2024

Encapsulate FSDPA in GaudiLlamaAttention (#129)

ffa9081

* Done to allow quantization using HQT * Added use_flash_attention and flash_attention_recompute to run_lm_eval

dudilester added a commit that referenced this pull request May 8, 2024

Encapsulate FSDPA in GaudiLlamaAttention (#129)

e231aa5

* Done to allow quantization using HQT * Added use_flash_attention and flash_attention_recompute to run_lm_eval

dudilester added a commit that referenced this pull request May 13, 2024

Encapsulate FSDPA in GaudiLlamaAttention (#129)

eb8b435

* Done to allow quantization using HQT * Added use_flash_attention and flash_attention_recompute to run_lm_eval

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Encapsulate FSDPA in GaudiLlamaAttention #129

Encapsulate FSDPA in GaudiLlamaAttention #129

dudilester commented Mar 21, 2024

astachowiczhabana commented Jun 12, 2024

dudilester commented Jun 13, 2024

Encapsulate FSDPA in GaudiLlamaAttention #129

Encapsulate FSDPA in GaudiLlamaAttention #129

Conversation

dudilester commented Mar 21, 2024

astachowiczhabana commented Jun 12, 2024

dudilester commented Jun 13, 2024