filter flash_attn optional imports loading remote code #30954

eaidova · 2024-05-22T08:38:40Z

What does this PR do?

in some remote available models code, we can meet optional import flash_attention module without try-except block.
Examples:
phi3-vision - https://huggingface.co/microsoft/Phi-3-vision-128k-instruct/blob/main/modeling_phi3_v.py#L52
orion-14b - https://huggingface.co/OrionStarAI/Orion-14B-Chat/blob/main/modeling_orion.py#L36
deepseek-moe - https://huggingface.co/deepseek-ai/deepseek-moe-16b-base/blob/main/modeling_deepseek.py#L54
nanoLLAVA- https://huggingface.co/qnguyen3/nanoLLaVA/blob/main/modeling_llava_qwen2.py#L861

loading such model in environment where flash_attn package is not installed failed with trust_remote_code flag. It may be problematic to install and import this package (e.g. for environment where no cuda and torch installed for cpu only) for some environment. This PR update dependencies search logic for checking dynamic modules loading.

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

amyeroberts · 2024-05-22T18:30:26Z

cc @Rocketknight1 as I think you were working on a related issue

andrei-kochin · 2024-06-03T12:52:43Z

@amyeroberts @Rocketknight1 do you have any good news for us here?

itikhono · 2024-07-02T12:19:19Z

@amyeroberts @Rocketknight1 Hi! any updates on this?
we are working on improving SDPA operation support on openvino side, and using these models for testing our changes:

phi3-vision - https://huggingface.co/microsoft/Phi-3-vision-128k-instruct/blob/main/modeling_phi3_v.py#L52
orion-14b - https://huggingface.co/OrionStarAI/Orion-14B-Chat/blob/main/modeling_orion.py#L36
deepseek-moe - https://huggingface.co/deepseek-ai/deepseek-moe-16b-base/blob/main/modeling_deepseek.py#L54
nanoLLAVA- https://huggingface.co/qnguyen3/nanoLLaVA/blob/main/modeling_llava_qwen2.py#L861

But unfortunately we encountered the same issue as described in the ticket. Do you have any plans to merge the fix?

amyeroberts · 2024-07-22T16:47:08Z

Gentle ping @Rocketknight1

CuriousPanCake · 2024-08-02T21:39:01Z

src/transformers/dynamic_module_utils.py

+    # filter out imports under is_flash_attn_2_available block for avoid import issues in cpu only environment
+    content = re.sub(r"if is_flash_attn_*available\(\):\s*(from flash_attn\s*.*\s*)+", "", content, flags=re.MULTILINE)


I believe, the regex can be improved to r"if is_flash_attn_[0-9]+_available\(\):\s*(from flash_attn\s*.*\s*)+" to capture the include properly.
I was trying to use this change locally for OrionStarAI/Orion-14B-Chat and the flash_attn imports were not working properly, hence the imports were not removed.
Most probably, the similar stands for the other models mentioned in the ticket as the imports string are the same.

Let's try to make it work for as many as possible! AST my be of help here as well

sorry, but I lost any motivation to finishing this PR after 3 months o silence... I'm not good in regular expressions, so I have no time to explore all models on hub to find all possible patterns...

I'll apply suggestion from @CuriousPanCake, but do not promise to look anything beyond of that if you do not have specific suggestions.

Sorry on our side for the late review, I'll help you get this merge, it's already good IMO to fix on most of the models!

ArthurZucker · 2024-08-06T14:02:47Z

Ping me if you need help to fix the CI / a review 🤗

eaidova · 2024-08-07T09:28:09Z

@ArthurZucker thank you for review, I fixed code style CI related issue, but I have no idea about flax tests (I do not think that it is some how related to my changes)

Rocketknight1

Hey! The fault for the extremely slow review is mine, and I'm sorry, especially because this issue affected other users besides you, and I really should have gotten it merged before now!

The tl;dr is that I think it's ready for merge as-is, because the only relevant test in our codebase is is_flash_attn_2_available. However, I've suggested a change that should catch anything that looks like is_flash_attn[...]available, which should cover us if any custom code is calling something similar, but without causing unwanted side effects.

Let me know if you're happy with the change, and then we'll try to merge this ASAP - you definitely don't need to trawl through the code after having to wait all this time. Thank you for the PR!

src/transformers/dynamic_module_utils.py

HuggingFaceDocBuilderDev · 2024-08-07T15:19:37Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

Rocketknight1 · 2024-08-08T16:21:17Z

@eaidova thanks for committing the suggestion, merging now!

huggingface deleted a comment from github-actions bot Jun 28, 2024

CuriousPanCake reviewed Aug 2, 2024

View reviewed changes

eaidova added 2 commits August 6, 2024 12:17

filter flash_attn optional imports loading remote code

d8ee26e

improve pattern

844ca87

eaidova force-pushed the ea/avoid_flash_attn_remote_code branch from c29889a to 844ca87 Compare August 6, 2024 08:19

fix code style

ec78b92

Rocketknight1 approved these changes Aug 7, 2024

View reviewed changes

src/transformers/dynamic_module_utils.py Outdated Show resolved Hide resolved

Update src/transformers/dynamic_module_utils.py

9eb8a80

Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

Rocketknight1 merged commit cc832cb into huggingface:main Aug 8, 2024
21 checks passed

amyeroberts mentioned this pull request Aug 12, 2024

Import guards not working properly for remote models on the hub #29735

Closed

4 tasks

Rocketknight1 mentioned this pull request Aug 16, 2024

flash_attn ImportError breaking model loading (Florence-2-base-ft) #31793

Closed

4 tasks

eaidova deleted the ea/avoid_flash_attn_remote_code branch August 18, 2024 04:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

filter flash_attn optional imports loading remote code #30954

filter flash_attn optional imports loading remote code #30954

eaidova commented May 22, 2024 •

edited

Loading

amyeroberts commented May 22, 2024

andrei-kochin commented Jun 3, 2024

itikhono commented Jul 2, 2024 •

edited

Loading

amyeroberts commented Jul 22, 2024

CuriousPanCake Aug 2, 2024 •

edited

Loading

ArthurZucker Aug 5, 2024

eaidova Aug 6, 2024

ArthurZucker Aug 6, 2024

ArthurZucker commented Aug 6, 2024

eaidova commented Aug 7, 2024

Rocketknight1 left a comment

HuggingFaceDocBuilderDev commented Aug 7, 2024

Rocketknight1 commented Aug 8, 2024

		# filter out imports under is_flash_attn_2_available block for avoid import issues in cpu only environment
		content = re.sub(r"if is_flash_attn_available\(\):\s(from flash_attn\s.\s*)+", "", content, flags=re.MULTILINE)

filter flash_attn optional imports loading remote code #30954

filter flash_attn optional imports loading remote code #30954

Conversation

eaidova commented May 22, 2024 • edited Loading

What does this PR do?

Before submitting

Who can review?

amyeroberts commented May 22, 2024

andrei-kochin commented Jun 3, 2024

itikhono commented Jul 2, 2024 • edited Loading

amyeroberts commented Jul 22, 2024

CuriousPanCake Aug 2, 2024 • edited Loading

Choose a reason for hiding this comment

ArthurZucker Aug 5, 2024

Choose a reason for hiding this comment

eaidova Aug 6, 2024

Choose a reason for hiding this comment

ArthurZucker Aug 6, 2024

Choose a reason for hiding this comment

ArthurZucker commented Aug 6, 2024

eaidova commented Aug 7, 2024

Rocketknight1 left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Aug 7, 2024

Rocketknight1 commented Aug 8, 2024

eaidova commented May 22, 2024 •

edited

Loading

itikhono commented Jul 2, 2024 •

edited

Loading

CuriousPanCake Aug 2, 2024 •

edited

Loading