Add aya #36521

ArthurZucker · 2025-03-03T20:54:48Z

What does this PR do?

Adds aya model

…tibility with cohere model

ArthurZucker

LGTM

ArthurZucker · 2025-03-03T21:37:09Z

src/transformers/models/aya_vision/modeling_aya_vision.py

@yonigozlan todo refactor with modular!

saurabhdash2512 · 2025-03-03T21:47:55Z

@ArthurZucker @yonigozlan Thank you so much for all your help! Y'all legends!

HuggingFaceDocBuilderDev · 2025-03-03T22:17:32Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

* initial commit * small fix * move stuff to image processing file * remove stuff in validate turn and fix return tensor * remove liquid stuff * in the process of addressing comments * changes to get the right tokenization * new __init__ works * fixing defulat std and mean * works * small testing scipt -- to be deleted before merge * remove redundant code * addressing comments * fix inits, add docs templates * refactor processor, switch to gotocr image processor * remove image proc from init * refactor to working llava-style architecture * Change AyaVisionModel to AyaVisionForConditionalGeneration * add tests * fixups * update doc * Adding logits_to_keep explicitly in ayavision forward to enable compatibility with cohere model * better variable names + remove code paths * Updates to aya_vision.md * address comments * adding copied from * make style and remove unused projector_hidden_act from config * sort init * include usage of fast image proc and proc on cuda in doc * update checkpoint iin test processor * update checkpoint in test processor 2 * remove test_model and update docstring * skip failing tests --------- Co-authored-by: Saurabh Dash <saurabh@cohere.com> Co-authored-by: yonigozlan <yoni.gozlan@huggingface.co>

saurabhdash2512 and others added 27 commits March 1, 2025 18:23

initial commit

82bce70

small fix

2318466

move stuff to image processing file

8ab75e3

remove stuff in validate turn and fix return tensor

8a704f3

remove liquid stuff

de98227

in the process of addressing comments

d976592

changes to get the right tokenization

df39794

new __init__ works

0ce3008

fixing defulat std and mean

34d9de7

works

3da8472

small testing scipt -- to be deleted before merge

0c7c349

remove redundant code

c1d3071

addressing comments

52dba3a

fix inits, add docs templates

5ee2bad

refactor processor, switch to gotocr image processor

80aaaa6

remove image proc from init

4890c35

refactor to working llava-style architecture

062756a

Change AyaVisionModel to AyaVisionForConditionalGeneration

7af4551

add tests

8403b2c

fixups

78788b9

update doc

cabebe7

Adding logits_to_keep explicitly in ayavision forward to enable compa…

156762f

…tibility with cohere model

better variable names + remove code paths

a8a6f71

Updates to aya_vision.md

0634d3b

address comments

24f4b3b

adding copied from

05bec86

Merge branch 'main' into add-aya

f960476

ArthurZucker added the New model label Mar 3, 2025

yonigozlan added 2 commits March 3, 2025 21:13

make style and remove unused projector_hidden_act from config

3bc9178

sort init

4df1aab

yonigozlan marked this pull request as ready for review March 3, 2025 21:20

yonigozlan approved these changes Mar 3, 2025

View reviewed changes

include usage of fast image proc and proc on cuda in doc

085bd78

ArthurZucker commented Mar 3, 2025

View reviewed changes

update checkpoint iin test processor

c9bcd6f

yonigozlan added 2 commits March 3, 2025 21:48

update checkpoint in test processor 2

8a2193f

remove test_model and update docstring

8b3325c

skip failing tests

ce63421

yonigozlan merged commit 84f0186 into main Mar 4, 2025
24 checks passed

yonigozlan deleted the add-aya branch March 4, 2025 11:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add aya #36521

Add aya #36521

ArthurZucker commented Mar 3, 2025

ArthurZucker left a comment

ArthurZucker Mar 3, 2025

yonigozlan Mar 3, 2025

saurabhdash2512 commented Mar 3, 2025

HuggingFaceDocBuilderDev commented Mar 3, 2025

Add aya #36521

Add aya #36521

Conversation

ArthurZucker commented Mar 3, 2025

What does this PR do?

ArthurZucker left a comment

Choose a reason for hiding this comment

ArthurZucker Mar 3, 2025

Choose a reason for hiding this comment

yonigozlan Mar 3, 2025

Choose a reason for hiding this comment

saurabhdash2512 commented Mar 3, 2025

HuggingFaceDocBuilderDev commented Mar 3, 2025