Add Yi support + benchmark results #27

MekkCyber · 2023-11-06T15:53:47Z

I noticed that there is no implementation of mpt_pos_shift_attention_forward, I know it's not necessary for the code knowing that no changes are made because there is no positional encoding, however, for consistency I think it's better to have it. Feel free to accept this pull request or not :). I will try working on adding other models to the library. Thank you for your time.

MekkCyber · 2023-11-06T18:55:24Z

Hello @tomaarsen

Do you have any suggestions about models to implement attention_sinks for ?

tomaarsen · 2023-11-06T19:25:14Z

Perhaps the very recent Yi models?

MekkCyber · 2023-11-06T22:25:42Z

i tried to add Yi support, i think the Yi tokenizer is not integrated yet in AutoTokenizer, so to test it i used the code provided for YiTokenizer, with tokenizer.model as a vocab_file. If you have any remark please let me know.

model = AutoModelForCausalLM.from_pretrained(
    model_id,
    # for efficiency:
    device_map="auto",
    torch_dtype=torch.float16,
    # `attention_sinks`-specific arguments:
    attention_sink_size=4,
    attention_sink_window_size=252,  # <- Low for the sake of faster generation
    trust_remote_code=True,
)
model.eval()
tokenizer = YiTokenizer('tokenizer.model')
tokenizer.pad_token_id = tokenizer.eos_token_id

tomaarsen · 2023-11-21T08:33:48Z

Hello!

Apologies for delaying this for a while. Regarding the tokenizer, I think that is because the AutoTokenizer also requires trust_remote_code=True, e.g.:

model_id = "01-ai/Yi-6B"
model = AutoModelForCausalLM.from_pretrained(
    model_id,
    # for efficiency:
    device_map="auto",
    torch_dtype=torch.float16,
    # `attention_sinks`-specific arguments:
    attention_sink_size=4,
    attention_sink_window_size=252,  # <- Low for the sake of faster generation
    trust_remote_code=True,
)
tokenizer = AutoTokenizer.from_pretrained(model_id, trust_remote_code=True)
tokenizer.pad_token_id = tokenizer.eos_token_id

And then it should be fine!

I've added some experiments, ran them, and put the results in the README. I also credited you for this addition there!

Tom Aarsen

add mpt_pos_shift_attention_forward

0ecf9b3

add yi support

3eea3f2

tomaarsen added 3 commits November 21, 2023 09:19

Merge branch 'main' into pr-27

5deb134

Add benchmarks for Yi

795ab28

Add references of Yi to the README, also credit @MekkCyber

f78cfcb

tomaarsen changed the title ~~add mpt_pos_shift_attention_forward~~ Add Yi support + benchmark results Nov 21, 2023

tomaarsen merged commit 34d071c into tomaarsen:main Nov 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Yi support + benchmark results #27

Add Yi support + benchmark results #27

MekkCyber commented Nov 6, 2023

MekkCyber commented Nov 6, 2023

tomaarsen commented Nov 6, 2023

MekkCyber commented Nov 6, 2023 •

edited

tomaarsen commented Nov 21, 2023

Add Yi support + benchmark results #27

Add Yi support + benchmark results #27

Conversation

MekkCyber commented Nov 6, 2023

MekkCyber commented Nov 6, 2023

tomaarsen commented Nov 6, 2023

MekkCyber commented Nov 6, 2023 • edited

tomaarsen commented Nov 21, 2023

MekkCyber commented Nov 6, 2023 •

edited