Support top chinese language models #116

wuxibin89 · 2023-10-10T08:49:38Z

No description provided.

catqaq · 2023-10-13T15:19:58Z

List of expected supported models:

GLM
Baichuan2
Qwen
InternLM
mistral

catqaq · 2023-10-15T15:59:56Z

Transformers officially supports Flash Attention 2, but it only covers models like Llama. Is it necessary for us to support Flash Attention 2 for other models?

wuxibin89 · 2023-10-18T07:59:12Z

huggingface/transformers#26350 Transformers community is adding support for more architecture.

catqaq · 2023-10-19T16:56:32Z

List of expected supported models:

GLM

Baichuan2

Qwen

InternLM

Done for baichuan, e770404
but requires further performance testing, and more model testing is also welcome.

Data for test:

SFT: alpaca+alpaca_cn
RM: hh-rlhf
PPO: alpaca-cleaned+full-hh-rlhf+oasst1_pairwise_rlhf_reward

model	SFT	RM	PPO
Baichuan2-7B-Chat	1h	4h	71h
chatglm2-6b
Qwen-7B-Chat
internlm-chat-7b-v1_1

catqaq · 2023-10-23T17:25:10Z

List of expected supported models:

GLM

Baichuan2

Qwen

InternLM

Done, e770404 but requires further performance testing, and more model testing is also welcome.

Data for test:

SFT: alpaca+alpaca_cn

RM: hh-rlhf

PPO: alpaca-cleaned+full-hh-rlhf+oasst1_pairwise_rlhf_reward

model SFT RM PPO Notes
Baichuan2-7B-Chat 1h 4h 71h
chatglm2-6b
Qwen-7B-Chat
internlm-chat-7b-v1_1

more test please!

pikaqqqqqq · 2023-10-24T07:07:33Z

Qwen-14B is being supported:

Different model architectures
Qwen cannot add extra special tokens
Qwen lacks eos_token setting, causing errors during dataset processing
Qwen supports flash_attention, but the parameter passing method is different

jovany-wang · 2023-10-26T02:38:50Z

What's the progress?

Triang-jyed-driung · 2023-10-26T09:27:15Z

Add RWKV

Would you like implementing RLHF for RWKV, here are some suggestions:

As a non-transformer architecture, RWKV does not use the HF transformers library.
RWKV uses a different tokenizer, called rwkv_vocab_v20230424, and tokenization process is based on their own tokenizer.
The official repositories are https://github.com/BlinkDL/RWKV-LM and https://github.com/BlinkDL/ChatRWKV. See train.py for training process.
RWKV contains tons of new tricks and would fail if any of these tricks is not set correctly.

catqaq · 2023-10-26T16:36:51Z

Add RWKV

Would you like implementing RLHF for RWKV, here are some suggestions:

As a non-transformer architecture, RWKV does not use the HF transformers library.

RWKV uses a different tokenizer, called rwkv_vocab_v20230424, and tokenization process is based on their own tokenizer.

The official repositories are https://github.com/BlinkDL/RWKV-LM and https://github.com/BlinkDL/ChatRWKV. See train.py for training process.

RWKV contains tons of new tricks and would fail if any of these tricks is not set correctly.

It seems that it is not easy to be compatible with RWKV in OpenRLHF.

ftmtk · 2023-12-07T07:41:20Z

I am wondering if mistral has been supported?

catqaq · 2023-12-11T19:18:36Z

I am wondering if mistral has been supported?

Not yet

hijkzzz · 2023-12-27T14:21:10Z

I am wondering if mistral has been supported?

OpenRLHF supports mistral since it's model architecture is the same as llama2.

wxhheian · 2024-02-23T10:39:38Z

现在qwen支持了吗

wxhheian · 2024-02-23T11:29:40Z

我看qwen的特殊token <|endoftext|> 在代码里面没有体现？

catqaq · 2024-02-23T17:54:40Z

我看qwen的特殊token <|endoftext|> 在代码里面没有体现？

应该是支持了，周末测测看~

catqaq changed the title ~~Support Baichuan2/Qwen chinese language model~~ Support top chinese language models Oct 13, 2023

catqaq mentioned this issue Oct 19, 2023

Community contribution: Adding Flash Attention 2 support for more architectures huggingface/transformers#26350

Open

24 tasks

catqaq self-assigned this Oct 19, 2023

catqaq added the enhancement New feature or request label Oct 19, 2023

catqaq added the P0 High priority label Oct 23, 2023

catqaq mentioned this issue Nov 29, 2023

support Qwen #146

Merged

hijkzzz closed this as completed Dec 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support top chinese language models #116

Support top chinese language models #116

wuxibin89 commented Oct 10, 2023

catqaq commented Oct 13, 2023 •

edited

Loading

catqaq commented Oct 15, 2023

wuxibin89 commented Oct 18, 2023

catqaq commented Oct 19, 2023 •

edited

Loading

catqaq commented Oct 23, 2023

pikaqqqqqq commented Oct 24, 2023 •

edited

Loading

jovany-wang commented Oct 26, 2023

Triang-jyed-driung commented Oct 26, 2023

catqaq commented Oct 26, 2023

ftmtk commented Dec 7, 2023

catqaq commented Dec 11, 2023

hijkzzz commented Dec 27, 2023

wxhheian commented Feb 23, 2024

wxhheian commented Feb 23, 2024

catqaq commented Feb 23, 2024

Support top chinese language models #116

Support top chinese language models #116

Comments

wuxibin89 commented Oct 10, 2023

catqaq commented Oct 13, 2023 • edited Loading

catqaq commented Oct 15, 2023

wuxibin89 commented Oct 18, 2023

catqaq commented Oct 19, 2023 • edited Loading

catqaq commented Oct 23, 2023

pikaqqqqqq commented Oct 24, 2023 • edited Loading

jovany-wang commented Oct 26, 2023

Triang-jyed-driung commented Oct 26, 2023

catqaq commented Oct 26, 2023

ftmtk commented Dec 7, 2023

catqaq commented Dec 11, 2023

hijkzzz commented Dec 27, 2023

wxhheian commented Feb 23, 2024

wxhheian commented Feb 23, 2024

catqaq commented Feb 23, 2024

catqaq commented Oct 13, 2023 •

edited

Loading

catqaq commented Oct 19, 2023 •

edited

Loading

pikaqqqqqq commented Oct 24, 2023 •

edited

Loading