MPT support #290

generalsvr · 2023-05-06T01:00:08Z

Model description

Can you add a new MPT model? This looks very promising, especially the ability to extend context length by up to 85K tokens.

Open source status

The model implementation is available
The model weights are available

Provide useful links for the implementation

https://github.com/mosaicml/llm-foundry

vvsotnikov · 2023-05-07T18:22:02Z

Probably a related issue: huggingface/transformers#23174

monuminu · 2023-06-23T04:50:15Z

any plan on supporting mosaicml/mpt-30b-instruct support

ccasimiro88 · 2023-06-23T10:27:21Z

I am also interested in deploying the mosaicml/mpt-30b-chat model. Would be really useful for the community! 🙏

tim-a-davis · 2023-06-23T14:29:52Z

Yes I am also interested in getting support for MPT models. I would love to assist in any way I can.

mantrakp04 · 2023-06-25T23:04:33Z

+1

alanxmay · 2023-06-26T04:05:18Z

please🙏

mantrakp04 · 2023-06-26T04:08:00Z

Yes I am also interested in getting support for MPT models. I would love to assist in any way I can.

Hey wanna work on the implementation then we can do a pr, add me on discord mantrakp- proud Logitech controller owner

Narsil · 2023-07-01T10:33:14Z

#514

# What does this PR do? This adds a non flash version of MPT. Flash is harder because we need to create a bias ready cuda kernel of flash attention. Fixes #361 Fixes #491 Fixes #290

# What does this PR do? This adds a non flash version of MPT. Flash is harder because we need to create a bias ready cuda kernel of flash attention. Fixes huggingface/text-generation-inference#361 Fixes huggingface/text-generation-inference#491 Fixes huggingface/text-generation-inference#290

dongs0104 mentioned this issue May 16, 2023

MPT-7B #331

Closed

2 tasks

arnocandel mentioned this issue Jun 22, 2023

Add support for mosaicml/mpt-30b and mosaicml/mpt-30b-instruct h2oai/h2ogpt#319

Closed

louis030195 mentioned this issue Jun 27, 2023

Support for mosaicml/mpt-30b-instruct model #491

Closed

Narsil mentioned this issue Jul 1, 2023

Non flash MPT. #514

Merged

5 tasks

OlivierDehaene closed this as completed in #514 Jul 3, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MPT support #290

MPT support #290

generalsvr commented May 6, 2023

vvsotnikov commented May 7, 2023

monuminu commented Jun 23, 2023

ccasimiro88 commented Jun 23, 2023

tim-a-davis commented Jun 23, 2023

mantrakp04 commented Jun 25, 2023

alanxmay commented Jun 26, 2023

mantrakp04 commented Jun 26, 2023

Narsil commented Jul 1, 2023

MPT support #290

MPT support #290

Comments

generalsvr commented May 6, 2023

Model description

Open source status

Provide useful links for the implementation

vvsotnikov commented May 7, 2023

monuminu commented Jun 23, 2023

ccasimiro88 commented Jun 23, 2023

tim-a-davis commented Jun 23, 2023

mantrakp04 commented Jun 25, 2023

alanxmay commented Jun 26, 2023

mantrakp04 commented Jun 26, 2023

Narsil commented Jul 1, 2023