support starcoder2 #1587

rburgstaller · 2024-02-29T17:06:41Z

https://techcrunch.com/2024/02/28/starcoder-2-is-a-code-generating-ai-that-runs-on-most-gpus

https://github.com/bigcode-project/starcoder2

wsxiaoys · 2024-02-29T21:48:19Z

ref: #1230

wsxiaoys · 2024-02-29T23:51:28Z

also pending on ggerganov/llama.cpp#5795

wsxiaoys · 2024-03-02T00:15:07Z

added https://github.com/wsxiaoys/registry-tabby, runnable with --model wsxiaoys/StarCoder-3B for nightly build.

aaronstevenson408 · 2024-03-02T04:10:50Z

@wsxiaoys having trouble getting it to run , i can run the normal models on the nightly build , but for some reason can't get starcoder2 to work, used your repo and my fork , also have tried manually downloading it.

https://huggingface.co/brittlewis12/starcoder2-3b-GGUF
https://huggingface.co/second-state/StarCoder2-3B-GGUF
are the repos i've gotten from.
keeps erroring 1 , but that isn't enough to go off of
i'm running a docker nightly image updated 8 hours ago
any ideas on getting it to run , i'm excited to get it running

wsxiaoys · 2024-03-02T04:36:21Z

Hey @aaronstevenson408 , could you try this docker images ghcr.io/tabbyml/tabby:main-ef15f97? Just realized the docker tag for nightly is not updated yet.

aaronstevenson408 · 2024-03-02T09:16:48Z

Seems like it worked ty

rudiservo · 2024-03-02T12:37:50Z

Is the prompt_template the same has starcoder 1?
It seems to have issues to autocomplete in the middle of writing a word and only triggering on new line.

CleyFaye · 2024-03-04T00:37:43Z

added https://github.com/wsxiaoys/registry-tabby, runnable with --model wsxiaoys/StarCoder-3B for nightly build.

I am not sure if here is the right place to report this, but the three size for StarCoder2 all have the name "StarCoder2-3B" there.

wsxiaoys · 2024-03-04T00:41:27Z

added https://github.com/wsxiaoys/registry-tabby, runnable with --model wsxiaoys/StarCoder-3B for nightly build.

I am not sure if here is the right place to report this, but the three size for StarCoder2 all have the name "StarCoder2-3B" there.

Thanks for reporting - fixed

xunfeng1980 · 2024-03-11T10:31:01Z

exit ....

docker run --rm -it -p 8001:8000  --gpus='"device=7"' -v /mnt/data/tabby-data:/data  tabbyml/tabby:main-60310d4 serve --model /data/starcoder2-7b --device cuda
2024-03-11T10:28:20.357969Z  INFO tabby::services::model: crates/tabby/src/services/model/mod.rs:121: Loading model from local path /data/starcoder2-7b
2024-03-11T10:28:20.358014Z  INFO tabby::serve: crates/tabby/src/serve.rs:118: Starting server, this might take a few minutes...
ggml_init_cublas: GGML_CUDA_FORCE_MMQ:   no
ggml_init_cublas: CUDA_USE_TENSOR_CORES: yes
ggml_init_cublas: found 1 CUDA devices:
  Device 0: NVIDIA GeForce RTX 3090, compute capability 8.6, VMM: yes

xunfeng1980 · 2024-03-12T08:00:22Z

出口 ....

docker run --rm -it -p 8001:8000  --gpus='"device=7"' -v /mnt/data/tabby-data:/data  tabbyml/tabby:main-60310d4 serve --model /data/starcoder2-7b --device cuda
2024-03-11T10:28:20.357969Z  INFO tabby::services::model: crates/tabby/src/services/model/mod.rs:121: Loading model from local path /data/starcoder2-7b
2024-03-11T10:28:20.358014Z  INFO tabby::serve: crates/tabby/src/serve.rs:118: Starting server, this might take a few minutes...
ggml_init_cublas: GGML_CUDA_FORCE_MMQ:   no
ggml_init_cublas: CUDA_USE_TENSOR_CORES: yes
ggml_init_cublas: found 1 CUDA devices:
  Device 0: NVIDIA GeForce RTX 3090, compute capability 8.6, VMM: yes

GGUF....

jeromepl · 2024-04-03T19:37:25Z

Hi @wsxiaoys, is it now possible to run StarCoder v2 from tabby directly with something like:
tabby serve --device metal --model TabbyML/StarCoderV2-3B? If not, what is the best way to run StarCoder v2 today?

rudiservo · 2024-04-03T22:15:06Z

@jeromepl you can you can either make you own repo registry-tabby or use one already available
https://github.com/wsxiaoys/registry-tabby

tabby serve --device metal --model wsxiaoys/StarCoder2-15B

RiQuY · 2024-04-08T18:12:04Z

Is this still pending to be implemented on the official model registry?

wsxiaoys · 2024-04-08T19:11:14Z

Just added the Starcoder2-3B / Starcoder2-7B to the official registry. Enjoy!

rudiservo · 2024-04-09T10:44:27Z

@wsxiaoys could you add the 15B? It's actually quite usable on a 4090 or a 7900XTX.

wsxiaoys · 2024-04-09T15:43:14Z

@rudiservo you could still maintain that in a forked registry. For tabby official registry we prefer not to include models go beyond 10B atm.

rudiservo · 2024-04-09T15:49:13Z

@wsxiaoys yes I already have one, I was just suggesting it since more people might be interested in it and given my experience with the model, also the quality between 7B and 15B is a bit noticeable.

It is quite usable with Tabby, but you do need more than 16GB of VRAM, currently the 7900XTX can handle it quite well.

den-run-ai · 2024-04-24T19:57:03Z

I see StarCoder2-7B and StarCoder2-15B in the Tabby leaderboard, but only 7B is available on Mac:

tabby serve --device metal --model StarCoder2-15B
thread 'main' panicked at crates/tabby-common/src/registry.rs:92:32:
Invalid model_id <TabbyML/StarCoder2-15B>
note: run with RUST_BACKTRACE=1 environment variable to display a backtrace

den-run-ai · 2024-04-24T20:26:40Z

It looks like the StarCoder2 15B quantized model is available in HuggingFace:

https://huggingface.co/nold/starcoder2-15b-GGUF

wsxiaoys transferred this issue from TabbyML/registry-tabby Feb 29, 2024

wsxiaoys added good first issue Good for newcomers enhancement New feature or request labels Feb 29, 2024

wsxiaoys added fixed-in-next-release and removed good first issue Good for newcomers labels Mar 6, 2024

wsxiaoys self-assigned this Mar 6, 2024

wsxiaoys closed this as completed Mar 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support starcoder2 #1587

support starcoder2 #1587

rburgstaller commented Feb 29, 2024

wsxiaoys commented Feb 29, 2024

wsxiaoys commented Feb 29, 2024

wsxiaoys commented Mar 2, 2024

aaronstevenson408 commented Mar 2, 2024

wsxiaoys commented Mar 2, 2024

aaronstevenson408 commented Mar 2, 2024

rudiservo commented Mar 2, 2024

CleyFaye commented Mar 4, 2024

wsxiaoys commented Mar 4, 2024

xunfeng1980 commented Mar 11, 2024

xunfeng1980 commented Mar 12, 2024

jeromepl commented Apr 3, 2024

rudiservo commented Apr 3, 2024

RiQuY commented Apr 8, 2024

wsxiaoys commented Apr 8, 2024 •

edited

Loading

rudiservo commented Apr 9, 2024

wsxiaoys commented Apr 9, 2024

rudiservo commented Apr 9, 2024

den-run-ai commented Apr 24, 2024

den-run-ai commented Apr 24, 2024

support starcoder2 #1587

support starcoder2 #1587

Comments

rburgstaller commented Feb 29, 2024

wsxiaoys commented Feb 29, 2024

wsxiaoys commented Feb 29, 2024

wsxiaoys commented Mar 2, 2024

aaronstevenson408 commented Mar 2, 2024

wsxiaoys commented Mar 2, 2024

aaronstevenson408 commented Mar 2, 2024

rudiservo commented Mar 2, 2024

CleyFaye commented Mar 4, 2024

wsxiaoys commented Mar 4, 2024

xunfeng1980 commented Mar 11, 2024

xunfeng1980 commented Mar 12, 2024

jeromepl commented Apr 3, 2024

rudiservo commented Apr 3, 2024

RiQuY commented Apr 8, 2024

wsxiaoys commented Apr 8, 2024 • edited Loading

rudiservo commented Apr 9, 2024

wsxiaoys commented Apr 9, 2024

rudiservo commented Apr 9, 2024

den-run-ai commented Apr 24, 2024

den-run-ai commented Apr 24, 2024

wsxiaoys commented Apr 8, 2024 •

edited

Loading