Add the deepseek model to the library #1040

Nan-Do · 2023-11-08T07:56:05Z

The deepseek model is currently the best coding open source model on the HumanEval dataset second only to ChatGPT4 by a little margin.
https://www.deepseek.com/
https://huggingface.co/deepseek-ai
https://huggingface.co/deepseek-ai/deepseek-coder-6.7b-instruct
https://huggingface.co/deepseek-ai/deepseek-coder-33b-instruct
https://evalplus.github.io/leaderboard.html

There are 7b and 33b model variants, the quantized versions can be found here:
https://huggingface.co/TheBloke/deepseek-coder-6.7B-instruct-GGUF
https://huggingface.co/TheBloke/deepseek-coder-33B-instruct-GGUF

This is a possible valid modelfile including a valid prompt template:

FROM ./deepseek-coder-33b-instruct.Q4_K_M.gguf

# set the temperature to 1 [higher is more creative, lower is more coherent]
PARAMETER temperature 0.2

# set the system prompt
TEMPLATE """{{ .System }}

### Instruction:
{{ .Prompt }}

### Response:
"""

SYSTEM """You are an advanced AI programming assistant."""

The authors propose a longer version of this template, which is more restrictive, as well as other variants for other kinds of inference
https://github.com/deepseek-ai/deepseek-coder#3-chat-model-inference

The text was updated successfully, but these errors were encountered:

eramax · 2023-11-08T18:01:10Z

Yes please add it.

valentimarco · 2023-11-17T21:22:05Z

+1

daniel-a-diaz · 2023-11-19T05:06:57Z

I was just about to write this feature request. Please add.

kapral18 · 2023-11-19T20:50:46Z

+1

gururise · 2023-11-19T23:17:07Z

Any movement on this? Would love to use deepseek coder as a coding assistant and Ollama as the server. Would work great with the 'continue' vscode extension!

Nan-Do · 2023-11-20T02:52:52Z

Just a reminder for anyone interested in using this model, you can still download the model and use the ollama create command to add it to your local repository of models.
https://github.com/jmorganca/ollama/blob/main/docs/modelfile.md.

This is a short recipe to run the 7B model:

wget https://huggingface.co/TheBloke/deepseek-coder-6.7B-instruct-GGUF/resolve/main/deepseek-coder-6.7b-instruct.Q4_K_M.gguf

Create the modelfile, with the following contents, in the same directory you downloaded the model.

FROM ./deepseek-coder-6.7b-instruct.Q4_K_M.gguf

# set the temperature to 1 [higher is more creative, lower is more coherent]
PARAMETER temperature 0.1

# set the system prompt
TEMPLATE """{{ .System }}

### Instruction:
{{ .Prompt }}

### Response:
"""

SYSTEM """You are an advanced AI programming assistant."""

Run the ollama create command.

ollama create deepseek-7B -f ./modelfile

Use the model.

ollama run deepseek-7B

mxyng · 2023-11-21T00:05:01Z

DeepSeek Coder is now available in the Ollama library

Nan-Do · 2023-11-21T04:06:26Z

Just a comment for people interested in using this model, with the current configuration you'll need a graphic card with at least 16GB of VRAM (for the 6.7GB) in order to be able to use this model with GPU acceleration

ex3ndr · 2023-11-22T16:47:44Z

Base models are very interesting to have too
https://huggingface.co/deepseek-ai/deepseek-coder-6.7b-base
https://huggingface.co/deepseek-ai/deepseek-coder-33b-base

jmorganca added the model request Model requests label Nov 8, 2023

mxyng closed this as completed Nov 21, 2023

daniel-a-diaz mentioned this issue Nov 21, 2023

Feature: Add DeepSeek Coder to models davila7/code-gpt-docs#202

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add the deepseek model to the library #1040

Add the deepseek model to the library #1040

Nan-Do commented Nov 8, 2023 •

edited

eramax commented Nov 8, 2023

valentimarco commented Nov 17, 2023

daniel-a-diaz commented Nov 19, 2023

kapral18 commented Nov 19, 2023

gururise commented Nov 19, 2023

Nan-Do commented Nov 20, 2023

mxyng commented Nov 21, 2023

Nan-Do commented Nov 21, 2023

ex3ndr commented Nov 22, 2023

Add the deepseek model to the library #1040

Add the deepseek model to the library #1040

Comments

Nan-Do commented Nov 8, 2023 • edited

eramax commented Nov 8, 2023

valentimarco commented Nov 17, 2023

daniel-a-diaz commented Nov 19, 2023

kapral18 commented Nov 19, 2023

gururise commented Nov 19, 2023

Nan-Do commented Nov 20, 2023

mxyng commented Nov 21, 2023

Nan-Do commented Nov 21, 2023

ex3ndr commented Nov 22, 2023

Nan-Do commented Nov 8, 2023 •

edited