New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add the deepseek model to the library #1040
Comments
Yes please add it. |
+1 |
I was just about to write this feature request. Please add. |
+1 |
Any movement on this? Would love to use deepseek coder as a coding assistant and Ollama as the server. Would work great with the 'continue' vscode extension! |
Just a reminder for anyone interested in using this model, you can still download the model and use the ollama create command to add it to your local repository of models. This is a short recipe to run the 7B model:
Create the modelfile, with the following contents, in the same directory you downloaded the model.
Run the ollama create command.
Use the model.
|
DeepSeek Coder is now available in the Ollama library |
Just a comment for people interested in using this model, with the current configuration you'll need a graphic card with at least 16GB of VRAM (for the 6.7GB) in order to be able to use this model with GPU acceleration |
Base models are very interesting to have too |
The deepseek model is currently the best coding open source model on the HumanEval dataset second only to ChatGPT4 by a little margin.
https://www.deepseek.com/
https://huggingface.co/deepseek-ai
https://huggingface.co/deepseek-ai/deepseek-coder-6.7b-instruct
https://huggingface.co/deepseek-ai/deepseek-coder-33b-instruct
https://evalplus.github.io/leaderboard.html
There are 7b and 33b model variants, the quantized versions can be found here:
https://huggingface.co/TheBloke/deepseek-coder-6.7B-instruct-GGUF
https://huggingface.co/TheBloke/deepseek-coder-33B-instruct-GGUF
This is a possible valid modelfile including a valid prompt template:
The authors propose a longer version of this template, which is more restrictive, as well as other variants for other kinds of inference
https://github.com/deepseek-ai/deepseek-coder#3-chat-model-inference
The text was updated successfully, but these errors were encountered: