Draft: feat: Support DBRX model in Llama #462

reneleonhardt · 2024-04-15T11:22:30Z

The new Open Source model DBRX sounds amazing, is this enough and correct to integrate it into Llama?
ggerganov/llama.cpp#6515
https://huggingface.co/collections/phymbert/dbrx-16x12b-instruct-gguf-6619a7a4b7c50831dd33c7c8
https://www.databricks.com/blog/announcing-dbrx-new-standard-efficient-open-source-customizable-llms
https://github.com/databricks/dbrx
https://huggingface.co/collections/databricks/

llama.cpp seems to support splitted/sharded files, but I would need to download all of them first I suppose... 😅

carlrobertoh · 2024-04-15T11:47:34Z

src/main/java/ee/carlrobert/codegpt/completions/llama/LlamaModel.java

+          + "Generation speed is significantly faster than LLaMA2-70B, while at the same time "
+          + "beating other open source models, such as, LLaMA2-70B, Mixtral, and Grok-1 on "
+          + "language understanding, programming, math, and logic.",
+      PromptTemplate.LLAMA,


I think it uses ChatML prompt template - PromptTemplate.CHAT_ML

carlrobertoh · 2024-04-15T11:48:13Z

Since the change was recent, we need to update the llama.cpp submodule as well

reneleonhardt · 2024-04-15T12:08:37Z

Since the change was recent, we need to update the llama.cpp submodule as well

Done

carlrobertoh · 2024-04-15T12:15:00Z

I'll try running the model locally soon and see if any other changes are necessary

reneleonhardt · 2024-04-15T12:45:48Z

I'll try running the model locally soon and see if any other changes are necessary

Great! But in this PR I have to implement downloading all 10 files first I guess... 😅

reneleonhardt · 2024-04-23T06:03:09Z

@phymbert I can download https://huggingface.co/phymbert/dbrx-16x12b-instruct-iq3_xxs-gguf without login in the browser, but inside the plugin I get 403 Forbidden, is this to be expected with the databricks-open-model-license (other) license?
Do you think DBRX is not particularly suited as a coding assistant? The smallest is 53 GB huge 😅

phymbert · 2024-04-23T07:22:05Z

Dbrx is a gated model, so I believe you have to pass a read token. There is an issue open on llama.cpp to support this.

reneleonhardt force-pushed the support-llama-model-dbrx branch 3 times, most recently from f852b16 to 27b9d62 Compare April 15, 2024 11:41

carlrobertoh reviewed Apr 15, 2024

View reviewed changes

reneleonhardt force-pushed the support-llama-model-dbrx branch from 27b9d62 to 3bc8480 Compare April 15, 2024 11:54

reneleonhardt force-pushed the support-llama-model-dbrx branch 2 times, most recently from 4fb52b2 to 7aa08d9 Compare April 16, 2024 05:54

reneleonhardt mentioned this pull request Apr 18, 2024

chore: Convert utils to Kotlin #473

Merged

carlrobertoh force-pushed the master branch from 279777c to bcb33ae Compare April 20, 2024 22:09

reneleonhardt added 2 commits April 21, 2024 08:46

feat: Support DBRX model in Llama

b9421ad

upgrade llama submodule

d3a610c

reneleonhardt force-pushed the support-llama-model-dbrx branch from 7aa08d9 to 05cdeed Compare April 21, 2024 07:00

Download 10 split files for DBRX

c87c1b1

reneleonhardt force-pushed the support-llama-model-dbrx branch from 05cdeed to c87c1b1 Compare April 21, 2024 07:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Draft: feat: Support DBRX model in Llama #462

Draft: feat: Support DBRX model in Llama #462

reneleonhardt commented Apr 15, 2024 •

edited

carlrobertoh Apr 15, 2024

reneleonhardt Apr 15, 2024

carlrobertoh commented Apr 15, 2024

reneleonhardt commented Apr 15, 2024

carlrobertoh commented Apr 15, 2024

reneleonhardt commented Apr 15, 2024

reneleonhardt commented Apr 23, 2024

phymbert commented Apr 23, 2024

Draft: feat: Support DBRX model in Llama #462

Are you sure you want to change the base?

Draft: feat: Support DBRX model in Llama #462

Conversation

reneleonhardt commented Apr 15, 2024 • edited

carlrobertoh Apr 15, 2024

Choose a reason for hiding this comment

reneleonhardt Apr 15, 2024

Choose a reason for hiding this comment

carlrobertoh commented Apr 15, 2024

reneleonhardt commented Apr 15, 2024

carlrobertoh commented Apr 15, 2024

reneleonhardt commented Apr 15, 2024

reneleonhardt commented Apr 23, 2024

phymbert commented Apr 23, 2024

reneleonhardt commented Apr 15, 2024 •

edited