feat(speculative-sampling): allow to specify a draft model in the model config #1052

mudler · 2023-09-14T10:13:51Z

Description

This PR fixes #1013.

It adds draft_model and n_draft to the model YAML config in order to load models with speculative sampling. This should be compatible as well with grammars.

example:

backend: llama                                                                                                                                                                   
context_size: 1024                                                                                                                                                                        
name: my-model-name
parameters:
  model: foo-bar
n_draft: 16                                                                                                                                                                      
draft_model: model-name

…el config Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

mudler added 2 commits September 14, 2023 12:07

feat(speculative-sampling): allow to specify a draft model in the mod…

c8ba8f2

…el config Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

fix(speculative-sampling): calculate relative path to model

5b4fb3c

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

mudler added the enhancement New feature or request label Sep 14, 2023

mudler merged commit 8ccf5b2 into master Sep 14, 2023
14 checks passed

mudler deleted the spec_sampling branch September 14, 2023 15:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(speculative-sampling): allow to specify a draft model in the model config #1052

feat(speculative-sampling): allow to specify a draft model in the model config #1052

mudler commented Sep 14, 2023

feat(speculative-sampling): allow to specify a draft model in the model config #1052

feat(speculative-sampling): allow to specify a draft model in the model config #1052

Conversation

mudler commented Sep 14, 2023