feat: add /models/apply endpoint to prepare models #286

mudler · 2023-05-17T17:28:40Z

This change allow to use the API to programmatically download models from a "gallery'.
It doesn't introduce the concept of the gallery, however, it introduces a way to install models from pre-defined yaml files which would be part of it, so it provides the "base" building block.

It exposes an api endpoint to apply new models from gallery YAML files. gallery files includes information to retrieve the model, templates to use, and its configuration file.

For instance, consider:

name: "groovy"
description: |
    LocalAI GPT4ALL-J Model
license: "Apache 2.0"
urls:
- gpt4all.io

config_file: |
    parameters:
      model: groovy
      top_k: 80
      temperature: 0.2
      top_p: 0.7
    context_size: 1024
    stopwords:
    - "HUMAN:"
    - "GPT:"
    roles:
      user: ""
      system: ""
    template:
      completion: "groovy-completion"
      chat: groovy-chat

files:
    - filename: "groovy"
      sha: ""
      uri: "https://gpt4all.io/models/ggml-gpt4all-j-v1.3-groovy.bin"

prompt_templates:
    - name: "groovy-completion"
      content: |
        Complete the prompt
        ### Prompt:
        {{.Input}}
        ### Response:
    - name: "groovy-chat"
      content: |
        The prompt below is a question to answer, a task to complete, or a conversation to respond to; decide which and write an appropriate response.
        ### Prompt:
        {{.Input}}
        ### Response:

uploaded as a gist, could be used as so to configure LocalAI to download all the necessary files in runtime to start using the model:

$ curl http://localhost:8080/models/apply -H "Content-Type: application/json" -d '{
     "url": "https://gist.githubusercontent.com/mudler/6112666e77061fe35ca3a535d25ccddc/raw/45eabb442f25db5bc991f2d2a616c15d4c26be67/gpt4all-j-localai.yaml"
   }'
{"uid":"1059474d-f4f9-11ed-8d99-c4cbe106d571","status":"http://localhost:8080/models/jobs/1059474d-f4f9-11ed-8d99-c4cbe106d571"}
$ curl http://localhost:8080/models/jobs/1059474d-f4f9-11ed-8d99-c4cbe106d571      
{"error":null,"processed":true,"message":"completed"}

This will:

Start a batch job to download the model and the relevant files
Reload the models once finished to pick up the model

In a bash script, you can wait on the operation to finish by checking the processed field:

model_url="https://gist.githubusercontent.com/mudler/6112666e77061fe35ca3a535d25ccddc/raw/45eabb442f25db5bc991f2d2a616c15d4c26be67/gpt4all-j-localai.yaml"

response=$(curl -s http://localhost:8080/models/apply -H "Content-Type: application/json" -d '{"url": "$model_url"}')

job_id=$(echo "$response" | jq -r '.uid')

while [ "$(curl -s http://localhost:8080/models/jobs/"$job_id" | jq -r '.processed')" != "true" ]; do 
  sleep 1
done

echo "Job completed"

Related to: #100

mudler marked this pull request as draft May 17, 2023 17:29

mudler force-pushed the batch_download branch 2 times, most recently from d095f4d to 945a9c3 Compare May 17, 2023 20:20

feat: add batch model download and apply

d1b5cb6

mudler force-pushed the batch_download branch from 945a9c3 to d1b5cb6 Compare May 17, 2023 20:33

mudler added 7 commits May 17, 2023 23:05

feat: allow to override name

b67b411

fix: be consistent. rename image-dir to image-path

dd4bf6f

feat: rename API endpoint

d8d578c

feat: check SHA before re-downloading, skip if already exist

80a3ae2

tests: add gallery tests

4c17306

refactor: return url in 'status'

bd159e0

refactor: be more specific in the hash

0114c46

mudler marked this pull request as ready for review May 17, 2023 23:04

docs: Update README

8957680

mudler changed the title ~~feat: batch download~~ feat: add /models/apply endpoint to prepare models May 18, 2023

mudler added 2 commits May 18, 2023 13:55

tests: take a context

a58182b

docs: Update README

4a883fb

mudler merged commit cc9aa9e into master May 18, 2023

mudler deleted the batch_download branch May 18, 2023 13:59

mudler mentioned this pull request May 26, 2023

feature: model gallery #100

Closed

6 tasks

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add /models/apply endpoint to prepare models #286

feat: add /models/apply endpoint to prepare models #286

mudler commented May 17, 2023 •

edited

Loading

feat: add /models/apply endpoint to prepare models #286

feat: add /models/apply endpoint to prepare models #286

Conversation

mudler commented May 17, 2023 • edited Loading

mudler commented May 17, 2023 •

edited

Loading