Support loading model into pipeline from local filesystem #308

Jeadie · 2024-05-14T23:29:12Z

Motivation

Support loading models from local filesystem to make it easier for productionising using mistral.rs.

Changes

Change trait Loader. From load_model
- load_model_from_hf
- load_model_from_path
Both of these can use the same logic once HF files have been downloaded locally.

EricLBuehler

Thank you for adding this! I think it's a great addition to the Rust API and just clears up the code a bit. I left a few minor comments regarding docs and one small question.

mistralrs-server/src/main.rs

mistralrs-core/src/pipeline/mod.rs

mistralrs-core/src/pipeline/speculative.rs

mistralrs-core/src/pipeline/mod.rs

EricLBuehler · 2024-05-15T17:36:10Z

@Jeadie, I think there are some formatting/linting issues. After those are fixed, I think this should be ready to merge.

github-actions · 2024-05-15T20:39:27Z

Code Metrics Report

  ===============================================================================
 Language            Files        Lines         Code     Comments       Blanks
===============================================================================
 Dockerfile              1           34           25            0            9
 Happy                   1          442          369            0           73
 JSON                    5            9            9            0            0
 Python                 21          741          622           21           98
 TOML                   16          419          378            1           40
-------------------------------------------------------------------------------
 Jupyter Notebooks       1            0            0            0            0
 |- Markdown             1           60           30           22            8
 |- Python               1           96           87            1            8
 (Total)                            156          117           23           16
-------------------------------------------------------------------------------
 Markdown               16         1026            0          758          268
 |- BASH                 6          205          192            0           13
 |- Python               6          121          110            0           11
 |- Rust                 3          185          172            9            4
 (Total)                           1537          474          767          296
-------------------------------------------------------------------------------
 Rust                   81        26428        24327          334         1767
 |- Markdown            38          359            0          354            5
 (Total)                          26787        24327          688         1772
===============================================================================
 Total                 143        29099        25730         1114         2255
===============================================================================

Jeadie · 2024-05-15T20:40:55Z

Fixed and ran both cargo fmt --all --check and cargo clippy --workspace --tests --examples -- -D warnings locally.

EricLBuehler · 2024-05-15T21:17:35Z

Thank you!

polarathene · 2024-05-19T03:23:41Z

@Jeadie Is the local filesystem support only a few days old? Can you explain if HF API is mandatory?

Active issue: #326

It's much simpler to run a local GGUF model via llama-cpp.
The expected parameters (redundant?) can also be a tad confusing. This GGUF presumably needs the additional files sourced from here, but there is no tokenizer.json? From what I've read all the relevant metadata should already be available in the GGUF file itself, and llama-cpp happily runs with only the GGUF file provided.
Without a HF token provided (--token-source none), the HF API is still hit but encounters an "Unauthorized" response (401), while only 404 responses have an exception which triggers a panic.

Perhaps there is some benefit with local filesystem going through HF API? Otherwise it'd probably be better to avoid that, or at least offer a way to opt-out if possible.

Jeadie and others added 6 commits May 14, 2024 13:43

split loading models from HF and regular filesystem

43cb81e

fix implementations of Loader

e1fb436

add new() for SimpleModelPaths

a42f32c

add SimpleModelPaths to public

f1ff0b7

add ModelPaths to public

f9b85c9

Merge branch 'master' into master

049a5d5

EricLBuehler reviewed May 15, 2024

View reviewed changes

mistralrs-server/src/main.rs Show resolved Hide resolved

mistralrs-core/src/pipeline/mod.rs Outdated Show resolved Hide resolved

mistralrs-core/src/pipeline/speculative.rs Show resolved Hide resolved

add documentation for ModelPaths trait

67ba9ad

EricLBuehler reviewed May 15, 2024

View reviewed changes

mistralrs-core/src/pipeline/mod.rs Outdated Show resolved Hide resolved

rename SimpleModelPaths -> LocalModelPaths

07ea5b3

rustfmt and clippy

1d8f1bb

EricLBuehler merged commit 3a79bc8 into EricLBuehler:master May 15, 2024
11 checks passed

This was referenced May 19, 2024

chore: SimpleModelPaths should be renamed to LocalModelPaths #331

Merged

Running model from a GGUF file, only #326

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support loading model into pipeline from local filesystem #308

Support loading model into pipeline from local filesystem #308

Jeadie commented May 14, 2024

EricLBuehler left a comment

EricLBuehler commented May 15, 2024

github-actions bot commented May 15, 2024

Jeadie commented May 15, 2024

EricLBuehler commented May 15, 2024

polarathene commented May 19, 2024

Support loading model into pipeline from local filesystem #308

Support loading model into pipeline from local filesystem #308

Conversation

Jeadie commented May 14, 2024

Motivation

Changes

EricLBuehler left a comment

Choose a reason for hiding this comment

EricLBuehler commented May 15, 2024

github-actions bot commented May 15, 2024

Jeadie commented May 15, 2024

EricLBuehler commented May 15, 2024

polarathene commented May 19, 2024