[Models] Add Phi3-mini, StableLM 1.6B, Qwen 1.8B, update MLC runtime #433

CharlieFRuan · 2024-05-29T21:47:50Z

This PR updates models to v0.2.39 compiled with mlc-ai/binary-mlc-llm-libs#123

The main change is the new MLC-LLM runtime, which supports grammar (i.e. json mode) for Llama3.

Hence we now read in field tokenizer_info (or deprecated token_table_postproc_method) from mlc-chat-config.json when post processing token table for Grammar
- If neither is available, we use the default byte_fallback

New prebuilt models introduced:

Phi3-mini-4k
Hermes-2-Pro-Llama-3-8B
Qwen1.5-1.8B
StableLM-2-zephyr_1.6B

Updates on examples:

json-mode and json-schema now use Llama3 to demonstrate
Function calling inside json-schema now uses Hermes-2-Pro-Llama-3-8B instead of Hermes-2-Pro-Mistral

### Changes Main changes include: - New prebuilt models: - Phi3-mini - StableLM-2-zephyr-1.6B - Qwen1.5-1.8B - Hermes2-Pro-Llama-3-8B to prebuilt models - Updates on `ModelRecord` fields - For detail see: #435 - Update all WASMs - For detail see: #433 - Update all WASMs to v0.2.39 - Support grammar for Llama3, hence update examples/json-mode to use `Llama3` and `Hermes2-pro-Llama3-8B` for function calling in `examples/json-schema` - Use `loglevel` package: - For details see #427 - Fix `index.js.map` issue for Vite - #420 - Enhance error handling and ServiceWorker ### TVMjs TVMjs compiled at apache/tvm@71f7af7 - Main changes include: - apache/tvm#17031 - apache/tvm#17028 - apache/tvm#17021 ### WASM version - All wasms updated to 0.2.39 via mlc-ai/binary-mlc-llm-libs#123 for new MLC-LLM runtime (mainly grammar)

[Models] Add Phi3-mini, StableLM 1.6B, Qwen 1.8B, update MLC runtime

a4e18db

CharlieFRuan force-pushed the pr-0529-llama3-grammar branch from 1b946ed to a4e18db Compare May 29, 2024 21:48

CharlieFRuan merged commit 3731fe6 into mlc-ai:main May 29, 2024
1 check passed

This was referenced May 29, 2024

Model request: Phi 3 mini 128K #432

Closed

[Tracking][WebLLM] Runtime updates #429

Closed

[Version] Bump version to 0.2.39, update prebuilt WASMs #436

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Models] Add Phi3-mini, StableLM 1.6B, Qwen 1.8B, update MLC runtime #433

[Models] Add Phi3-mini, StableLM 1.6B, Qwen 1.8B, update MLC runtime #433

CharlieFRuan commented May 29, 2024

[Models] Add Phi3-mini, StableLM 1.6B, Qwen 1.8B, update MLC runtime #433

[Models] Add Phi3-mini, StableLM 1.6B, Qwen 1.8B, update MLC runtime #433

Conversation

CharlieFRuan commented May 29, 2024