Feat/gguf yaml parser #1246

nguyenhoangthuan99 · 2024-09-18T07:35:17Z

Fix #1244

This is result of tinyllama.yaml file after downloaded successfully from cortexso.

…feat/model.list-utils

dan-menlo · 2024-09-18T08:07:23Z

@nguyenhoangthuan99 Can I just check:

I think from our earlier discussion, model.yaml is optional (we can load params directly from gguf)
However, is this done by us auto-creating a model.yaml for every GGUF file (e.g. upon pull or run)

The reason why I'm asking is:

Do we need a model method to create a model.yaml by parsing from a GGUF file?

nguyenhoangthuan99 · 2024-09-18T08:15:36Z

I think we still need it now because:

There is 1 field chat_template we still need to parse from gguf file and render chat template. The implementation of cortex.llamacpp requires rendered chat template.
Stop tokens also needed to parse from gguf because cortex.llamacpp requires that field when run chat completion.
For further feature like model-compatibility API, we need the llama.embedding_length, llama.attention.head_count, llama.attention.head_count_kv to calculate the memory constraint for kv cache and recommend suitable ctx_len for user
ngl <> llama.block_count is also can be read inside gguf file and can be used to recommend the suitable ngl to user's hardware

nguyenhoangthuan99 · 2024-09-18T08:37:22Z

I'll add uni-tests for this PR

dan-menlo · 2024-09-18T08:43:59Z

Got it. So we will still create a model.yaml whenever we pull, run or import a model?

nguyenhoangthuan99 · 2024-09-18T08:56:30Z

yes, we need to create model.yaml with the following cases:

pull model from other sources
import a random model from local

Models from cortexso already have model.yml so we don't need to create model.yml, just update the file location of binary inside the model.yml

dan-menlo

Approving to unblock. Will let @vansangpfiev review for code quality.

…feat/gguf-yaml-parser

engine/test/components/test_gguf_parser.cc

nguyenhoangthuan99 and others added 12 commits September 18, 2024 07:44

Init model.list utils

222e819

Add cmakelist compile

3a4aa20

Add cmakelist compile

78dce12

Fix CI build windows

96bbb2f

add unitest

d13e8d8

Merge branch 'dev' into feat/model.list-utils

41cfddd

Add test

b3a258c

Merge branch 'feat/model.list-utils' of github.com:janhq/cortex into …

d2eb1cb

…feat/model.list-utils

Merge branch 'dev' of github.com:janhq/cortex into feat/gguf-yaml-parser

6f6bb91

Update yaml and gguf parser

654a979

Fix update wrong params

6b107a9

Merge branch 'dev' of github.com:janhq/cortex into feat/gguf-yaml-parser

e92ed40

nguyenhoangthuan99 requested review from namchuai and vansangpfiev September 18, 2024 07:50

nguyenhoangthuan99 marked this pull request as ready for review September 18, 2024 07:50

nguyenhoangthuan99 requested a review from dan-menlo September 18, 2024 08:39

nguyenhoangthuan99 and others added 2 commits September 18, 2024 16:23

add unitests

ac4282a

Merge branch 'dev' into feat/gguf-yaml-parser

2e627e1

dan-menlo approved these changes Sep 18, 2024

View reviewed changes

nguyenhoangthuan99 added 3 commits September 18, 2024 18:44

add unitest for gguf

a162636

Merge branch 'feat/gguf-yaml-parser' of github.com:janhq/cortex into …

098423b

…feat/gguf-yaml-parser

add unitest for gguf

d919516

vansangpfiev approved these changes Sep 19, 2024

View reviewed changes

vansangpfiev reviewed Sep 19, 2024

View reviewed changes

engine/test/components/test_gguf_parser.cc Outdated Show resolved Hide resolved

Merge branch 'dev' of github.com:janhq/cortex into feat/gguf-yaml-parser

22e4408

nguyenhoangthuan99 added 2 commits September 19, 2024 10:21

Fix comment

8a2fc36

Fix build test fail

7fa1e61

nguyenhoangthuan99 merged commit f558899 into dev Sep 19, 2024

nguyenhoangthuan99 deleted the feat/gguf-yaml-parser branch September 19, 2024 04:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feat/gguf yaml parser #1246

Feat/gguf yaml parser #1246

Uh oh!

nguyenhoangthuan99 commented Sep 18, 2024 •

edited

Loading

Uh oh!

dan-menlo commented Sep 18, 2024 •

edited

Loading

Uh oh!

nguyenhoangthuan99 commented Sep 18, 2024 •

edited

Loading

Uh oh!

nguyenhoangthuan99 commented Sep 18, 2024

Uh oh!

dan-menlo commented Sep 18, 2024

Uh oh!

nguyenhoangthuan99 commented Sep 18, 2024

Uh oh!

dan-menlo left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Feat/gguf yaml parser #1246

Feat/gguf yaml parser #1246

Uh oh!

Conversation

nguyenhoangthuan99 commented Sep 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dan-menlo commented Sep 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nguyenhoangthuan99 commented Sep 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nguyenhoangthuan99 commented Sep 18, 2024

Uh oh!

dan-menlo commented Sep 18, 2024

Uh oh!

nguyenhoangthuan99 commented Sep 18, 2024

Uh oh!

dan-menlo left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

nguyenhoangthuan99 commented Sep 18, 2024 •

edited

Loading

dan-menlo commented Sep 18, 2024 •

edited

Loading

nguyenhoangthuan99 commented Sep 18, 2024 •

edited

Loading