Table of Best Results #2755

SilvaRaulEnrique · 2023-08-24T00:12:14Z

SilvaRaulEnrique
Aug 24, 2023

The idea of this post is construct and maintain colaborativaly a complete and actualizated best orders ranking list table to compile and executate, considering best models results for the more very long context with efectively complety coherent response, by example:

For NVidia 3090 (with CUDA and 24 GB RAM), for now the best orders are:

compile:
make LLAMA_CUBLAS=1

run main example:

./main \
-m models/TheBloke/llama-2-13b-chat.ggmlv3.q8_0.bin \
-ngl 254 \
-c 4096 \
-gqa 8 \
--reverse-prompt "User:" \
--file chat_dies_User_Assistant.txt \
--in-prefix ' ' "$@"

or?

./main \
-m models/s3nh/longchat-7b-v1.5-32k.ggmlv3.q8_0.bin \
-ngl 254 \
-c 32768 \
--rope-scale 8 \
--reverse-prompt "User:" \
--file chat_dies_User_Assistant.txt \
--in-prefix ' ' "$@"

fine tuning? train-text-from scratch:

./train-text-from-scratch \
--vocab-model ServidorIA/models/models/TheBloke/llama-2-13b-chat.ggmlv3.q8_0.bin \
--ctx 64 \
--embd 256 \
--head 8 \
--layer 16 \
--checkpoint-out chk-Ultimas_Acordadas_y_Circulares-256x16.bin \
--model-out ggml-Ultimas_Acordadas_y_Circulares-256x16-f32.bin \
--train-data "Ultimas_Acordadas_y_Circulares.txt" \
-t 4 \
-b 8 \
-n 32 \
--seed 1 \
--adam-iter 16 \
--print-details-interval 0 \
--predict 16 \
--use-flash \
--mem-compute 8

or?

./train-text-from-scratch
--vocab-model ServidorIA/models/s3nh/longchat-7b-v1.5-32k.ggmlv3.q8_0.bin
--ctx 64
--embd 256
--head 8
--layer 16
--checkpoint-out chk-Ultimas_Acordadas_y_Circulares-256x16.bin
--model-out ggml-Ultimas_Acordadas_y_Circulares-256x16-f32.bin
--train-data "Ultimas_Acordadas_y_Circulares.txt"
-t 4
-b 8
-n 32
--seed 1
--adam-iter 16
--print-details-interval 0
--predict 16
--use-flash
--mem-compute 8

Note: models TheBloke, and s3nh are in https://huggingface.co/

¿There is the best parameters or considerer "now" one best?, ¿and for another hardware configuration?

JRZS · 2023-09-03T02:39:05Z

JRZS
Sep 3, 2023

Running on an M2, built from scratch, I get this error:
error: unknown argument: -gqa

Anyone know why this is happening?

4 replies

ianscrivener · 2023-09-03T05:31:21Z

ianscrivener
Sep 3, 2023

"-gqa used to be needed when loading Llama2 70B model, but in the current code with the new GGUF format it is no longer used"
~ @ScarletEmerald

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Table of Best Results #2755

{{title}}

Replies: 2 comments 4 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

Table of Best Results #2755

SilvaRaulEnrique Aug 24, 2023

Replies: 2 comments · 4 replies

JRZS Sep 3, 2023

ScarletEmerald Sep 4, 2023

JRZS Sep 4, 2023

ScarletEmerald Sep 5, 2023

JRZS Sep 5, 2023

ianscrivener Sep 3, 2023

SilvaRaulEnrique
Aug 24, 2023

Replies: 2 comments 4 replies

JRZS
Sep 3, 2023

ianscrivener
Sep 3, 2023