[gguf] Add types #562

mishig25 · 2024-03-19T16:59:15Z

GGUF add types. Follow up to #540 (comment).

No any kind of validation, just types

cc: @biw also

mishig25 · 2024-03-19T17:04:05Z

packages/gguf/src/gguf.ts

+export type { MetadataValue, Version, GGUFMetadata, GGUFTensorInfo, GGUFParseOutput } from "./types";
+export { GGUFValueType, GGMLQuantizationType } from "./types";


is it a correct way to re-export ?

julien-c · 2024-03-20T12:02:50Z

packages/gguf/src/types.ts

+export enum GGMLQuantizationType {
+	F32 = 0,
+	F16 = 1,
+	Q4_0 = 2,
+	Q4_1 = 3,
+	Q5_0 = 6,
+	Q5_1 = 7,
+	Q8_0 = 8,
+	Q8_1 = 9,
+	Q2_K = 10,
+	Q3_K = 11,
+	Q4_K = 12,
+	Q5_K = 13,
+	Q6_K = 14,
+	Q8_K = 15,
+	IQ2_XXS = 16,
+	IQ2_XS = 17,
+	IQ3_XXS = 18,
+	IQ1_S = 19,
+	IQ4_NL = 20,
+	IQ3_S = 21,
+	IQ2_S = 22,
+	IQ4_XS = 23,
+}


enums are not strictly speaking types as they expose objects in the runtime.

End of not super useful pedantic note haha. cc @coyotte508

julien-c · 2024-03-20T12:14:48Z

packages/gguf/src/types.ts

+export type RWKV = ModelBase<"rwkv"> & { "rwkv.architecture_version": number };
+export type LLM = TransformerLLM | RWKV;
+export type Whisper = ModelBase<"encoder.whisper"> & ModelBase<"decoder.whisper">;
+export type Model = (LLM | Whisper) & Partial<Tokenizer>;


very neat types (though they make my head hurt a bit, lol)

julien-c · 2024-03-20T12:16:25Z

packages/gguf/src/types.ts

+	"llama",
+	"mpt",
+	"gptneox",
+	"gptj",
+	"gpt2",
+	"bloom",
+	"falcon",
+	"gemma",
+	"rwkv",
+	"whisper",


Suggested change

"llama",

"mpt",

"gptneox",

"gptj",

"gpt2",

"bloom",

"falcon",

"gemma",

"rwkv",

"whisper",

"llama",

"falcon",

"baichuan",

"gpt2",

"gptj",

"gptneox",

"mpt",

"starcoder",

"persimmon",

"refact",

"bert",

"nomic-bert",

"bloom",

"stablelm",

"qwen",

"qwen2",

"phi2",

"plamo",

"codeshell",

"orion",

"internlm2",

"minicpm",

"gemma",

"starcoder2",

"mamba",

(optional, but it's the current list from the llama.cpp source of truth IIUC)

julien-c

i would merge as is and iterate later

Follow up to #562

[gguf] Add types

b244568

mishig25 requested review from julien-c and coyotte508 March 19, 2024 17:00

mishig25 marked this pull request as ready for review March 19, 2024 17:01

mishig25 commented Mar 19, 2024

View reviewed changes

mishig25 added 2 commits March 20, 2024 11:51

stronger typing for ModelBase

a948e16

format

81e6ce1

julien-c reviewed Mar 20, 2024

View reviewed changes

julien-c approved these changes Mar 20, 2024

View reviewed changes

julien-c reviewed Mar 20, 2024

View reviewed changes

julien-c approved these changes Mar 20, 2024

View reviewed changes

mishig25 merged commit e745ba5 into main Mar 20, 2024
2 checks passed

mishig25 deleted the gguf_types branch March 20, 2024 12:32

This was referenced Mar 20, 2024

[gguf types] Add missing types & make existing types stronger #566

Closed

[gguf] Export MetadataBaseValue #571

Merged

mishig25 added a commit that referenced this pull request Mar 21, 2024

[gguf] Export MetadataBaseValue (#571)

32d403e

Follow up to #562

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[gguf] Add types #562

[gguf] Add types #562

mishig25 commented Mar 19, 2024 •

edited

mishig25 Mar 19, 2024

julien-c Mar 20, 2024

julien-c Mar 20, 2024

julien-c Mar 20, 2024

julien-c Mar 20, 2024

julien-c left a comment

		export type { MetadataValue, Version, GGUFMetadata, GGUFTensorInfo, GGUFParseOutput } from "./types";
		export { GGUFValueType, GGMLQuantizationType } from "./types";

-	"llama",
-	"mpt",
-	"gptneox",
-	"gptj",
-	"gpt2",
-	"bloom",
-	"falcon",
-	"gemma",
-	"rwkv",
-	"whisper",
+    "llama",
+    "falcon",
+    "baichuan",
+    "gpt2",
+    "gptj",
+    "gptneox",
+    "mpt",
+    "starcoder",
+    "persimmon",
+    "refact",
+    "bert",
+    "nomic-bert",
+    "bloom",
+    "stablelm",
+    "qwen",
+    "qwen2",
+    "phi2",
+    "plamo",
+    "codeshell",
+    "orion",
+    "internlm2",
+    "minicpm",
+    "gemma",
+    "starcoder2",
+    "mamba",

[gguf] Add types #562

[gguf] Add types #562

Conversation

mishig25 commented Mar 19, 2024 • edited

mishig25 Mar 19, 2024

Choose a reason for hiding this comment

julien-c Mar 20, 2024

Choose a reason for hiding this comment

julien-c Mar 20, 2024

Choose a reason for hiding this comment

julien-c Mar 20, 2024

Choose a reason for hiding this comment

julien-c Mar 20, 2024

Choose a reason for hiding this comment

julien-c left a comment

Choose a reason for hiding this comment

mishig25 commented Mar 19, 2024 •

edited