Activity

Update git repository

countzeropushed 1 commit to main • 264c978…fdf541b •

on Feb 17

Fix ValidateSet in server.ps1

countzeropushed 1 commit to main • 2d04c87…264c978 •

on Feb 13

Update OpenBLAS to v0.3.29

countzeropushed 1 commit to main • f28666f…2d04c87 •

on Feb 10

Add speculative decoding example

countzeropushed 1 commit to main • 2bd590c…f28666f •

on Jan 22

Update changelog

countzeropushed 1 commit to main • c0325d2…2bd590c •

on Jan 9

Fix path to gguf_dump.py

countzeropushed 2 commits to main • 44c688d…c0325d2 •

on Jan 9

[Server] Update changelog

countzeropushed 1 commit to main • d2c5d17…44c688d •

on Nov 26, 2024

[Server] Add -additionalArguments option

countzeropushed 1 commit to main • a1e9524…d2c5d17 •

on Nov 26, 2024

Remove n-predict

countzeropushed 1 commit to main • 1887a24…a1e9524 •

on Nov 13, 2024

Update changelog

countzeropushed 1 commit to main • 1fb472e…1887a24 •

on Oct 14, 2024

Remove deprecated self extending context window from server example

countzeropushed 1 commit to main • c894c4d…1fb472e •

on Oct 14, 2024

Remove --log-disable from llama-server example

countzeropushed 1 commit to main • 2ea8e7b…c894c4d •

on Sep 26, 2024

Update Changelog and use gemma-2-9b-it-IQ4_XS.gguf model across all e…

countzeropushed 1 commit to main • 417d993…2ea8e7b •

on Aug 3, 2024

Update python instructions

countzeropushed 1 commit to main • 029f7b8…417d993 •

on Jul 27, 2024

Fix wikitext URI

countzeropushed 1 commit to main • 57e5e3f…029f7b8 •

on Jul 24, 2024

Update OpenBLAS to 0.3.27

countzeropushed 1 commit to main • 6314038…57e5e3f •

on Jul 24, 2024

Add chat template option

countzeropushed 1 commit to main • 89577c2…6314038 •

on Jul 23, 2024

Remove chrome startup

countzeropushed 1 commit to main • c275774…89577c2 •

on Jul 23, 2024

Add -help option

countzeropushed 1 commit to main • dfc8339…c275774 •

on Jul 22, 2024

Add -help option

countzeropushed 1 commit to main • 8c8be19…dfc8339 •

on Jul 22, 2024

Add human readable file size

countzeropushed 1 commit to main • eab0da7…8c8be19 •

on Jul 22, 2024

Add human readable file size

countzeropushed 1 commit to main • b9bbe4b…eab0da7 •

on Jul 22, 2024

Fix benchmark order

countzeropushed 1 commit to main • 0b397cf…b9bbe4b •

on Jul 10, 2024

Add llama-bench example

countzeropushed 1 commit to main • 4889372…0b397cf •

on Jul 10, 2024

Add missing tiktoken package to support GLM models

countzeropushed 1 commit to main • 82cb614…4889372 •

on Jul 8, 2024

Fix renaming of gguf_dump.py

countzeropushed 1 commit to main • d9cf0c7…82cb614 •

on Jul 5, 2024

Default KV cache type to f16

countzeropushed 1 commit to main • 71ad08f…d9cf0c7 •

on Jun 27, 2024

Fix CUDA build

countzeropushed 1 commit to main • 5f647e6…71ad08f •

on Jun 27, 2024

Fix torch package 2.2.1+cu121

countzeropushed 1 commit to main • 3cc80e7…5f647e6 •

on Jun 21, 2024

Update torch package to 2.2.1+cu121

countzeropushed 1 commit to main • dd6d6db…3cc80e7 •

on Jun 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update git repository

Fix ValidateSet in server.ps1

Update OpenBLAS to v0.3.29

Add speculative decoding example

Update changelog

Fix path to gguf_dump.py

[Server] Update changelog

[Server] Add -additionalArguments option

Remove n-predict

Update changelog

Remove deprecated self extending context window from server example

Remove --log-disable from llama-server example

Update Changelog and use gemma-2-9b-it-IQ4_XS.gguf model across all e…

Update python instructions

Fix wikitext URI

Update OpenBLAS to 0.3.27

Add chat template option

Remove chrome startup

Add -help option

Add -help option

Add human readable file size

Add human readable file size

Fix benchmark order

Add llama-bench example

Add missing tiktoken package to support GLM models

Fix renaming of gguf_dump.py

Default KV cache type to f16

Fix CUDA build

Fix torch package 2.2.1+cu121

Update torch package to 2.2.1+cu121