Skip to content

Activity

Update git repository

countzeropushed 1 commit to main • 264c978…fdf541b • 
on Feb 17

Fix ValidateSet in server.ps1

countzeropushed 1 commit to main • 2d04c87…264c978 • 
on Feb 13

Update OpenBLAS to v0.3.29

countzeropushed 1 commit to main • f28666f…2d04c87 • 
on Feb 10

Add speculative decoding example

countzeropushed 1 commit to main • 2bd590c…f28666f • 
on Jan 22

Update changelog

countzeropushed 1 commit to main • c0325d2…2bd590c • 
on Jan 9

Fix path to gguf_dump.py

countzeropushed 2 commits to main • 44c688d…c0325d2 • 
on Jan 9

[Server] Update changelog

countzeropushed 1 commit to main • d2c5d17…44c688d • 
on Nov 26, 2024

[Server] Add -additionalArguments option

countzeropushed 1 commit to main • a1e9524…d2c5d17 • 
on Nov 26, 2024

Remove n-predict

countzeropushed 1 commit to main • 1887a24…a1e9524 • 
on Nov 13, 2024

Update changelog

countzeropushed 1 commit to main • 1fb472e…1887a24 • 
on Oct 14, 2024

Remove deprecated self extending context window from server example

countzeropushed 1 commit to main • c894c4d…1fb472e • 
on Oct 14, 2024

Remove --log-disable from llama-server example

countzeropushed 1 commit to main • 2ea8e7b…c894c4d • 
on Sep 26, 2024

Update Changelog and use gemma-2-9b-it-IQ4_XS.gguf model across all e…

countzeropushed 1 commit to main • 417d993…2ea8e7b • 
on Aug 3, 2024

Update python instructions

countzeropushed 1 commit to main • 029f7b8…417d993 • 
on Jul 27, 2024

Fix wikitext URI

countzeropushed 1 commit to main • 57e5e3f…029f7b8 • 
on Jul 24, 2024

Update OpenBLAS to 0.3.27

countzeropushed 1 commit to main • 6314038…57e5e3f • 
on Jul 24, 2024

Add chat template option

countzeropushed 1 commit to main • 89577c2…6314038 • 
on Jul 23, 2024

Remove chrome startup

countzeropushed 1 commit to main • c275774…89577c2 • 
on Jul 23, 2024

Add -help option

countzeropushed 1 commit to main • dfc8339…c275774 • 
on Jul 22, 2024

Add -help option

countzeropushed 1 commit to main • 8c8be19…dfc8339 • 
on Jul 22, 2024

Add human readable file size

countzeropushed 1 commit to main • eab0da7…8c8be19 • 
on Jul 22, 2024

Add human readable file size

countzeropushed 1 commit to main • b9bbe4b…eab0da7 • 
on Jul 22, 2024

Fix benchmark order

countzeropushed 1 commit to main • 0b397cf…b9bbe4b • 
on Jul 10, 2024

Add llama-bench example

countzeropushed 1 commit to main • 4889372…0b397cf • 
on Jul 10, 2024

Add missing tiktoken package to support GLM models

countzeropushed 1 commit to main • 82cb614…4889372 • 
on Jul 8, 2024

Fix renaming of gguf_dump.py

countzeropushed 1 commit to main • d9cf0c7…82cb614 • 
on Jul 5, 2024

Default KV cache type to f16

countzeropushed 1 commit to main • 71ad08f…d9cf0c7 • 
on Jun 27, 2024

Fix CUDA build

countzeropushed 1 commit to main • 5f647e6…71ad08f • 
on Jun 27, 2024

Fix torch package 2.2.1+cu121

countzeropushed 1 commit to main • 3cc80e7…5f647e6 • 
on Jun 21, 2024

Update torch package to 2.2.1+cu121

countzeropushed 1 commit to main • dd6d6db…3cc80e7 • 
on Jun 21, 2024