Skip to content

v0.5.2

@jagmarques jagmarques tagged this 13 Jun 16:47
KV cache dtype fix (quant_only on fp32/CPU) + fp32 eviction guard
Assets 2
Loading