Skip to content

ternative v1.0.0

Latest

Choose a tag to compare

@michelangeloromerochisco michelangeloromerochisco released this 19 May 01:32
· 24 commits to main since this release

ternative v1.0.0 - First release. Runtime LoRA merge, I2_S support, OpenAI-compatible server, GPU decode ~6-7 tok/s on RTX 3050. See README for full details.