Skip to content

xllama 0.2.1

Choose a tag to compare

@gianlucamazza gianlucamazza released this 22 May 22:47

What's Changed

Fixed

  • ChatML stop sequence: add <|im_end|> as stop sequence in UI inference path. SmolLM2-360M does not always emit EOS naturally (bench reaches n=990 with max_length=1024); without this fix the model could generate filler or hallucinate the next user turn up to n_predict=512 tokens beyond end-of-turn. Bench path (run\_inference) unchanged.
  • CHANGELOG structure: collapsed duplicate ### Added blocks in 0.2.0 section; recovered missing OrtModelPtr → OgaModelPtr fix entry.

Tests

  • Add tests/test_session.cpp: two smoke tests for Session::create error paths (non-existent path, empty path). Covers the Linux/llama.cpp LlamaSession constructor branch previously unexercised by CI.

Full Changelog: v0.2.0...v0.2.1