xllama 0.2.1
What's Changed
Fixed
- ChatML stop sequence: add
<|im_end|>as stop sequence in UI inference path. SmolLM2-360M does not always emit EOS naturally (bench reaches n=990 with max_length=1024); without this fix the model could generate filler or hallucinate the next user turn up to n_predict=512 tokens beyond end-of-turn. Bench path (run\_inference) unchanged. - CHANGELOG structure: collapsed duplicate
### Addedblocks in 0.2.0 section; recovered missingOrtModelPtr → OgaModelPtrfix entry.
Tests
- Add
tests/test_session.cpp: two smoke tests forSession::createerror paths (non-existent path, empty path). Covers the Linux/llama.cppLlamaSessionconstructor branch previously unexercised by CI.
Full Changelog: v0.2.0...v0.2.1