Skip to content

Model v3

Choose a tag to compare

@avafloww avafloww released this 25 Feb 07:13
· 2 commits to main since this release

Model v3 — EOS Fix

Training improvements:

  • Added EOS token to completions — model now learns when to stop generating, eliminating hallucinated multi-line garbage
  • Aligned prompt format — leading space moved from completion into prompt (OP: instead of OP: + )

Results:

  • Single-op cases (typos, sudo) produce clean single results
  • Multi-alt cases (ambiguous errors) still produce valid alternatives
  • No more nonsensical hallucinations after the first valid op

Known limitations (targeted for v4):

  • Augmentation data has low diversity — myapp placeholder overrepresented (~7% of training data), causing hallucinations on unfamiliar commands