Model v3

avafloww released this 25 Feb 07:13

· 2 commits to main since this release

d29b3d3

Model v3 — EOS Fix

Training improvements:

Added EOS token to completions — model now learns when to stop generating, eliminating hallucinated multi-line garbage
Aligned prompt format — leading space moved from completion into prompt (OP: instead of OP: + )

Results:

Single-op cases (typos, sudo) produce clean single results
Multi-alt cases (ambiguous errors) still produce valid alternatives
No more nonsensical hallucinations after the first valid op

Known limitations (targeted for v4):

Augmentation data has low diversity — myapp placeholder overrepresented (~7% of training data), causing hallucinations on unfamiliar commands

Assets 4