Releases: avafloww/shit
Releases · avafloww/shit
Model v4
Model v4 — Diverse Augmentation
Training data now uses real-world package names from PyPI (800+), npm (1300+), crates.io (500+), plus curated docker images, repo names, and system packages. No more myapp hallucinations.
Improvements
pytohn→python— previously hallucinatedmyapp:v1.0, now correctrm mydir/→rm -rf mydir/— previously garbled, now correctpip installon PEP 668 — now suggestsuvx,pipx, and venv creationdocker pspermission — now suggests bothsudoandusermod -aG docker- All v3 fixes retained (clean EOS stopping, multi-alt where appropriate)
Stats
- 60K training examples with 2600+ unique package/project names
- No single placeholder exceeds 0.1% of training data (was 7% for
myapp) - Train loss: 0.099, eval loss: 0.068
Model v3
Model v3 — EOS Fix
Training improvements:
- Added EOS token to completions — model now learns when to stop generating, eliminating hallucinated multi-line garbage
- Aligned prompt format — leading space moved from completion into prompt (
OP:instead ofOP:+)
Results:
- Single-op cases (typos, sudo) produce clean single results
- Multi-alt cases (ambiguous errors) still produce valid alternatives
- No more nonsensical hallucinations after the first valid op
Known limitations (targeted for v4):
- Augmentation data has low diversity —
myappplaceholder overrepresented (~7% of training data), causing hallucinations on unfamiliar commands