v0.7.0
What's Changed
- Fix typos by @omahs in #380
- filters: fix C4BadWordsFilter _get_badwords lang shadowing (fixes #377) by @dipampaul17 in #379
- chore: fixed a broken link in the documentation summary_stats by @Olexandr88 in #382
- [BUG Fix] Launching dependent
LocalPipelineExecutors withskip_completed=Falselead to waiting by @silverriver in #300 - Allow the postprocess_fn to take self as a parameter by @JoelNiklaus in #391
- fix: typos by @DeVikingMark in #386
- docs: fixed a broken link in the documentation stats by @Olexandr88 in #383
- Add support to load HF dataset from disk by @iamgroot42 in #385
- ensure folder_path has consistent usage by @hynky1999 in #366
- fixes #388 by @zinccat in #389
- Fix sentinel condition in rust mh3 by @jordane95 in #394
- bugfixes + warnings + callback option for inference runner by @guipenedo in #395
- general bugfixes by @guipenedo in #396
- add additional verification after minhash step1 by @guipenedo in #404
- Inference runner refactoring: rollouts, gen params, etc by @guipenedo in #398
- Fix missing param by @shallyan in #407
- Multi-Node Distributed Inference Support by @hynky1999 in #406
- ray nits by @hynky1999 in #403
- fix parquet last batch + slurm srun option by @guipenedo in #409
New Contributors
- @omahs made their first contribution in #380
- @dipampaul17 made their first contribution in #379
- @Olexandr88 made their first contribution in #382
- @DeVikingMark made their first contribution in #386
- @iamgroot42 made their first contribution in #385
- @zinccat made their first contribution in #389
- @shallyan made their first contribution in #407
Full Changelog: v0.6.0...v0.7.0