Skip to content

v0.9.6.2 mixture-of-experts training

Compare
Choose a tag to compare
@bghira bghira released this 06 Jun 00:53
· 373 commits to release since this release
0e53ca5

What's Changed

Mixture-of-Experts

Mixture-of-Experts training complete with a brief tutorial on how to accelerate your training and start producing mind-blowing results.

image

  • DeepSpeed fix (#424)
  • Parquet backend fixes for different dataset sources
  • Parquet backend JSON / JSONL support
  • Updated check for aspect ratio mismatch to be more reliable by @bghira in #427
  • minor bugfixes for sd2.x/controlnet/sdxl refiner training by @bghira in #428
  • mixture-of-experts training via segmind models by @bghira in #429

Full Changelog: v0.9.6.1...v0.9.6.2