Skip to content

LEONA -Ubasti- AI for DiffSinger v0.0.1b (without reflow)

Pre-release
Pre-release
Compare
Choose a tag to compare
@lottev1991 lottev1991 released this 23 Apr 19:56

Initial release.

This model was trained with the old method, so it does not make use of reflow. The sound quality should be quite good, and the model's performance is fairly stable. Still, as it's still in beta, there might be some unpredictable quirks.

This model was also trained on multispeaker with Hanami's dataset; LEONA's base vocal modes contain under 1 minutes of data each. Despite this, the model is of fairly good quality.

No other parameters are currently planned to be supported, as they're considered redundant.

Note that the voicebank currently lacks art, this will be added later. The art will be all-new and is currently WIP. It uses a beta icon as a placeholder for now, and lacks a piano roll portrait.

Voice features

  • Soprano voice type (female character);
  • Cute character-style voice tone;
  • 3 vocal modes:
    • Core (normal);
    • CatNip (power);
    • CatNap (soft).

Supported languages

  • English (primary) - approx. 2 hours of data;
  • Japanese (secondary) - approx 40 min. of data.

Supported parameters

  • Random pitch shifting (gender curve);
  • Duration;
  • Auto-pitch.

Other features

  • Custom vocoder.