Skip to content

@psibre psibre released this Aug 21, 2014 · 1610 commits to master since this release

This is a feature release, adding new features while maintaining compatibility with existing 4.x voices.

This release marks the final results of work on MARY TTS in the ​PAVOQUE project, in which we experimented with different technologies for adding expressivity to unit selection synthesis.
The release makes available those project results that may be of interest to a wider audience.

New features for expressive unit selection synthesis

  • selecting style from a multi-style database using a symbolic style feature;
  • imposing target prosody using FD-PSOLA signal modification.

Style can be selected using RAWMARYXML's <prosody style="..."> markup (see new expressive voice, below).

Prosody modification is available for all unit selection voices, including older ones;
to activate it, click the checkbox "Apply prosody modification" in the ​web interface.
This feature should be considered experimental, and the quality depends on many factors, including the accuracy of the pitchmarks used for building the unit selection voice.
While this feature is likely to lead to reduced quality, it enables research on expressive prosody with unit selection voices.

For more information on the MaryXML <prosody> markup which can now be applied to all types of MARY voices, see ProsodySpecificationSupport.

New expressive voice

  • we release the multi-style expressive German voice 'dfki-pavoque-styles' (660 MB) built from the full PAVOQUE corpus;
    see ​Steiner et al. (2010) for a description of this corpus.
    The different styles can be requested using RAWMARYXML <prosody style="A_STYLE">...</prosody>, where A_STYLE is one of happy, angry, sad, and poker.

New language: Russian

  • Nickolay Shmyrev has kindly made available language support for Russian, as well as the Russian male unit selection voice voxforge-ru-nsh.
    Thanks Nickolay!


Assets 3
You can’t perform that action at this time.