Skip to content

Latest commit

 

History

History
44 lines (32 loc) · 1.25 KB

index.rst

File metadata and controls

44 lines (32 loc) · 1.25 KB

faseAlign Documentation

faseAlign is a Python-based suite for doing automatic forced alignment of text to audio for Spanish data using the HTK Speech Recognition Toolkit. The included acoustic models and dictionary are appropriate for many varieties of Latin American Spanish (see :ref:`development` for more details).

Features

  • force-aligned .TextGrid output files
  • support for .txt or .TextGrid input transcriptions
  • custom dictionaries
  • support for stereo audio
  • automatic phonemicization of unknown words
  • automatic syllabification and tonicity

Citation

Wilbanks, E. (|year|). faseAlign (Version |version|) [Computer software]. Retrieved |today| from https://github.com/EricWilbanks/faseAlign.

Contribute

Contact

If you're experiencing issues or have questions about the usage or installation of faseAlign, please feel free to contact me at wilbanks.ericw@gmail.com

.. toctree::
   :hidden:
   :maxdepth: 2
   :caption: Contents:
   :numbered:

   installation
   preparation
   aligning
   development
   changelog