Skip to content
This repository

HTTPS clone URL

Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP

No Praat TextGrid file output #9

Closed
loretoparisi opened this Issue · 18 comments

2 participants

Loreto Parisi Kyle Gorman
Loreto Parisi

Running a training and a sample test alignement, the procedure ends up with no TextGrid data, and no errors:

macbookproloreto:Prosodylab-Aligner loreto$ ./align.py -s 44010 -t data data
Nearest viable SR is 40000 Hz
Initializing...
Training...
Modeling silence...
More training...
Realigning...
WARNING [-8221] InitPronHolders: Total of 296 duplicate pronunciations removed in HVite
More training...
Final aligning...
Making TextGrids...
Alignment complete.

No TextGrid file was found in ./data/ directory

Kyle Gorman
Owner
Loreto Parisi

When using the bash script it ends up with

macbookproloreto:Prosodylab-Aligner loreto$ ./align_ex.sh data/River.wav data/River.lab
Initializing...
Aligning...
ERROR [+8522] LatFromPaths: Align have dur<=0
FATAL ERROR - Terminating program HVite
Making TextGrids...
Alignment complete.
mv: rename .dat/River.TextGrid to ./River.TextGrid: No such file or directory
Output is in River.TextGrid.

I've found that the error

LatFromPaths: Align have dur<=0

was due to a error in a conditional check in HTKLib/HRec.c, lines 1626 and 1651 where

labid != splabid

should be replaced with

labpr != splabid

After patching and compiling HTK again, it worked in some cases, but now it stopped working in almost all other alignment tests.

Kyle Gorman
Owner
Loreto Parisi

Hi Kyle,
regarding the bug on HTK, here is the citation:

http://speechtechie.wordpress.com/2009/06/12/using-htk-3-4-1-on-mac-os-10-5/

I'm new to HTK, so I cannot say if it has been accepted, but I will look into.

Regarding, the pronunciation dictionary, I guess I miss something in my workflow. I'm going to build the dictionary again.

I will be back to you as soon as I generate the dictionary again.

Thanks!

Kyle Gorman
Owner

I don't see any reason to assume that that bug is real. If changing a line makes it work, but breaks other things, it's probably not a bug. (And who is Felix, and why didn't he submit the bug to the HTK bugtracker?).

How much audio do you have for training?

Loreto Parisi

Yes I suppose you're right :+1: and I don't know who is Felix. The audio is about 4 minutes.

Kyle Gorman
Owner

FYI I just pushed some new features: try the newest version when you get a chance.

Loreto Parisi

Thanks I tried it and now (without improving the pronunciation dictionary) it capture the error

macbookproloreto:Prosodylab-Aligner loreto$ ./align_ex.sh data/River.wav data/River.lab
Initializing... done.
Aligning... ERROR [+8522] LatFromPaths: Align have dur<=0
FATAL ERROR - Terminating program HVite
Traceback (most recent call last):
File "./align.py", line 733, in
scores_txt))
File "./align.py", line 436, in align_and_score
raise CalledProcessError(retcode, 'HVite')
subprocess.CalledProcessError: Command 'HVite' returned non-zero exit status 74
Alignment failed.
macbookproloreto:

Kyle Gorman
Owner
Kyle Gorman
Owner
Loreto Parisi

Hi Kyle,
as a test, I tried to use The Penn Phonetics Lab Forced Aligner, http://www.ling.upenn.edu/phonetics/p2fa/
and it gave me a similar forced alignment that I was able to do in my first tests with Prosodylab, that I was not able to reproduce later. Actually I achieved the first result with HTK 3.4.1 and the previous update of Prosodylab-Aligner, but I guess I was lucky, because when I changed something in the configuration, I've got the error on HVite.

Considering that p2f seems to work (no errors coming from the HVite) we could say that the patched version of HVite worked out (Felix patch), but I cannot be sure of that in any case.

Going to send you these results by email.

Kyle Gorman
Owner
Kyle Gorman
Owner
Loreto Parisi

dont know why, but sending by email it was poste by yourself :8ball:

Kyle Gorman
Owner

I have a good guess why: GitHub passes around fancy reply-to headers and you happened to include the ones associated with my username when you replied. But interesting engineering!

Kyle Gorman
Owner

Loreto, okay if I close this issue?

Loreto Parisi

Hi Kyle,
yes I suppose so.
My idea is to apply further analysis to make a good training set and apply Prosodylab scripts after that to this training set. For my knowledge Goto et alia applied this solution (HVite at the basis but different specific training set for the HMMs phonemes for cluster of artists/genres/etc). He also applies the harmonic melody media/variance calculations of the F0 function to discretize the audio source in order to have a better knowledge of the SIL intervals.

It could be interesting to fork this process and continue the work with Prosodylab in the specific case of the Music if it makes sense of course.

Of course this comments is outside the specific problem I raised here, it could be a specific discussione somewhere else maybe.

Thanks! Let's see what happens.

Kyle Gorman
Owner

Best of luck , and feel free to fork it!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Something went wrong with that request. Please try again.