Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Visualizing the predicted alignments #33

Open
snalyami opened this issue Mar 25, 2023 · 1 comment
Open

Visualizing the predicted alignments #33

snalyami opened this issue Mar 25, 2023 · 1 comment

Comments

@snalyami
Copy link

snalyami commented Mar 25, 2023

Thank you so much for releasing your code!

I'm trying to visualize the predicted alignments but the timings in the out.output-hypothesis-dev.ctm.sgml and the out.output-hypothesis-test.ctm.sgml files don’t make sense. The timings t1+t2 indicate the start time + end time? This means that the gloss es overlap. Also I get the same timings for all the samples e.g.
0.000+0.010
0.010+0.030
0.020+0.050
0.030+0.07

Example from the out.output-hypothesis-test.ctm.sgml file:

<SYSTEM title="./work_dir/baseline_res18_SD_VAC_Phoenix/out.output-hypothesis-test.ctm" ref_fname="./work_dir/baseline_res18_SD_VAC_Phoenix/tmp.stm" hyp_fname="./work_dir/baseline_res18_SD_VAC_Phoenix/out.output-hypothesis-test.ctm" creation_date="Sun Dec 11 22:13:30 2022" format="2.4" frag_corr="FALSE" opt_del="FALSE" weight_ali="FALSE" weight_filename="">
<SPEAKER id="signer04">
<PATH id="(signer04-000)" word_cnt="6" file="01april_2010_thursday_heute_default-5" channel="1" sequence="0" R_T1="0.000" R_T2="inf" word_aux="h_t1+t2">
C,"aber","aber",0.000+0.010:S,"freuen","woche",0.010+0.030:C,"morgen","morgen",0.020+0.050:C,"sonne","sonne",0.030+0.070:S,"selten","kaum",0.040+0.090:C,"regen","regen",0.050+0.110
</PATH>
<PATH id="(signer04-001)" word_cnt="7" file="01april_2010_thursday_tagesschau_default-7" channel="1" sequence="1" R_T1="0.000" R_T2="inf" word_aux="h_t1+t2">
C,"samstag","samstag",0.000+0.010:C,"wechselhaft","wechselhaft",0.010+0.030:C,"besonders","besonders",0.020+0.050:C,"freundlich","freundlich",0.030+0.070:D,"nordost",,:S,"bisschen","nord",0.040+0.090:S,"bereich","ix",0.060+0.130
</PATH>
<PATH id="(signer04-002)" word_cnt="7" file="01april_2010_thursday_tagesschau_default-8" channel="1" sequence="2" R_T1="0.000" R_T2="inf" word_aux="h_t1+t2">
C,"sonntag","sonntag",0.000+0.010:C,"regen","regen",0.010+0.030:C,"teil","teil",0.020+0.050:C,"gewitter","gewitter",0.030+0.070:C,"suedost","suedost",0.040+0.090:D,"durch",,:C,"regen","regen",0.050+0.110
</PATH>
@ycmin95
Copy link
Collaborator

ycmin95 commented Mar 27, 2023

Hi, I'd like to clarify that we do not currently utilize SGML files to visualize the alignment. However, you may refer to the discussion about the alignment in this issue. If you have any further inquiries or difficulties, please do not hesitate to post them here - we'd be happy to help!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants