Skip to content

Code for "Lost in Interpretation: Predicting Untranslated Terminology in Simultaneous Interpretation" at NAACL 2019

Notifications You must be signed in to change notification settings

nvog/lost-in-interpretation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

lost-in-interpretation

Code for "Lost in Interpretation: Predicting Untranslated Terminology in Simultaneous Interpretation" at NAACL 2019

Untranslated Term Annotations

Untranslated term annotations for the NAIST Simultaneous Translation Corpus will be provided upon request (via email) after confirmation that you have access to the corpus, available at https://ahcweb01.naist.jp/resource/stc/.

Other Requirements

The feature code also requires the EIJIRO English-Japanese bilingual dictionary, which you'll need to purchase here: http://www.eijiro.jp/get-144.htm. Alternatively, you could construct your own bilingual dictionary using NLP tools.

Ngram word frequencies are obtained using the Google 1T Ngrams corpus: https://catalog.ldc.upenn.edu/LDC2006T13.

About

Code for "Lost in Interpretation: Predicting Untranslated Terminology in Simultaneous Interpretation" at NAACL 2019

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages