Skip to content

adliska/parallel_text_cleaning

master
Switch branches/tags

Name already in use

A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 

Cleaning of Parallel Texts for Machine Translation

Code for my BSc thesis. More info at http://www.adliska.com/publications/#theses.

Abstract: The aim of the thesis is to design, implement and manually evaluate filters for parallel data cleaning, focused on statistical machine translation. Annotated sets of parallel texts to be used during development of new filters in the future are another result of this work. Several tools facilitating work with these sets and allowing for automatic evaluation of filter outputs are also developed.

About

Code for my BSc thesis: Cleaning of Parallel Texts for Machine Translation

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages