Skip to content


  • Arctic Code Vault Contributor


@paracrawl @bitextor


  1. Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.

    Python 64 10

  2. Tool to fix bitexts and tag near-duplicates for removal

    Python 5

  3. Anonymizer module for Bicleaner's pipeline (WIP)

    Python 3

  4. Repository for data models, dictionaries and more resources for Bitextor


  5. Forked from loomchild/segment

    Program used to split text into segments

    Java 1

  6. Tool for manual evaluation of parallel sentences.

    PHP 7 1

99 contributions in the last year

Nov Dec Jan Feb Mar Apr May Jun Jul Aug Sep Oct Mon Wed Fri
Activity overview
Contributed to bitextor/bicleaner, bitextor/bifixer, bitextor/biroamer and 2 other repositories

Contribution activity

October 2020

mbanon has no activity yet for this period.

September 2020

Seeing something unexpected? Take a look at the GitHub profile guide.

You can’t perform that action at this time.