Skip to content
Avatar

Highlights

  • Arctic Code Vault Contributor

Organizations

@paracrawl @bitextor

Pinned

  1. Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.

    Python 64 10

  2. Tool to fix bitexts and tag near-duplicates for removal

    Python 5

  3. Anonymizer module for Bicleaner's pipeline (WIP)

    Python 3

  4. Repository for data models, dictionaries and more resources for Bitextor

    2

  5. Forked from loomchild/segment

    Program used to split text into segments

    Java 1

  6. Tool for manual evaluation of parallel sentences.

    PHP 7 1

99 contributions in the last year

Nov Dec Jan Feb Mar Apr May Jun Jul Aug Sep Oct Mon Wed Fri
Activity overview
Contributed to bitextor/bicleaner, bitextor/bifixer, bitextor/biroamer and 2 other repositories
Loading

Contribution activity

October 2020

mbanon has no activity yet for this period.

September 2020

Seeing something unexpected? Take a look at the GitHub profile guide.

You can’t perform that action at this time.