New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bot-generated sentences #1492

Open
trang opened this Issue Aug 13, 2017 · 4 comments

Comments

Projects
None yet
4 participants
@trang
Copy link
Member

trang commented Aug 13, 2017

Problem

Lately there has been a large number of sentences added seemingly by some bot.

https://tatoeba.org/eng/sentences/of_user/VITAE
https://tatoeba.org/eng/sentences/of_user/Strategos
https://tatoeba.org/eng/sentences/of_user/Alva

Here's a sample of the kind of sentences added:

Il désire gamberger.
Miou-Miou s'échappe.
Le cuisinier jongle.
Des scouts pleurent.
Vous alliez partout.
L'architecte dérive.
Des fakirs mèneront.
J'aplatis ces mises.
Des bébés embraient.
La voyageuse stagne.
Un groupe se coiffe.
Nous allons à Kyoto.
Des fumeurs aboient.
Le traitre guerroie.

A large part of these sentences don't make much sense. While they aren't all incorrect, they are overall not bringing high value to the corpus.

Possible solution

Just like we've put a limit for the amount of private messages that new users can send per day, we could put a limit on how many sentences a new contributor can add per day. This would at least give more time for admins to react and avoid thousands of nonsensical sentences being added.

@Ppjet6

This comment has been minimized.

Copy link
Contributor

Ppjet6 commented Aug 16, 2017

@halfdan

This comment has been minimized.

Copy link
Member

halfdan commented Oct 19, 2017

It might also be a good idea to think of a level system where you have to contribute a couple sentences and get those reviewed by another member before being allowed to continue.

@Ppjet6

This comment has been minimized.

Copy link
Contributor

Ppjet6 commented Oct 20, 2017

@trang trang added the enhancement label May 23, 2018

@ckjpn

This comment has been minimized.

Copy link

ckjpn commented Jan 29, 2019

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment