Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create a reference dataset to perform quality checks #905

Open
1 task
Tracked by #5538
teolemon opened this issue Oct 10, 2017 · 1 comment
Open
1 task
Tracked by #5538

Create a reference dataset to perform quality checks #905

teolemon opened this issue Oct 10, 2017 · 1 comment
Labels
🧽 Data quality https://wiki.openfoodfacts.org/Quality dataset creation

Comments

@teolemon
Copy link
Member

teolemon commented Oct 10, 2017

What

  • Create a reference dataset to perform quality checks

Part of

@teolemon teolemon added the 🧽 Data quality https://wiki.openfoodfacts.org/Quality label Oct 10, 2017
@teolemon teolemon added this to the Data Quality Checks milestone Oct 10, 2017
@CharlesNepote
Copy link
Member

CharlesNepote commented Jan 7, 2019

Do you mean a test dataset to test quality checks tools?

Why? Production dataset returns too many answers? Production dataset is too big to use locally? Why not create an extract based on a random extraction of 1 or 2% of the production database? 1% of the production database should be around 3500 products and 10 Mb.

This dataset could be implemented inside the docker container?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
🧽 Data quality https://wiki.openfoodfacts.org/Quality dataset creation
Projects
Status: To discuss and validate
Status: To do
Development

No branches or pull requests

2 participants