Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

🧽 Data quality (tracker) #5538

Open
19 of 93 tasks
Tracked by #5529 ...
teolemon opened this issue Aug 23, 2021 · 1 comment
Open
19 of 93 tasks
Tracked by #5529 ...

🧽 Data quality (tracker) #5538

teolemon opened this issue Aug 23, 2021 · 1 comment
Labels
🧽 Data quality https://wiki.openfoodfacts.org/Quality 🧽 Quality facet One of the facets available in Open Food Facts is /quality & allows us to spot products w/ bad data ⏰ Stale This issue hasn't seen activity in a while. You can try documenting more to unblock it.

Comments

@teolemon
Copy link
Member

teolemon commented Aug 23, 2021

What

Before-the-fact data quality (Prevent mistakes before they happen)

Data destroying issues in production right now

List of data destroying issues in production right now

Product addition

  1. product addition products without barcodes 🧽 Data quality

Spellcheck

  1. Spellcheck 🥗 Ingredients 🧽 Data quality

On-the-fly checks

  1. Fixable via Userscript Hacktoberfest JavaScript Outreachy Perl help wanted ✨ Feature 🧽 on-the-fly quality checks 🧽 Data quality
    jnsereko
  2. ⭐ top issue ⏰ Stale 🧽 Data quality
    MonalikaPatnaik
  3. ✨ Feature 🧽 on-the-fly quality checks 🧽 API - Quality

Edit rules

  1. Fixed ? automation 🧽 Data quality 🧽 Data quality - edit rules
    stephanegigandet
  2. ⏲️ 5 minute fix 🐛 bug 🧽 Data quality 🧽 Data quality - edit rules
  3. Data destroying issue ⏲️ 5 minute fix 🐛 bug 🧽 Data quality 🧽 Data quality - edit rules
  4. 🧽 Data quality 🧽 Data quality - edit rules

Edit rules - Nutrition

  1. API WRITE Nutrition facts 🎯 Big Bang for Your Time 🎯 P1 candidate 🧽 Data quality - Nutrition 🧽 Data quality 🧽 Data quality - edit rules
    benbenben2
  2. Fixed ? Nutrition facts 🐛 bug 🧽 Data quality - Nutrition 🧽 Data quality

Ingredient list blocklist

  1. ✨ Feature 🥗 Ingredients 🧽 Data quality

Taxonomy special characters

  1. P3 categories 🐛 bug 🧬 Taxonomies 🧽 Data quality
    stephanegigandet

Ingredients consistency (languages)

  1. 🥗 Ingredients 🧽 Data quality

Old data

  1. Product Page ✨ Feature 🧽 Data quality

Verification

  1. Checked products Fixed ? 🧽 Data quality

Making quality more visible

  1. Admin tools Fixable via Userscript 🎨 Mockup available 🧽 Data quality

Remediation

No tasks being tracked yet.

Quality facets

  1. Fixable via Userscript 🧽 Data quality

Images

  1. Admin tools 🧽 Data quality

Anonymous edits & UUIDs

  1. Admin tools comments 🧽 Data quality
  2. ⏰ Stale ✏️ Editing - anonymous edits 🐛 bug 🧽 Data quality
  3. 🎯 P1 candidate 🐛 bug 🧽 Data quality
  4. 🧽 Data quality

In place edit

image rotation

  1. Admin tools Product Page inline edit ✏️ Editing - Images ✏️ Editing - drip editing 🎨 Mockup available 🖼️ Images 🧽 Data quality

Tackle variable barcodes

  1. variable-barcodes 🧽 Data quality

Ingredient analysis (false positives)

  1. Tests - ToDo 🌾 gluten 🎯 P1 candidate 🐛 bug 🥗 Ingredients 🥗🔍 Ingredients analysis 🧽 Data quality

Ingredients/Label coherence

  1. labels 🌱 Vegan 🥗 Ingredients 🧽 Data quality

Ingredients

  1. ingredient-list-cutting ✨ Feature 🌍 Multilingual products 🥗 Ingredients 🧽 Data quality
  2. 🥗 Ingredients 🧽 Data quality

Golden set

  1. dataset creation 🧽 Data quality

Search

  1. ✨ Feature 🔎 Search 🥗 Ingredients 🧽 Data quality

Producer platform

  1. ⏰ Stale ✨ Feature 🏭 Producers Platform 🧽 Data quality

Producer - One-off

  1. ⏰ Stale 🎯 P1 🏭 Producers Platform 🐛 bug 🧽 Data quality
    stephanegigandet

Countries

  1. scan statistics 🌐 i18n 🐛 bug 🧽 Data quality
    stephanegigandet
  2. ⏰ Stale 🌍 Multilingual products 🐛 bug 🧽 Data quality
    stephanegigandet

Improve the Recent changes screen

  1. Recent changes ✨ Feature 🎨 Mockup available 🧽 Data quality
  2. Recent changes ✨ Feature 🧽 Data quality
  3. ✏️ Editing ✨ Feature 🧽 Data quality

Nutrition

  1. 🐛 bug 🧽 Data quality
  2. good first issue osd'22 portions ⏰ Stale ⚖️ Quantity ✨ Feature 🧽 Data quality 🧽 Quality facet
  3. Outreachy portions ✨ Feature 🧽 Data quality 🧽 Quality facet
    benbenben2
  4. Nutrition facts ✨ Feature 🧽 Data quality 🧽 Quality facet
    stephanegigandet
  5. ⭐ top issue 🍬 Sugar 🧽 Data quality 🧽 Quality facet
    benbenben2
  6. ⏰ Stale 🐛 bug 🧽 Data quality
  7. ✨ Feature 🧽 Data quality

Nutrition - Averages

  1. 📖 Knowledge Panels 🧂 Salt 🧽 Data quality - Nutrition 🧽 Data quality

Absence of values

  1. frontend ✨ Feature 🎯 P1 candidate 🚦Nutri-Score 🧽 Data quality
    stephanegigandet

Photo/Data discrepancy

  1. Nutrition facts ✨ Feature 🧽 Data quality - Nutrition 🧽 Data quality
  2. status system ✨ Feature 🧽 Data quality

Grasp opportunities for improvement

Ingredients

  1. slack-notifications ✨ Feature 🧽 Data quality

Nutrition

  1. old-products 📊 Charts 🧽 Data quality

Checkbot

  1. checkbot 🧽 Data quality
    CharlesNepote
  2. P4 🧽 Data quality

Top-scan excellence (slack warnings)

  1. scan statistics slack-notifications 🧽 Data quality

Data augmentation to detect specific products

  1. Can be done by Robotoff automatic-data-augmentation 🧽 Data quality
    stephanegigandet

Data quality measurement

Tasks

No tasks being tracked yet.

General

  1. 🧽 Data quality 🧽 Quality facet

User reports

  1. Fixable via Userscript Product Page good first issue mockup-required 👮 Moderation 🧽 Data quality
  2. API WRITE Admin tools Fixable via Userscript P2 🎨 Mockup available 🧽 Data quality
  3. ⭐ top issue Admin tools ✨ Feature 🎯 P1 candidate 👍 Top 10 Issue! 👮 Moderation 🖼️ Images 🧽 Data quality
  4. barcodes good first issue ✨ Feature 👮 Moderation 🧽 Data quality
  5. Can be done by Robotoff OCR ⏰ Stale ✨ Feature 🤳🥫 blocking mobile apps 🧽 Data quality

NSFW and vandalism detection

  1. 🎯 Big Bang for Your Time 🎯 P1 candidate 🧽 Data quality 🧽 Data quality - edit rules 🧽 Quality facet

Barcodes

  1. barcodes 🧽 Data quality 🧽 Quality facet
  2. 🧽 Data quality 🧽 Quality facet

Ingredients

  1. Stale ⏰ Stale 💥 Merge Conflicts 🥗 Ingredients 🧽 Quality facet
    teolemon
  2. ingredients analysis ✨ Feature 🥗 Ingredients 🧽 Quality facet
  3. ✨ Feature 🧽 Data quality 🧽 Quality facet
  4. 🐛 bug 🧽 Quality facet
  5. ⭐ 🐛 top bug Hacktoberfest Outreachy Perl 🐛 bug 🥗 Ingredients 🧽 Data quality 🧽 Quality facet
    matheus-de
  6. ✨ Feature 🥗 Ingredients 🧽 Data quality 🧽 Quality facet
  7. organic products 🥗 Ingredients 🧽 Data quality 🧽 Quality facet
  8. 🐠 Fishing 🧽 Data quality 🧽 Quality facet
  9. 🌱 Eco-Score 📍 Origins 🧽 Data quality 🧽 Quality facet

Packaging

  1. ⏰ Stale ✨ Feature 📦 Packaging 🧽 Quality facet

Duplicates

  1. ✨ Feature 🧽 Data quality - Nutrition 🧽 Data quality 🧽 Quality facet

Products stored in the wrong language

  1. ✨ Feature 🧽 Quality - foreign-products-stored-in-french 🧽 Quality facet

Countries

  1. barcodes 🧽 Data quality 🧽 Quality facet

Quantity

  1. ⚖️ Quantity 🐛 bug 🧽 Data quality 🧽 Quality facet

Facets

Labels

  1. Outreachy organic products 🧪 additives 🧽 Data quality 🧽 Quality facet

Common name

  1. 🧽 Data quality 🧽 Quality facet

Image freshness

  1. data-freshness ✨ Feature 🖼️ Images 🧽 Data quality 🧽 Quality facet

Data quality remediation

Tools to fix

Ingredients

  1. 🥗 Ingredients 🧽 Data quality

Getting contributors to own their mistakes

  1. checkbot ✨ Feature 📨 Emails 🕹️ Gamification 🧽 Data quality
    CharlesNepote

Other projets / Non-Food products

  1. help wanted 🧴 Open Beauty Facts 🧽 Data quality 🧽 Quality facet
    CharlesNepote benbenben2
  2. help wanted 🧽 Data quality 🧽 Quality facet
    CharlesNepote benbenben2
  3. Fixable via Userscript Nutrition facts 🧴 Open Beauty Facts 🧽 Data quality
  4. barcodes ✨ Feature 🐾 Open Pet Food Facts 📸 Open Products Facts 🧴 Open Beauty Facts 🧽 Data quality
    stephanegigandet

Additives detection

  1. 🐛 bug 🥗 Ingredients 🧪 additives 🧽 Data quality
    stephanegigandet
  2. 🐛 bug 🧪 additives 🧽 Data quality

Hunger games

Part of

@teolemon teolemon added 🧽 API - Quality 🧽 Data quality https://wiki.openfoodfacts.org/Quality 🧽 Quality facet One of the facets available in Open Food Facts is /quality & allows us to spot products w/ bad data and removed 🧽 API - Quality labels Aug 23, 2021
@teolemon teolemon changed the title Data quality (tracker) 🧽 Data quality (tracker) Aug 16, 2023
Copy link
Contributor

This issue has been open 90 days with no activity. Can you give it a little love by linking it to a parent issue, adding relevant labels and projets, creating a mockup if applicable, adding code pointers from https://github.com/openfoodfacts/openfoodfacts-server/blob/main/.github/labeler.yml, giving it a priority, editing the original issue to have a more comprehensive description… Thank you very much for your contribution to 🍊 Open Food Facts

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
🧽 Data quality https://wiki.openfoodfacts.org/Quality 🧽 Quality facet One of the facets available in Open Food Facts is /quality & allows us to spot products w/ bad data ⏰ Stale This issue hasn't seen activity in a while. You can try documenting more to unblock it.
Projects
Status: To discuss and validate
Status: To do
Development

No branches or pull requests

1 participant