Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Quality facets - barcodes that have a strange number of digits, or a bad last digit key #1806

Open
Tracked by #10273
stephanegigandet opened this issue May 26, 2019 · 3 comments
Labels
🧽 Data quality - Measure - Quality facets One of the facets available in Open Food Facts is /quality & allows us to spot products w/ bad data 🧽 Data quality https://wiki.openfoodfacts.org/Quality

Comments

@stephanegigandet
Copy link
Contributor

stephanegigandet commented May 26, 2019

What

From @cquest:

length | count
--------+--------
0 | 1
1 | 10
2 | 77
3 | 183
4 | 269
5 | 385
6 | 468
7 | 381
8 | 49154
9 | 210
10 | 530
11 | 2855
12 | 1643
13 | 808324
14 | 696
15 | 290
16 | 50
17 | 12
18 | 36
19 | 5
20 | 47
21 | 7
22 | 26
23 | 2
24 | 70
25 | 3
26 | 35
27 | 4
28 | 5
29 | 1
30 | 5
31 | 5
32 | 13
33 | 1
34 | 7
35 | 4
36 | 2
37 | 4
38 | 4
39 | 2
40 | 6
41 | 2
42 | 3
43 | 2
44 | 2
45 | 4
46 | 2
48 | 3
49 | 2
65 | 1
69 | 1
| 130

Part of

@teolemon teolemon added the 🧽 Data quality https://wiki.openfoodfacts.org/Quality label May 27, 2019
@stephanegigandet stephanegigandet added the 🧽 Data quality - Measure - Quality facets One of the facets available in Open Food Facts is /quality & allows us to spot products w/ bad data label Jun 17, 2019
@bredowmax
Copy link

Have you already tested how many barcodes have a correct check digit? I've found quite a few type-o's where I assume the check digit must be incorrect. For example the first digit of the barcode of this product https://world.openfoodfacts.org/product/5337226537658/nougat-pockets-k-classic

More:
https://en.wikipedia.org/wiki/Check_digit

@bredowmax
Copy link

bredowmax commented Feb 1, 2020

Some of the 6, 7 or 11-digit barcodes might be due to a leading 0 that was removed when processing these numbers

@bredowmax
Copy link

The barcodes that are longer than 13 digits might be Code 128 or GS1-128 codes, which is not necessarily wrong input but just a longer barcode that is sometimes used for fresh products that vary in weight or price

https://en.wikipedia.org/wiki/Code_128
https://en.wikipedia.org/wiki/GS1-128

See also openfoodfacts/openfoodfacts-ios#563

@teolemon teolemon changed the title Add quality facets for barcodes that have a strange number of digits, or a bad last digit key Quality facets - barcodes that have a strange number of digits, or a bad last digit key Oct 11, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
🧽 Data quality - Measure - Quality facets One of the facets available in Open Food Facts is /quality & allows us to spot products w/ bad data 🧽 Data quality https://wiki.openfoodfacts.org/Quality
Projects
Status: To discuss and validate
Status: To do
Development

No branches or pull requests

3 participants