The dataset includes 246 food products from Australian retailers in 2025. Each record contains about 20 variables covering nutrients, metadata, and Health Star Ratings (HSR). It enables analysis of nutritional quality and labeling diversity while following FAIR principles for accessibility and reusability.
- Nutrient Composition: Energy (kJ and kcal), protein, total fat, saturated fat, carbohydrates, sugars, fiber, and sodium (per 100 g).
- Metadata: Product name, product type, category, country of origin, retailer, date collected, allergens, ingredients, and data source.
- Health Star Rating (HSR): A score from 0.5 to 5 stars, representing overall nutritional quality.
The dataset is structured to support analysis of nutritional quality, HSR modeling, and food labeling research.
All data were manually collected from Australian retail products in 2025. Efforts were made to ensure accuracy and consistency, including verification against product packaging and standardized variable naming.
The dataset can be used for:
- Nutritional analysis
- Machine learning models predicting HSR
- Policy research or consumer behavior studies
Please cite this dataset if used in your research (see Citation below).
This dataset is licensed under the Creative Commons Attribution 4.0 International (CC BY 4.0). Users are free to share and adapt the dataset, provided proper attribution is given.
If you use this dataset, please cite:
N'kam Suguem, F., & Lafargue, V. (2025). Open Health Star Rating (OpenHSR) (Version v1) [Data set]. Zenodo. https://doi.org/10.5281/zenodo.17469191
The dataset follows the FAIR principles:
- Findable: Each product has a unique identifier; DOI assigned.
- Accessible: Open access via GitHub under CC BY 4.0.
- Interoperable: Standardized units, consistent variable names, and harmonized categories.
- Reusable: Complete metadata, data dictionaries, and documentation support reproducibility and further research.
For questions or feedback, please contact: [Valentin Lafargue/valentin.lafargue@math.univ-toulouse.fr]