Skip to content

FloAI/OpenHSR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 

Repository files navigation

OpenHSR

Description

The dataset includes 246 food products from Australian retailers in 2025. Each record contains about 20 variables covering nutrients, metadata, and Health Star Ratings (HSR). It enables analysis of nutritional quality and labeling diversity while following FAIR principles for accessibility and reusability.

  • Nutrient Composition: Energy (kJ and kcal), protein, total fat, saturated fat, carbohydrates, sugars, fiber, and sodium (per 100 g).
  • Metadata: Product name, product type, category, country of origin, retailer, date collected, allergens, ingredients, and data source.
  • Health Star Rating (HSR): A score from 0.5 to 5 stars, representing overall nutritional quality.

The dataset is structured to support analysis of nutritional quality, HSR modeling, and food labeling research.

Data Source

All data were manually collected from Australian retail products in 2025. Efforts were made to ensure accuracy and consistency, including verification against product packaging and standardized variable naming.

Usage

The dataset can be used for:

  • Nutritional analysis
  • Machine learning models predicting HSR
  • Policy research or consumer behavior studies

Please cite this dataset if used in your research (see Citation below).

License

This dataset is licensed under the Creative Commons Attribution 4.0 International (CC BY 4.0). Users are free to share and adapt the dataset, provided proper attribution is given.

Citation

If you use this dataset, please cite:

N'kam Suguem, F., & Lafargue, V. (2025). Open Health Star Rating (OpenHSR) (Version v1) [Data set]. Zenodo. https://doi.org/10.5281/zenodo.17469191

FAIR Principles

The dataset follows the FAIR principles:

  • Findable: Each product has a unique identifier; DOI assigned.
  • Accessible: Open access via GitHub under CC BY 4.0.
  • Interoperable: Standardized units, consistent variable names, and harmonized categories.
  • Reusable: Complete metadata, data dictionaries, and documentation support reproducibility and further research.

Contact

For questions or feedback, please contact: [Valentin Lafargue/valentin.lafargue@math.univ-toulouse.fr]

About

The dataset includes 246 food products from Australian retailers in 2025. Each record contains about 20 variables covering nutrients, metadata, and Health Star Ratings (HSR). It enables analysis of nutritional quality and labeling diversity while following FAIR principles for accessibility and reusability.

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors