Skip to content
#

dataset-creation

Here are 76 public repositories matching this topic...

This repository contains Jupyter notebooks detailing the experiments conducted in our research paper on Ukrainian news classification. We introduce a framework for simple classification dataset creation with minimal labeling effort, and further compare several pretrained models for the Ukrainian language.

  • Updated Aug 12, 2023
  • Jupyter Notebook

Through this project, ONC in partnership with National Institutes of Health (NIH) National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK), advanced the application of AI/ML in patient-centered outcomes research (PCOR) by generating high quality training datasets for a chronic kidney disease (CKD) use case – predicting mortality …

  • Updated Sep 17, 2021
  • Jupyter Notebook

A simple project that creates a dataset of News Headlines with Primary Category, Secondary Category, Date, Day, Month,Year, Sentiment, SentimentPolarity, Emotion and Url. All News Headlines are scraped from punch newspaper and sorted into a csv file.

  • Updated Mar 9, 2023
  • Python

Improve this page

Add a description, image, and links to the dataset-creation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the dataset-creation topic, visit your repo's landing page and select "manage topics."

Learn more