Skip to content
View danhively's full-sized avatar

Highlights

  • Pro

Block or report danhively

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

Datasets

22 repositories

datasets for database research

12 5 Updated Aug 25, 2023

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Python 19,772 2,788 Updated Mar 12, 2025

Code for fine-tuning Platypus fam LLMs using LoRA

Python 628 60 Updated Feb 4, 2024
Jupyter Notebook 127 17 Updated Aug 31, 2024

TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...

Python 4,367 1,567 Updated Mar 12, 2025

Datasets used in Plotly examples and documentation

HTML 674 1,620 Updated Mar 12, 2025

Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)

5,853 975 Updated Feb 15, 2023

Techniques for deep learning with satellite & aerial imagery

9,104 1,539 Updated Nov 19, 2024

Training Materials for R and Microsoft R Server

Jupyter Notebook 1 130 Updated Jul 18, 2016

Data archive of identifiable COVID-19 related public projects on GitHub

528 182 Updated Mar 30, 2023

Repository containing Reproducility Material of "Bayesian Transfer Learning for Artificially Intelligent Geospatial Systems: A Predictive Stacking Approach" (Presicce and Banerjee, 2024).

R 1 Updated Oct 14, 2024

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,672 355 Updated Dec 7, 2024

The full dataset behind paperswithcode.com

337 34 Updated Oct 8, 2021

COVID-19 data from the repo CSSEGISandData/COVID-19> https://github.com/CSSEGISandData/COVID-19

15 19 Updated Mar 10, 2023

Maps following the #MapPromptMonday social mapping prompts

R 6 1 Updated Jul 30, 2023

https://github.com/CSSEGISandData/COVID-19

Jupyter Notebook 3 Updated Apr 5, 2020

Visualization of confirmed and recovered Corona cases, data from https://github.com/CSSEGISandData/COVID-19

R 5 1 Updated Apr 2, 2020

CSV files of COVID-19 total daily confirmed cases and deaths in the USA by state and county. All data from Johns Hopkins & NYT..

Jupyter Notebook 36 20 Updated Jun 21, 2021

Replication files for "The Effects of Historical Pandemics: The Black Death". Published in the Journal of Economic Literature.

PostScript 1 2 Updated Jan 11, 2021

A Collection of Covid Death Files.

1 Updated Feb 5, 2024

This R file shows the investigation of differentially expressed genes between cocaine addict deaths and non-cocaine addict deaths.

R 1 Updated Feb 20, 2021

R Markdown file analyzing US leading causes of death in the United States

1 Updated Jan 2, 2024