huggingface-datasets

Automate metadata extraction for Parquet & ORC datasets (schema, outliers, contextual, skewness, semanto) with this toolkit. Compatible with Google Gemma and Meta Llama frameworks.

python google hugging huggingface-models huggingface-datasets genai genai-usecase gemma-7b-it meta-llama meta-llama-3-70b-instruct

Updated May 9, 2024
Python

shunk031 / cookiecutter-huggingface-datasets

Star

cookiecutter for huggingface datasets

cookiecutter cookiecutter-template huggingface-datasets

Updated Apr 26, 2024
Python

louisbrulenaudet / legalkit-pipeline

Sponsor

Star

Publication pipeline for French legal codes on 🤗 Datasets from LegiFrance with concurrent upload and dynamic REAMDE.md.

python open-source legal data parquet datasets legifrance legaltech huggingface huggingface-datasets piste-api

Updated Mar 31, 2024
Python

creative-graphic-design / huggingface-datasets_Rico

Star

Rico: A Mobile App Dataset for Building Data-Driven Design Applications for huggingface datasets

rico huggingface huggingface-datasets

Updated Mar 24, 2024
Python

creative-graphic-design / huggingface-datasets_PubLayNet

Star

PubLayNet for huggingface datasets

huggingface publaynet huggingface-datasets

Updated Apr 1, 2024
Python

creative-graphic-design / huggingface-datasets_CGL-Dataset-v2

Star

CGL-Dataset v2 for huggingface datasets

huggingface huggingface-datasets

Updated Feb 12, 2024
Python

npuichigo / tarzan

Star

High-level API for tar-based dataset

tar tensorflow-datasets data-loading webdataset huggingface-datasets torchdata

Updated Feb 3, 2024
Python

eljandoubi / huggingface_image_classifier

Star

Fine-tune the Vision Transformer (ViT) using LoRA and Optuna for hyperparameter search.

numpy pytorch hyperparameter-optimization image-classification huggingface-transformers visual-transformer huggingface-datasets huggingface-peft huggingface-optuna

Updated Feb 2, 2024
Python

JohnBogdan1 / Historical-Conversational-Agent

Star

A simple conversational agent that answers Wikipedia questions about anyone.

python ai deep-learning wikipedia transformers gpt historical conversational-ai huggingface-transformers huggingface-datasets

Updated Jan 8, 2024
Python

ItzCrazyKns / Dataset-Converter

Star

A Python script for converting URL-based datasets into image datasets.

python machine-learning ai ml artificial-intelligence datasets url-to-image huggingface dataset-converter huggingface-datasets

Updated Jan 8, 2024
Python

obendidi / st-visium-datasets

Star

Spatial transcriptomics datasets from 10XGenomices (spatial-gene-expression datasets)

bioinformatics biology python3 dataset spatial-transcriptomics huggingface-datasets 10xgen

Updated Jan 7, 2024
Python

creative-graphic-design / huggingface-datasets_PosterErase

Star

PosterErase for huggingface datasets

huggingface huggingface-datasets

Updated Nov 19, 2023
Python

creative-graphic-design / huggingface-datasets_PKU-PosterLayout

Star

PKU-PosterLayout for huggingface datasets

huggingface huggingface-datasets layout-generation

Updated Apr 18, 2024
Python

Improve this page

Add a description, image, and links to the huggingface-datasets topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the huggingface-datasets topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

huggingface-datasets

Here are 42 public repositories matching this topic...

vTuanpham / Large_dataset_translator

BUAADreamer / MLLM-Finetuning-Demo

shunk031 / huggingface-datasets_cocoapi-tools

shunk031 / huggingface-datasets_MSCOCO

BUAADreamer / Chinese-LLaVA-Med

shunk031 / huggingface-datasets_JGLUE

ksgr5566 / AutoTuneNLP

varunajmera0 / MetaGenAI

shunk031 / cookiecutter-huggingface-datasets

louisbrulenaudet / legalkit-pipeline

creative-graphic-design / huggingface-datasets_Rico

creative-graphic-design / huggingface-datasets_PubLayNet

creative-graphic-design / huggingface-datasets_CGL-Dataset-v2

npuichigo / tarzan

eljandoubi / huggingface_image_classifier

JohnBogdan1 / Historical-Conversational-Agent

ItzCrazyKns / Dataset-Converter

obendidi / st-visium-datasets

creative-graphic-design / huggingface-datasets_PosterErase

creative-graphic-design / huggingface-datasets_PKU-PosterLayout

Improve this page

Add this topic to your repo