Translate large dataset to any language with google translation api and multithread processing, no key required !
-
Updated
Jun 28, 2024 - Python
Translate large dataset to any language with google translation api and multithread processing, no key required !
使用LLaMA-Factory微调多模态大语言模型的示例代码 Demo of Finetuning Multimodal LLM with LLaMA-Factory
A helper library for easily converting MSCOCO format data using the loading script of huggingface datasets.
Microsoft COCO: Common Objects in Context for huggingface datasets
中文医学多模态大模型 Large Chinese Language-and-Vision Assistant for BioMedicine
JGLUE: Japanese General Language Understanding Evaluation for huggingface datasets
A comprehensive toolkit for seamless data generation and fine-tuning of NLP models, all conveniently packed into a single block.
Automate metadata extraction for Parquet & ORC datasets (schema, outliers, contextual, skewness, semanto) with this toolkit. Compatible with Google Gemma and Meta Llama frameworks.
cookiecutter for huggingface datasets
Publication pipeline for French legal codes on 🤗 Datasets from LegiFrance with concurrent upload and dynamic REAMDE.md.
Rico: A Mobile App Dataset for Building Data-Driven Design Applications for huggingface datasets
PubLayNet for huggingface datasets
CGL-Dataset v2 for huggingface datasets
High-level API for tar-based dataset
Fine-tune the Vision Transformer (ViT) using LoRA and Optuna for hyperparameter search.
A simple conversational agent that answers Wikipedia questions about anyone.
A Python script for converting URL-based datasets into image datasets.
Spatial transcriptomics datasets from 10XGenomices (spatial-gene-expression datasets)
PKU-PosterLayout for huggingface datasets
Add a description, image, and links to the huggingface-datasets topic page so that developers can more easily learn about it.
To associate your repository with the huggingface-datasets topic, visit your repo's landing page and select "manage topics."