🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
-
Updated
Jun 12, 2024 - Python
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Open source annotation tool for machine learning practitioners.
TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
pix2code: Generating Code from a Graphical User Interface Screenshot
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
搜索所有中文NLP数据集,附常用英文NLP数据集
🪐 End-to-end NLP workflows from prototype to production
Benchmark datasets, data loaders, and evaluators for graph machine learning
数据接口:百度、谷歌、头条、微博指数,宏观数据,利率数据,货币汇率,千里马、独角兽公司,新闻联播文字稿,影视票房数据,高校名单,疫情数据…
An extension of Open3D to address 3D Machine Learning tasks
TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
Papers and Datasets about Point Cloud.
Colour Science for Python
Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
Large datasets for conversational AI
Add a description, image, and links to the datasets topic page so that developers can more easily learn about it.
To associate your repository with the datasets topic, visit your repo's landing page and select "manage topics."