The 🌏 data science library you've been waiting for~
-
Updated
Apr 12, 2024 - Python
The 🌏 data science library you've been waiting for~
SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batching, and more. Supports datasets from Huggingface, torchdata iterables, or simple lists of dictionaries.
High-level API for tar-based dataset
Change detection for Burned area Delineation (ChaBuD) ECML/PKDD 2023 challenge
一个lightning风格的精排轮子
Add a description, image, and links to the torchdata topic page so that developers can more easily learn about it.
To associate your repository with the torchdata topic, visit your repo's landing page and select "manage topics."