Compute clustering on your data in a visual, intuitive way with FiftyOne and Sklearn!
-
Updated
Apr 5, 2024 - Python
Compute clustering on your data in a visual, intuitive way with FiftyOne and Sklearn!
Quilt: Robust Data Segment Selection against Concept Drifts (AAAI 2024)
Filter a float-valued field on two ranges simultaneously with this FiftyOne Plugin!
Semantically search through OCR text blocks with Qdrant, Sentence Transformers, and FiftyOne!
Customer churn train/prediction library with automatic dataset size optimisation features.
Hugging Face Plugins for FiftyOne
🧼🔎 A holistic self-supervised data cleaning strategy to detect irrelevant samples, near duplicates and label errors.
Enhancing Efficiency in Multidevice Federated Learning through Data Selection
Self-Evolved Diverse Data Sampling for Efficient Instruction Tuning
Chat with your images using GPT-4 Vision!
Quickly set up an image labelling web application for manually tagging images for machine learning tasks.
Official Python SDK for Kern AI refinery.
Unsupervised classification to improve the quality of a bird song recording dataset. https://doi.org/10.1016/j.ecoinf.2022.101952
Frontiers in Neuroinformatics 2022: Local Label Point Correction for Edge Detection of Overlapping Cervical Cells
Client interface for all things Cleanlab Studio
[ECCV 2022] Official Implementation for Unsupervised Selective Labeling for More Effective Semi-Supervised Learning
OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)
pyDVL is a library of stable implementations of algorithms for data valuation and influence function computation
This data-centric AI repository implements a robust deep learning method (LFBNet) for fully automated tumor segmentation in whole-body [18]F-FDG PET/CT images.
[NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.
Add a description, image, and links to the data-centric-ai topic page so that developers can more easily learn about it.
To associate your repository with the data-centric-ai topic, visit your repo's landing page and select "manage topics."