data-annotation

Here are 31 public repositories matching this topic...

diffgram / diffgram

The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.

kubernetes data-science data machine-learning deep-learning image-annotation annotation video-annotation annotations data-analytics labeling datastore datasets annotation-tool data-annotation training-data

Updated Jun 21, 2024
Python

thepanacealab / SMMT

Star

Social Media Mining Toolkit (SMMT) main repository

tweets annotation twitter-api data-acquisition spacy data-preprocessing gathering data-annotation

Updated Nov 11, 2022
Python

yihong1120 / Construction-Hazard-Detection

Star

An AI-driven solution for enhancing safety at construction sites. Utilises YOLOv8 for object detection to identify overhead hazards like heavy loads and steel pipes. Alerts are triggered if personnel are detected beneath these hazards. Dataset sourced from Taiwan's construction industry.

construction machine-learning computer-vision deep-learning image-processing artificial-intelligence safety object-detection post-processing hazard-detection data-annotation model-training real-time-detection yolov8 alert-system

Updated Jun 9, 2024
Python

ziliHarvey / smart-annotation-pointrcnn

Star

A PointRCNN version of SAnE, which is a web-based semi-automatic annotation tool for point cloud data.

deep-learning point-cloud webapp data-annotation pointrcnn

Updated Jul 29, 2020
Python

BatsResearch / alfred

Star

A system for prompted weak supervision.

data weak-supervision annotation-tool vlm data-annotation llm programmatic-weak-supervision prompting

Updated Jun 11, 2024
Python

rbsathish / Data-annotation

Star

Convert your annotated data from one format to another format

converter csv xml xmltojson data-annotation xmltocsv

Updated Jan 29, 2021
Python

fastent / fastent

Star

custom models for named-entity recognition

nlp natural-language-processing spacy named-entities named-entity-recognition data-generation data-annotation

Updated Mar 31, 2021
Python

saran9991 / llm-data-annotation

Star

Use Large Language Models like OpenAI's GPT-3.5 for data annotation and model enhancement. This framework combines human expertise with LLMs, employs Iterative Active Learning for continuous improvement, and integrates CleanLab (Confident Learning) to ensure high-quality datasets and better model performance

nlp gpt bert active-learning data-annotation fine-tuning dvc confident-learning noisy-labels mlflow cleanlab gpt-4 llm gpt-3-5-turbo

Updated Sep 11, 2023
Python

liamtoran / flippers

Star

Flippers is a weak supervision library for creating high quality labels using your domain kownledge and weak supervision sources.

python data-science machine-learning annotation weak-supervision data-annotation

Updated Apr 9, 2024
Python

monatis / asr-annotation-bot

Sponsor

Star

Simple Telegram bot to annotate and varify automatic speech recognition datasets

machine-learning telegram-bot automatic-speech-recognition data-annotation

Updated Mar 30, 2021
Python

pixano / pixano-inference

Star

Inference models for Pixano

python machine-learning computer-vision deep-learning data-visualization data-annotation

Updated Mar 18, 2024
Python

fensorechase / LLMs_SDOH_Integration

Star

Supplemental code: Large Language Models for Integrating Social Determinant of Health Data: A Case Study on Heart Failure 30-Day Readmission Prediction

nlp text-classification data-annotation sdoh social-determinants-of-health

Updated Apr 24, 2024
Python

superannotateai / generated_text_detector

Star

SuperAnnotate HTTP service for Generated Text Detection

nlp detection data-annotation llm generated-text-detection

Updated May 8, 2024
Python

joactr / AnnoTheia

Star

AnnoTheia is a data annotation toolkit that identifies when a person speaks in a scene and transcribes their speech, also offering flexibility to replace modules for different languages.

languages data-annotation fine-tuning active-speaker-detection speech-technologies

Updated May 29, 2024
Python

minnesotanlp / infoVerse

Star

Jaehyung Kim et al's ACL 2023 paper on "infoVerse: A Universal Framework for Dataset Characterization with Multidimensional Meta-information"

nlp active-learning dpp data-annotation data-centric data-pruning

Updated Jun 28, 2023
Python

kosmolebryce / shyft

Star

`Shyft` is a time-tracking and data-logging utility designed to assist data annotators with managing and monitoring their service records. It represents the first programattic offering from ENCLAIM, a bourgeoning workers' union dedicated to promoting and protecting data annotators' labor interests as the industry continues to evolve.