A system for quickly generating training data with weak supervision
-
Updated
May 2, 2024 - Python
A system for quickly generating training data with weak supervision
The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.
Collection of casual conversations that can be used with the Rasa Stack
Pure Python, lightweight, Pillow-based solver for Amazon's text captcha.
skweak: A software toolkit for weak supervision applied to NLP tasks
A machine learning tool for automated prediction engineering. It allows you to easily structure prediction problems and generate labels for supervised learning.
Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes
Generating training data from the Carla driving simulator in the KITTI dataset format
COVID-19 Coughs files for training AI models
Covid19 Face Mask Detector
Augmenty is an augmentation library based on spaCy for augmenting texts.
Natural Language Data Augmentation Tool for Conversational Systems
🔎 Classification helper for sex classification feature of InstaPy
A command line interface to combine text information from subtitles with voice data in the video. Provides a convenient way to generate training data for speech-recognition purposes.
A Sentiment Analyzer for a set of Hotel Reviews using Naive Bayes Algorithm
A simple implement of TransE, the ML algorithm published in 2013
明日方舟相关机器学习训练数据 | Machine learning training data for Arknights
Interactive Weak Supervision: Learning Useful Heuristics for Data Labeling
Full resources supporting the publication "A Pragmatic Guide to Geoparsing Evaluation."
PyTorch reimplementation of computing Shapley values via Truncated Monte Carlo sampling from "What is your data worth? Equitable Valuation of Data" by Amirata Ghorbani and James Zou [ICML 2019]
Add a description, image, and links to the training-data topic page so that developers can more easily learn about it.
To associate your repository with the training-data topic, visit your repo's landing page and select "manage topics."