RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
-
Updated
Oct 12, 2024 - Python
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
a delightful machine learning tool that allows you to train, test, and use models without writing code
MLBox is a powerful Automated Machine Learning python library.
Automated Time Series Forecasting
NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.
Audio processing by using pytorch 1D convolution network
A Deep Learning Python Toolkit for Healthcare Applications.
High performance model preprocessing library on PyTorch
✔️Contextual word checker for better suggestions
Deal with bad samples in your dataset dynamically, use Transforms as Filters, and more!
🎯 Personal data science and machine learning toolbox
A full pipeline AutoML tool for tabular data
Pure-Python Japanese character interconverter for Hiragana, Katakana, Hankaku, and Zenkaku
ACE 2005 corpus preprocessing for Event Extraction task
16 Text Preprocessing Techniques in Python for Twitter Sentiment Analysis.
[WIP] VoiceSmith makes training text to speech models easy.
Preprocessing pipeline on Brain MR Images through FSL and ANTs, including registration, skull-stripping, bias field correction, enhancement and segmentation.
TFRecorder makes it easy to create TensorFlow records (TFRecords) from Pandas DataFrames and CSVs files containing images or structured data.
Automated rejection and repair of bad trials/sensors in M/EEG
Add a description, image, and links to the preprocessing topic page so that developers can more easily learn about it.
To associate your repository with the preprocessing topic, visit your repo's landing page and select "manage topics."