Stars
OCR, layout analysis, reading order, table recognition in 90+ languages
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
The web-based visual programming editor.
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, and other large language models.
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models
可循环值守和多人录制的直播录制软件,支持抖音、TikTok、Youtube、快手、虎牙、斗鱼、B站、小红书、pandatv、sooplive、flextv、popkontv、twitcasting、winktv、百度、微博、酷狗、17Live、Twitch、Acfun、CHZZK、shopee等40+平台直播录制
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
新闻网页正文通用抽取器 Beta 版.
A library for efficient similarity search and clustering of dense vectors.
State-of-the-art 2D and 3D Face Analysis Project
OpenMMLab Text Detection, Recognition and Understanding Toolbox
Generative Models by Stability AI
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
This repo contains the Hugging Face Deep Reinforcement Learning Course.
OpenChat: Advancing Open-source Language Models with Imperfect Data
MuCGEC中文纠错数据集及文本纠错SOTA模型开源;Code & Data for our NAACL 2022 Paper "MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error Correction"
Quill is a modern WYSIWYG editor built for compatibility and extensibility
A utility-first CSS framework for rapid UI development.
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
🔊 Text-Prompted Generative Audio Model
Robust Speech Recognition via Large-Scale Weak Supervision