Stars
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Vision-based 1D barcode localization method for scale and rotation invariant
A Unified Toolkit for Deep Learning Based Document Image Analysis
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS ev…
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
🔥Highlighting the top ML papers every week.
State-of-the-art 2D and 3D Face Analysis Project
A high resolution face dataset for face editing purpose
Official Implementation of 'Fast AutoAugment' in PyTorch.
OpenMMLab Text Detection, Recognition and Understanding Toolbox
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
[CVPRW 2022] Delving into High-Quality Synthetic Face Occlusion Segmentation Datasets
Real-time face swap for PC streaming or video calls
Four landmark detection algorithms, implemented in PyTorch.
Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.
Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019)
Robust realtime face and facial landmark tracking on CPU with Unity integration
An arbitrary face-swapping framework on images and videos with one single trained model!
Official Implementation for "StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery" (ICCV 2021 Oral)
Face detection algorithms in PyTorch.
Implementation of PFLD For 68 Facial Landmarks By Pytorch
Practice on cifar100(ResNet, DenseNet, VGG, GoogleNet, InceptionV3, InceptionV4, Inception-ResNetv2, Xception, Resnet In Resnet, ResNext,ShuffleNet, ShuffleNetv2, MobileNet, MobileNetv2, SqueezeNet…
Official PyTorch implementation of StyleGAN3
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.