- United States
-
20:58
(UTC -04:00)
Highlights
- Pro
ML Projects & Research
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Official implementation of "DCT-Net: Domain-Calibrated Translation for Portrait Stylization", SIGGRAPH 2022 (TOG); Multi-style cartoonization
GUI for a Vocal Remover that uses Deep Neural Networks.
[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
State-of-the-art 2D and 3D Face Analysis Project
A latent text-to-image diffusion model
A concise but complete full-attention transformer with a set of promising experimental features from various papers
Muzic: Music Understanding and Generation with Artificial Intelligence
This repository contains demos I made with the Transformers library by HuggingFace.
[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.
Real-time face swap for PC streaming or video calls
FFCV: Fast Forward Computer Vision (and other ML workloads!)
The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."
Robust Speech Recognition via Large-Scale Weak Supervision
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
【NeurIPS 2022 Spotlight】Neural Surface Reconstruction of Dynamic Scenes with Monocular RGB-D Camera
Neural Light Transport for Relighting and View Synthesis
MARVEL: Raster Gray-level Manga Vectorization via Primitive-wise Deep Reinforcement Learning
[CVPR 2021 Oral] Im2Vec Synthesizing Vector Graphics without Vector Supervision
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
Lightning ⚡️ fast forecasting with statistical and econometric models.
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Official Repository of ChatCaptioner
Papers and resources on Controllable Generation using Diffusion Models, including ControlNet, DreamBooth, IP-Adapter.
FILM: Frame Interpolation for Large Motion, In ECCV 2022.
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)




