ai
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
Official PyTorch repo for JoJoGAN: One Shot Face Stylization
Blendshape and kinematics calculator for Mediapipe/Tensorflow.js Face, Eyes, Pose, and Finger tracking models.
Image Captcha Solving Using TensorFlow and CNN Model. Accuracy 90%+
Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView: Mastering Text-to-Image Generation via Transformers".
Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
VNN是由欢聚集团(Joyy Inc.)推出的高性能、轻量级神经网络部署框架。目前已为Hago、VOO、VFly、马克相机等App提供20余种AI能力的支持,覆盖直播、短视频、视频编辑等泛娱乐场景和工程场景
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M
Gender, Age, and Emotion for Flickr-Faces-HQ Dataset (FFHQ)
Collection of scripts to aggregate image data for the purposes of training an NSFW Image Classifier
Machine Learning Yearning 中文版 - 《机器学习训练秘籍》 - Andrew Ng 著
The example project of inferencing Pose Estimation using Core ML
Face recognition with deep neural networks.
code for paper "Combining Markov Random Fields and Convolutional Neural Networks for Image Synthesis"
Torch implementation of neural style algorithm
Image-to-Image Translation in PyTorch
A neural network that transforms a design mock-up into a static website.
Using pix2pix to convert scribbles to Chinese calligraphy
Spectral segmentation described in Aksoy et al., "Semantic Soft Segmentation", ACM TOG (Proc. SIGGRAPH), 2018
《神经网络与深度学习》 邱锡鹏著 Neural Network and Deep Learning