-
Yonsei University, @FCAI-Lab
- Seoul, Republic of Korea
-
05:39
(UTC +09:00) - https://rangho.me
- @bsky.rangho.moe
- https://rangho.moe/@rangho_220
- rangho
Highlights
🧠 machine learnings
A latent text-to-image diffusion model
A simple deep learning library for estimating a set of tags and extracting semantic feature vectors from given illustrations.
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
🤗 Pretrained BERT model & WordPiece tokenizer trained on Korean Comments 한국어 댓글로 프리트레이닝한 BERT 모델과 데이터셋
PyTorch implementation of AANets (CVPR 2021) and Mnemonics Training (CVPR 2020 Oral)
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
Stable Diffusion web UI
Singing Voice Conversion via diffusion model
Singing Voice Conversion via diffusion model
High-speed download of LLaMA, Facebook's 65B parameter GPT model
[ECCV 2022] Skeleton-free Pose Transfer for Stylized 3D Characters
Artificial intelligence for MapleStory that uses machine learning and computer vision to navigate challenging in-game environments
[CVPR 2022] FaceFormer: Speech-Driven 3D Facial Animation with Transformers
MiniLLM is a minimal system for running modern LLMs on consumer-grade GPUs
Notice that there are no torch-related code about item2vec, I just want to provide a readable item2vec implementation for researchers
Audio generation using diffusion models, in PyTorch.
Source code for Twitter's Recommendation Algorithm
speech self-supervised representations
KorNLI and KorSTS: New Benchmark Datasets for Korean Natural Language Understanding
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
[CVPR 2023] Learning with Fantasy: Semantic-Aware Virtual Contrastive Constraint for Few-Shot Class-Incremental Learning
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.
A repo containing several methods for near eye gaze tracking in HMDs
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)





