- Jakarta, Indonesia
-
10:36
- 7h ahead - http://githuh.com/threeal
- @_threeal
- in/alfi-m-40546184
AI
a CLI utility/library for AnimateDiff stable diffusion generation
ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation
Simple, safe way to store and distribute tensors
[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
Large-scale text-video dataset. 10 million captioned short videos.
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
RIFE, Real-Time Intermediate Flow Estimation for Video Frame Interpolation implemented with ncnn library
Robust Speech Recognition via Large-Scale Weak Supervision
Faster Whisper transcription with CTranslate2
Fast inference engine for Transformer models
Measuring Massive Multitask Language Understanding | ICLR 2021
Code repo for the paper "LLM-QAT Data-Free Quantization Aware Training for Large Language Models"
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
A latent text-to-image diffusion model