Skip to content
View mkocabas's full-sized avatar

Block or report mkocabas

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official PyTorch Implementation of Opt-CWM: Self-Supervised Learning of Motion Concepts by Optimizing Counterfactuals.

Python 13 1 Updated Mar 27, 2025

Simplifying reinforcement learning for complex game environments

C 1,875 103 Updated Mar 29, 2025

Physics-based Noise Modeling for Extreme Low-light Photography (CVPR 2020 Oral & TPAMI 2021)

Python 520 69 Updated Dec 9, 2024
Python 249 16 Updated Mar 20, 2025

This package contains the original 2012 AlexNet code.

Cuda 2,294 288 Updated Mar 12, 2025

SpatialLM: Large Language Model for Spatial Understanding

Python 2,614 191 Updated Mar 28, 2025

Stable Virtual Camera: Generative View Synthesis with Diffusion Models

Python 1,027 61 Updated Mar 26, 2025

Open-Sora: Democratizing Efficient Video Production for All

Python 25,898 2,494 Updated Mar 27, 2025
889 25 Updated Mar 12, 2025

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Python 14,720 1,712 Updated Mar 29, 2025

Official implementation of TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models

Python 591 24 Updated Mar 23, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 37,675 4,322 Updated Mar 29, 2025

HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo

Python 1,217 95 Updated Mar 28, 2025

Toolkit for linearizing PDFs for LLM datasets/training

Python 10,680 714 Updated Mar 28, 2025

[CVPR 2025] Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass

Python 869 39 Updated Mar 26, 2025

[CVPR 2025] Sparse Voxels Rasterization: Real-time High-fidelity Radiance Field Rendering

Jupyter Notebook 580 27 Updated Mar 18, 2025

[CVPR 2025] DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention

Python 158 7 Updated Mar 1, 2025

Enjoy the magic of Diffusion models!

Python 8,158 731 Updated Mar 26, 2025

[CVPR 2025] MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors

Python 1,691 134 Updated Mar 11, 2025

FastVideo is a lightweight framework for accelerating large video diffusion models.

Python 1,284 76 Updated Mar 30, 2025

PyTorch video decoding

Python 470 30 Updated Mar 30, 2025
Python 526 29 Updated Mar 22, 2025

SkyReels V1: The first and most advanced open-source human-centric video foundation model

Python 1,917 182 Updated Mar 10, 2025

Official Implementation for our NeurIPS 2024 paper, "Don't Look Twice: Run-Length Tokenization for Faster Video Transformers".

Python 203 12 Updated Mar 29, 2025

Code and weights for the paper "Cluster and Predict Latents Patches for Improved Masked Image Modeling"

Python 82 2 Updated Mar 16, 2025

VGGSfM: Visual Geometry Grounded Deep Structure From Motion

Python 1,026 81 Updated Mar 11, 2025

Frontier Multimodal Foundation Models for Image and Video Understanding

Jupyter Notebook 682 43 Updated Mar 21, 2025

An efficient video loader for deep learning with smart shuffling that's super easy to digest

C++ 2,079 174 Updated Jul 17, 2024
Next
Showing results