Skip to content
View fmu2's full-sized avatar

Highlights

  • Pro

Block or report fmu2

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 3,599 334 Updated Feb 24, 2025

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python 890 117 Updated Mar 21, 2025

Official inference repo for FLUX.1 models

Python 21,018 1,488 Updated Feb 6, 2025

Finetune Llama 3.3, DeepSeek-R1, Gemma 3 & Reasoning LLMs 2x faster with 70% less memory! 🦥

Python 35,757 2,747 Updated Mar 22, 2025

Official implementation of "Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance" (NeurIPS 2024)

Python 287 11 Updated Dec 10, 2024

Code release for RICA^2: Rubric-Informed, Calibrated Assessment of Actions (ECCV 2024)

Python 11 Updated Nov 2, 2024

Demo code for "Revolutionizing Collection Service: How AI-Driven Personalization is Transforming Payment Recovery""

Jupyter Notebook 2 Updated Oct 6, 2024

Code release for "Deep Learning to Quantify Care Manipulation Activities in Neonatal Intensive Care Units"

Python 5 Updated May 29, 2024

Officail Implementation for "ReNoise: Real Image Inversion Through Iterative Noising"

Python 221 9 Updated Jul 3, 2024

Official Pytorch Implementation for "Splicing ViT Features for Semantic Appearance Transfer" presenting "Splice" (CVPR 2022 Oral)

Jupyter Notebook 382 32 Updated Nov 21, 2023
Jupyter Notebook 67 4 Updated Jan 27, 2025

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 3,056 254 Updated Mar 25, 2025

OpenTAD is an open-source temporal action detection (TAD) toolbox based on PyTorch.

Python 238 18 Updated Feb 28, 2025

Official Implementation of SnAG (CVPR 2024)

Python 44 4 Updated Oct 29, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,962 2,408 Updated Aug 12, 2024

High-speed Large Language Model Serving for Local Deployment

C++ 8,164 426 Updated Feb 19, 2025

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 8,970 1,545 Updated Mar 25, 2025

The Triton TensorRT-LLM Backend

Python 811 118 Updated Mar 18, 2025

Structured state space sequence models

Jupyter Notebook 2,590 314 Updated Jul 17, 2024

Mamba-Chat: A chat LLM based on the state-space model architecture 🐍

Python 922 68 Updated Mar 3, 2024

Mamba SSM architecture

Python 14,375 1,255 Updated Jan 18, 2025

✨✨Latest Advances on Multimodal Large Language Models

14,439 928 Updated Mar 21, 2025

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

2,082 96 Updated Jan 26, 2025

Official implementation of CVPR 2024 paper: "FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition"

Python 463 15 Updated Oct 21, 2024

T2I-Adapter

Python 3,625 219 Updated Jun 21, 2024

Make huge neural nets fit in memory

Python 2,773 271 Updated Apr 26, 2020

Official implementation for "Break-A-Scene: Extracting Multiple Concepts from a Single Image" [SIGGRAPH Asia 2023]

Python 518 25 Updated Jan 14, 2024

HF-NeuS: Improved Surface Reconstruction Using High-Frequency Details (NeurIPS 2022)

Python 222 9 Updated Oct 25, 2022

[ICCV 2023] Official code for NeuS2

Cuda 676 47 Updated Mar 22, 2024
Next
Showing results