-
camel Public
Forked from camel-ai/camel🐫 CAMEL: Finding the Scaling Law of Agents. The first and the best multi-agent framework. https://www.camel-ai.org
Python Apache License 2.0 UpdatedMar 12, 2025 -
DiffCLIP-1 Public
Forked from hammoudhasan/DiffCLIPOfficial Implementation of DiffCLIP: Differential Attention Meets CLIP
Python UpdatedMar 11, 2025 -
-
yoloe Public
Forked from THU-MIG/yoloeYOLOE: Real-Time Seeing Anything
Jupyter Notebook GNU Affero General Public License v3.0 UpdatedMar 11, 2025 -
HCMA Public
Forked from YoujunZhao/HCMAHierarchical Cross-Modal Alignment for Open-Vocabulary 3D Object Detection (AAAI 2025)
UpdatedMar 11, 2025 -
LRS-VQA Public
Forked from VisionXLab/LRS-VQAWhen Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning
Python UpdatedMar 11, 2025 -
-
llm4ad Public
Forked from Optima-CityU/llm4adLLM4AD: A Platform for Algorithm Design with Large Language Model
Python MIT License UpdatedMar 11, 2025 -
PE3R Public
Forked from hujiecpp/PE3RPE3R: Perception-Efficient 3D Reconstruction. Take 2 - 3 photos with your phone, upload them, wait a few minutes, and then start exploring your 3D world via text!
Python Creative Commons Zero v1.0 Universal UpdatedMar 11, 2025 -
AlphaDrive Public
Forked from hustvl/AlphaDriveUnleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning
Apache License 2.0 UpdatedMar 11, 2025 -
-
DiffusionAsShader Public
Forked from IGL-HKUST/DiffusionAsShader[arXiv 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control
Python Apache License 2.0 UpdatedMar 10, 2025 -
PDSG-SDA Public
Forked from ryime/PDSG-SDAOfficial implementation of "Towards Effective and Sparse Adversarial Attack on Spiking Neural Networks via Breaking Invisible Surrogate Gradients" (CVPR 2025)
Python UpdatedMar 10, 2025 -
VisRL Public
Forked from zhangquanchen/VisRLVisRL: Intention-Driven Visual Perception via Reinforced Reasoning
Python UpdatedMar 10, 2025 -
-
SHIFNet Public
Forked from iAsakiT3T/SHIFNetUnveiling the Potential of Segment Anything Model 2 for RGB-Thermal Semantic Segmentation with Language Guidance
UpdatedMar 10, 2025 -
VLRMBench Public
Forked from JCruan519/VLRMBenchThis is a repository for VLRMBench.
Apache License 2.0 UpdatedMar 10, 2025 -
Paper-AnyAnomaly Public
Forked from SkiddieAhn/Paper-AnyAnomalyPyTorch Implementation of the Paper 'AnyAnomaly': Official Version
Python MIT License UpdatedMar 10, 2025 -
-
UnifiedReward Public
Forked from CodeGoat24/UnifiedRewardOfficial implementation of Unified Reward Model for Multimodal Understanding and Generation.
Python MIT License UpdatedMar 10, 2025 -
CLIP-Test-time-Counterattacks Public
Forked from Sxing2/CLIP-Test-time-Counterattacks[CVPR-25🔥] Test-time Counterattacks (TTC) towards adversarial robustness of CLIP
Python UpdatedMar 10, 2025 -
DualDiff Public
Forked from yangzhaojason/DualDiffA dual-branch conditional diffusion model designed to enhance driving scene generation across multiple views and video sequences.
UpdatedMar 9, 2025 -
OmniTrack Public
Forked from xifen523/OmniTrackThe official implementation of OmniTrack: Omnidirectional Multi-Object Tracking (CVPR 2025)
UpdatedMar 9, 2025 -
JamMa Public
Forked from leoluxxx/JamMaJamMa is a lightweight image matcher that enables fast internal and mutual interaction of images with joint Mamba.
Python MIT License UpdatedMar 9, 2025 -
-
-
-
IMFine Public
Forked from zhshi0816/IMFineIMFine: 3D Inpainting via Geometry-guided Multi-view Refinement
Python UpdatedMar 7, 2025 -
-