Popular repositories
-
vision_transformer
vision_transformer PublicForked from google-research/vision_transformer
Jupyter Notebook 4
-
UnOpticalFlow
UnOpticalFlow PublicOcclusion Aware Unsupervised Learning of Optical Flow From Video
Python 2
-
DeepTracking
DeepTracking PublicForked from pondruska/DeepTracking
Source code of DeepTracking research project
Lua 1
-
hoecoseg
hoecoseg PublicForked from shenjianbing/Higher-Order-Image-Co-segmentation
W. Wang and J. Shen, Higher-Order Image Co-segmentation, IEEE Trans. on Multimedia, 18(6):1011-1021, 2016
C++ 1
Repositories
- ViTamin Public Forked from Beckschen/ViTamin
[CVPR 2024] Official implementation of "ViTamin: Designing Scalable Vision Models in the Vision-language Era"
-
- TSCM Public Forked from nubot-nudt/TSCM
[ICRA24] TSCM: A Teacher-Student Model for Vision Place Recognition Using Cross-Metric Knowledge Distillation
- Official_Remote_Sensing_Mamba Public Forked from walking-shadow/Official_Remote_Sensing_Mamba
Official code of Remote Sensing Mamba
-
- Awesome-Foundation-Models Public Forked from uncbiag/Awesome-Foundation-Models
A curated list of foundation models for vision and language tasks
-
- Q-Bench Public Forked from Q-Future/Q-Bench
①[ICLR2024 Spotlight] (GPT-4V/Gemini-Pro/Qwen-VL-Plus+16 OS MLLMs) A benchmark for multi-modality LLMs (MLLMs) on low-level vision and visual quality assessment.
- MDKNet Public Forked from csguomy/MDKNet
Modulating Domain-Specific Knowledge for Multi-domain Crowd Counting
- MobileAgent Public Forked from X-PLUG/MobileAgent
Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…