Skip to content
View JihoChoi's full-sized avatar
☘️
🪴 🌱
☘️
🪴 🌱

Organizations

@scone-snu

Block or report JihoChoi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Python implementation of EVM(Eulerian Video Magnification)

Python 235 90 Updated May 5, 2022

[CVPR 2025] VGGT: Visual Geometry Grounded Transformer

Python 3,545 218 Updated Mar 25, 2025

The official NetsPresso Python package.

Jupyter Notebook 44 1 Updated Mar 27, 2025

😎 Awesome lists of papers and codes about open-vocabulary perception, including both 3D and 2D

52 3 Updated Feb 25, 2025

Verifying Vision-Language alignment using DINO visualization techniques on cross-attention maps

Python 5 Updated Jun 12, 2022

Embodied Reasoning Question Answer (ERQA) Benchmark

Python 105 3 Updated Mar 12, 2025

Official repository of ’Visual-RFT: Visual Reinforcement Fine-Tuning’

Python 1,453 65 Updated Mar 19, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 40,590 6,804 Updated Mar 29, 2025

[ICCV 2023] Multi3DRefer: Grounding Text Description to Multiple 3D Objects

Python 82 4 Updated Jan 26, 2024

[NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding

Python 86 4 Updated Feb 2, 2025

An easy way to apply LoRA to CLIP. Implementation of the paper "Low-Rank Few-Shot Adaptation of Vision-Language Models" (CLIP-LoRA) [CVPRW 2024].

Python 189 17 Updated Oct 14, 2024

Fine-tuning CLIP Text Encoders with Two-step Paraphrasing (EACL 2024, Findings)

Python 9 1 Updated Nov 23, 2024

[ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model

Python 166 5 Updated Aug 5, 2024

[CVPR 2024] Official implementation of "Universal Segmentation at Arbitrary Granularity with Language Instruction"

Python 85 3 Updated Mar 8, 2024

The toolbox for the Google Refexp dataset proposed in this paper: http://arxiv.org/abs/1511.02283

Jupyter Notebook 164 45 Updated Mar 1, 2017

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 33,637 4,872 Updated Feb 23, 2025

[AAAI2025] - Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints

13 Updated Dec 11, 2024

A lightweight codebase for referring expression comprehension and segmentation

Python 53 4 Updated May 21, 2022

An official PyTorch implementation of the CRIS paper

Python 269 38 Updated Jun 9, 2024

Python3 Referring Expression Datasets API

Jupyter Notebook 7 Updated Jan 20, 2025

Official implementation of "Can Language Understand Depth?"

Python 82 7 Updated Oct 21, 2022

This repo contains the official implementation of ICLR 2024 paper "Is ImageNet worth 1 video? Learning strong image encoders from 1 long unlabelled video""

Python 84 12 Updated May 17, 2024

A single handwritten digit classifier, using the MNIST dataset. Pure Numpy.

Python 786 83 Updated Oct 12, 2019

Referring Expression Datasets API

Jupyter Notebook 504 80 Updated Aug 27, 2024

[AAAI 2024] The official implementation of the paper "3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression Segmentation"

Python 40 3 Updated Dec 20, 2023

[MM2024 Oral] 3D-GRES: Generalized 3D Referring Expression Segmentation

Python 35 Updated Dec 15, 2024

[ICCV 2023] Official code release of our paper "Referring Image Segmentation Using Text Supervision"

Python 69 3 Updated Oct 13, 2024

Official code release of "CLIP goes 3D: Leveraging Prompt Tuning for Language Grounded 3D Recognition"

Python 232 13 Updated May 1, 2023

[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception

Python 556 42 Updated May 8, 2024

[ICLR 2025] Duoduo CLIP: Efficient 3D Understanding with Multi-View Images

Python 50 3 Updated Mar 20, 2025
Next
Showing results