Skip to content
View imr555's full-sized avatar
🤖
There is no easy day. The only easy day was yesterday - Pritom Mojumder
🤖
There is no easy day. The only easy day was yesterday - Pritom Mojumder

Block or report imr555

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

chatgpt_learn

Chatgpt Related Stuff
9 repositories

clean_vllm

clean code_vllms
2 repositories

cuda_stuff

Learning Cuda resources
1 repository

extra_dl_stuff

stuff to look into
2 repositories

Fun_stuff

Just random fun stuff to do in my own time
23 repositories

interest_vis_research_1_3_24

Interesting papers for initial research plans for cac
5 repositories

javascript_learn_101

Repositories I am going to use to start learning Javascript
2 repositories

jax_nerf_101

Study jax and jax for starters
6 repositories

Starred repositories

Showing results

[CVPR25] Official repository for the paper: "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"

Python 105 2 Updated Mar 25, 2025

Mobile-VideoGPT: Fast and Accurate Video Understanding Language Model

Python 32 Updated Mar 28, 2025

An encyclopedia of jailbreaking techniques to make AI models safer.

Python 198 14 Updated Mar 30, 2025

Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!

Python 91 3 Updated Mar 4, 2025

Scaling Vision Pre-Training to 4K Resolution

80 5 Updated Mar 26, 2025

[icml, iclr, cvpr, neurips, eccv, iccv]: browse through whole conference by just reading a book of abstract. My fingers hurt having to click each paper. We want to read everything lol. So, i took t…

TeX 7 2 Updated Mar 27, 2025
Python 5 Updated Mar 27, 2025

RF-DETR is a real-time object detection model architecture developed by Roboflow, SOTA on COCO & designed for fine-tuning.

Python 1,438 124 Updated Mar 29, 2025

Implementation of paper EditCLIP: Representation Learning for Image Editing

Python 12 Updated Mar 26, 2025

Practical tips for effective AI-assisted software development

159 3 Updated Mar 29, 2025

Code for "Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More"

Python 26 Updated Mar 29, 2025

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 3,074 255 Updated Mar 25, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 9,351 645 Updated Mar 27, 2025

History-Aware Transformation of ReID Features for Multiple Object Tracking

7 Updated Mar 15, 2025

A curated list of Person Re-Identification papers and BibTeX entries

TeX 17 2 Updated Feb 24, 2024

A personal list of papers and resources of image matching and pose estimation, including perspective images and panoramas.

302 31 Updated Mar 26, 2025

SuperGlue: Learning Feature Matching with Graph Neural Networks (CVPR 2020, Oral)

Python 3,560 701 Updated Aug 30, 2024

A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.

Python 1,360 78 Updated Mar 20, 2025

Bringing BERT into modernity via both architecture changes and scaling

Python 1,300 102 Updated Mar 25, 2025
Python 63 4 Updated Mar 20, 2025

Assessing Context-Aware Creative Intelligence in MLLMs

JavaScript 11 Updated Mar 26, 2025

A curated list of temporal action localization/detection and related area (e.g. temporal action proposal) resources.

577 65 Updated Sep 22, 2022

Temporal Action Detection & Weakly Supervised Temporal Action Detection & Temporal Action Proposal Generation

476 36 Updated Mar 28, 2025

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Python 4,526 1,270 Updated Aug 14, 2024

Segment-anything related awesome extensions/projects/repos.

345 14 Updated Jun 28, 2023

Tracking and collecting papers/projects/others related to Segment Anything.

1,601 132 Updated Mar 13, 2025

General AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask, AnyX

1,769 98 Updated Nov 15, 2023

The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling

Python 27 Updated Mar 20, 2025

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 5,246 502 Updated Feb 26, 2025
Next
Showing results