[icml, iclr, cvpr, neurips, eccv, iccv]: browse through whole conference by just reading a book of abstract. My fingers hurt having to click each paper. We want to read everything lol. So, i took t…

TeX 7 2 Updated Mar 27, 2025

1y33 / DiT

Python 5 Updated Mar 27, 2025

roboflow / rf-detr

RF-DETR is a real-time object detection model architecture developed by Roboflow, SOTA on COCO & designed for fine-tuning.

Python 1,438 124 Updated Mar 29, 2025

QianWangX / EditCLIP

Implementation of paper EditCLIP: Representation Learning for Image Editing

Python 12 Updated Mar 26, 2025

Qais-Hweidi / ai-assisted-development-guide

Practical tips for effective AI-assisted software development

159 3 Updated Mar 29, 2025

kijai / ComfyUI-WanVideoWrapper

Python 1,727 92 Updated Mar 30, 2025

ZichenWen1 / DART

Code for "Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More"

Python 26 Updated Mar 29, 2025

NVlabs / VILA

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 3,074 255 Updated Mar 25, 2025

QwenLM / Qwen2.5-VL

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 9,351 645 Updated Mar 27, 2025

HELLORPG / HATReID-MOT

History-Aware Transformation of ReID Features for Multiple Object Tracking

7 Updated Mar 15, 2025

feifeiobama / Awesome-Person-ReID

A curated list of Person Re-Identification papers and BibTeX entries

TeX 17 2 Updated Feb 24, 2024

chicleee / Image-Matching-Paper-List

A personal list of papers and resources of image matching and pose estimation, including perspective images and panoramas.

302 31 Updated Mar 26, 2025

magicleap / SuperGluePretrainedNetwork

SuperGlue: Learning Feature Matching with Graph Neural Networks (CVPR 2020, Oral)

Python 3,560 701 Updated Aug 30, 2024

AnswerDotAI / rerankers

A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.

Python 1,360 78 Updated Mar 20, 2025

AnswerDotAI / ModernBERT

Bringing BERT into modernity via both architecture changes and scaling

Python 1,300 102 Updated Mar 25, 2025

showlab / Impossible-Videos

Python 63 4 Updated Mar 20, 2025

open-compass / Creation-MMBench

Assessing Context-Aware Creative Intelligence in MLLMs

JavaScript 11 Updated Mar 26, 2025

Alvin-Zeng / Awesome-Temporal-Action-Localization

A curated list of temporal action localization/detection and related area (e.g. temporal action proposal) resources.

577 65 Updated Sep 22, 2022

zhenyingfang / Awesome-Temporal-Action-Detection-Temporal-Action-Proposal-Generation

Temporal Action Detection & Weakly Supervised Temporal Action Detection & Temporal Action Proposal Generation

476 36 Updated Mar 28, 2025

open-mmlab / mmaction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Python 4,526 1,270 Updated Aug 14, 2024

JerryX1110 / awesome-segment-anything-extensions

Segment-anything related awesome extensions/projects/repos.

345 14 Updated Jun 28, 2023

Hedlen / awesome-segment-anything

Tracking and collecting papers/projects/others related to Segment Anything.

1,601 132 Updated Mar 13, 2025

VainF / Awesome-Anything

General AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask, AnyX

1,769 98 Updated Nov 15, 2023

sail-sg / SkyLadder

Forked from jzhang38/TinyLlama

The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling

Ifty Mohammad Rezwan imr555

Highlights

Lists (32)

chatgpt_learn

clean_vllm

cuda_stuff

extra_dl_stuff

Fun_stuff

interest_vis_research_1_3_24

javascript_learn_101

jax_nerf_101

Laion_Stuff

learn_ai

learn_all_grounding_open_set

learncpp

llm_local

LLM_Stuff

machine_learning_engineering

Medical_image_computing_ucf

mlops_Software_Design

neovim

open_source_lm

paper_review_advanced_cv

paper_review_medical_image

rust_start_101

segmentation_ucf_2024_1

shafin_sir_multi_label_few_shot_

software_design

software_engineering

Software_tooling_general

state_space_models

study

summer_24_stuff

super_hack_list

visual_language_models_ucf

Starred repositories

neural-radiance-fields

neural-fields

termux-hacking

video-summarization

information-retrieval

retrieval-model

document-retrieval

particle-swarm-optimization

finite-state-transducers

speech-recognition