Skip to content

deep-overflow/deep-overflow

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

8 Commits
Β 
Β 

Repository files navigation

πŸ‘‹ Hi, I'm Seongchan Kim (κΉ€μ„±μ°¬)

πŸŽ₯ Video Generation & Multimodal Large Language Models (MLLM)
πŸ§‘β€πŸ’» Integrated M.S./Ph.D. @CVLAB in KAIST AI

I design next-generation video generation models and build evaluation frameworks for understanding and improving video diffusion models.
Currently exploring interaction-aware video generation and multimodal understanding of videos.


πŸ§ͺ Research Highlights

  • 🎬 Video Generation & Evaluation β€” Improving interaction fidelity and multi-instance understanding in video diffusion transformers
  • 🧩 Video Object Segmentation (VOS) β€” Multi-granularity & referring VOS with language and temporal reasoning
  • 🧠 MLLM for Video β€” Leveraging multimodal large language models to better understand and describe video content

πŸ“ Publications

  • Self-Evolving Neural Radiance Fields
    Wild3D Workshop @ ICCV 2025
    πŸ”— Project Page

  • MUG-VOS: Multi-Granularity Video Object Segmentation
    AAAI 2025
    πŸ”— Project Page

  • Referring Video Object Segmentation via Language Aligned Track Selection
    arXiv 2025
    πŸ”— Project Page

  • InterRVOS: Interaction-aware Referring Video Object Segmentation
    Under review at AAAI 2026
    πŸ”— Project Page

  • MATRIX: Mask Track Alignment for Interaction-Aware Video Generation
    Under review at ICLR 2026


🌎 Links

✨ β€œUnderstanding the World through Video and Multimodalities.”

Wave

πŸ”„ Last updated: 2025λ…„ 9μ›” 28일 | πŸ’» Made with ❀️ by Deep Overflow

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published