Skip to content

xid32/xid32

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

18 Commits
Β 
Β 

Repository files navigation

Hi there πŸ‘‹

I'm Xingjian Diao, a Ph.D. candidate in Computer Science at Dartmouth College 🌲, co-advised by Prof. Soroush Vosoughi and Prof. Jiang Gui. During my Ph.D. at Dartmouth, I interned twice at Amazon on computer vision and robotics (Summer 2025) and VLM systems (Summer 2026), and at Samsung Research America on agentic memories (Spring 2026).

Previously, I completed my M.S. in Computer Science at Northwestern University πŸ’œ, advised by Prof. Nabil Alshurafa. I received my B.S. in Computer Science from the University of Pittsburgh πŸ’™, graduating with Cum Laude honors.


πŸ” Research

My research focuses on multimodal learning for video, audio, and language understanding. I have developed methods for multimodal reasoning, efficient multimodal learning, and generative multimodal modeling, aiming to build scalable and generalizable multimodal models that advance multimodal question answering, video understanding, and audio–visual reasoning across complex real-world scenarios and dynamic environments. Highlights of my work include:


πŸ§‘β€πŸ’» Internship Experience

  • Amazon Science (Jun 2026 – Sept 2026)
    Applied Scientist Intern, Sunnyvale, CA
    Research on vision language models.

  • Samsung Research America (Mar 2026 – Jun 2026)
    NLP Research Intern, Mountain View, CA
    Research on agentic memories.

  • Amazon Science (Jun 2025 – Sept 2025)
    Applied Scientist Intern, Santa Cruz, CA
    Research on computer vision and robotics.


About

About Me

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors