Skip to content
View Jingkang50's full-sized avatar
πŸ’
Today's Fruit
πŸ’
Today's Fruit

Highlights

  • Pro

Block or report Jingkang50

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Jingkang50/README.md

Hi there! πŸ‘‹ I'm Jingkang Yang.

πŸŽ“ Currently pursuing a PhD in Visual Perception and Reasoning.

πŸ” My research interests revolve around Vision-Language Models 🧠, Embodied Agents πŸ€–, and Scene Graph Generation πŸ•Έ. I am passionate about creating generalist AI models capable of understanding and interacting with complex visual data.

πŸš€ My Ongoing Research Projects:

  • Visual Generalist Models: Developing models that process diverse visual data (e.g., images, videos, 3D, audio, IMU) to tackle various tasks in perception, reasoning, generation, robotics, and gaming. Notable projects include EgoLife, Octopus, FunQA, and Otter.

  • AI Safety for Foundation Models: Investigating how to mitigate hallucinations in large language models (LLMs) and multimodal models (LMMs). A key contribution is the introduction of UPD to withhold answers when faced with unsolvable questions.

πŸ† Previous Contributions:

  • PSG Series (2022-2023): Led the development of the PSG, PVSG, and PSG4D models, focusing on relation modeling for scene understanding. I also collaborated on works like Relate-Anything and PairNet.

  • OOD Detection (2021-2022): Led a comprehensive survey and developed OpenOOD, a popular codebase for Out-of-Distribution detection in AI safety.

  • Prompt Tuning (2022): Contributed to foundational works like CoOp and CoCoOp for prompt tuning in vision-language models.

πŸ“ˆ GitHub Stats

Jingkang50's GitHub stats


πŸ“¬ Get in Touch

Feel free to reach out for collaboration or just to chat about AI and technology!

Thanks for visiting my profile!

Pinned Loading

  1. EvolvingLMMs-Lab/EgoLife Public

    [CVPR 2025] EgoLife: Towards Egocentric Life Assistant

    Python 244 16

  2. dongyh20/Octopus Public

    [ECCV2024] πŸ™Octopus, an embodied vision-language model trained with RLEF, emerging superior in embodied visual planning and programming.

    Python 285 19

  3. OpenPSG Public

    Benchmarking Panoptic Scene Graph Generation (PSG), ECCV'22

    Python 443 69

  4. OpenOOD Public

    Benchmarking Generalized Out-of-Distribution Detection

    Python 940 127

  5. EvolvingLMMs-Lab/RelateAnything Public

    Relate Anything Model is capable of taking an image as input and utilizing SAM to identify the corresponding mask within the image.

    Python 454 21

  6. KaiyangZhou/CoOp Public

    Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)

    Python 1.9k 212