Skip to content

ByteDance-BandAI/CodeVision

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 

Repository files navigation

Logo

Thinking with Programming Vision: Towards a Unified View for Thinking with Images

license

Overview

  • Introduction: A framework leveraging code-as-tool and comprehensive SFT/RL datasets for "thinking with images".

  • Features: Supports multi-turn agent loops for the Qwen2.5-VL and Qwen3-VL series.

  • Datasets: Includes an SFT dataset constructed using GPT-5-High and an RL dataset covering diverse domains.

Overview

Getting Started

Coming soon...

About

Thinking with Programming Vision: Towards a Unified View for Thinking with Images

Resources

Stars

Watchers

Forks