Skip to content

[Technical Report] A Comprehensive Evaluation of Nano Banana Pro on 14 Low-Level Vision Tasks and 40 Datasets

License

Notifications You must be signed in to change notification settings

Zplusdragon/LowLevelBanana

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

18 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Is Nano Banana Pro a Low-Level Vision All-Rounder? 🍌

Teaser Image

Jialong Zuo, Haoyou Deng, Hanyu Zhou, Jiaxin Zhu, Yicheng Zhang, Yiwei Zhang, Yongxin Yan, Kaixing Huang, Weisen Chen, Yongtai Deng, Rui Jin, Nong Sang, Changxin Gao

School of Artificial Intelligence and Automation, Huazhong University of Science and Technology (HUST)


πŸ“’ Introduction

This repository hosts the official resources for the technical report: "Is Nano Banana Pro a Low-Level Vision All-Rounder? A Comprehensive Evaluation on 14 Tasks and 40 Datasets.".

While commercial T2I models like Nano Banana Pro excel in creative synthesis, their potential as generalist solvers for traditional low-level vision challenges remains largely underexplored. In this study, we investigate the critical question: Is Nano Banana Pro a Low-Level Vision All-Rounder? We conducted a comprehensive zero-shot evaluation across 14 distinct low-level tasks spanning 40 diverse datasets.

Teaser Image

Figure 1: Exemplary zero-shot results of Nano Banana Pro across 14 low-level vision tasks.

πŸ”₯ Key Highlights

  • Massive Benchmark: Evaluated on 14 low-level vision tasks and 40 datasets.
  • Zero-Shot Setting: Utilized simple textual prompts without any fine-tuning.
  • The Dichotomy Discovery: We reveal a distinct performance dichotomy:
    • βœ… Superior Subjective Quality: Often hallucinates plausible high-frequency details that surpass specialist models.
    • ❌ Lower Reference-Based Metrics: Lags behind in PSNR/SSIM due to the inherent stochasticity of generative models.

πŸ“Š Evaluation Results

Detailed quantitative and qualitative comparisons can be found in our project page and full report.

Our extensive analysis identifies Nano Banana Pro as a capable zero-shot contender for low-level vision tasks. While it struggles to maintain the strict pixel-level consistency required by conventional metrics (PSNR/SSIM), it offers superior visual quality, suggesting a need for new perception-aligned evaluation paradigms.

We have released the evaluation datasets and corresponding inference results of Nano Banana Pro used in our study on HuggingFace to facilitate future research.

Download the Inference Results on HuggingFace

πŸ’» Evaluation Code

After downloading the inference results of Nano Banana Pro for each dataset from HuggingFace, you can use the evaluation code provided for each task to obtain quantitative results. Please refer to the eval folder.

πŸ”— Citation

If you find this work helpful for your research, please consider citing:

@misc{zuo2025nanobananaprolowlevel,
      title={Is Nano Banana Pro a Low-Level Vision All-Rounder? A Comprehensive Evaluation on 14 Tasks and 40 Datasets}, 
      author={Jialong Zuo and Haoyou Deng and Hanyu Zhou and Jiaxin Zhu and Yicheng Zhang and Yiwei Zhang and Yongxin Yan and Kaixing Huang and Weisen Chen and Yongtai Deng and Rui Jin and Nong Sang and Changxin Gao},
      year={2025},
      eprint={2512.15110},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2512.15110}, 
}

About

[Technical Report] A Comprehensive Evaluation of Nano Banana Pro on 14 Low-Level Vision Tasks and 40 Datasets

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published