Jailbreaking on Text-to-Video Models via Scene Splitting Strategy (ICLR 2026)

This is the official repository for the paper "Jailbreaking on Text-to-Video Models via Scene Splitting Strategy".

📢 News

[Jan 2026] 🎉 Code will be released soon.
[Jan 2026] 🎉 SceneSplit has been accepted to ICLR 2026.

🌟 Overview

Despite the rapid advancement of Text-to-Video (T2V) models, their safety vulnerabilities remain largely unexplored. SceneSplit is a novel black-box jailbreak method that bypasses safety filters by fragmenting a harmful narrative into multiple scenes that are individually benign. By sequentially combining these safe scenes, the method constrains the generative output space to an unsafe region, significantly increasing the likelihood of generating harmful content.

Core Mechanism: While each scene individually corresponds to a wide and safe space, their sequential combination collectively restricts this space to an unsafe region.

📝 Citation

@article{lee2025jailbreaking,
  title={Jailbreaking on Text-to-Video Models via Scene Splitting Strategy},
  author={Lee, Wonjun and Park, Haon and Lee, Doehyeon and Ham, Bumsub and Kim, Suhyun},
  journal={arXiv preprint arXiv:2509.22292},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
figs		figs
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Jailbreaking on Text-to-Video Models via Scene Splitting Strategy (ICLR 2026)

📢 News

🌟 Overview

📝 Citation

About

Uh oh!

Releases

Packages

velpegor/SceneSplit

Folders and files

Latest commit

History

Repository files navigation

Jailbreaking on Text-to-Video Models via Scene Splitting Strategy (ICLR 2026)

📢 News

🌟 Overview

📝 Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages