Connecting Dreams with Visual Brainstorming Instruction

Yasheng Sun, Bohan Li, Mingchen Zhuge, Deng-Ping Fan, Salman Khan, Fahad Shahbaz Khan, Hideki Koike

Paper

We aim to develop a straightforward framework that uses other modalities, such as natural language, to translate the original “dreamland”. We present DreamConnect, employing a dual-stream diffusion framework to manipulate visually stimulated brain signals. By integrating an asynchronous diffusion strategy, our framework establishes an effective interface with human “dreams”, progressively refining their final imagery synthesis.

Table of Content

News
Installation
Prepare Data
Pretrained Model
Testing
License
Acknowledgements

News

[2024/08]: Paper is on Arxiv.
[2024/12]: Paper is accepted by Visual Intelligence.

Step-by-step Installation Instructions

a. Create a conda virtual environment and activate it. It requires python >= 3.7 as base environment.

conda create -n sssp python=3.7 -y
conda activate sssp

b. Install PyTorch and torchvision following the official instructions.

conda install pytorch==1.10.0 torchvision==0.8.2 -c pytorch -c conda-forge

c. Install other dependencies. We simply freeze our environments. Other environments might also works. Here we provide requirements.txt file for reference.

pip install -r requirements.txt

Note that the transformers==1.19.2 is strictly required.

Prepare Data

Agree to the Natural Scenes Dataset's Terms and Conditions and fill out the NSD Data Access form.
Our Customized Dataset. The editing instructions are located in third_party/StableDiffusionReconstruction/codes/utils/misc directory. The obtained images after instruction can be downloaded from nsd_coco_output.tar.

Pretrained Model

Download Pretrained model and put it to logs/train_res_inject_idback_train_res_inject_idback_2024-04-22/checkpoints/ckpt_epoch_50/mp_rank_00_model_states.pt accordingly.

Instructions for Testing the Model

Step 1: Update the Checkpoint Path

Open the configuration file located at configs/test/test_res_value_inject_idback_css15.yaml and update the checkpoint path to match the paths of the downloaded models.

Download the pre-trained language-based instruction model provided by InstructDiffusion.

Step 2: Pre-Aligned Features for Convenience

For ease of use, we provide the following pre-aligned features:

If you are interested in training a alignment model by yourself, please follow the overall procedure fMRI-reconstruction-NSD. We provide the our trained alignment model for img_clip and text_clip. Download them and place to the directory of train_logs/latent_diffusion_image_fp32_resume/ and train_logs/latent_diffusion_text_fp32_resume2/ accordingly. Then, you can run below commands to obtain the above provided img_clip and text_clip files.

bash experiments/diffusion_test.sh image
bash experiments/diffusion_test.sh text

Step 3: Testing the Model

Once the paths are updated, you can test the model by running the following command:

bash experiments/test_language_control.sh

Acknowledgements

Many thanks to these excellent open source projects:

[InstructPix2Pix] (https://github.com/timothybrooks/instruct-pix2pix)
[fMRI-reconstruction-NSD] (https://github.com/MedARC-AI/fMRI-reconstruction-NSD)
[Versatile-Diffusion] (https://github.com/SHI-Labs/Versatile-Diffusion)
[Stable-Diffusion] (https://github.com/CompVis/stable-diffusion)
[InstructDiffusion] (https://github.com/cientgu/InstructDiffusion)

Citation

If you find our paper and code useful for your research, please consider citing:

@misc{sun2024connectingdreamsvisualbrainstorming,
      title={Connecting Dreams with Visual Brainstorming Instruction}, 
      author={Yasheng Sun and Bohan Li and Mingchen Zhuge and Deng-Ping Fan and Salman Khan and Fahad Shahbaz Khan and Hideki Koike},
      year={2025},
      journal={Viusal Intelligence}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2,396 Commits
configs		configs
experiments		experiments
metrics		metrics
misc		misc
scripts		scripts
stable_diffusion		stable_diffusion
third_party		third_party
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt
train_diffusion_prior.py		train_diffusion_prior.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Connecting Dreams with Visual Brainstorming Instruction

Paper

Table of Content

News

Step-by-step Installation Instructions

Prepare Data

Pretrained Model

Instructions for Testing the Model

Step 1: Update the Checkpoint Path

Step 2: Pre-Aligned Features for Convenience

Step 3: Testing the Model

Acknowledgements

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Connecting Dreams with Visual Brainstorming Instruction

Paper

Table of Content

News

Step-by-step Installation Instructions

Prepare Data

Pretrained Model

Instructions for Testing the Model

Step 1: Update the Checkpoint Path

Step 2: Pre-Aligned Features for Convenience

Step 3: Testing the Model

Acknowledgements

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages