Finding and Editing Multi-Modal Neurons in Pre-Trained Transformers

This repository is the official implementation of the paper "Finding and Editing Multi-Modal Neurons in Pre-Trained Transformers" (Findings of ACL 2024).

Requirements

To get started, please clone this repository and install packages as:

git clone https://github.com/opanhw/MM_Neurons.git
conda create -n MM_Neurons python=3.9
...
pip install -r requirements.txt

Model Preparation

We evaluate three widely used Multi-modal Large Language Models: LLaVA, InstructBLIP and mPLUG-Owl2. Model weights can be downloaded from the following links:

Since we need to obtain and modify neuron activations, we have made some modifications to the model source code. Please replace LLaVA/llava/model/language_model/llava_llama.py in your LLaVA project path with open_source_model/LLaVA/llava_llama.py and replace mPLUG-Owl/mPLUG-Owl2/mplug_owl2/model/modeling_mplug_owl2.py in your mPLUG-Owl2 project path with open_source_model/mPLUG-Owl2/modeling_mplug_owl2.py .

You should run src/preparation.py for some preparation work before finding neurons. A sample usage command is:

python src/preparation.py --model_type LLaVA --model_path YOUR_LLAVA_MODEL_PATH

Dataset

SBU Captions Dataset

See the dataset details in this page.

You should download the json file which contains image urls + captions.

Finding Multi-Modal Neurons

You can use src/find_mm_neurons.py to find multi-modal neurons. A sample usage command is:

python src/find_mm_neurons.py --model_type LLaVA --model_path YOUR_LLAVA_MODEL_PATH --save_to ./results --task_type sbu --data_path ./datasets/sbu --add_prompt --cal_activations

src/find_mm_neurons.py contains the following arguments.

model_type: we support (case-insensitive)
- LLaVA
- InstructBLIP
- mPLUG-Owl2
model_path: path to the model you choose
save_to: path to save the results
task_type: name of the dataset, we only support sbu now
data_path: path to the dataset you choose
query: prompt of the model, we use Describe the image in few words. in all experiments
max_num: maximum number of the samples, default is 1000
start: start number of the samples, default is 0
end: end number of the samples, default is 1000
add_prompt: whether to add a prefix "An image of", store true
cal_activations: whether to return activations of neurons, store true
shuffle: whether to shuffle the input sequence of image tokens, store true

Acknowledgements

Some codes are built upon Skill-Neuron.

Thanks @wonderful9462 for the assistance in this work.

Citation

If you find this code useful, please kindly cite our work as:

@misc{pan2024finding,
      title={Finding and Editing Multi-Modal Neurons in Pre-Trained Transformers}, 
      author={Haowen Pan and Yixin Cao and Xiaozhi Wang and Xun Yang and Meng Wang},
      year={2024},
      eprint={2311.07470},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
open_source_model		open_source_model
src		src
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

open_source_model

open_source_model

src

src

LICENSE

LICENSE

README.md

README.md

requirements.txt

requirements.txt

Repository files navigation

Finding and Editing Multi-Modal Neurons in Pre-Trained Transformers

Requirements

Model Preparation

Dataset

Finding Multi-Modal Neurons

Acknowledgements

Citation

About

Releases

Packages

Languages

License

opanhw/MM_Neurons

Folders and files

Latest commit

History

Repository files navigation

Finding and Editing Multi-Modal Neurons in Pre-Trained Transformers

Requirements

Model Preparation

Dataset

Finding Multi-Modal Neurons

Acknowledgements

Citation

About

Resources

License

Stars

Watchers

Forks

Languages