Characterizing and overcoming the greedy nature of learning in multi-modal deep neural networks

We provide the source code for the balanced multi-modal learning algorithm proposed in the above paper, along with implementations for the derived metrics, conditional utilization rate and conditional learning speed.

**Accepted by ICML 2022 ** [Paper]

Dependencies:

Python 3.8 / gin-config / numpy / pandas / pytorch / scikit-learn / scipy / torchvision / skimage / PIL

Workflow

We take the 3D object classification task using the ModelNet40 dataset as an example. One can train the multi-modal DNN via the balanced multi-modal learning algorithm :

python3 train.py $RESULTS_DIR/random configs/training_guided.gin

or its random version:

python3 train.py $RESULTS_DIR/random configs/training_random.gin

To analysis multi-modal DNNs' conditional utilization rate, run the following two scripts consecutively:

python3 eval.py $RESULTS_DIR/random configs/recording.gin
python3 eval.py $RESULTS_DIR/random configs/eval.gin

Citation

Please cite this work if you find the analysis or the proposed method useful for your research.

@misc{wu2022characterizing,
      title={Characterizing and overcoming the greedy nature of learning in multi-modal deep neural networks}, 
      author={Nan Wu and Stanisław Jastrzębski and Kyunghyun Cho and Krzysztof J. Geras},
      year={2022},
      eprint={2202.05306},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
configs		configs
src		src
README.md		README.md
env.sh		env.sh
eval.py		eval.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Characterizing and overcoming the greedy nature of learning in multi-modal deep neural networks

Dependencies:

Workflow

Citation

About

Releases

Packages

Languages

nyukat/greedy_multimodal_learning

Folders and files

Latest commit

History

Repository files navigation

Characterizing and overcoming the greedy nature of learning in multi-modal deep neural networks

Dependencies:

Workflow

Citation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages