Moccasin

This repo implements Moccasin, a constraint programming (CP) method for rematerialization. It takes as input a computation graph in nx.DiGraph and solves a CP using OR-Tools' CP-SAT Solver.

Installation

Download the repo

git clone https://github.com/haoming-codes/moccasin.git
cd moccasin

Create the environment (e.g. using conda)

conda create -n moccasin_env python=3.9
conda activate moccasin_env
pip3 install -e .

Reproducing the paper

Run the experiments

python3 main.py -g "data/random_layered_n100_w0.27_nlv0.75_ed0.2_scd0.14.json" -m 0.9 -o "output/icml"
python3 main.py -g "data/random_layered_n100_w0.27_nlv0.75_ed0.2_scd0.14.json" -m 0.8 -o "output/icml"
python3 main.py -g "data/random_layered_n250_w0.43_nlv0.75_ed0.2_scd0.14.json" -m 0.9 -o "output/icml"
python3 main.py -g "data/random_layered_n250_w0.43_nlv0.75_ed0.2_scd0.14.json" -m 0.8 -o "output/icml"
python3 main.py -g "data/random_layered_n500_w0.36_nlv0.75_ed0.2_scd0.14.json" -m 0.9 -o "output/icml"
python3 main.py -g "data/random_layered_n500_w0.36_nlv0.75_ed0.2_scd0.14.json" -m 0.8 -o "output/icml"
python3 main.py -g "data/random_layered_n1000_w0.31_nlv0.75_ed0.2_scd0.14.json" -m 0.9 -o "output/icml"
python3 main.py -g "data/random_layered_n1000_w0.31_nlv0.75_ed0.2_scd0.14.json" -m 0.8 -o "output/icml"
python3 main.py -g "data/ResNet50 (MLSys)_256_(224, 224, 3)_train_nx.json" -m 0.9 -o "output/icml"
python3 main.py -g "data/ResNet50 (MLSys)_256_(224, 224, 3)_train_nx.json" -m 0.8 -o "output/icml"
python3 main.py -g "data/fcn_8_vgg (MLSys)_32_(416, 608, 3)_train_nx.json" -m 0.9 -o "output/icml"
python3 main.py -g "data/fcn_8_vgg (MLSys)_32_(416, 608, 3)_train_nx.json" -m 0.8 -o "output/icml"

Parse the results

python3 print_latex_table.py -o "output/icml"

Citing the paper

@InProceedings{pmlr-v202-bartan23a,
  title = 	 {Moccasin: Efficient Tensor Rematerialization for Neural Networks},
  author =       {Bartan, Burak and Li, Haoming and Teague, Harris and Lott, Christopher and Dilkina, Bistra},
  booktitle = 	 {Proceedings of the 40th International Conference on Machine Learning},
  pages = 	 {1826--1837},
  year = 	 {2023},
  editor = 	 {Krause, Andreas and Brunskill, Emma and Cho, Kyunghyun and Engelhardt, Barbara and Sabato, Sivan and Scarlett, Jonathan},
  volume = 	 {202},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {23--29 Jul},
  publisher =    {PMLR},
  pdf = 	 {https://proceedings.mlr.press/v202/bartan23a/bartan23a.pdf},
  url = 	 {https://proceedings.mlr.press/v202/bartan23a.html}
}

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
data		data
moccasin.egg-info		moccasin.egg-info
src/moccasin		src/moccasin
LICENSE		LICENSE
README.md		README.md
main.py		main.py
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Moccasin

Installation

Reproducing the paper

Citing the paper

About

Releases

Packages

Languages

License

haoming-codes/moccasin

Folders and files

Latest commit

History

Repository files navigation

Moccasin

Installation

Reproducing the paper

Citing the paper

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages