Virtual Branching

Main idea: Partially share parameters between multiple models (“branches”) in order to reduce overall parameter footprint while maintaining feature diversity (increased feature diversity generally leads to improved ensemble performance). We also incorporate bagging to further improve performance.

Datasets:

MNIST
CIFAR10
Omniglot (triplet loss)
Market 1501 (person re-identification) (triplet loss)

We perform analysis on strength and correlation between branches in order to explore how varying the level of parameter sharing affects ensemble performance (based on Random Forest paper).

The vbranch implementation is modeled after Keras. Currently supported layers include Dense, Batch Normalization, Conv2D, padding layers, and pooling layers. We provide implementations of ResNet and DenseNet.

ArXiv pre-print: "Virtual CNN Branching: Efficient Feature Ensemble for Person Re-Identification" https://arxiv.org/pdf/1803.05872.pdf

Results

MNIST

FCN = 784 (input) -> 512 -> 10 (output); batch norm, relu; softmax; 20 epochs; learning rate = 0.001; test = before mean acc

Training graph:

CNN = 1 (input) -> 16 -> 16 -> 32 -> 32 filters; batch norm, relu; softmax; 30 epochs; learning rate = 0.001; test = before mean acc

Converges by around epoch 30

Correlation and Strength (from Random Forest paper):

Omniglot

CNN = 1 (input) -> 32 -> 32 -> 64 -> 64 -> 128 -> 128 -> 256 -> 256 filters; batch norm, relu; softmax; 90 epochs; learning rate = 0.001; concatenate embeddings

Notes

Can we reduce batch size of individual branches when using vbranch and achieve same performance improvements with lower computation?

How to decrease correlation between virtual branches?

What is effect of varying shared_frac throughout the model (e.g., could lower level of sharing for lower layers help propagate variance to final layers)?

Person re-ID: ignore samples that cannot be detected properly by OpenPose

Related Papers

http://openaccess.thecvf.com/content_cvpr_2018/papers/Rebuffi_Efficient_Parametrization_of_CVPR_2018_paper.pdf

https://arxiv.org/pdf/1511.02954.pdf

http://leemon.com/papers/2009scsb.pdf

Net2Net: https://arxiv.org/pdf/1511.05641.pdf

TF QueueRunners: https://adventuresinmachinelearning.com/introduction-tensorflow-queuing/

https://blog.metaflow.fr/tensorflow-how-to-optimise-your-input-pipeline-with-queues-and-multi-threading-e7c3874157e0

TF Dataset API: https://towardsdatascience.com/how-to-use-dataset-in-tensorflow-c758ef9e4428

Bagging (with derivations): https://www.stat.berkeley.edu/~breiman/bagging.pdf

Random Forest: https://www.stat.berkeley.edu/~breiman/randomforest2001.pdf

Name		Name	Last commit message	Last commit date
Latest commit History 134 Commits
experiments		experiments
figs		figs
old		old
omniglot @ 057f034		omniglot @ 057f034
openpose_output/market		openpose_output/market
results		results
vbranch		vbranch
weights		weights
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
cifar10-keras.ipynb		cifar10-keras.ipynb
classification.ipynb		classification.ipynb
omniglot.ipynb		omniglot.ipynb
pretrained-weights.ipynb		pretrained-weights.ipynb
reid.ipynb		reid.ipynb
requirements.txt		requirements.txt
results.ipynb		results.ipynb
shared-sensitivity.ipynb		shared-sensitivity.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Virtual Branching

Results

MNIST

Omniglot

Notes

Related Papers

About

Releases

Packages

Languages

agongt408/vbranch

Folders and files

Latest commit

History

Repository files navigation

Virtual Branching

Results

MNIST

Omniglot

Notes

Related Papers

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages