Custom

supernet : fbnet => mobilenet v2
LAT X
saved log during architecture search : searched_result/{architecture_anme}/architecture_search
(Model name should not overlap!)
1. architecture search (default : 180)
1. architecture sampling
example sh file

#!/bin/sh
python supernet_main_file.py --train_or_sample train --architecture_name fluctuation_exp_4 --epoch 180 --warm_up 10 --gpu 3 --eta_min 1 --dataset cifar10
python supernet_main_file.py --train_or_sample sample --architecture_name fluctuation_exp_4 --gpu 3 --dataset cifar10
python architecture_main_file.py --architecture_name fluctuation_exp_4 --gpu 3 --dataset cifar10

FBNet

This repository reproduces the results of the following paper:

FBNet: Hardware-Aware Efficient ConvNet Design via Differentiable Neural Architecture Search
Bichen Wu1, Xiaoliang Dai, Peizhao Zhang, Yanghan Wang, Fei Sun, Yiming Wu, Yuandong Tian, Peter Vajda, Yangqing Jia, Kurt Keutzer (Fasebook Research)

Layers to Search are from a FacebookResearch repository Utils stuff is taken from DARTS repository

Advantages

Building blocks (searched layers) was taken from the FacebookResearch repository (Quick Note: their repo consists files with names fbnet*, but doesn't include any fbnet architecture from their paper)
Latency Calculation Code
Successfully Tested on Cifar10
Logging. You can find all my logs and tensorboards into SAVED_LOGS/architectures_training (for architectures training)

Disadvantages

Loss : $CE(a, w_a) · α β log(LAT(a))$ instead of $CE(a, w_a) · α log(LAT(a))^β$ (occasionally)
SAVED_LOGS/supernet_training* (for supernet training) - logs with validation on the training data to thetas optimization (bug, code was fixed)
No MultiGPU Support yet
Training only CIFAR10

Good News!

FacebookResearch published weights for the resulted architectures: FBNet-A, FBNet-B & FBNet-C (trained on imagenet)

https://github.com/facebookresearch/mobile-vision

Results, Cifar10

The architectures are not SOTAs: we search only for filters' sizes (these numbers are good for the simple architecture) and the goal is to Reduce Inference Time for Your Device

FacebookResearch didn't share latencies for their test machines, so, I couldn't prove their latencies results, but I have builded and trained theier proposed architectures:

FBNet Architecture	top1 validation accuracy	top3 validation accuracy
FBNet-A	78.8%	95.4%
FBNet-B	82%	96%
FBNet-C	79.9%	95.6%
FBNet-s8 (for Samsung Galaxy S8)	79.6%	95.7%
FBNet-iPhoneX	76.2%	94.3%
------	------	------
fbnet_cpu_sample1	82.8%	98.9%
fbnet_cpu_sample2	80.6%	95.7%

Note: be cautious! these numbers are just validation's the bests (without confidence intervals, measured in a single run). Do not use these numbers to make decisions. They are here to compliment the tensorboards in the SAVED_LOGS directory. The reason why I don't split data into validation and test is in the next note.

Note: as it was stated in the paper and according to my results, if we train with small images (as cifar's 32x32), we can see a lot of 'skip' layers into the resulting architecture. I feel, for cifar10 we should search for less number of layers

FBNet Optimization performance

We have no theoretical guarantees of converjence. So, I run a distinct checking experiment to compare the method with the pioneer of gradient NASes in application to search a part of a NN. See DARTS VS FBNet.md for results

Code Structure and Training Pipeline, Cifar10

The repository consists of 2 Neural Net Models:

(1) FBNet Searched Architectures. All tools in the architecture_functions folder

(2) Stochastic SuperNet to search for new architectures. All tools in the supernet_functions folder

They use different functions and architectures specification. Functions used by both Nets are in the folders: general_functions (utilities) and fbnet_building_blocks (modified code of facebookresearch team)

I encourage you to visit TRAINIG_DETAILS.md in this folder for details and instructions.

Dependencies

I have tested the code with the following dockerfile: DOCKERFILE (Pytorch 0.4.1 Nightly)

btw I think it should work well with Pytorch 0.4.0+

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 71 Commits
.idea		.idea
SAVED_LOGS		SAVED_LOGS
analysis		analysis
architecture_functions		architecture_functions
distiller_utils		distiller_utils
fbnet_building_blocks		fbnet_building_blocks
figure		figure
general_functions		general_functions
pictures		pictures
supernet_functions		supernet_functions
yaml		yaml
.gitignore		.gitignore
DARTS_VS_FBNET.md		DARTS_VS_FBNET.md
README.md		README.md
TRAINIG_DETAILS.md		TRAINIG_DETAILS.md
architecture_main_file.py		architecture_main_file.py
architecture_ploting(B&G).ipynb		architecture_ploting(B&G).ipynb
example_theta_plot.ipynb		example_theta_plot.ipynb
make_tiny_imagenet.sh		make_tiny_imagenet.sh
random_sampler.ipynb		random_sampler.ipynb
supernet_main_file.py		supernet_main_file.py
valid_top1_extractor.ipynb		valid_top1_extractor.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Custom

FBNet

Advantages

Disadvantages

Good News!

Results, Cifar10

FBNet Optimization performance

Code Structure and Training Pipeline, Cifar10

Dependencies

License

About

Releases

Packages

Languages

notaxaiha/FBNet

Folders and files

Latest commit

History

Repository files navigation

Custom

FBNet

Advantages

Disadvantages

Good News!

Results, Cifar10

FBNet Optimization performance

Code Structure and Training Pipeline, Cifar10

Dependencies

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages