FReLU: Flexible Rectified Linear Unit

This project is the original implementation of FReLU: Flexible Rectified Linear Units for Improving Convolutional Neural Networks in Torch.

This project is also a clone of Facebook ResNet implementation using ReLU in Torch.

Other implementations (many thanks for the contributors):

FReLU in caffe by Dmytro Mishkin.

This is my first time to share the experimental codes. It may be a little messy. Any comments/pull requests are appreciated.

FAQs

We list some questions about the method here. Some come from the disscussion on reddit.

The purpose of this work ?

We wanna figure out: 1) the effects of negative values for networks, 2) the compatibility between activation functions and batch normalization.
Layer-wise biases or channel-wise biases ?

FReLU uses layer-wise biases. We do not suggest to use channel-wise biases, which may be easy to make the training harder.
Compare to SELU ?

The bias in FReLU is not a constant value. We think it is hard to say which value is the best.
When to use FReLU ?

FReLU is helpful for smaller networks and is also a good choice when using batch normalization.
Failure cases of FReLU ?

By monitoring the value of the bias in FReLU, we observe that positive bias will harm the training. For large networks that have large capacities, FReLU may lose advantages.
About the theory ?

The current paper gives a little theory analysis. The intuition mainly comes from ELU, normalizing networks and the expressiveness of rectifier networks.
Future work ?
- Different tasks (e.g. classification & regression) may need different activation functions. The exploration of the task-specific activation function is helpful to understand the corresponding task and network architecture.
- The theory developments about the network architecture and the learning behavior will better guide the design of activation functions.

We appreciate any comments/disscussions about activation functions. More observations are going to get us closer to the truth.

Training

The example training commands are available in the following scripts. Please read the corresponding script before running. More scripts are in the sub folder scripts.

PReLU: run-prelu.sh
SReLU: scripts/cifar100-pelu-smallnet-srelu-seed.sh
ELU: run-elu.sh
ReLU: run-relu.sh
ReLU with ResNet: run-resnet.sh
FReLU: run-possrelu.sh
FReLU with ResNet: run-resnet-possrelu.sh

The implementation of FReLU in torch is models/frelu/PosSReLU.lua.

To monitor the value of biases in FReLU, use th show.lua -model $your_model_path.

Codes in draw are use to read and plot the curves from log files.

Run the visulization experiment. Just cd mnist and th *.lua. FReLU may need several runs. Some initial parameters can lead the dead neuron.

Model files table :

File	Network	ACT
resnet-possrelu	Ori. bottleneck	FReLU
elu-resnet-possrelu	w/o ACT after addition	FReLU

Name		Name	Last commit message	Last commit date
Latest commit History 145 Commits
datasets		datasets
draw		draw
mnist		mnist
models		models
pretrained		pretrained
scripts		scripts
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
INSTALL.md		INSTALL.md
LICENSE		LICENSE
PATENTS		PATENTS
README.md		README.md
TRAINING.md		TRAINING.md
checkpoints.lua		checkpoints.lua
dataloader.lua		dataloader.lua
download_cifar100.lua		download_cifar100.lua
main.lua		main.lua
opts.lua		opts.lua
run-elu.sh		run-elu.sh
run-possrelu.sh		run-possrelu.sh
run-prelu.sh		run-prelu.sh
run-relu.sh		run-relu.sh
run-resnet-possrelu.sh		run-resnet-possrelu.sh
run-resnet.sh		run-resnet.sh
show.lua		show.lua
train.lua		train.lua

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FReLU: Flexible Rectified Linear Unit

FAQs

Training

Related work

About

Releases

Packages

Contributors 19

Languages

License

kyuusaku/frelu.torch

Folders and files

Latest commit

History

Repository files navigation

FReLU: Flexible Rectified Linear Unit

FAQs

Training

Related work

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 19

Languages

Packages