Implements the Wasserstein Loss Layer (as described in Frogner et al. 2015) in Caffe. (Previously, the only open source implementation of the loss function was in Mocha.jl, which we used as a reference)
Check ./project.pdf
Check ./examples/mnist and ./examples/sample_wasserstein
Fix the test to check for difference in gradient.