add loss_mean_squared_per_channel by arrufat · Pull Request #1863 · davisking/dlib

arrufat · 2019-08-19T05:33:26Z

This pull request adds support for a loss that computes independently on each plan of the output tensor.
This loss is useful to estimate keypoints (for pose estimation, etc).
The article that motivated me to add this kind of loss was Simple Baselines for Human Pose Estimation and Tracking.
I have successfully trained a network that replicates that article with this loss.

As a unit test, I added a small example that takes a matrix with 9 white dots as an input and outputs 9 channels, each one with a white dot if there was a dot at the associated position.

Sorry for the delay in making this PR... I couldn't think of a simple example (that can actually be used to learn how to use the loss)

davisking

Thanks for the PR :)

Just a few minor things and then it's ready to merge.

arrufat · 2019-08-19T14:31:44Z

I've updated with the requested changes. Your suggested name is more self-explanatory.

arrufat · 2019-08-21T00:49:36Z

My last commit fixed a build error on the windows checks (it built fine on my machine with GCC-9.1).
Is there anything else I need to do?

davisking · 2019-08-21T23:12:24Z

Thanks, the code looks good. I tried running the test and it passes, but I notice the loss only drops a little. Is it possible to make the test so that it runs to a low loss? Otherwise it's not obvious that it's working. For instance, you could ask it to do something simple like train 10 linear functions rather than use this more complex network in the test now.

davisking · 2019-08-21T23:16:40Z

For instance, make up 10 random linear functions. Then use them to make training data. E.g. make a 10x5 random matrix w, a random 5x1000 random matrix x and set y to w*x. Train a network with just one linear layer to map x to y. It should be able to learn it to basically 0 error and do so very quickly.

arrufat · 2019-08-21T23:18:11Z

Ok, I'll try that. I also noticed the loss drops only a little, and I think it's because predicting everything as 0 already gives a pretty low error (since there's only one pixel that would differ). I'll think of a more obvious test.

davisking · 2019-08-22T00:25:12Z

That’s better. But a little linear thing that dropped the loss really low would be nice.

The unit tests also aren’t a good way to do documentation. That kind of thing should go in a fully documented example program (you don’t have to make one for this though).

arrufat · 2019-08-22T00:26:47Z

Ok, I understand. I'll redo the tests then, probably during this weekend :)

Edit: I did this, because sometimes, I find myself reading the unit tests to better understand how to use some parts of the library.

arrufat · 2019-08-24T12:55:27Z

I've implemented the suggested linear test. I did not fiddle much with the parameters, but I'm not very happy with the results. Maybe learning that many random functions is not easy...

error_before = 2.03315
error_after = 0.216274

For the moment, I haven't removed the previous test. Please let me know what you think, or if I misunderstood your test.

davisking

The new tests look good. With some minor tweaks they will probably give really low error.

arrufat · 2019-08-25T11:59:28Z

Thank you for all the suggestions, I've simplified the MSE computation and reduced the dimensions of the data, it still does not converge to 0 as fast as I'd like:

error_before: 2.95371
error_after: 0.137677

arrufat · 2019-08-25T12:13:12Z

I'm suspecting maybe the tests are too random? Maybe I need to generate a more structured weight matrix w?

davisking · 2019-08-25T19:22:57Z

Don’t call set_max_num_epochs. That’s probably telling it to stop early.

arrufat · 2019-08-25T23:00:58Z

The only reason why I put the value of 160 is because it stops at epoch 157 (where it reaches a learning rate of 1e-7). I've also tried with smaller learning rate, but I get the same results.

arrufat · 2019-08-26T01:19:08Z

I've modified the network a little bit:

using net_type = loss_mean_squared_per_channel_and_pixel<num_channels,                                                                                                                                                               
                    extract<0, num_channels, 1, dimension,                                                                                                                                                                           
                    fc<num_outputs,                                                                                                                                                                                                  
                    relu<bn_fc<fc<500,                                                                                                                                                                                               
                    input<matrix<float>>>>>>>>;

Now, the errors look like this:

error_before: 2.25673
error_after: 0.0214044

davisking · 2019-08-28T03:29:20Z

Cool. That's good enough for me :)

Remove the non-simple version of the test and the PR is good to go.

arrufat · 2019-08-28T05:30:22Z

Great! Thank you! I'm probably more happy than you about this PR (my first serious one).
I wanted to contribute to dlib so badly!!

davisking · 2019-08-28T11:23:42Z

No problem, thanks for making the PR :)

* Fixed compiler warnings * Include the Intel MKL's iomp dll in the output folder to reduce confusino for windows users. * Fixed build error in newer clang on OpenBSD. * Fixed constness for lapack functions (davisking#1737) * disable annoying warning * Fixed global_function_search's initialization being wrong if explicitly given an empty list of initial function evaluations. * Suppress compiler warnings * Make things work in visual studio. * fix some pedantic warnings (davisking#1756) * fix some pedantic warnings * remove unneeded assert * more pedantic silencing (davisking#1763) * prevent GCC from complaining about this unused parameter * Even more warning silencing (davisking#1766) These warnings occurred when building the semantic segmentation examples * iEnsures DLIB_FALLTHROUGH macro is only set for GCC>=7 (davisking#1770) * Feature/upgrade libjpeg (davisking#1769) * Upgrades dlib's included libjpeg to version 8d * Overloads load_jpeg to read from memory buffer * Removes "__inline__" define in jconfig, broke VC build * Changes buffer size type to size_t * Adds a comprehensive error message when jpeg loading fails. * Disable use of non-memory based backing store in libjpeg. This fixes libjpeg not being able to open some types of jpeg file. * Stop building parts of libjpeg we don't need. * Add input_grayscale_image_pyramid, issue davisking#354 (davisking#1761) Add input_grayscale_image_pyramid * Added methods for getting keyboard and mouse clicks to image_window's pyhton API. * Fixed pytest broken dependencies * Fix python setup warnings * Revert "Fixed pytest broken dependencies" Apparently pytest is still sort of busted. This reverts commit 5e63d01. * Fix setting a point's y coordinate changes x instead (Python bindings) (davisking#1795) * Add point assignment test * Fix setting points y coordinate changes x instead (issue davisking#1794) * Push all include and link options needed for dlib to pkg-config. We do this by getting them from the same list cmake uses. * Fixed incorrect return type * Fixed grammar in comments * Added missing include * fixed typo in docs * fix mismatch between documentation and implementation (davisking#1835) * Fixed cmake warning * fixing grammar * Fix the CMake BUILDING_PYTHON_IN_MSVC variable not getting picked up where it should. * pybind11: cmake: ignore the check between host-python and cross-compiler (davisking#1848) When dlib is compiling, cmake will compare python architecture and target architecture. So in cross-compiling case, it is irrevelant because host and target architecture often differs. The main problem come from checking python architecture on host and not on target. Here is an error when compiling dlib from x86_64 to arm 32-bit target : ``` Python config failure: Python is 64-bit, chosen compiler is 32-bit ``` So : - Skipping the comparation when cross-compiling is enabled. Signed-off-by: Romain Naour <romain.naour@smile.fr> Signed-off-by: Alexandre PAYEN <alexandre.payen@smile.fr> * Const-correct a LAPACK declaration and add aarch64 as a 64-bit architecture (davisking#1859) * Added aarch64 to list of 64-bit architechtures * Const-corrected declaration of ssyevr * Fix davisking#1849 by calling device_global_buffer() unconditionally (davisking#1862) * Hold on to the CUDA buffer - second try see: davisking#1855 (comment) * Fix davisking#1849 by calling device_global_buffer() unconditionally * Simplified the device_global_buffer() code and API. * don't cast away constness (davisking#1865) * dpoint mutates x-coord in y-property (see davisking#1794) (davisking#1866) * add loss_mean_squared_per_channel (davisking#1863) add loss_mean_squared_per_channel_and_pixel * Clear truth_idxs between samples (davisking#1870) * Clear truth_idxs between samples * Move truth_idxs inside loop body after all * Push to truth_idxs even when the box can't be detected; improve formatting * Add an option to force static runtime (davisking#1847) * dos2unix tell_visual_studio_to_use_static_runtime.cmake * Add an option to force static runtime

add loss_mean_squared_per_channel_and_pixel

Adrià Arrufat added 2 commits June 22, 2019 20:23

add loss_mean_squared_per_channel

0ae7ae9

add test for loss_multiclass_per_channel

624dc1c

davisking requested changes Aug 19, 2019

View reviewed changes

Comment thread dlib/dnn/loss.h Outdated

Comment thread dlib/dnn/loss.h Outdated

Comment thread dlib/dnn/loss.h

rename loss and fix asserts

d78e4d5

add capture for num_channels in lambda expression

230c400

arrufat commented Aug 22, 2019

View reviewed changes

Comment thread dlib/test/dnn.cpp Outdated

set background values to make sure loss decreases significantly

d7407dc

add linear test for loss_mean_squared_per_channel_and_pixel

0b045d1

davisking requested changes Aug 25, 2019

View reviewed changes

Comment thread dlib/test/dnn.cpp Outdated

Comment thread dlib/test/dnn.cpp Outdated

Comment thread dlib/test/dnn.cpp Outdated

reduce dimension of loss_mean_squared_per_channel_and_pixel

37cea91

add an extra fc layer to loss_mean_squared_per_channel_and_pixel test

c9267e1

remove non-simple test for loss_mean_squared_per_channel_and_pixel

1000b30

davisking merged commit 170877d into davisking:master Aug 28, 2019

arrufat mentioned this pull request Jul 22, 2020

How to train CNN learn 2D coordinate information? (like shape_predicter_traier) #2136

Closed

nidegen pushed a commit to kapanu/dlib that referenced this pull request Sep 23, 2020

add loss_mean_squared_per_channel (davisking#1863)

ea1c92d

add loss_mean_squared_per_channel_and_pixel

Conversation

arrufat commented Aug 19, 2019

Uh oh!

davisking left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

arrufat commented Aug 19, 2019

Uh oh!

arrufat commented Aug 21, 2019

Uh oh!

davisking commented Aug 21, 2019

Uh oh!

davisking commented Aug 21, 2019

Uh oh!

arrufat commented Aug 21, 2019

Uh oh!

Uh oh!

davisking commented Aug 22, 2019

Uh oh!

arrufat commented Aug 22, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

arrufat commented Aug 24, 2019

Uh oh!

davisking left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

arrufat commented Aug 25, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

arrufat commented Aug 25, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

davisking commented Aug 25, 2019

Uh oh!

arrufat commented Aug 25, 2019

Uh oh!

arrufat commented Aug 26, 2019

Uh oh!

davisking commented Aug 28, 2019

Uh oh!

arrufat commented Aug 28, 2019

Uh oh!

davisking commented Aug 28, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

arrufat commented Aug 22, 2019 •

edited

Loading

arrufat commented Aug 25, 2019 •

edited

Loading

arrufat commented Aug 25, 2019 •

edited

Loading