Accuracy issue #21

taewookim · 2017-11-20T19:21:32Z

When I run this model on video, i've noticed the prediction changes pretty wildly from frame to frame, depending on the face dimensions, lighting, position, etc. But as soon as the face becomes closed to the camera, the predictions are more consistent.

Two questions

the default 32x32... why was that chosen?
could prediction be helped by training / running on larger dimensions? (64x64 or even much larger)

yu4u · 2017-11-20T21:41:54Z

Hi @taewookim,

I simply chose 32x32 for efficiency in development and testing. It is not appropriate for deployment for its poor accuracy as you pointed out.
I think the image size should be larger if accuracy matters. The image size 64x64 is used in the sample code in README.md as python3 create_db.py --output data/imdb_db.mat --db imdb --img_size 64. In the original paper [1], the VGGNet is used; the input image size is 224x224.

[1] R. Rothe, R. Timofte, and L. V. Gool, "DEX: Deep EXpectation of apparent age from a single image," ICCV, 2015.

taewookim · 2017-11-20T21:46:24Z

ah oops.. completely forgot about that. Thank you @yu4u

taewookim · 2017-11-20T22:32:20Z

By the way, how do you figure out the upper limit on how far this architecture can be taken? I'm sure this is more of a hardware limitation as well as how much time I'm willing to wait on, but are there any guidelines?

yu4u · 2017-11-27T01:29:53Z

Did you mean how to determine the size of the network?
I like WideResNet with moderate depth (16-22); it provides good trade-offs between accuracy and computational cost.

taewookim · 2017-12-04T18:53:35Z

@yu4u

I tried changing img_size to 224 and the model is performing really bad. For each of the 3 dense layers (i.e. classifying 3 things), loss was anywhere between 9 and 20 .. and accuracy didnt go anywhere even above 0.25. I didn't even bother finishing 30 epochs (i stopped at 28)

I am re-training with img_size at 64, but do you have any idea where I might have gone wrong and what are some things that I can tweak?

EDIT: Lowering to 32 img_size helped to lower loss ... currently on epoch 7:

Epoch 7/30 - loss: 3.7325
dense_3_loss: 1.4535 - dense_1_loss: 0.2372 - dense_2_loss: 1.5244
dense_3_acc: 0.3509 - dense_1_acc: 0.9 - dense_2_acc: 0.4345

yu4u · 2017-12-05T17:33:18Z

Currently, my implementation is not suitable for larger sizes of images (e.g. 224) for several reasons: 1) it saves and loads all raw images; it requires large amount of memory, 2) the stride of the network is only 4 (before pool (8, 8)) because small image size is assumed.

If you use the image size 224, it seems good idea to use officially supported pre-trained models:
https://keras.io/ja/applications/

taewookim · 2017-12-05T18:18:23Z

so what's the largest possible that would you'd recommend w/your implementation? im on Google compute w/12gb VRAM K80 GPU w/ 30gb RAM

yu4u · 2017-12-06T14:44:13Z

I think 64x64 is good enough because there seem to be many low resolution images in the dataset.

darshansharma · 2017-12-17T15:33:14Z

Hi, I tried this project with your pretrained model with default depth and width but it is giving the wrong prediction. for example it is predicting age 43 F of a 80-year female lady. It is predicting wrong on each and every image.

yu4u · 2017-12-20T13:39:22Z

Please confirm that your backend and channel order are consistent with the pretrained model.

darshansharma · 2017-12-22T05:19:49Z

I am using your pretrained model and my hardware specs are different from your one. Is that an issue?

yu4u · 2017-12-22T15:38:37Z

What is your backend?

ghost · 2018-09-13T03:17:48Z

Hi, I'd like to ask you where I can see the calculation of this model? because its accuracy is bad from 0.2 to 0.4

yu4u · 2018-09-13T03:30:08Z

See:

https://github.com/yu4u/age-gender-estimation/blob/master/demo.py#L101-L105

ghost · 2018-09-13T04:39:16Z

Thank you for your reply. But I mean train and test(validation) accuracy.
When I set the learning rate = 0.001 by all epochs, It would be trained for train dataset. but val_loss increased so there was an over-fitting(did not decrease val_loss)

yu4u · 2018-09-15T17:34:09Z

The validation accuracy is automatically calculated by Keras using validation_data=(X_test, [y_test_g, y_test_a]).
Overfitting would be alleviated by data augmentation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Accuracy issue #21

Accuracy issue #21

taewookim commented Nov 20, 2017

yu4u commented Nov 20, 2017

taewookim commented Nov 20, 2017

taewookim commented Nov 20, 2017

yu4u commented Nov 27, 2017

taewookim commented Dec 4, 2017 •

edited

Loading

yu4u commented Dec 5, 2017

taewookim commented Dec 5, 2017 •

edited

Loading

yu4u commented Dec 6, 2017

darshansharma commented Dec 17, 2017

yu4u commented Dec 20, 2017

darshansharma commented Dec 22, 2017

yu4u commented Dec 22, 2017

ghost commented Sep 13, 2018

yu4u commented Sep 13, 2018

ghost commented Sep 13, 2018

yu4u commented Sep 15, 2018

Accuracy issue #21

Accuracy issue #21

Comments

taewookim commented Nov 20, 2017

yu4u commented Nov 20, 2017

taewookim commented Nov 20, 2017

taewookim commented Nov 20, 2017

yu4u commented Nov 27, 2017

taewookim commented Dec 4, 2017 • edited Loading

yu4u commented Dec 5, 2017

taewookim commented Dec 5, 2017 • edited Loading

yu4u commented Dec 6, 2017

darshansharma commented Dec 17, 2017

yu4u commented Dec 20, 2017

darshansharma commented Dec 22, 2017

yu4u commented Dec 22, 2017

ghost commented Sep 13, 2018

yu4u commented Sep 13, 2018

ghost commented Sep 13, 2018

yu4u commented Sep 15, 2018

taewookim commented Dec 4, 2017 •

edited

Loading

taewookim commented Dec 5, 2017 •

edited

Loading