problem about loss value #18

Cong222 · 2018-10-13T04:47:11Z

Epoch 1/60
97/97 [==============================] - 24s - loss: 1.0072 - mAP: 0.1649 - val_loss: 0.9624 - val_mAP: 0.1296
Epoch 2/60
97/97 [==============================] - 22s - loss: 1.0060 - mAP: 0.1959 - val_loss: 0.9647 - val_mAP: 0.0784
Epoch 3/60
97/97 [==============================] - 21s - loss: 1.0051 - mAP: 0.2268 - val_loss: 0.9851 - val_mAP: 0.1536
Epoch 4/60
97/97 [==============================] - 21s - loss: 1.0051 - mAP: 0.1650 - val_loss: 0.9519 - val_mAP: 0.1808
Epoch 5/60
97/97 [==============================] - 21s - loss: 1.0034 - mAP: 0.2474 - val_loss: 0.9696 - val_mAP: 0.3072
Epoch 6/60
97/97 [==============================] - 21s - loss: 1.0025 - mAP: 0.2577 - val_loss: 0.9895 - val_mAP: 0.3584
Epoch 7/60
97/97 [==============================] - 21s - loss: 1.0044 - mAP: 0.2990 - val_loss: 0.9717 - val_mAP: 0.5392
Epoch 8/60
97/97 [==============================] - 21s - loss: 1.0007 - mAP: 0.2784 - val_loss: 0.9902 - val_mAP: 0.4096

Hi, I found something wrong for the loss value.
The loss value is almost not changing at the time model training.
And the loss value is changed when I change margin value, loss value is approximate of margin

omoindrot · 2018-10-13T09:09:13Z

This usually means that all the embeddings have collapsed on a single point.

One solution that might work is to lower your learning rate so that this collapse doesn't happen.

Cong222 · 2018-10-13T09:58:37Z

thx, but I lower my learning rate down to 1e-6, also the problem exists, maybe my learning rate need lower? My dataset is cifar10, and the net is alexNet

Cong222 · 2018-10-13T12:54:44Z

I realize the problem now.
I use your loss function in keras, but keras loss function need return tensor of [batch_size, 1] ,
however your function return a scalar tensor.
So the problem out.
Could you have some methods about this?

omoindrot · 2018-10-14T10:16:17Z

You could juste duplicate the loss so that it has the right shape?

loss = ...  # scalar
loss = tf.ones([batch_size, 1]) * loss

Cong222 · 2018-10-14T12:37:25Z

No, I have tested it.
It is wrong way to get right.
But thx.

qingchenwuhou · 2018-10-30T08:18:09Z

@Cong222 Hi, how did you solve the problem that triplet_loss is scalar tensor which is inconsistent with the loss in keras?

virgile-blg · 2019-01-10T10:59:09Z

Hello, I think the problem mainly comes from the fact that in Keras, any custom loss should be designed this way:

"The function should takes the following two arguments:
y_true: True labels. TensorFlow/Theano tensor.
y_pred: Predictions. TensorFlow/Theano tensor of the same shape as y_true. "

This is in practice impossible for any embedding learning tasks, but maybe there could be a workaround for it...

vijayanand-Git · 2019-01-13T11:48:52Z

@Cong222 did you find the way we can use the triplet loss in keras? Even I have the same issue with the loss value.

Cong222 · 2019-01-13T15:42:06Z

I can't find the way.
I gave up.

ChristieLin · 2019-01-17T03:42:26Z

@vijayanand-Git

I encountered the same problem that the loss is stuck in the margin value.
Then I tried to tune parameters including learning rate, batch size and even data normalization, finally the loss converged.
Also, I modified the "batch_hard_triplet_loss function" as follows:

you can have a try....

swpucl · 2019-01-30T03:13:14Z

@Cong222 I meet the same problem but set lower learining rate the loss converged.
I'd like to ask you a question that how to calculate mAP about cifar10?

vijayanand-Git · 2019-01-31T06:49:24Z

@vijayanand-Git

I encountered the same problem that the loss is stuck in the margin value.
Then I tried to tune parameters including learning rate, batch size and even data normalization, finally the loss converged.
Also, I modified the "batch_hard_triplet_loss function" as follows:

you can have a try....

Thank you @ChristieLin . Changing the learning rate worked for me.

xiaomingdaren123 · 2019-03-02T12:33:14Z

Epoch 1/60
97/97 [==============================] - 24s - loss: 1.0072 - mAP: 0.1649 - val_loss: 0.9624 - val_mAP: 0.1296
Epoch 2/60
97/97 [==============================] - 22s - loss: 1.0060 - mAP: 0.1959 - val_loss: 0.9647 - val_mAP: 0.0784
Epoch 3/60
97/97 [==============================] - 21s - loss: 1.0051 - mAP: 0.2268 - val_loss: 0.9851 - val_mAP: 0.1536
Epoch 4/60
97/97 [==============================] - 21s - loss: 1.0051 - mAP: 0.1650 - val_loss: 0.9519 - val_mAP: 0.1808
Epoch 5/60
97/97 [==============================] - 21s - loss: 1.0034 - mAP: 0.2474 - val_loss: 0.9696 - val_mAP: 0.3072
Epoch 6/60
97/97 [==============================] - 21s - loss: 1.0025 - mAP: 0.2577 - val_loss: 0.9895 - val_mAP: 0.3584
Epoch 7/60
97/97 [==============================] - 21s - loss: 1.0044 - mAP: 0.2990 - val_loss: 0.9717 - val_mAP: 0.5392
Epoch 8/60
97/97 [==============================] - 21s - loss: 1.0007 - mAP: 0.2784 - val_loss: 0.9902 - val_mAP: 0.4096

Hi, I found something wrong for the loss value.
The loss value is almost not changing at the time model training.
And the loss value is changed when I change margin value, loss value is approximate of margin

I also meet same question, loss value is approximate of margin,i found that the distance is close to 0，I don't know how it was caused.Does the output of the network need to be L2 normalized, what is the role of L2 normalization?

Cong222 · 2019-03-02T12:42:46Z

#18 (comment)
Hey. Google the way to calculate mAP, you can find it.

parthnatekar · 2019-05-13T06:55:53Z

@Cong222 @ChristieLin Can you elaborate how you used this loss function with Keras with incompatible y_true and y_pred shapes?

TuanAnhNguyen14111998 · 2019-08-21T09:09:55Z

Hello, I have a similar problem, I use transfer learning on vggface with keras, combined with triplet loss. val_loss does not change every time to 0.500. Because the training data is too much, I read and store data into the ".h5" file, each time I train, I will read each batch from that file. Then I create a Data Generate that returns batch_x and batch_y. I use model.fit_generator to train the model, however the error occurs when val_loss doesn't change every time down to 0.500. My learning_rate is 0.001.
I did the same as @omoindrot and @ChristieLin instructions but still doesn't work for my case. Do you have any ideas to solve this problem for me?
Should I change the value of the learning rate and how should I change it properly? Thank you!

aknakshay · 2019-10-04T22:16:57Z

I am facing similar problem with my model. Training loss is stuck at the margin with very low learning rate as well. Is there any solution yet?

ma1112 · 2019-10-28T10:45:41Z

@vijayanand-Git

I encountered the same problem that the loss is stuck in the margin value.
Then I tried to tune parameters including learning rate, batch size and even data normalization, finally the loss converged.
Also, I modified the "batch_hard_triplet_loss function" as follows:

you can have a try....

As @vijayanand-Git pointed it out, the loss function introduced in this repository is not to be applied as-is in a Keras environment. A small enhancment is needed, that in the answer above is adding the line ( labels = tf.squeeze(y_true, axis = -1).

In Keras, the default shape for y_true is (batch_size,1) and omoindrot 's code is intended to be used with labels of shape (batch_size,). It may seem that the difference is minimal, however, Tenforflow (and Numpy) functions work in a very different way with objects of these two shapes. So one should flatten their y_true tensor before applying the hereby defined triplet loss function on it.

To elaborate a bit more on the expected shapes of y_pred and y_true tensors in Keras, an how a loss function like this can work in Keras: I believe that the purpose of the loss funciton is to come up with a number (the loss) in a way that the loss can later be backpropagated in the network. Up to my understanding the y_pred tensor does not have to be of the shape as y_true, as long as the defined loss function is able to calculate the loss based on these two tensors of whatever shapes. It is true though that many conventional loss functions expect the two shapes to match, but I don't see why one could not define a loss function that expects these two tensors to have different shapes.

For those who are still looking for a working example in Keras, I created a notebook that shows how omoindrot 's triplet loss function can be used with Keras, check it out here: https://github.com/ma1112/keras-triplet-loss

shanmukh05 · 2021-01-26T10:45:38Z

Adding labels = tf.squeeze(y_true, axis = -1) worked for me, thanks @ma1112 for detailed explanation.

JJKK1313 · 2021-08-14T19:01:41Z

but there are not labels on triplet loss, there is only the embeddings and the margin.
which value did you choose for y_true then?

ma1112 · 2021-08-14T19:47:49Z

but there are not labels on triplet loss, there is only the embeddings and the margin.
which value did you choose for y_true then?

When using triplet loss, labels help the algorithm determine which pairs are positive and which pairs are negative, by inspecting whether the labels for two training examples are the same or not.

Two training examples with the same label are considered a positive pair and will have their embeddings close together in the embedding space.
Two training examples with different labels are considered a negative pair and will have their embeddings far away.

So the only important concept around labels is that they should be the same for every example from a given class and they should be different for examples from different classes. Keeping that in mind you can use any numeric value as a label.

Particularly, if your dataset has N different classes, you can use label 1 for examples belonging to the first class, 2 for examples belonging to the second class, ..., N for examples belonging to the N-th class.

JJKK1313 · 2021-08-14T20:03:03Z

@ma1112 thanks for the explanation, but if I understand you correctly, your samples are combinations of pairs? not triplets?
My samples are built from 3 images, (anchor, positive, negative) images (all 3 in one sample). Is that incorrect? Or less preferred for some reason? I'm asking because I'm trying to improve my failing model.

ma1112 · 2021-08-22T21:35:38Z

@JJKK1313 Sorry for the confusing answer, let me elaborate further.

If you wish to use the triplet loss implementation found in this repo, your samples should be individual samples just as if you trained a network without using triplet loss. I.e. in case of working with the MNIST dataset, in which there are 60k grayscale images of hand written digits, each with a size of 28x28, you can use that dataset as-is to train a network with the triplet loss algorithm. So your input tensor should have a size of 60kx28x28x1. (Note that you should keep labels as integers from 0 to 9 when working with triplet loss, whereas if you were to use softmax activation + crossentropy loss, you'd one-hot encode the labels.)

That is because the triplet loss implementation found in this repo implements online triplet mining, and picks the best triplets from a batch of images during the time the model is being trained. As triplets are created on-the-fly, the algorithm needs to know whether for a given anchor another sample is negative or positive. Hence you need to have labels for online triplet mining.

And you are quite right, if you were to use a model with offline triplet mining, i.e. if you fed the network with triplets of samples during training, then you would not need to pass labels to the network. However in that case you could not use the triplet loss function you find in this repo and your model would be probably worse than one with online triplet mining.

JJKK1313 · 2021-08-23T11:20:49Z

Ohhhhhh nnooowww I got it! Thank you very much for the explanation @ma1112!!

omoindrot mentioned this issue Feb 3, 2019

build my model on my own data set #33

Open

aknakshay mentioned this issue Oct 5, 2019

fraction_positive increasing #51

Open

paweller mentioned this issue Jan 28, 2021

Implementation of metrics to monitor training process in tf.keras environment #58

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

problem about loss value #18

problem about loss value #18

Cong222 commented Oct 13, 2018

omoindrot commented Oct 13, 2018

Cong222 commented Oct 13, 2018

Cong222 commented Oct 13, 2018

omoindrot commented Oct 14, 2018

Cong222 commented Oct 14, 2018

qingchenwuhou commented Oct 30, 2018

virgile-blg commented Jan 10, 2019 •

edited

Loading

vijayanand-Git commented Jan 13, 2019

Cong222 commented Jan 13, 2019

ChristieLin commented Jan 17, 2019

swpucl commented Jan 30, 2019

vijayanand-Git commented Jan 31, 2019

xiaomingdaren123 commented Mar 2, 2019

Cong222 commented Mar 2, 2019

parthnatekar commented May 13, 2019

TuanAnhNguyen14111998 commented Aug 21, 2019 •

edited

Loading

aknakshay commented Oct 4, 2019

ma1112 commented Oct 28, 2019

shanmukh05 commented Jan 26, 2021

JJKK1313 commented Aug 14, 2021

ma1112 commented Aug 14, 2021 •

edited

Loading

JJKK1313 commented Aug 14, 2021 •

edited

Loading

ma1112 commented Aug 22, 2021

JJKK1313 commented Aug 23, 2021 •

edited

Loading

problem about loss value #18

problem about loss value #18

Comments

Cong222 commented Oct 13, 2018

omoindrot commented Oct 13, 2018

Cong222 commented Oct 13, 2018

Cong222 commented Oct 13, 2018

omoindrot commented Oct 14, 2018

Cong222 commented Oct 14, 2018

qingchenwuhou commented Oct 30, 2018

virgile-blg commented Jan 10, 2019 • edited Loading

vijayanand-Git commented Jan 13, 2019

Cong222 commented Jan 13, 2019

ChristieLin commented Jan 17, 2019

swpucl commented Jan 30, 2019

vijayanand-Git commented Jan 31, 2019

xiaomingdaren123 commented Mar 2, 2019

Cong222 commented Mar 2, 2019

parthnatekar commented May 13, 2019

TuanAnhNguyen14111998 commented Aug 21, 2019 • edited Loading

aknakshay commented Oct 4, 2019

ma1112 commented Oct 28, 2019

shanmukh05 commented Jan 26, 2021

JJKK1313 commented Aug 14, 2021

ma1112 commented Aug 14, 2021 • edited Loading

JJKK1313 commented Aug 14, 2021 • edited Loading

ma1112 commented Aug 22, 2021

JJKK1313 commented Aug 23, 2021 • edited Loading

virgile-blg commented Jan 10, 2019 •

edited

Loading

TuanAnhNguyen14111998 commented Aug 21, 2019 •

edited

Loading

ma1112 commented Aug 14, 2021 •

edited

Loading

JJKK1313 commented Aug 14, 2021 •

edited

Loading

JJKK1313 commented Aug 23, 2021 •

edited

Loading