Chapter 09, encoder-decoder Data Preparation test_points not used for test set #25

gsgxnet · 2022-02-01T14:35:17Z

In the chunk for generation of the test set (Data Generation — Test) the full_testis derived from the points data structure, which are used for training, not from the test_points.

test_points, test_directions = generate_sequences(seed=19)
full_test = torch.as_tensor(points).float()
source_test = full_test[:, :2]
target_test = full_test[:, 2:]

I do not think that is intended, so there is a simple correction possible:

full_test = torch.as_tensor(test_points).float()

Based on that change we get different performance figures.
Loss:

and another figures prediction:

with 8 of 10 sequences with "clashing" points.
If my results are right, this text chunk needs some adaption as well:

The results are, at the same time, very good and very bad. In half of the sequences,
the predicted coordinates are quite close to the actual ones. But, in the other half,
the predicted coordinates are overlapping with each other and close to the
midpoint between the actual coordinates.

For whatever reason, the model learned
to make good predictions whenever the first corner is on the right edge of the
square, but really bad ones otherwise.

See sequence pictures, these statements needs to be adapted. Especially the second.

Same issue can be found in the final putting it all together section:

# Validation/Test Set
test_points, test_directions = generate_sequences(seed=19)
full_test = torch.as_tensor(points).float()
source_test = full_test[:, :2]
target_test = full_test[:, 2:]
test_data = TensorDataset(source_test, target_test)
test_loader = DataLoader(test_data, batch_size=16)

All based on your 1.1 revision, if I did not make any mistakes in updating by git pull.

The text was updated successfully, but these errors were encountered:

gsgxnet · 2022-02-02T07:38:20Z

The differences caused by the needed change in the full_testdata result in significant different results also in the

Encoder + Decoder + PE

section.
I see now this loss graph (when running the code with the independent test_point data):

quite different from the original (where the test set is filled with the same points as the training set):

and now in the updated test setup, we have one clashing points sequence:

please compare to the sequences based on the original code:

doubling the size of the training set from 128 to 256 sequences, will give results nearer to the expectation:
points, directions = generate_sequences(n=256, seed=13) (has to be changed at several places)

and the selected 10 validation sequences are good (depending on seed, this was run with 13):

dvgodoy · 2022-02-11T17:56:24Z

Thank you so much for pointing this out!

You're absolutely correct - it should be:
full_test = torch.as_tensor(test_points).float()

I will fix this and update the text to reflect the changes.

Thanks for supporting my work and helping to improve it :-)

Best,
Daniel

dvgodoy · 2022-02-13T18:38:33Z

Hi @gsgxnet,

I've updated code, figures, and text in the book, and published the revised edition (v1.1.1) today :-)

Once again, thanks for pointing this out.

Best,
Daniel

dvgodoy closed this as completed Jun 17, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chapter 09, encoder-decoder Data Preparation test_points not used for test set #25

Chapter 09, encoder-decoder Data Preparation test_points not used for test set #25

gsgxnet commented Feb 1, 2022 •

edited

gsgxnet commented Feb 2, 2022 •

edited

dvgodoy commented Feb 11, 2022

dvgodoy commented Feb 13, 2022

Chapter 09, encoder-decoder Data Preparation test_points not used for test set #25

Chapter 09, encoder-decoder Data Preparation test_points not used for test set #25

Comments

gsgxnet commented Feb 1, 2022 • edited

gsgxnet commented Feb 2, 2022 • edited

Encoder + Decoder + PE

dvgodoy commented Feb 11, 2022

dvgodoy commented Feb 13, 2022

gsgxnet commented Feb 1, 2022 •

edited

gsgxnet commented Feb 2, 2022 •

edited