Incorrect prediction while testing after training network #7

ChrisZeThird · 2023-06-08T18:25:44Z

Currently network doesn't give a correct output when running test_network.py. Output is always 0.4202.

Diagrams are not all the same
Lines are correctly calculated
Angles are correctly calculates

In training (simple_network.py) the y_pred is always the same (line 89).

The text was updated successfully, but these errors were encountered:

ChrisZeThird · 2023-06-08T18:32:57Z

At line 77, y_pred is constant through each iteration, but changes between iteration

ChrisZeThird · 2023-06-08T19:18:58Z

The predicted angle tends to convert to 0.42... when training. The expected angle values and the X_batch are all correct.

ChrisZeThird · 2023-06-08T19:41:39Z

Network might be learning by heart, solutions:

Needs resample
Smaller network
More data
Train on synthetic data, and test with experimental one

ChrisZeThird · 2023-06-08T19:54:10Z

The data are indeed not balanced statistically with a majority of angles between 0.41 and 0.49 with the majority at 0.47 approximately.

ChrisZeThird · 2023-06-09T14:30:18Z

The problem comes maybe from the normalization. Instead of taking the angle over 2*pi what if it was between pi alone considering the angles are all between 0 and 180°. It could increase the spacing between the angles.

ChrisZeThird · 2023-06-09T14:36:11Z

Changing normalization

Still the exact same issue and now MSE is very high

ChrisZeThird · 2023-06-13T01:03:40Z

New direction

After talking with @victor-yon, the right direction would be to consider a CNN instead of a simple feed-forward. The CNN would recognize the angle better among the noise of the stability diagram. The network indeed can't find the line, therefore tries to minimize the error by returning a value close to the expected values. Here is the workflow for the coming week (should be done in 2 days max):

Implement a CNN to train on synthetic data to validate the theory
Apply the trained network on experimental data
Repeat step 1 with experimental data for training.

Additional note

Check in Victor's code the CNN used
Use the derivative of the diagram to be closer to the synthetic data.
Create folder in .\saved\model to store CNN and FF separately.

ChrisZeThird · 2023-06-14T14:30:35Z

Accuracy paradox

There is a great chance the imbalance of data causes issue. However, while testing with synthetic data, the standard deviation doesn't drop. The problem certainly comes from the network not being able to identify features in noisy/non-binary data. But exploring data imbalance a bit more could also help.

This link explains the concept of accuracy paradox, as well as this post.

ChrisZeThird · 2023-06-14T17:10:22Z

A new loss function

After computing the MSE (Mean Square Error) and the MAE (Mean Absolute Error), another loss function was found: SmoothL1Loss. This new function . It uses a squared term if the absolute element-wise error falls below beta and an L1 term otherwise. It is less sensitive to outliers than torch.nn.MSELoss and in some cases prevents exploding gradients.

This loss function helped decreasing the loss drastically and obtained much more accurate results, and by that I mean the network predicts different values for different input. This means this issue can be close, as it's only a matter of tweaking the hyper-parameters to decrease the standard deviation now.

ChrisZeThird pinned this issue Jun 8, 2023

ChrisZeThird added this to the Testing network milestone Jun 8, 2023

ChrisZeThird mentioned this issue Jun 8, 2023

Resampling data #8

Closed

ChrisZeThird added the enhancement New feature or request label Jun 9, 2023

ChrisZeThird self-assigned this Jun 9, 2023

ChrisZeThird added bug Something isn't working help wanted Extra attention is needed labels Jun 9, 2023

ChrisZeThird mentioned this issue Jun 13, 2023

CNN for angle recognition #16

Closed

ChrisZeThird closed this as completed Jun 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Incorrect prediction while testing after training network #7

Incorrect prediction while testing after training network #7

ChrisZeThird commented Jun 8, 2023 •

edited

Loading

ChrisZeThird commented Jun 8, 2023 •

edited

Loading

ChrisZeThird commented Jun 8, 2023 •

edited

Loading

ChrisZeThird commented Jun 8, 2023

ChrisZeThird commented Jun 8, 2023

ChrisZeThird commented Jun 9, 2023

ChrisZeThird commented Jun 9, 2023

ChrisZeThird commented Jun 13, 2023

ChrisZeThird commented Jun 14, 2023

ChrisZeThird commented Jun 14, 2023

Incorrect prediction while testing after training network #7

Incorrect prediction while testing after training network #7

Comments

ChrisZeThird commented Jun 8, 2023 • edited Loading

ChrisZeThird commented Jun 8, 2023 • edited Loading

ChrisZeThird commented Jun 8, 2023 • edited Loading

ChrisZeThird commented Jun 8, 2023

ChrisZeThird commented Jun 8, 2023

ChrisZeThird commented Jun 9, 2023

ChrisZeThird commented Jun 9, 2023

Changing normalization

ChrisZeThird commented Jun 13, 2023

New direction

Additional note

ChrisZeThird commented Jun 14, 2023

Accuracy paradox

ChrisZeThird commented Jun 14, 2023

A new loss function

ChrisZeThird commented Jun 8, 2023 •

edited

Loading

ChrisZeThird commented Jun 8, 2023 •

edited

Loading

ChrisZeThird commented Jun 8, 2023 •

edited

Loading