Training error - huge difference (encog 3.1.0) #55

ghost · 2014-02-23T20:13:11Z

I used the common XOR sample in NN (training method ResilientPropagation and train till Error<0.001) and I got huge error after training (ideal 0, real value 0,989125420071542), see output:

Epoch #1 Error:0,403222760807917
Epoch #2 Error:0,326979855722731
...
Epoch #42 Error:0,00152763617214056
Epoch #43 Error:0,000498892283437333
Neural Network Results:
0,0, actual=0,00861768412365147,ideal=0
1,0, actual=0,982667334534116,ideal=1
0,1, actual=0,998007704200434,ideal=1
1,1, actual=0,989125420071542,ideal=0 (it seems as error]

part of source code (I used encog-dotnet-core-3.1.0):

    public static double[][] XOR_INPUT = {
      new double[2] { 0.0, 0.0 },
      new double[2] { 1.0, 0.0 },
      new double[2] { 0.0, 1.0 },
      new double[2] { 1.0, 1.0 } };

    public static double[][] XOR_IDEAL = {                                              
      new double[1] { 0.0 }, 
      new double[1] { 1.0 }, 
      new double[1] { 1.0 }, 
      new double[1] { 0.0 } };

        BasicNetwork network = new BasicNetwork();

        network.AddLayer(new BasicLayer(new ActivationSigmoid(), true, 2));
        network.AddLayer(new BasicLayer(new ActivationSigmoid(), true, 10));
        network.AddLayer(new BasicLayer(new ActivationSigmoid(), true, 1));
        network.Structure.FinalizeStructure();
        network.Reset();

       IMLDataSet trainingSet = new BasicMLDataSet(XOR_INPUT, XOR_IDEAL);

        IMLTrain train = new ResilientPropagation(network, trainingSet);

        int epoch = 1;
        do
        {
            train.Iteration();
            Console.WriteLine("Epoch #" + epoch + " Error:" + train.Error);
            epoch++;
        } while ((epoch < 10000) && (train.Error > 0.001));

        Console.WriteLine("Neural Network Results:");
        foreach (IMLDataPair pair in trainingSet)
        {
            IMLData output = network.Compute(pair.Input);

            Console.WriteLine(pair.Input[0] + "," + pair.Input[1]
            + ", actual=" + output[0] + ",ideal=" + pair.Ideal[0]);
            if (Math.Abs(pair.Ideal[0] - output[0]) > 0.2)
            {
                Console.WriteLine("Huge error");
                hugeError = true;
            }
        }

The text was updated successfully, but these errors were encountered:

ghost · 2014-02-23T20:44:27Z

BTW: It is not a problem to reproduce the same error. It is enought to run the sample code 100x times and you can get the same problem.

ghost · 2014-02-26T19:43:05Z

I probably know, where is the problem, I used this (basic) code
...
do
{
train.Iteration();
Console.WriteLine("Epoch #" + epoch + " Error:" + Format.FormatPercent(train.Error));
Console.WriteLine("Evaluated error: " + Format.FormatPercent(network.CalculateError(trainingSet)));
epoch++;
} while (train.Error > 0.001);
...

and I got these outputs:
...
Epoch #35 Error:0,125270%
Evaluated error: 0,058253%
Epoch #36 Error:0,058253%
Evaluated error: 22,434797%
Final evaluated error: 22,434797%
Neural Network Results:
0,0, actual=0,023279943378334,ideal=0
1,0, actual=0,351834582261846,ideal=1
0,1, actual=0,309542538954124,ideal=1
1,1, actual=7,84406348522279E-05,ideal=0

The problem is, that in some situations the calculation of train.Error and network.CalculateError can generate huge difference as in my sample 0,058253% (train.Error) vs 22,434797% (CalculateError).

I didn't have problem with topic huge difference in situation when I used cycle with "while (network.CalculateError(trainingSet) > 0.001);". It takes more time for training, but output of training is corrent in all situations (it would be fine to have final solution not this work-around).

BTW: This problem is also in Java code, I tested C# and Java also.

jeffheaton · 2014-03-25T23:18:13Z

This is the way that the training code is designed. train.Error is the error at the beginning of a training iteration (before weights are updated), whereas CalculateError is the error AFTER an iteration. They will always move sort of lockstep like you have there. Your results above seem to follow this, as epoch 35's evaluated error becomes the regular error for epoch 36, same thing on 36 to the final.

More info here:

http://www.jeffheaton.com/2014/03/when-is-a-models-training-error-calculated/

Also, sometimes, the random weights will produce a network that cannot be trained for XOR. If it takes 100 or so runs to see a large difference, you might be seeing that case.

XeonMan · 2016-01-14T15:01:38Z

Why did Epoch#36 jump from train.Error:0,058253% to a whopping Evaluated error: 22,434797%, whereas Epoch#35 decreased from train.Error:0,125270% to Evaluated error: 0,058253%, which is to be more or so expected?

jeffheaton closed this as completed Mar 25, 2014

jeffheaton added this to the Encog v3.2 milestone Mar 26, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training error - huge difference (encog 3.1.0) #55

Training error - huge difference (encog 3.1.0) #55

ghost commented Feb 23, 2014

ghost commented Feb 23, 2014

ghost commented Feb 26, 2014

jeffheaton commented Mar 25, 2014

XeonMan commented Jan 14, 2016

Training error - huge difference (encog 3.1.0) #55

Training error - huge difference (encog 3.1.0) #55

Comments

ghost commented Feb 23, 2014

ghost commented Feb 23, 2014

ghost commented Feb 26, 2014

jeffheaton commented Mar 25, 2014

XeonMan commented Jan 14, 2016