Adjust SPSA Logistic Regression test optimizer parameter #87

zoq · 2019-02-28T22:55:05Z

Adjust SPSA Logistic Regression test optimizer parameter.

rcurtin · 2019-03-04T21:42:49Z

I found that this helps, but there are still occasional failures. I had better success by running the test three times, like we do for some tests, and then making sure it succeeded at least once. Here's the code I used:

TEST_CASE("SPSALogisticRegressionTest", "[SPSATest]")
{
  bool success = false;
  arma::Row<size_t> responses, testResponses, shuffledResponses;

  for (size_t trial = 0; trial < 3; ++trial)
  {
    LogisticRegressionTestData(data, testData, shuffledData,
        responses, testResponses, shuffledResponses);
    LogisticRegression<> lr(shuffledData, shuffledResponses, 0.5);

    SPSA optimizer(0.5, 1, 0.102, 0.16, 0.3, 1000, 1e-7);
    arma::mat coordinates = lr.GetInitialPoint();
    optimizer.Optimize(lr, coordinates);

    // Ensure that the error is close to zero.
    const double acc = lr.ComputeAccuracy(data, responses, coordinates);
    const double testAcc = lr.ComputeAccuracy(testData, testResponses,
        coordinates);
    if (acc == Approx(100.0).epsilon(0.003) &&
        testAcc == Approx(100.0).epsilon(0.006))
      success = true;
    
    if (success)
      break;
  }

  REQUIRE(success == true);
}

Note that I also found that each run of Optimize() took surprisingly long---so I reduced the number of iterations to only 1000. I think that maybe sometimes SPSA will go a bad direction and doesn't come back, so restarts are a solution.

However, I then dug into the SPSA implementation and found some other issues that I'm happy to fix but I want to get some confirmation on:

SPSA is not meant for differentiable separable functions like the documentation says. In fact it is for arbitrary functions, as it only calls Evaluate().
SPSA isn't originally defined as something that works on separable functions, but the extension is straightforward; simply call Evaluate() on a small batch of points instead of all the points. Should we implement that extension? I am okay with that if we can make it clear to users via, perhaps, two separate classes/typedefs like SPSA (full batch) and BatchSPSA (or some similar name).
I'm confused by this block inside SPSA::Optimize():

    gradient.zeros();
    for (size_t b = 0; b < batchSize; b++)
    { 
      // Stochastic directions.
      spVector = arma::conv_to<arma::mat>::from(
          arma::randi(iterate.n_rows, iterate.n_cols,
          arma::distr_param(0, 1))) * 2 - 1;
      
      iterate += ck * spVector;
      const double fPlus = function.Evaluate(iterate, 0, iterate.n_elem);
      
      iterate -= 2 * ck * spVector;
      const double fMinus = function.Evaluate(iterate, 0, iterate.n_elem);                                                                                                       
      iterate += ck * spVector;

      gradient += (fPlus - fMinus) * (1 / (2 * ck * spVector));
    }

    gradient /= (double) batchSize;

That should be the approximation of the gradient. However, we're simply looping batchSize times to approximate the gradient, and we're also calling Evaluate() very strangely, with a strange batch size of iterate.n_elem. It seems to me that:

if we are doing full-batch SPSA (as defined earlier) that we should be computing gradient as:

    spVector = arma::conv_to<arma::mat>::from(
        arma::randi(iterate.n_rows, iterate.n_cols,
        arma::distr_param(0, 1))) * 2 - 1;

    iterate += ck * spVector;
    const double fPlus = function.Evaluate(iterate);
      
    iterate -= 2 * ck * spVector;
    const double fMinus = function.Evaluate(iterate);                                                                                                       
    iterate += ck * spVector;

    gradient = (fPlus - fMinus) * (1 / (2 * ck * spVector));

A key point there is that there is no loop over the batch size.

If we are doing small-batch SPSA, then we should call function.Evaluate(iterate, 0, batchSize) instead.

Let me know what you think. Like I said I'm happy to make the changes, but I want to double-check that you agree first.

zoq · 2019-03-04T22:18:50Z

Interesting, I Interesting, I tested the optimizer like 500 times with a single error (using a new random seed each time), but if you still see errors I'm fine to run the test multiple times, I guess since you adjusted the number of iterations it's still fast enough. About SPSA/BatchSPSA; I agree, switching to function.Evaluate(iterate) makes a lot more sense. I think at this point there is no need to provide a batch variant unless you see one. Do you like to open a PR?

rcurtin

Right, I observed about the same thing too. It was one error every ~500-800 iterations, but I generally shoot for ~1000 clean iterations with no failures. (Maybe I am too paranoid?) Mostly I found the change I proposed made it run a bunch faster too. Anyway, I'm fine whichever way you want to go with this.

zoq · 2019-03-04T22:24:26Z

We should definitely switch to function.Evaluate(iterate), and if it runs faster I guess it makes sense to adjust that one as well. I can do this inside of this PR or we can open a new one, but I don't think we should merge this one in. Let me know what you think.

rcurtin · 2019-03-04T22:25:08Z

I'll work up a patch and get it to you shortly, and we can apply all the changes in this PR. 👍

rcurtin · 2019-03-04T23:40:44Z

Here's a patch: https://gist.github.com/a1deacc1debefbcd90b9d280364469a0

Signed-off-by: Marcus Edel <marcus.edel@fu-berlin.de>

rcurtin

Ah, thanks, I forgot to remove the random seed. Feel free to merge whenever you're ready. 👍

Adjust SPSA Logistic Regression test parameter.

b98886e

favre49 mentioned this pull request Mar 3, 2019

Optimized CNE and DE #90

Merged

rcurtin approved these changes Mar 4, 2019

View reviewed changes

rcurtin and others added 2 commits March 5, 2019 01:23

Fixes for SPSA and its documentation.

09c6d1c

Signed-off-by: Marcus Edel <marcus.edel@fu-berlin.de>

Remove debug random seed.

b1b2bd7

rcurtin approved these changes Mar 5, 2019

View reviewed changes

zoq merged commit 64f15cf into mlpack:master Mar 5, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adjust SPSA Logistic Regression test optimizer parameter #87

Adjust SPSA Logistic Regression test optimizer parameter #87

zoq commented Feb 28, 2019

rcurtin commented Mar 4, 2019

zoq commented Mar 4, 2019

rcurtin left a comment

zoq commented Mar 4, 2019

rcurtin commented Mar 4, 2019

rcurtin commented Mar 4, 2019

rcurtin left a comment

Adjust SPSA Logistic Regression test optimizer parameter #87

Adjust SPSA Logistic Regression test optimizer parameter #87

Conversation

zoq commented Feb 28, 2019

rcurtin commented Mar 4, 2019

zoq commented Mar 4, 2019

rcurtin left a comment

Choose a reason for hiding this comment

zoq commented Mar 4, 2019

rcurtin commented Mar 4, 2019

rcurtin commented Mar 4, 2019

rcurtin left a comment

Choose a reason for hiding this comment