WIP Frank wolfe #67

amueller · 2013-08-13T09:30:12Z

This is the implementation of the Block-coordinate Frank Wolfe by Xianghang Liu :)

amueller · 2013-08-13T09:45:02Z

Ok so I observed that the duality gap goes negative, even with exact inference. Also, there is a jump in the objective at the beginning of each epoch.

amueller · 2013-08-13T09:47:19Z

I disabled the stopping criterion and just tried without line search as that seemed to be simpler.

amueller · 2013-08-13T15:27:37Z

Ok, the jump in the objective is probably caused by not using the line-search and the online updates. will try the batch version ^^

amueller · 2013-08-13T16:02:59Z

so with line search and batch and my change the dual is increasing monotonically... that is a good sign ^^

amueller · 2013-08-13T16:04:11Z

The objective is converging to a different value than the other methods, though. And it it not a factor of n_samples :-/

amueller · 2013-08-13T17:33:05Z

Ok, so the meaning of C is different and the objective is scaled differently. I'll try to fix that. but other than that I think it looks very good!

amueller · 2013-08-13T18:22:58Z

It also needs to use "batch_loss_augmented_inference" and "batch_psi" in the batch version.....

… Works now that C is scaled correctly :)

amueller · 2013-08-13T20:38:25Z

Something is still odd. The objective on the ocr data is around 230 for bcfw but 250 for oneslack cutting plane (using ad3 which should be exact as this is a chain).
Does any one have an idea why that happens?

amueller · 2013-08-14T06:48:35Z

ping @xianghang :)

vene · 2013-08-14T07:36:32Z

pystruct/learners/frankwolfe_ssvm.py

+                self._frank_wolfe_batch(X, Y)
+            else:
+                self._frank_wolfe_bc(X, Y)
+        except KeyboardInterrupt:


Is it fine if the try/except is outside of the inner loop?

I think so, yes. It is the same for SubgradientSSVM. What do youthink could go wrong?

Just checking. If it can be done at such a high level, can't it be made
into a @decorator or maybe put in a base class? Anything that could go
wrong?

On Wed, Aug 14, 2013 at 11:09 AM, Andreas Mueller
notifications@github.comwrote:

In pystruct/learners/frankwolfe_ssvm.py:

print("p = %d, dual: %f, dual_gap: %f, primal: %f, positive slack: %d"

% (p, dual_val, dual_gap, primal_val, n_pos_slack))

if dual_gap < self.tol:

return

def fit(self, X, Y):

self.model.initialize(X, Y)

self.objective_curve_, self.primal_objective_curve_ = [], []

self.timestamps_ = [time()]

self.w = getattr(self, "w", np.zeros(self.model.size_psi))

try:

if self.batch_mode:

self._frank_wolfe_batch(X, Y)

else:

self._frank_wolfe_bc(X, Y)

except KeyboardInterrupt:

I think so, yes. It is the same for SubgradientSSVM. What do youthink
could go wrong?

—
Reply to this email directly or view it on GitHubhttps://github.com//pull/67/files#r5757516
.

Some classes have a finalization in fit. I think currently it is mostly computing the objective once again. I didn't do that here, as it probably didn't move much since the last call to the duality gap computation. In the OneSlackSSVM on the other hand, the last inference migh be from the cach and won't tell you much about the actual objective. It would probably be worthwhile to put the innermost loop in a _fit function and wrap the try around that, just to make the code easier to read.
For decorators: I am not sure if having a single decorator instead of three lines of code is better, as the code is pretty clear and the decorator is not very obvious to someone not familiar with the codebase.

…relative to the iteration, not the update (otherwise the overhead is hard to judge).

amueller · 2013-08-15T12:45:25Z

So for the name... should we leave it as Frank-Wolfe? It also implements the batch version so BCFW is a bit misleading.
We could also call it "LJJSP" (the authors) but that is really hard to remember and spell ^^. So maybe FrankWolfeSSVM is fine?

…multi_class example in batch mode much faster....

…hmarked. I think we could improve it a bit more but want to merge it now.

amueller · 2013-08-15T14:20:39Z

Ok, I merged this guy :)

amueller added 2 commits August 13, 2013 11:08

init frank wolfe stuff

90f1e94

COSMIT slight cleanup, added frank wolfe to learners module.

653d7b9

Trying some debugging....

3aaf540

FIX don't normalize loss, use batch mode and line search.

81f5ec6

amueller added 2 commits August 13, 2013 19:23

also fix online version, add stopping criterion.

1afddc9

TST make test on FrankWolfeSSVM pass.

e024698

amueller added 9 commits August 13, 2013 20:27

ENH more iterations by default in FrankWolfeSSVM

43f0698

FIX rescale C in Frank-Wolfe SSVM for consistency.

6be62b3

ENH get rid of "lam" variables, simplify formulas as much as possible.

6df9f42

add timing curve and objective curves to frank wolfe ssvm

b20e612

rescale frankwolfe objective to be consistent with other learners

bd7de7b

FIX Really rescale objective to be the same as other learners

a64e2ad

ENH allow stopping of frank-wolfe training with ctrl+c

46af90c

ENH control verbosity in bcfwssvm

dbcd87c

TEST use same amount of noise for frank wolfe test as for other test.…

d012a3d

… Works now that C is scaled correctly :)

slight adjustments in letters example (and frank-wolfe letters example).

892afa0

vene reviewed Aug 14, 2013
View reviewed changes

amueller added 4 commits August 14, 2013 11:03

FIX make frankwolfe work with latent svm, make self.dual_check_every …

b3e46db

…relative to the iteration, not the update (otherwise the overhead is hard to judge).

FIX stupid bug in checking duality gap

ce002cf

ENH add averaging to frank_wolfe_ssvm block-coordinate

f9e83f0

rename dual_check_every to check_dual_every

b478523

DOC add docstrings to frank-wolfe ssvm

e9c0c3b

amueller added 8 commits August 15, 2013 14:48

DOC a bit more / better docs

d639c2e

minor fix in example

7b82855

blub

4fa9391

FIX use batch-psi and batch-loss augmented inference. That makes the …

420d14b

…multi_class example in batch mode much faster....

DOC add sentence that the implementation is not finally tested / benc…

75664a2

…hmarked. I think we could improve it a bit more but want to merge it now.

EXAMPLE Add BCFW to examples

83f6add

FIX loop in BCFW

e07953b

Example: use less samples again.

aafcfbc

amueller merged commit aafcfbc into pystruct:master Aug 15, 2013

amueller deleted the frank_wolfe branch August 15, 2013 14:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP Frank wolfe #67

WIP Frank wolfe #67

amueller commented Aug 13, 2013

amueller commented Aug 13, 2013

amueller commented Aug 13, 2013

amueller commented Aug 13, 2013

amueller commented Aug 13, 2013

amueller commented Aug 13, 2013

amueller commented Aug 13, 2013

amueller commented Aug 13, 2013

amueller commented Aug 13, 2013

amueller commented Aug 14, 2013

vene Aug 14, 2013

amueller Aug 14, 2013

vene Aug 14, 2013

amueller Aug 14, 2013

amueller commented Aug 15, 2013

amueller commented Aug 15, 2013

WIP Frank wolfe #67

WIP Frank wolfe #67

Conversation

amueller commented Aug 13, 2013

amueller commented Aug 13, 2013

amueller commented Aug 13, 2013

amueller commented Aug 13, 2013

amueller commented Aug 13, 2013

amueller commented Aug 13, 2013

amueller commented Aug 13, 2013

amueller commented Aug 13, 2013

amueller commented Aug 13, 2013

amueller commented Aug 14, 2013

vene Aug 14, 2013

Choose a reason for hiding this comment

amueller Aug 14, 2013

Choose a reason for hiding this comment

vene Aug 14, 2013

Choose a reason for hiding this comment

amueller Aug 14, 2013

Choose a reason for hiding this comment

amueller commented Aug 15, 2013

amueller commented Aug 15, 2013