GloVe by ynqa · Pull Request #21 · ynqa/wego

ynqa · 2017-10-18T16:27:32Z

Overview

Implementation GloVe (on memory)
- Solver is prepared SGD and AdaGrad.

Feature Works

GloVe with External Memory (write to disk once before training) like lexvec for huge data.

chewxy

left you some things to think about

chewxy · 2017-10-21T20:17:43Z

model/glove/embedding.go

+	P []tensor.Tensor
+	Q []tensor.Tensor
+
+	gradP []tensor.Tensor


To think about: is it necessary to keep a copy of the tensors of the gradients?

Yes. GloVe uses AdaGrad as the optimizer, instead of SGD. AdaGrad also use all past iteration's gradients to decrease learning rate automatically.
Here: https://en.wikipedia.org/wiki/Stochastic_gradient_descent#AdaGrad

Yeah I'm familiar with Adagrad. I'm thinking maybe this shouldn't be put into the Embedding struct. Instead have something like a Solver struct

ah I see, that's right. I'll try it.
(It's also good to select other optimizers, AdaDelta, Adam, and so on.)

chewxy · 2017-10-21T20:25:42Z

model/glove/glove.go

+
+func (g *GloVe) train(pind, qind int, f float64) (err error) {
+	// SGD
+	inner, _ := tensor.Inner(g.emb.P[pind], g.emb.Q[qind])


bad habit! should always check for errors.

Perhaps create a utility struct somewhere that looks like this:

type maybe struct{ err error } func (m *maybe) DoBinOp(fn func(a, b interface{})(tensor.Tensor, error), a, b tensor.Tensor) (retVal tensor.Tensor) { if m.err != nil { return nil } retVal, m.err = fn(a, b) return } func (m *maybe) Error() error { return m.err }

then you can do this:

m := new(maybe) inner := m.Do(tensor.Inner, g.emb.p[pind], g.emb.Q[qind]) bias := m.Do(tensor.Add, g.emb.biasP[pind, g.emb.biasQ[qind]) // and so on and so forth

chewxy · 2017-10-21T20:26:23Z

model/type.go

 }
+
+// OnesTensor create a tensor has 1 in all elements.
+func (t *Type) OnesTensor(shape ...int) tensor.Tensor {


should be called Ones... it already returns a Tensor, so no need to repeat

coveralls · 2017-11-06T13:53:19Z

Coverage decreased (-10.0%) to 31.199% when pulling f6192c6 on glove into d310aec on master.

coveralls · 2017-11-07T13:03:54Z

Coverage increased (+1.6%) to 42.793% when pulling 844fcd1 on glove into d310aec on master.

chewxy · 2017-11-15T05:25:40Z

model/glove/adagrad.go

+	cost = 0.5 * fdiff * diff
+	fdiff *= a.initLearningRate
+
+	for i := 0; i < a.dimension; i++ {


has there any work been done to compare this with a FMA function?

Done a little experiments: against about 30 million records
with FMA in tensor: 1 min per iteration
without FMA (this): 30 sec per iteration

chewxy · 2017-11-15T05:28:15Z

model/glove/cofreq.go

+// CofreqMap stores the co-frequency between word-word.
+type CofreqMap map[Pair]float64
+
+// Pair stores the co-frequency pair words.


To make this even faster check this out: https://blog.chewxy.com/2017/07/12/21-bits-english/. I'll help you convert when I find the time

I'll watch it, thanks!

chewxy · 2017-11-15T05:30:23Z

model/glove/glove.go

+)
+
+// GloVe stores the configs for GloVe models.
+type GloVe struct {


While GloVe is the proper name (Global Vector) I think it's quite terrible to have random Uppercase letters in the middle of a name that is not a word. Perhaps stick with Glove ? Just an opinion

Nope, so I'll rename it, by the way, why is it terrible there are uppercases in the middle?

coveralls · 2017-11-25T09:32:41Z

Coverage increased (+1.8%) to 43.019% when pulling 5f85057 on glove into d310aec on master.

coveralls · 2017-12-03T10:33:48Z

Coverage increased (+1.9%) to 43.062% when pulling 98ed1c7 on glove into d310aec on master.

coveralls · 2017-12-05T08:05:03Z

Coverage increased (+1.8%) to 43.029% when pulling 3982f37 on glove into d310aec on master.

ynqa force-pushed the glove branch from a074f9b to 2dd428e Compare October 20, 2017 12:11

Repository owner deleted a comment from coveralls Oct 20, 2017

ynqa force-pushed the glove branch from 2dd428e to 419caa6 Compare October 21, 2017 10:45

chewxy reviewed Oct 21, 2017

View reviewed changes

ynqa force-pushed the glove branch 2 times, most recently from be9fc29 to ad90f44 Compare October 28, 2017 06:40

Repository owner deleted a comment from coveralls Oct 28, 2017

ynqa force-pushed the glove branch 2 times, most recently from abae4b5 to fd4d134 Compare October 30, 2017 08:38

Repository owner deleted a comment from coveralls Oct 30, 2017

ynqa force-pushed the glove branch from fd4d134 to 09580af Compare October 30, 2017 08:40

Repository owner deleted a comment from coveralls Nov 1, 2017

ynqa force-pushed the glove branch 2 times, most recently from 38e3416 to fb6a8a8 Compare November 1, 2017 17:36

ynqa added 2 commits November 6, 2017 22:31

Move the part of utils for all model

7b1a7e4

GloVe Ver1.0

f6192c6

Repository owner deleted a comment from coveralls Nov 6, 2017

ynqa force-pushed the glove branch 2 times, most recently from 66d14b4 to f6192c6 Compare November 6, 2017 13:46

ynqa added 3 commits November 7, 2017 21:49

Add tests for GloVe

073b645

Update README

135f79f

GloVe Ver1.1

844fcd1

chewxy reviewed Nov 15, 2017

View reviewed changes

Replace GloVe to Glove

5f85057

Improve word pair count to use uint mapping

98ed1c7

ynqa assigned chewxy Dec 4, 2017

Enc/Dec are out of glove

3982f37

ynqa merged commit 83aaee1 into master Dec 7, 2017

Conversation

ynqa commented Oct 18, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Feature Works

Uh oh!

chewxy left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ynqa Oct 21, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ynqa Oct 21, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ynqa Oct 21, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

coveralls commented Nov 6, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coveralls commented Nov 7, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ynqa Nov 15, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ynqa Nov 15, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

coveralls commented Nov 25, 2017

Uh oh!

coveralls commented Dec 3, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coveralls commented Dec 5, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ynqa commented Oct 18, 2017 •

edited

Loading

ynqa Oct 21, 2017 •

edited

Loading

ynqa Oct 21, 2017 •

edited

Loading

ynqa Oct 21, 2017 •

edited

Loading

coveralls commented Nov 6, 2017 •

edited

Loading

coveralls commented Nov 7, 2017 •

edited

Loading

ynqa Nov 15, 2017 •

edited

Loading

ynqa Nov 15, 2017 •

edited

Loading

coveralls commented Dec 3, 2017 •

edited

Loading

coveralls commented Dec 5, 2017 •

edited

Loading