Use only 1 hasher #2

guillaume-chevalier · 2019-01-14T11:26:04Z

Instead of using a FeatureUnion over T=80 random hashers of d=14 dimensions (80*14=1120 word features), use only one hasher (1x1120), which results in a dramatic speedup.

You can see the fix here: https://github.com/guillaume-chevalier/NLP-TP3

guillaume-chevalier · 2019-01-14T11:27:04Z

(note: the linked repo will become public soon and isn't yet public)

guillaume-chevalier · 2019-01-14T11:29:51Z

I profiled the execution time of the pipeline. The speedup to using only 1 hasher rather than 80 is quite good.

Also, it's weird that using threads on hashers (rather than 1 thread) was performing slower. I tested on a 32 core computer to use many threads for the hashers in parallel (n_jobs=-1 param in feature union), and it was even slower than using no thread.

tnlin · 2019-04-16T12:46:50Z

Hi, have you replicate the model in their paper?
I found their experiment results are too good to be true. In my experiment, I use BERT with context to achieve 81% accuracy, so I wonder how did they achieve their score (SwDA acc=83% without context?)
Besides, their talk on EMNLP said ATIS has intent "purchase", but I never see any ATIS dataset have "purchase" intent...

guillaume-chevalier · 2019-04-16T22:57:45Z

@tnlin I did not implement the neural network layer on top of the projection layer. This is only the projection, and it also differs a little bit from the paper.

glicerico · 2021-01-11T02:10:26Z

Hey @tnlin , did you manage to find the real performance of SGNN? I can only achieve 71% with their architecture.

guillaume-chevalier mentioned this issue Jan 14, 2019

Implementation details #1

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use only 1 hasher #2

Use only 1 hasher #2

guillaume-chevalier commented Jan 14, 2019

guillaume-chevalier commented Jan 14, 2019

guillaume-chevalier commented Jan 14, 2019

tnlin commented Apr 16, 2019

guillaume-chevalier commented Apr 16, 2019

glicerico commented Jan 11, 2021

Use only 1 hasher #2

Use only 1 hasher #2

Comments

guillaume-chevalier commented Jan 14, 2019

guillaume-chevalier commented Jan 14, 2019

guillaume-chevalier commented Jan 14, 2019

tnlin commented Apr 16, 2019

guillaume-chevalier commented Apr 16, 2019

glicerico commented Jan 11, 2021