Add support for a convnet/pooling model #17

wpm · 2017-07-15T14:55:25Z

Another kind of neural model for sequences.

wpm · 2017-07-20T16:56:08Z

See

Kim, Y. Convolutional neural networks for sentence classification. In EMNLP, 2014.
Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., and Kuksa, P. Natural language processing (almost) from scratch. In JMLR, 2011

Section 3.1 of Kim's paper describes his hyper-parameters. He also uses l₂ regularization.

wpm · 2017-07-26T21:10:17Z

From the Keras Slack Channel

Me

I'm trying to repro an experiment in which someone used a 1d convnet/maxpooling strategy over words in a sentence. (Yoon Kim, 2014, "Convolutional Neural Networks for Sentence Classification") In that experiment the convolution filters represent sliding windows over consecutive tokens. They ran a model with window sizes 3, 4, and 5.

How would I build an equivalent model with multiple window sizes in Keras? Do I have my input feed into three different Conv1D layers (or pairs of Conv1D and MaxPooling1D layers) with different kernel_size values and then concatenate the results into a single vector?

dref306

yup

wpm · 2017-07-26T21:13:58Z

A model that isn't working right now.

Convolutional text sequence classifier: 2 labels, 100 filters, kernel size 3, pool factor 4, dropout rate 0.50
Text sequence embedder: core_web_sm, embedding matrix (20000, 300)

_________________________________________________________________
Layer (type)                 Output Shape              Param #   
=================================================================
embedding (Embedding)        (None, 22, 300)           6000000   
_________________________________________________________________
convolution (Conv1D)         (None, 20, 100)           90100     
_________________________________________________________________
pooling (MaxPooling1D)       (None, 5, 100)            0         
_________________________________________________________________
softmax (Dense)              (None, 5, 2)              202       
_________________________________________________________________
dropout (Dropout)            (None, 5, 2)              0         
=================================================================
Total params: 6,090,302.0
Trainable params: 90,302.0
Non-trainable params: 6,000,000.0
_________________________________________________________________

This is a model with the topology described in Yoon Kim 2014. This addresses #17.

wpm added the enhancement label Jul 15, 2017

wpm self-assigned this Jul 15, 2017

wpm mentioned this issue Jul 20, 2017

Optional TF-IDF weighting #6

Closed

wpm added this to the Release 1.1.0 milestone Jul 21, 2017

wpm added a commit that referenced this issue Jul 26, 2017

Add support for a convnet/pooling model

fec2c77

This is a model with the topology described in Yoon Kim 2014. This addresses #17.

wpm closed this as completed Jul 26, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for a convnet/pooling model #17

Add support for a convnet/pooling model #17

wpm commented Jul 15, 2017

wpm commented Jul 20, 2017 •

edited

Loading

wpm commented Jul 26, 2017

wpm commented Jul 26, 2017

Add support for a convnet/pooling model #17

Add support for a convnet/pooling model #17

Comments

wpm commented Jul 15, 2017

wpm commented Jul 20, 2017 • edited Loading

wpm commented Jul 26, 2017

wpm commented Jul 26, 2017

wpm commented Jul 20, 2017 •

edited

Loading