code for Learning Structured Text Representations
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
README.md Update README.md Dec 22, 2017
cli.py init Nov 5, 2017
data_structure.py init Nov 5, 2017
main.py fix Nov 6, 2017
models.py Update models.py Nov 8, 2017
neural.py init Nov 5, 2017
prepare_data.py init Nov 5, 2017
utils.py init Nov 5, 2017

README.md

Learning Structured Text Representations

Code for the paper:

Learning Structured Text Representations
Yang Liu and Mirella Lapata, Accepted by TACL

Dependencies

This code is implemented with Tensorflow and the data preprocessing is with Gensim

Document Classification

Data

The pre-processed YELP 2013 data can be downloaded at https://drive.google.com/open?id=0BxGUKratNjbaZjFIR1MtbkdzZVU

Preprocessing

To preprocess the data, run

python prepare_data.py path-to-train path-to-dev path-to-test

This will generate a pickle file, the format for the input data can be found in the sample folder

Training

python cli.py --data_file path_to_pkl --rnn_cell lstm --batch_size 16 --dim_str 50 --dim_sem 75 --dim_output 5 --keep_prob 0.7 --opt Adagrad
--lr 0.05 --norm 1e-4 --gpu -1 --sent_attention max --doc_attention max --log_period 5000

This will train the Tree-Matrix structured attention model in the paper on the training-set and present results on the devset/testset

License

MIT