Skip to content
tensorflow and theano cnn code for insurance QA(question Answer matching)
Branch: master
Clone or download
Latest commit 94971ec Sep 7, 2018
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
cnn fix some bugs Sep 5, 2018
lstm_cnn/theano 1. add precision in the code Jul 4, 2016
rnn_attention/tensorflow Update tf_rnn_char.py Mar 1, 2018
swem 1. add quora and swem Sep 7, 2018
README.md Update README.md Sep 7, 2018
config.py 1. add quora and swem Sep 7, 2018
gen.py 1. add quora and swem Sep 7, 2018
utils.py 1. add quora and swem Sep 7, 2018

README.md

Insurance-QA deeplearning model

This is a repo for Q&A Mathing, includes some deep learning models, such as CNN、RNN.

  1. CNN. Basic CNN model from 《Applying Deep Learning To Answer Selection: A Study And An Open Task》
  2. RNN. RNN seems the best model on Insurance-QA dataset.
  3. SWEM. SWEM is the fastest, and has good effect on other datasets, such as WikiQA ..., but is seems not so good on Insurance-QA dataset. I think that, SWEM is more suitable for Q&Q matching, not Q&A matching.

It's hard to say which model is the best in other datasets, you have to choose the most suitable model for you.

More models are on the way, pay attention to the updates.

Requirements

  1. tensorflow 1.4.0
  2. python3.5

Performance

margin loss version

Model/Score Ins_qa_top1_precision quora_best_prec
CNN 62% None
LSTM+CNN 68% None
SWEM <55% None

logloss version

Model/Score Insqa_top1_precision quora_best_prec
CNN None 79.60%
LSTM+CNN None None
SWEM <40% 82.69%

Running

Change configuration to your own environment, just like data pathes

vim config.py

Data processing

python3 gen.py

Run CNN model

cd ./cnn/tensorflow && python3 insqa_train.py

It will take few hours(thousands of epoches) to train this model on a single GPU.

Downloads

  1. You can get Insurance-QA data from here https://github.com/shuzi/insuranceQA
  2. You can get Quora data from here http://qim.ec.quoracdn.net/quora_duplicate_questions.tsv

Links

  1. CNN and RNN textual classification repo https://github.com/white127/TextClassification_CNN_RNN
  2. 《Applying Deep Learning To Answer Selection: A Study And An Open Task》
You can’t perform that action at this time.