No description or website provided.
Switch branches/tags
Nothing to show
Clone or download
abhishekkrthakur Merge pull request #2 from dfcf93/master
Updated README and deprecated Word2Vec Usage
Latest commit 08ad544 Mar 13, 2017

README.md

is_that_a_duplicate_quora_question

all the code for the article https://www.linkedin.com/pulse/duplicate-quora-question-abhishek-thakur will be available here..

How To

  1. Install Required Libraries
pip install pandas
pip install numpy
pip install scikit-learn
pip install nltk
pip install tqdm
pip install keras
pip install tensorflow
pip install pyemd
pip install fuzzywuzzy
pip install python-levenshtein
pip install --upgrade gensim
  1. Download Required Language libraries
mkdir data
cd data
wget http://www-nlp.stanford.edu/data/glove.840B.300d.zip
unzip glove.840B.300d.zip
rm glove.840B.300d.zip
wget http://qim.ec.quoracdn.net/quora_duplicate_questions.tsv
wget https://s3.amazonaws.com/dl4j-distribution/GoogleNews-vectors-negative300.bin.gz
sudo python -m nltk.downloader stopwords
cd ..
  1. Run
python feature_engineering.py
python deepnet.py