Deep Learning Methods for Quotable Text

The repository for the "Deep Learning Methods for Quotable Text" article published in my blog here.

Repository Structure

The base directory contains the model and the code required to train it
data/ contains the quotes, and GloVe embeddings
scraping/ contains all the code for acquiring the relevant dataset
reference-papers/ contains a copy of the papers and some additional reading materials I had referenced in the post

Data Sources

LitQuotes.com - Over 2800 Literary Quotes website
QuotationsPage.com - Quotes and Famous Sayings
You had me at hello: How phrasing affects memorability, Cristian Danescu-Niculescu-Mizil, Justin Cheng, Jon Kleinberg and Lillian Lee, Proceedings of ACL, 2012
"Echoes of Persuasion: The Effect of Euphony in Persuasive Communication" by Guerini, M., Gozde, O., & Strapparava, C. HLT-NAACL, page 1483-1493. The Association for Computational Linguistics (2015).
News Aggregator Dataset
Project Gutenberg

Dependencies

keras
tensorflow (CUDA and tensorflow-gpu for GPU training)
nltk/spacy (for Dataset preprocessing and sentence pairing)
BeautifulSoup (for scraping the websites for quotes)

Usage

Data Collection

python litquotes_scraper.py
python quotationspage_scraper.py

cd litquotes/
copy /b *.txt ../quotes.txt
cd ../quotationspage
copy /b *.txt ../quotes.txt

Training

Configure the parameters in configuration.py

then run

python main.py

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
reference-papers		reference-papers
scraping		scraping
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
configuration.py		configuration.py
main.py		main.py
models.py		models.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Learning Methods for Quotable Text

Repository Structure

Data Sources

Dependencies

Usage

Data Collection

Training

About

Releases

Packages

Languages

License

TheAnig/memorable-quotes

Folders and files

Latest commit

History

Repository files navigation

Deep Learning Methods for Quotable Text

Repository Structure

Data Sources

Dependencies

Usage

Data Collection

Training

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages