Create your own GitHub profile
Sign up for your own profile on GitHub, the best place to host code, manage projects, and build software alongside 28 million developers.
Full working examples for Text Mining and NLP. Gensim word2vec, phrase embeddings, keyword extraction with TF-IDF and sklearn, word count with PySpark
Scalable approach to phrase discovery for large text corpora using PySpark.
Examples of code in spark
Opinosis - Graph Based Summarization Framework. This repo contains Opinosis Demo Software & Dataset
OpinRank Dataset. Dataset containing user reviews for entities namely cars and hotels. Full reviews from Tripadvisor (~259,000 reviews) and Edmunds (~42,230 reviews)
in the last year
Press h to open a hovercard with more details.