Skip to content

Full-scale search engine consisting of a front-end website(JavaScript) along with Apache Solr back-end which is used to index Tweets. Used LDA subtopic modelling, Wiki and Pixabay API's to create a dynamic summary of the input search query.

Notifications You must be signed in to change notification settings

JunaidAShaikh/Topic_Summarization_Search_Engine

Repository files navigation

Topic_Summarization_Search_Engine

https://github.com/JunaidAShaikh/Tweet_Crawler

•Full-scale search engine consisted of a front-end website(JavaScript) along with Apache Solr back-end which is used to index Twitter’s unstructured data.

https://github.com/JunaidAShaikh/Latent_Dirichlet_Allocation_Subtopic_Modelling (LDA Topic Sub-Modelling) Search Engine used Latent Dirichlet Allocation sub-topic model to operate on half a million crawled tweets data across 5 language. Model used this tweet data for model training and then clustered tweets based on similarity of topics.

•Wiki and pixabay APIs are used for generating static Summaries and for graphical images respectively for user-queries.

•This search engine presents user with a web interface(Hosted on Apache Solr) to enter desired query and in response presented user with the static summary retrieved using wiki and pixabay APIs as well as dynamic summary composed using relevant tweets using above LDA model.

About

Full-scale search engine consisting of a front-end website(JavaScript) along with Apache Solr back-end which is used to index Tweets. Used LDA subtopic modelling, Wiki and Pixabay API's to create a dynamic summary of the input search query.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published