Skip to content

mimanshujain/News-Summarization

Repository files navigation

News Search Engine and Story Representation (Nodejs, javascript, jQuery, Java, Apache Solr)

• Reuters RCV 1 and NYT (2007) used as initial index database.
• Focused web crawl to retrieve and index recent news using news API for eg. Guardian.
• Timeline.js to represent data in chronological summarization.
• Implemented Adaptive LDA(Latent Dirichlet Allocation) to cluster and classify news data as topic summarization. Number of topic clusters is made adaptive to news data.
• LexRank algorithm to build a coherent news story through extraction.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •