Skip to content

koustubh25/crawl-google-results

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Google Search crawler using Jsoup- Java library

This program makes use of Jsoup library, which can crawl as well as parse HTML. It gives you the following:

  1. Title
  2. Site
  3. Short Abstract

Steps:

  1. Clone this repo.
  2. Add the jar in the build path.
  3. In the source file replace " Natural Language Processing" by your search query and "10" by the number of results you want.
  4. Run the program.

Sample Output

Title : The Stanford NLP (Natural Language Processing) Group
Site : nlp.stanford.edu/
Abstract : Stanford Natural Language Processing and Computational Linguistics Group.

Title : Natural Language Processing - Research at Google
Site : research.google.com/pubs/NaturalLanguageProcessing.html
Abstract : Most NLP applications such as information extraction, machine translation, sentiment analysis and question answering, require both .... Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing ( EMNLP '14).

Title : natural language processing blog
Site : nlpers.blogspot.com/
Abstract : 15 Nov 2014 ... This is often tough because in the current NLP atmosphere in which we're obsessed with shiny new techniques, it's not appealing to show that the new data gets you 90% of the gain and the new technique is only 10% on top ...

About

Google Search Results using Java library Jsoup

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages