Skip to content
/ Cosmos Public

Cosmos is a WebCrawler + SearchEngine written in Java

License

Notifications You must be signed in to change notification settings

YanSchw/Cosmos

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

62 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Cosmos is an experimental WebCrawler + SearchEngine written in Java. Using HTML WebScraping and a set of Seed URLs Cosmos crawls through the web to index URLs.

Screenshot

Important

Please keep in mind, that this is a short-term project. Written within a few days. If you look up the source code, you might find some nasty stuff... I only created this project, because I was interested in the idea of a WebCrawler and wanted to explore it... In the end, this is more of an experiment rather than a sufficient product!

Featuring

Indexes 20,000+ Webpages using 1,000,000+ Indicies within 15 Minutes! Screenshot2

Screenshot

Screenshot3

Run It Yourself

  • Clone the Repository
  • Build using Maven
  • Make sure a MySQL Database is running on "mysql://localhost:3306/", User: root, Password: 1234
  • Run the Project
  • Visit http://localhost:8080/