webcrawler

Small web crawler developed in Java and Spring Boot.

This crawler is a multi-threaded one which can start multiple crawling.

#How this works:

This is a REST based crawler and crawling a web site can be started using a REST end point.

Start with a seed web site to crawl and depth to crawl. This is a "POST" request

Once it is requested, it will generate a "token". This token can be used to query the status of the crawling operation

http://localhost:8080/status/ which will generate a JSON response.

This endpoint which will give you the result of the crawling.

http://localhost:8080/result/
The following endpoint will cancel the current crawling task

http://localhost:8080/stop/token

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
src		src
README.md		README.md
pom.xml		pom.xml

Provide feedback