Skip to content

SamuilG/simpleWebCrawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Simple Web Crawler

Both Web Crawler programs accept user input and use as a starting/rood URL for the crawling.

The output is the 100 URLs visited during the crawling.

Web Crawler with regex

String urlRegex = "https?:\/\/(www\.)?[-a-zA-Z0-9@:%.\+~#=]{1,256}\.[a-zA-Z0-9()]{1,6}\b([-a-zA-Z0-9()@:%\+.~#?&//=]*)";

Web Crawler with jsoup

jsoup: Java HTML Parser https://jsoup.org/

About

My simple Web Crawler - Java

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages