This project was my final project for bachelor degree. It is a custom web crawler that crawls web pages, finds their category with naive Bayes algorithm and then, extraction of email addresses and phone numbers from each webpage. I have written it with java language in IntelliJ IDEA and Maven.
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
.idea
src/main/java/ir/barasm
emailcrawler.iml
pom.xml