Web crawler for a search engine of multiple forums content, built with Java, MongoDB and Apache HttpComponents
-
Updated
Dec 1, 2016 - Java
Web crawler for a search engine of multiple forums content, built with Java, MongoDB and Apache HttpComponents
Given any page (URL), be able to classify the page, and return a list of relevant topics.
A Study project with the purpose of learning/practicing Java and it's libraries/tecnologies.
A web crawler that collects user data from social network MyLife (https://www.mylife.com/) made for Applied Topics in Data Structures and Databases class at ICSD of University of the Aegean. This project respects user's privacy and collects only public profile data.
A web crawler able to concurrently grab linked URLs with user defined depth-control
This java project is a multithreaded web crawler that uses three search engine, Bing, Yahoo, and Google to generate seeds to crawl the website.
This Project deals with the webcrawler code which help to find all the existing links in a site
An async web crawler for ads.txt project
An generic Web Crawler in Java 8
A general-purpose web crawler that extracts information from SEC filings based on detailed criteria provided by the user
A web crawler that implements breadth first search algorithm and built with maven.
Add a description, image, and links to the webcrawler topic page so that developers can more easily learn about it.
To associate your repository with the webcrawler topic, visit your repo's landing page and select "manage topics."