Crawler for bacalaureat.edu.ro for 2018 results. HTML parsing & caching, content stored in MongoDB. Built with Java, SpringBoot and Jsoup.
-
Updated
Mar 14, 2019 - Java
Crawler for bacalaureat.edu.ro for 2018 results. HTML parsing & caching, content stored in MongoDB. Built with Java, SpringBoot and Jsoup.
WebCrawler is a simple Java based framework which scans websites concurrently and stores data into persistent storage
A mini-implementation of a search engine built with Java and Spring Boot that uses web crawling,
web crawler allowing full page render crawl using HtmlUnit
🔍 A web crawling app written in java.
A Distributed Data Collection Framework on top of Spark
A Fast Multi-Threaded search engine implemented in Java, supporting Crawling, Indexing, Relevance scoring, trend analysis & in-memory caching
A console based web search engine developed in Java.
Efficient crawling & data extraction from web pages using concurrency in multiple programming languages.
Crawles all song files available on 'http://downloads.khinsider.com/'. Creates a list of direct download links for all such songs, intended for use with JDownloader or similar.
[JAVA] Web crawler using explicit threads, with /DanieleCali
This is a simple web crawler written in Java. Jsoup library is used for implementation of scrapper engine.
A basic web crawler that logs emails and URLs on webpages. CS-240 Final Project.
A Java-program which retrieves the full-texts or datasets from the Publication-Web-Pages.
Add a description, image, and links to the web-crawler topic page so that developers can more easily learn about it.
To associate your repository with the web-crawler topic, visit your repo's landing page and select "manage topics."