crawler-engine

WebCrawler is a C# console application that recursively scans a website starting from a given URL, collects all discovered links, and saves them to a file. It’s useful for site mapping, link analysis, and content discovery.

github git url cli commandline website crawler opensource web grabber web-crawler web-crawling commandline-interface commandline-tool crawler-engine crawling-sites crawler-csharp

Updated May 29, 2025
C#

supernebula / shark

Star

Shark (Plunder)可配置、插件化的爬虫引擎，二次开发框架。Configurable, pluginable crawler engine, secondary development framework.

downloader framework pipeline scheduler analyzer crawler-engine remove-duplicate

Updated Feb 10, 2022
C#

MCStreetguy / Crawler

Star

An advanced web-crawler written in PHP.

php crawler composer guzzle php-library web-crawler http-requests php-7 webcrawler composer-library crawler-engine

Updated Apr 5, 2019
PHP

hseghetti / simple-crawler

Star

Simple crawler using apache nutch and elasticsearch

docker elasticsearch crawler docker-compose nutch crawling cerebro crawlspider crawler-engine

Updated May 27, 2020
Shell

plugnsearch / plugnsearch

Star

The only real pluggable crawler / spider / webcrawler to search the web for stuff you need to know.

search-engine crawler scraper crawler-engine webpage-scraper

Updated Apr 23, 2023
JavaScript

Improve this page

Add a description, image, and links to the crawler-engine topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the crawler-engine topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

crawler-engine

Here are 52 public repositories matching this topic...

6677-ai / tap4-ai-crawler

nuhmanpk / WebScrapper

RevoltSecurities / SpideyX

namhong1412 / browser-clone-web

bkeepers / spiderman

web-extractors / arachnid-seo-js

fooock / robots.txt

Sobak / scrawler

wefindx / metadrive

wetrycode / tegenaria

spekulatius / spatie-crawler-cached-queue-example

crawlbase / crawlbase-ruby

lichang98 / visualize_spider

ShiqinHuo / wuhan_house_price_crawler

BaseMax / NetPHP

atymri / WebCrawler

supernebula / shark

MCStreetguy / Crawler

hseghetti / simple-crawler

plugnsearch / plugnsearch

Improve this page

Add this topic to your repo