Skip to content

Wrelin/web-crawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

web-crawler

Crawl site concurrently, extract page url, first title, first paragraph, image and link urls.
Save result to report.csv.

Alt Text

Contributing

Clone the repo

git clone https://github.com/wrelin/web-crawler
cd web-crawle

Build the project

go mod tidy
go build

Run the project

./web-crawler <site_url> <max_concurrent_thread> <max_pages_to_crawl>

Run the tests

go test ./...

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages