Skip to content
This repository has been archived by the owner on Dec 1, 2020. It is now read-only.

Latest commit

 

History

History
29 lines (23 loc) · 1.2 KB

CHANGELOG.md

File metadata and controls

29 lines (23 loc) · 1.2 KB

Changelog

This file documents all notables changes to the project. The project uses semantic versioning and is based on Keep a Changelog.

Added

  • Initial release after code audit
  • Added documentation, README, Code of Conduct, Contributing Guidelines, and Changelog
  • Basic features of the crawler are implemented :
    • package is a module, but contains a compilable app in app/crawl.go
    • public functions are FetchLinks(), StreamLinks() and ScrapLinks()
    • documentation on https://godoc.org/github.com/bytemare/crawl
    • single domain scope
    • parallel scraping for speed, without critical code in concurrent goroutines
    • optional timeout
    • scraps queries and fragments from URLs
    • control plane for signal interception and timeout
    • avoid loops on already visited links and visiting links
    • logging through logrus, and logs to file in JSON for log aggregation
  • added some code examples in README
  • integrated CI tools