Skip to content

mislav/anemone

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

74 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Anemone

Anemone is a web spider framework that can spider a domain and collect useful information about the pages it visits. It is versatile, allowing you to write your own specialized spider tasks quickly and easily.

Features:

  • Multi-threaded design for high performance

  • Tracks 301 HTTP redirects to understand a page’s aliases

  • Built-in BFS algorithm for determining page depth

  • Allows exclusion of URLs based on regular expressions

Examples

See the scripts under lib/anemone/cli directory for examples of several useful Anemone tasks.

REQUIREMENTS

  • nokogiri

About

Anemone web-spider framework

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Ruby 100.0%