mislav / anemone forked from chriskite/anemone
- Source
- Commits
- Network (23)
- Downloads (0)
- Graphs
-
Branch:
master
Chris Kite (author)
Sun Oct 18 14:19:54 -0700 2009
anemone /
| name | age | message | |
|---|---|---|---|
| |
CHANGELOG.md | Tue Oct 20 00:25:24 -0700 2009 | |
| |
LICENSE.txt | Tue Apr 14 12:14:47 -0700 2009 | |
| |
README.rdoc | Mon Oct 05 08:32:44 -0700 2009 | |
| |
anemone.gemspec | Mon Sep 07 14:30:40 -0700 2009 | |
| |
bin/ | Thu Oct 01 07:39:13 -0700 2009 | |
| |
lib/ | Tue Oct 20 00:25:38 -0700 2009 | |
| |
spec/ | Tue Oct 20 00:23:49 -0700 2009 |
README.rdoc
Anemone
Anemone is a web spider framework that can spider a domain and collect useful information about the pages it visits. It is versatile, allowing you to write your own specialized spider tasks quickly and easily.
Features:
- Multi-threaded design for high performance
- Tracks 301 HTTP redirects to understand a page’s aliases
- Built-in BFS algorithm for determining page depth
- Allows exclusion of URLs based on regular expressions
Examples
See the scripts under lib/anemone/cli directory for examples of several useful Anemone tasks.
REQUIREMENTS
- nokogiri
