Skip to content

emson/forager

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

forager

DESCRIPTION:

RubyGem that provides a library for scraping content from web sites. This library has the following emphasis:

  1. To be very very easy to maintain and change

  2. To provide an excellent tracking system, that will make it easy to modify when problems occur

  3. To easily scale to 1000’s of web sites

TODO:

  1. Move rules out of the main project

  2. Improve the tracker, for managing broken page links.

  3. Refactor Extractor so that it, extracts the value in a more robust way.

SYNOPSIS:

To use forager from the command line try:

forager ["uri"|<help>|<demo>|<test>|<testlive>]

e.g. 
forager help
forager demo
forager test  // this will test local HTML pages
forager testlive  // this will run the tests against the live (test) urls
forager "http://www.amazon.co.uk/Ruby-Way-Programming-Addison-Wesley-Professional/dp/0672328844/ref=sr_1_1?ie=UTF8&s=books&qid=1265974068&sr=1-1"

REQUIREMENTS:

  • FIX (list of requirements)

INSTALL:

sudo gem install forager

LICENSE:

Copyright © June 2009 B. F. B. Emson

About

Forager is Ruby gem for scraping content from web sites.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages