Skip to content

zeke/zhook

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Zhook

Zhook is a rails webservice that scrapes nonprofit metadata such as phone numbers, website URLs, and email addresses from sites like Google, Facebook, LinkedIn, Idealist, and others. Zhook (жук) is the Russian word for bug; it sounds kinda like zook and it’s a crawler!

Setting up the Development Environment

sudo gem sources -a http://gems.github.com sudo gem install mislav-will_paginate fastercsv haml mechanize

sudo gem install mysql —with-mysql-config=/usr/local/mysql/bin/mysql_config
rake bootstrap:setup
rake db:migrate

Setting up the Production Environment

sudo gem sources -a http://gems.github.com sudo gem install mislav-will_paginate fastercsv haml mechanize

sudo gem install mysql —with-mysql-config=/usr/local/mysql/bin/mysql_config

Immediate To-Dos

Eventual To-Dos

  • If/When IRS database is updated, foreign keys should be switched from regular old incrementing ids to EINs.
  • Create a Search model that stores searches and displays links to recent ones under the search form
  • Configure XML respond_to for metadata output (currently only works for plain organizations with no metadata)
  • Add a ‘version’ field to factoids, so newer crawler implementations will know which sites to re-crawl

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published