This repository is private.
All pages are served over SSL and all pushing and pulling is done over SSH.
No one may fork, clone, or view it unless they are added as a member.
Every repository with this icon (
) is private.
Every repository with this icon (
This repository is public.
Anyone may fork, clone, or view it.
Every repository with this icon (
) is public.
Every repository with this icon (
Chris K Wensel (author)
Sat May 09 16:34:20 -0700 2009
bixo /
README
=============================== Introduction =============================== Bixo is an open source Java crawler that runs as a series of Cascading pipes. It is designed to be used as a tool for creating customized crawlers, thus each Cascading pipe implements a discrete operation. By building a customized Cascading pipe assembly, you can quickly create specialized crawlers that are optimized for a particular use case. Bixo borrows heavily from the Apache Nutch project, as well as many other open source projects at Apache and elsewhere. Bixo is released under the MIT license. =============================== Building =============================== You need Apache Ant 1.7 or higher. In the project root type: ant -p To clean, run the tests and integration tests and build a jar type: ant clean test it jar To build a distribution type: ant dist To build a eclipse project type: ant eclipse Than choose "import existing project" in eclipse.








