emi / bixo
- Source
- Commits
- Network (5)
- Issues (0)
- Downloads (1)
- Wiki (1)
- Graphs
-
Tree:
42e6be8
Ken Krugler (author)
Mon Oct 26 14:29:00 -0700 2009
commit 42e6be876bdb6d9995f2ef95380bbb8c41d0b9fc
tree be730427b73b5fa5375d1966bfbed21a34affe92
parent 55655d0258036a7f9dc030f49d25e6b690af3cf4
tree be730427b73b5fa5375d1966bfbed21a34affe92
parent 55655d0258036a7f9dc030f49d25e6b690af3cf4
bixo /
| name | age | message | |
|---|---|---|---|
| |
.gitignore | ||
| |
README | ||
| |
bin/ | ||
| |
build.xml | ||
| |
contrib/ | ||
| |
doc/ | ||
| |
lib/ | Mon Sep 21 12:40:47 -0700 2009 | |
| |
pom.xml | ||
| |
src/ |
README
=============================== Introduction =============================== Bixo is an open source Java web mining tooklit that runs as a series of Cascading pipes. It is designed to be used as a tool for creating customized web mining apps. By building a customized Cascading pipe assembly, you can quickly create a workflow using Bixo that fetches web content, parses, analyzes, and publishes the results. Bixo borrows heavily from the Apache Nutch project, as well as many other open source rojects at Apache and elsewhere. Bixo is released under the MIT license. =============================== Building =============================== See http://bixo.101tec.com/documentation/building-bixo/ for full details. You need Apache Ant 1.7 or higher. To get a list of valid targets: % cd <project directory> % ant -p To clean, run the tests and build a jar: % ant clean test jar To create Eclipse project files: % ant eclipse Than choose "Import existing project" in Eclipse, and select the Bixo project directory.

