Report or block aaronbinns
Contact Support about this user's behavior.Report abuse
- San Francisco, CA
Experimenting with Apache Pig.
Builds Lucene/Solr indexes out of NutchWAX segments and revisit records via Hadoop.
(T)he (N)ew (H)otness. Improved full-txt search of archival web data.
Full-text indexing pipeline of Pig scripts.
Clone of iof ia-hadoop-tools repo, but just zipnum branch with new features for zipnum and cluster merging.