Browse files

Include fisheye info in README

  • Loading branch information...
1 parent 2dcc7d1 commit ae8627244569dff6c2d43e62d7001f4b0068e02c @mikecafarella mikecafarella committed Sep 13, 2012
Showing with 20 additions and 1 deletion.
  1. +20 −1
@@ -7,7 +7,26 @@ RecordBreaker is a project that automatically turns your text-formatted data (se
You can (and should!) read the full RecordBreaker tutorial here: [](
-The RecordBreaker repository is hosted at GitHub, here: [](
+The RecordBreaker repository is hosted at GitHub, here:
+One interesting part of RecordBreaker is the FishEye system. It's a
+web-based tool for examining and managing the diverse datasets likely
+to be found in a typical HDFS installation. It draws features from
+both filesystem management and database administration tools. Most
+interestingly, it uses RecordBreaker techniques to automatically
+figure out the structure of files it finds. You can run it by typing:
+ bin/learnstructure fisheye -run <portnum> <localstoragedir>
+Where <portnum> is the HTTP port where it will provide data to the
+user, and <localstoreagedir> is where it will maintain information
+about a target filesystem.

0 comments on commit ae86272

Please sign in to comment.