giant-squash

giant-squash is a data collector of hbase table sizes. This command-line tool does one thing: it collects data on hbase table sizes and, on shutdown, writes it out to a json file. The usual story is that this data is then fed to the bloom-harvester for maximum enjoyment.

To see a demo of how the data from this tool can be used, see The Story of the Big Data Elders and The Big Data Elders, Archeology Hour.

Requirements

You must be able to run this jar from a gateway node or any machine on which you can do hadoop fs -du -s /bla.

Quick start

mvn clean install
java -Xmx3G -jar ./giant-squash-1.0-jar-with-dependencies.jar -output <giant-squashes.json> -interval <poll_interval_in_seconds> -tableNames <space delimited list of the table names>
Ctrl-C when done or kill -2 <pid if you run it with nohup in the background.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
src/main		src/main
README.md		README.md
giant-squash.sh		giant-squash.sh
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

src/main

src/main

README.md

README.md

giant-squash.sh

giant-squash.sh

pom.xml

pom.xml

Repository files navigation

giant-squash

Requirements

See also

Quick start

About

Releases

Packages

Languages

alexandre-normand/giant-squash

Folders and files

Latest commit

History

Repository files navigation

giant-squash

Requirements

See also

Quick start

About

Resources

Stars

Watchers

Forks

Languages