GitHub - woopi/chukwa: Mirror of Apache Chukwa

woopi / chukwa Public

forked from apache/chukwa

Notifications You must be signed in to change notification settings
Fork 0
Star 0

Mirror of Apache Chukwa

Apache-2.0 license

0 stars 45 forks Branches Tags Activity

Star

Notifications

Branches Tags

Name		Name	Last commit message	Last commit date
Latest commit History 706 Commits
bin		bin
conf		conf
contrib		contrib
lib		lib
script/pig		script/pig
src		src
test/samples		test/samples
tools		tools
CHANGES.txt		CHANGES.txt
DISCLAIMER.txt		DISCLAIMER.txt
LICENSE.txt		LICENSE.txt
NOTICE.txt		NOTICE.txt
README.txt		README.txt
forrest.properties		forrest.properties
pom.xml		pom.xml

Repository files navigation

Chukwa 0.5 -- April 2010

This is the second formal release of Chukwa, an Apache Hadoop subproject 
dedicated to scalable log collection and processing. If you have large 
volumes of log data generated across a cluster, and you need to process 
them with MapReduce, Chukwa may be the tool for you.

The notes for this release are in docs/releasenotes.html

BUILDING CHUKWA

To build chukwa from source:

mvn clean package

To check that things are ok, run 'ant test'. It should take roughly fifteen minutes.

RUNNING CHUKWA

If you are unfamiliar with Chukwa, you should start by reading the design 
overview, in docs/design.html. This will tell you what each piece of Chukwa
does.  

If you're impatient, the following is the 30-second explanation:

The minimum you need to run Chukwa are agents on each machine you're 
monitoring, and a collector to write the collected data to HDFS.  The
basic command to start an agent is bin/chukwa agent.  The base command to
start a collector is bin/chukwa collector.

If you want to start a bunch of agents, you can use the
bin/start-agents.sh script. This just uses ssh to start agents on a
list of machines, given in conf/agents. It's exactly parallel to
Hadoop's start-hdfs and start-mapred scripts.  There's also a 
bin/start-collectors.sh that does the same to start collectors, on 
machines listed in conf/collectors.  One hostname per line.

There are stop scripts that do the exact opposite of the start commands. 

Full installation instructions are in docs/admin.html.