Jumbune is an open-source project to optimize both Yarn (v2) and older (v1) Hadoop based solutions. It's a Linux based framework to assist in fault free analytic solution development, data quality validation and efficient cluster utilization. Jumbune can decoupled from the Hadoop cluster & run on a separate machine. Being a generic implementatio…
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
bin Merge branch 'master' into JUM-2.0 Jun 29, 2018
cluster-analysis Fixed jumbune-cluster-monitoring dependency in cluster-analysis module Jun 1, 2018
cluster-monitoring Refectored code Jun 1, 2018
common 1. Removed hadoop-auth dependency from pom. 2. Fixed auto updation of… Jun 4, 2018
datavalidation
debugger Refectored code Jun 1, 2018
deploy Refectored code Jun 1, 2018
distribution Refectored code Jun 1, 2018
docker-conf Updated the files with new corrected version Jun 22, 2016
examples Refactored JobConfig, Cluster, ClusterDefinition, multiple ui bug fixes. Apr 13, 2018
execution
remoting-common
remoting-jsch Refectored code Jun 1, 2018
remoting Refectored code Jun 1, 2018
utilities Refectored code Jun 1, 2018
web Removed jsp files Jun 15, 2018
.dockerignore JUM-100 Ability to build Jumbune docker image from the latest SNAPSHO… May 18, 2015
.gitignore 1. Removed hadoop-auth dependency from pom. 2. Fixed auto updation of… Jun 4, 2018
.travis.yml Updated Travis to use jdk8 Jun 1, 2018
CODE_OF_CONDUCT.md Added Base source for version 2.0. Apr 3, 2018
Dockerfile
LICENSE.txt Committed Initial version of Jumbune (1.0.0) Jun 5, 2014
NOTICE.txt Refectored code Jun 1, 2018
README.md Updated Organization Mar 29, 2018
assembly.xml
pom.xml 1. Removed hadoop-auth dependency from pom. 2. Fixed auto updation of… Jun 4, 2018

README.md

jumbune Build Status

Synopsis

Jumbune is an open-source product built for analyzing Hadoop cluster and MapReduce jobs. It provides development & administrative insights of Hadoop based analytical solutions. It enables user to Debug, Profile, Monitor & Validate analytical solutions hosted on decoupled clusters.

Website

http://jumbune.org

Issue Tracker

http://jumbune.org/jira/browse/JUM

Building

Jumbune can be build with maven

  • To build for a Yarn Hadoop cluster

mvn clean install -P yarn

  • To build for a non Yarn Hadoop cluster

mvn clean install

Installation

Detailed installation guide can be found at http://bit.ly/1kBG4Qo

Deployment Planning guide at http://bit.ly/1oiXGk2

  • Deploying Jumbune

java -jar <location of the distribution build jar>

  • Deploying Jumbune (in verbose mode)

java -jar <location of the distribution build jar> -verbose

  • Running Jumbune Agent

java -jar <jumbune agent jar> <port> <|verbose>

  • Running Jumbune

./startWeb

or

./runCli

Docker Image (Jumbune + Apache YARN )

  • Building from root of the checked out repository $ sudo docker build –t = "jumbune/pseudo-distributed:1.5.0" .

  • Building from the github.com repository $ sudo docker build –t = "jumbune/pseudo-distributed:1.5.0" github.com/impetus-opensource/jumbune/

  • Getting the automated build from docker registry https://registry.hub.docker.com/u/jumbune/jumbune/

  • Running the built image $ docker run -d --name="jumbune" -h "jumbune-docker" -p 8080:8080 -p 5555 jumbune/pseudo-distributed:1.5.0

Code Examples

Code examples are packages inside the distribution,

  • For Flow analyzer: BankDefaulters, ClickStreamAnalysis, USRegionPortouts
  • For Profilng: MovieRating
  • For Data Validation - script

Documentation

License

Jumbune is licensed under LGPLv3 license