This repository hold the Amazon Elastic MapReduce sample bootstrap actions
A big data platform monitoring tool based on ELK stack
Collaborate Apache Tajo + Elasticsearch
It converts custom tajo profiling result json file to CSV file
Tajo is a distributed data warehouse system on Hadoop that provides low-latency and scalable ad-hoc queries and ETL on large-data sets stored on HDFS and other data sources. This repository is for another Tajo distribution based on CDH.
Mirror of Apache Tajo
light-weight hive query workbench
oozie designer and job management system
WE HAVE MOVED to Apache Incubator. https://cwiki.apache.org/FLUME/ . Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data. It has a simple and flexible architecture based on streaming data flows. It is robust and fault tolerant with tunable reliability mechanisms an…
cloud platform monitoring system
Log, Data Generator
common libraries for gruter's opensource projects
Distributed Structured Data Store, NoSQL, Bigtable 분산데이터베이스.