Loading…

Python 351 31

ibis

Productivity-centric Python big data framework for high performance at Hadoop-scale, with first-class integration with Impala. Co-founded by the creator of pandas

Updated

hue

Let’s Big Data. Hue is an open source Web interface for analyzing data with Apache Hadoop.

Updated

Impala

Real-time Query for Hadoop

Updated

Python 109 28

impyla

Python client and Numba-based UDFs for Impala

Updated

spark-dataflow

Provides a Spark backend for executing Dataflow pipelines.

Updated

spark-timeseries

A library for financial and time series calculations on Apache Spark

Updated

Java 1 2

quince

Scalable genomics variant store and analytics

Updated

Python 0 1

thrift_sasl

Thrift SASL module that implements TSaslClientTransport

Updated

impala-lzo

Updated

oryx

Simple real-time large-scale machine learning infrastructure.

Updated

Python 30 0

ibis-notebooks

IPython notebooks and learning materials for Ibis

Updated

kitten

The fast and fun way to write YARN applications.

Updated

spark

forked from baeeq/incubator-spark

Mirror of Apache Spark

Updated

flume-ng

Updated

llama

Llama - Low Latency Application MAster

Updated

search

Updated

avro

forked from apache/avro

Mirror of Apache Avro

Updated

Java 3 4

datafu

Updated

sqoop

Sqoop has moved to Apache!

Updated

cdk

Cloudera Development Kit

Updated