• Updated Sep 13, 2018
  • An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks

    Python 133 Apache-2.0 Updated Jan 8, 2018
  • S3-backed pypi server implementation

    Python 87 MIT Updated Sep 26, 2017
  • hue

    Forked from cloudera/hue

    Hue is an open source Analytics Workbench for browsing, querying and visualizing data.

    Python 1,250 Apache-2.0 Updated Sep 19, 2017
  • DockerHub public images - Hadoop, Kafka, ZooKeeper, HBase, Cassandra, Solr / SolrCloud, Presto, Apache Drill, Nifi, Spark, Superset, H2O, Mesos, Serf, Consul, Riak, Alluxio, Jython, Advanced Nagios Plugins Collection / PyTools / Tools repos on CentOS / Ubuntu / Debian / Alpine

    Shell 132 Updated Sep 18, 2017
  • A lightweight server clone of Azure Blob Storage that simulates most of the commands supported by it with minimal dependencies.

    JavaScript 30 MIT Updated Sep 18, 2017
  • A docker container for spark standalone cluster mode, built on top of the openjdk8-jre container

    Shell Apache-2.0 Updated Aug 27, 2017
  • Terraform is a tool for building, changing, and combining infrastructure safely and efficiently.

    Go 4,115 MPL-2.0 Updated Jul 11, 2017
  • scikit-learn: machine learning in Python

    Python 1 15,132 Updated Nov 11, 2016
  • A light library that adds job scheduling capabilities to RQ (Redis Queue)

    Python 155 MIT Updated Oct 28, 2016
  • Mirror of Apache Spark

    Scala 16,966 Apache-2.0 Updated Sep 20, 2016
  • hadoop

    Forked from apache/hadoop

    Mirror of Apache Hadoop

    Java 5,036 Apache-2.0 Updated Jun 15, 2016
  • Pivotal Greenplum Database

    PLpgSQL 824 Updated Jun 3, 2016
  • Install and set up Docker

    SaltStack 245 Updated Apr 19, 2016
  • Java 93 Updated Oct 15, 2015
  • Java 17 Updated Oct 6, 2015
  • Studio project common to all MDM projects

    Java 12 Updated Oct 6, 2015
  • Java 29 Updated Oct 6, 2015
  • Studio open source projects related to Data Quality

    Java 19 Updated Oct 6, 2015
  • Java 93 Updated Oct 6, 2015
  • The Master repository (using gitslave) that define all public repositories required to build the Talend Open Studio.

    HTML 27 Updated Oct 5, 2015
  • Studio open source projects related to Big Data

    Java 36 Updated Oct 5, 2015
  • A sane VPC NAT instance chef cookbook

    Ruby 2 Updated Jun 15, 2015
  • Chef cookbook to deploy artifacts from standard maven repositories

    Ruby 3 MIT Updated Feb 2, 2015
  • Chef cookbook for Apache Cassandra, DataStax Enterprise (DSE) and DataStax agent

    Ruby 235 Updated Jan 24, 2015
  • A convenient Chef LWRP to manage user accounts and SSH keys

    Ruby 150 Updated Jan 20, 2015
  • chef cookbook to install Apache Spark

    Ruby 16 MIT Updated Jan 20, 2015
  • Chef Cookbook for R

    Ruby 50 Updated Jan 20, 2015
  • Chef cookbook for Kafka

    Ruby 97 Updated Mar 21, 2014
  • A JPA 2.0 compliant Object-Datastore Mapping Library for NoSQL Datastores. Please subscribe to:

    Java 246 Updated Mar 14, 2014