Grow your team on GitHub
GitHub is home to over 28 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.Sign up
An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks
S3-backed pypi server implementation
Hue is an open source Analytics Workbench for browsing, querying and visualizing data.
DockerHub public images - Hadoop, Kafka, ZooKeeper, HBase, Cassandra, Solr / SolrCloud, Presto, Apache Drill, Nifi, Spark, Superset, H2O, Mesos, Serf, Consul, Riak, Alluxio, Jython, Advanced Nagios Plugins Collection / PyTools / Tools repos on CentOS / Ubuntu / Debian / Alpine
A lightweight server clone of Azure Blob Storage that simulates most of the commands supported by it with minimal dependencies.
A docker container for spark standalone cluster mode, built on top of the openjdk8-jre container
Terraform is a tool for building, changing, and combining infrastructure safely and efficiently.
scikit-learn: machine learning in Python
A light library that adds job scheduling capabilities to RQ (Redis Queue)
Mirror of Apache Spark
Mirror of Apache Hadoop
Pivotal Greenplum Database
Install and set up Docker
Studio project common to all MDM projects
Studio open source projects related to Data Quality
The Master repository (using gitslave) that define all public repositories required to build the Talend Open Studio.
Studio open source projects related to Big Data
A sane VPC NAT instance chef cookbook
Chef cookbook to deploy artifacts from standard maven repositories
Chef cookbook for Apache Cassandra, DataStax Enterprise (DSE) and DataStax agent
A convenient Chef LWRP to manage user accounts and SSH keys
chef cookbook to install Apache Spark
Chef Cookbook for R
Chef cookbook for Kafka
A JPA 2.0 compliant Object-Datastore Mapping Library for NoSQL Datastores. Please subscribe to: