GitHub is home to over 50 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.
Create an RK storage account in Azure.
JSON Schema definition for the rubikloud metafile format
Docker image with Uvicorn managed by Gunicorn for high-performance web applications in Python 3.7 and 3.6 with performance auto-tuning. Optionally with Alpine Linux.
An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks
S3-backed pypi server implementation
Hue is an open source Analytics Workbench for browsing, querying and visualizing data.
DockerHub public images - Hadoop, Kafka, ZooKeeper, HBase, Cassandra, Solr / SolrCloud, Presto, Apache Drill, Nifi, Spark, Superset, H2O, Mesos, Serf, Consul, Riak, Alluxio, Jython, Advanced Nagios Plugins Collection / PyTools / Tools repos on CentOS / Ubuntu / Debian / Alpine
A lightweight server clone of Azure Blob Storage that simulates most of the commands supported by it with minimal dependencies.
A docker container for spark standalone cluster mode, built on top of the openjdk8-jre container
Terraform is a tool for building, changing, and combining infrastructure safely and efficiently.
scikit-learn: machine learning in Python
A light library that adds job scheduling capabilities to RQ (Redis Queue)
Mirror of Apache Spark
Pivotal Greenplum Database
Install and set up Docker
Studio project common to all MDM projects
Studio open source projects related to Data Quality
The Master repository (using gitslave) that define all public repositories required to build the Talend Open Studio.
Studio open source projects related to Big Data
A sane VPC NAT instance chef cookbook
Chef cookbook to deploy artifacts from standard maven repositories
Chef cookbook for Apache Cassandra, DataStax Enterprise (DSE) and DataStax agent