A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
Common data science and data engineering utilities to help us perform analytics. Our toolbox for data scientists, licensed under Apache-2.0
Knowledge base for Hortonworks Ambari Installations
Experiment in anonymization of text files
Run a REST service from a Jupyter Notebook
Reportserver community edition Docker file
Contains useful libraries for engineering projects on Big Data Infrastructures
Using reinforcement learning to solve te elevator proble. Like a boss
Cloud-provider agnostic deployment code for Big Data Infrastructure components
Experiments with CrateDB
Simple local Riak Vagrant with Jupyter notebooks
Artikel Java Magazine Machine Learning in Scala/Spark ML
Spawns JupyterHub user servers in Docker containers
Ansible playbooks to help to deploy Hadoop CDH4 and Spark in High Availability with Automatic Failover and many other cool stuff!
This organization has no public members. You must be a member to see who’s a part of this organization.