GitHub is home to over 31 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.
library for performing data-science and machine learning related data preparation, aggregation, manipulation and querying tasks at scale using Scalding or Spark
Humbug is a very simple scrooge replacement for Thrift structs with more than 254 fields
Feature Engineering as Composable Functions
hdfs tools, taps, sinks, sources and alike for use with cascading and scalding.
A Cascading-Sqoop Integration
A micro-test framework for scalding pipes to make sure you don't get burnt
Monadic interface for Hive databases
Scalding and Cascading support for using scrooge with parquet.
Utilities around testing.
Scala utility library
HDFS utility library
Well-typed data pipeline framework for building robust ETL jobs
Wrangle different dev tools together and look good doing it.
An sbt plugin for maintaining uniform approach to building cba components.
Composing Lenses and Prisms with stronger resulting properties
A functional wrapper for scalikejdbc.
CI scripts for use by Travis and Drone
Work space for open banking standards development in Australia
DEPRECATED. Old style CI scripts.
A cleaner version of the literate starter kit based on Emacs24
Jenkins plugin that improves build performance for transient slaves by caching files
Postgres pljava image based on openjdk
Rackspace Private Cloud Offering based on OpenStack
Ansible playbooks for deploying OpenStack.
Journalbeat is a log shipper from systemd/journald to Logstash/Elasticsearch
Curator: Tending your Elasticsearch indices
Combined repository for uPickle/PPrint
Deriving shapeless generics for scrooge-generated types
Fault tolerant job scheduler for Mesos which handles dependencies and ISO8601 based schedules