Grow your team on GitHub
GitHub is home to over 28 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.Sign up
Scriptable scheduler for periodical Hadoop workflows
R package for binning potentially unevenly distributed values in a vector into evenly distributed groups (bins).
R package with extended timing functions tic/toc, as well as stack and list structures.
Sparse feature extraction with Spark
Beautiful static documentation for your API
Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark
Interactive Audience Analytics with Spark and HyperLogLog
Cantor provides utilities for estimating the cardinality of large sets.
Puppet module that generates and manages Kerberos host keytabs
Charming sonnet written by Carol Lin about an unnamed Collective employee.
Joda-Time is the widely used replacement for the Java date and time classes.
A modern testing and behavioural specification framework for Java 8
R package for generation of formatted text and code from templates.
R package with various utilities, some having to do with loading CSV files and timing.
R package containing convenience functions for working with SQL databases via ODBC and JDBC.
R package with utilities for local and remote command execution (un*x only).
Jasig CAS - Single Sign On for the Web
Java client and server implementation of Redis
An R package for running, timing and logging multistep processes.
Real²time Exploratory Analytics on Large Datasets
Common interface for Ruby's HTTP clients
Mirror of Apache Kafka
utilities for logging async (files & flume)
our fork of trove
little benchmark utility based on jetty's http client
A command line tool for pushing Nagios host and service notifications to a HipChat room
Pulp .deb packages support!
Avro serialization format support for Node.js