Grow your team on GitHub
GitHub is home to over 28 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.Sign up
Provides Spotify specific TensorFlow helpers
Python virtualenvs in Debian packages
A Scala API for Apache Beam and Google Cloud Dataflow.
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
Utilities for working with futures in Java 8
Ephemeral Hadoop clusters using Google Compute Platform
A Scala feature transformation library for data science and machine learning
A lightweight workflow definition library
"The path to execution", Styx is a service that schedules batch data processing jobs in Docker containers on Kubernetes.
Common library for serving TensorFlow, XGBoost and scikit-learn models in production.
DBeam extracts SQL tables using JDBC and Apache Beam
DNS record reconciliation for Gordon: Event-driven Cloud DNS
Homebrew formula for open-source software developed by Spotify
A tool for data sampling, data generation, and data diffing
A Java implementation of the FastForward metrics agent
Scala Aggregators used for ML Model metrics monitoring
The Heroic Time Series Database
Algebraic data types in Java.
A ffwd-http-client for Java
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
Java library for working with Guava futures
A Giter8 template for scio
Community-supported add-ons for Scio
A functional reactive framework for managing state evolution and side-effects.
Android Architecture Blueprint sample app implementation using Mobius
Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code
Docker container orchestration platform
GCP Plugin for Gordon: Event-driven Cloud DNS
A simple docker client for the JVM
Apache Cassandra cluster orchestration tool for the command line