GitHub is home to over 28 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.
This repository contains the Streamlin platform. It is a fork of Apache Flink featuring side input implementation, which enables hybrid computation between data-at-rest and data-in-motion.
This repo is hosting all common ML pipeline abstractions built on Apache Flink.
Hopsworks - Hadoop for Humans
Scalable Detection of Concept Drifts on Data Streams with Parallel Adaptive Windowing
a simple example on how to use the sideinput API
Parameter Server implementation in Apache Flink
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Flink and DataFlow
This repository contains a fork of Emma ML library, which we extend with ML algorithms developed within Streamline