Skip to content
@data-processing

data-processing

Popular repositories Loading

  1. kafka-storm-starter kafka-storm-starter Public

    Forked from miguno/kafka-storm-starter

    Code examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+, while using Apache Avro as the data serialization format.

    Scala

  2. incubator-samza incubator-samza Public

    Forked from baeeq/incubator-samza

    Mirror of Apache Samza

    Scala

  3. exelixi exelixi Public

    Forked from ceteri/exelixi

    Exelixi is a distributed framework based on Apache Mesos, mostly implemented in Python using gevent for high-performance concurrency. It is intended to run cluster computing jobs (partitioned batch…

    Python

  4. spark-ec2 spark-ec2 Public

    Forked from shivaram/spark-ec2

    Scripts used to setup a Spark cluster on EC2

    Shell

  5. cdk cdk Public

    Forked from markgrover/cdk

    Cloudera Development Kit

    Java

  6. Impatient Impatient Public

    Forked from Cascading/Impatient

    source examples to support the "Cascading for the Impatient" blog post series

    Java

Repositories

Showing 10 of 47 repositories
  • mpire Public Forked from sybrenjansen/mpire

    A Python package for easy multiprocessing, but faster than multiprocessing

    data-processing/mpire’s past year of commit activity
    Python 0 MIT 37 0 0 Updated Sep 10, 2021
  • hydra Public Forked from facebookresearch/hydra

    Hydra is a framework for elegantly configuring complex applications

    data-processing/hydra’s past year of commit activity
    Python 0 MIT 673 0 0 Updated Mar 25, 2021
  • klio Public Forked from spotify/klio

    Smarter data pipelines for audio.

    data-processing/klio’s past year of commit activity
    Python 0 Apache-2.0 51 0 0 Updated Oct 16, 2020
  • Neuraxle Public Forked from Neuraxio/Neuraxle

    Build neat pipelines with the right abstractions to do AutoML. Let your pipeline steps have hyperparameter spaces. Enable checkpoints to cut duplicate calculations. Go from research to production environment easily.

    data-processing/Neuraxle’s past year of commit activity
    Python 0 Apache-2.0 63 0 0 Updated Dec 19, 2019
  • faust Public Forked from robinhood/faust

    Python Stream Processing

    data-processing/faust’s past year of commit activity
    Python 0 582 0 0 Updated Aug 4, 2018
  • thredo Public Forked from dabeaz/thredo
    data-processing/thredo’s past year of commit activity
    Python 0 MIT 19 0 0 Updated Aug 1, 2018
  • bloop Public Forked from scalacenter/bloop

    A hot bloop for your productivity

    data-processing/bloop’s past year of commit activity
    Scala 0 Apache-2.0 205 0 0 Updated Jun 5, 2018
  • Stream-Framework Public Forked from tschellenbach/Stream-Framework

    Stream Framework is a Python library, which allows you to build news feed, activity streams and notification systems using Cassandra and/or Redis. The authors of Stream-Framework also provide a cloud service for feed technology:

    data-processing/Stream-Framework’s past year of commit activity
    Python 0 557 0 0 Updated Jan 21, 2018
  • fireant Public Forked from kayak/fireant

    Data analysis and reporting tool for quick access to custom charts and tables in Jupyter Notebooks and in the shell.

    data-processing/fireant’s past year of commit activity
    Python 0 Apache-2.0 20 0 0 Updated Nov 30, 2017
  • gain Public Forked from elliotgao2/gain

    Web crawling framework based on asyncio for everyone.

    data-processing/gain’s past year of commit activity
    Python 0 GPL-3.0 215 0 0 Updated Jun 19, 2017

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…