Skip to content
View rgruener's full-sized avatar

Organizations

@create-at-cooper
Block or Report

Block or report rgruener

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned

  1. uber/petastorm uber/petastorm Public

    Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch…

    Python 1.8k 281

  2. apache/arrow apache/arrow Public

    Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing

    C++ 13.6k 3.3k

  3. spotify/scio spotify/scio Public

    A Scala API for Apache Beam and Google Cloud Dataflow.

    Scala 2.5k 509

  4. apache/beam apache/beam Public

    Apache Beam is a unified programming model for Batch and Streaming data processing.

    Java 7.6k 4.1k

  5. apache/parquet-mr apache/parquet-mr Public

    Apache Parquet

    Java 2.4k 1.4k

  6. spotify/simple-bigtable spotify/simple-bigtable Public

    Java 36 20