Skip to content
@data-commons

data-commons

Collection of Open Source libraries that enable working with data at scale

Popular repositories Loading

  1. prep-buddy prep-buddy Public

    A Scala / Java / Python library for cleansing, transforming and preparing large datasets for ML operations on Apache Spark.

    Scala 8 7

  2. protectr protectr Public

    A Scala / Java / Python library for anonymization, encryption and redaction operations for large datasets on Apache Spark.

    Scala 2

  3. pyts pyts Public

    A library for stats module in python

    Python 1

  4. spark-timeseries spark-timeseries Public

    Forked from sryza/spark-timeseries

    A library for time series analysis on Apache Spark

    Scala

  5. data-commons.github.io data-commons.github.io Public

    HTML 1

  6. ApacheWombat ApacheWombat Public

    Forked from justinmclean/ApacheWombat

    Apache worked LICENSE and NOTICE example

    HTML

Repositories

Showing 10 of 11 repositories

Top languages

Loading…

Most used topics

Loading…