Pinned repositories

  1. ladydi

    Code Less, Build More. Clean, automated Feature Generation and Selection for Apache Spark!

    Scala 14 3

  2. peapod

    Dependency and data pipeline management framework for Spark and Scala

    Scala 13 3

  3. factotum

    a dot repository

    Vim script 1 1

  • Wikipedia Data Parsing in Spark and Scala

    Java 3 2 Apache-2.0 Updated Mar 27, 2018
  • A Python wrapper for LibFFM

    C++ 363 Updated Nov 21, 2017
  • ūnus - one from union

    Scala 2 3 MIT Updated Jul 24, 2017
  • Docker build for Zeppelin, a web-based Spark notebook

    Shell 131 Updated Jul 1, 2017
  • a dot repository

    Vim script 1 1 Updated Jun 6, 2017
  • Code Less, Build More. Clean, automated Feature Generation and Selection for Apache Spark!

    Scala 14 3 Updated Apr 8, 2017
  • Dependency and data pipeline management framework for Spark and Scala

    Scala 13 3 MIT Updated Apr 8, 2017
  • Scala 1 MIT Updated Apr 8, 2017
  • quill

    Forked from getquill/quill

    Compile-time Language Integrated Queries for Scala

    Scala 186 Apache-2.0 Updated Nov 18, 2016
  • spark

    Forked from apache/spark

    Mirror of Apache Spark

    Scala 16,974 Apache-2.0 Updated Jul 3, 2016
  • This package contains a generic implementation of greedy Information Theoretic Feature Selection (FS) methods. The implementation is based on the common theoretic framework presented by Gavin Brown. Implementations of mRMR, InfoGain, JMI and other commonly used FS filters are provided.

    Scala 35 Apache-2.0 Updated May 19, 2016
  • Spark implementation of Fayyad's discretizer based on Minimum Description Length Principle (MDLP)

    Scala 14 Apache-2.0 Updated Apr 26, 2016
  • Airflow is a system to programmatically author, schedule and monitor data pipelines.

    Python 3,195 Updated Apr 24, 2016
  • nak

    Forked from scalanlp/nak

    The Nak Machine Learning Library

    Scala 82 Apache-2.0 Updated Mar 18, 2016
  • DBSCAN clustering algorithm on top of Apache Spark

    HTML 91 Apache-2.0 Updated Mar 18, 2016
  • Java based GraphViz HTTP Server

    Java 9 Updated Jan 18, 2016
  • Mirror of Apache Zeppelin (Incubating)

    Java 1,922 Apache-2.0 Updated Dec 10, 2015
  • Use Apache Spark straight from the Browser

    JavaScript 571 Apache-2.0 Updated Dec 10, 2015
  • Scripts used to setup a Spark cluster on EC2

    Shell 283 Apache-2.0 Updated Dec 1, 2015