Block or report user


@sparklingpandas @SparkTC @high-performance-spark

Popular repositories

  1. spark-testing-base

    Base classes to use when writing tests with Spark

    Scala 479 127

  2. learning-spark-examples

    Examples for learning spark

    Java 206 164

  3. elasticsearchspark

    Elastic Search on Spark

    Scala 98 39

  4. fastdataprocessingwithsparkexamples

    Examples for Fast Data Processing with Spark

    Scala 53 32

  5. spark-validator

    A library you can include in your Spark job to validate the counters and perform operations on success. Goal is scala/java/python support.

    Scala 40 13

  6. chef-cookbook-spark

    A chef cookbook for deploying spark

    Ruby 30 34

586 contributions in the last year

Jun Jul Aug Sep Oct Nov Dec Jan Feb Mar Apr May Mon Wed Fri

Contribution activity First pull request First issue First repository Joined GitHub

May 2017

Created a pull request in apache/spark that received 8 comments

[SPARK-20627][PYSPARK] Drop the hadoop distirbution name from the Python version

What changes were proposed in this pull request? Drop the hadoop distirbution name from the Python version (PEP440 -…

Created an issue in pypa/packaging-problems that received 5 comments

PyPI size limit for PySpark

PySpark is a little under 200mb because it needs all of the Java parts of Spark with it to be usable. The Spark PMC publishes a pip installable pac…

Seeing something unexpected? Take a look at the GitHub profile guide.