Be notified of new releases
Create your free GitHub account today to subscribe to this repository for new releases and build software alongside 28 million developers.Sign up
Added features to sparksql so that it can be called like this:
val result = myDataFrame.Bin(100, -5, 5, myColumn)
and a Py4J-friendly interface so that it can be called from PySpark as well. Running SparkSQL Histogrammar in PySpark will actually call Histogrammar-Scala for better performance.