Implementation of the Parallel Coordinate Descent for L1-Regularized Loss Minimization ( arxiv, code from authors of the paper ) in Spark.
The report goes into the details of how to run the Spark implementation on a SLURM (slurm.shedmd.org) based linux cluster.
- Apache Spark : 2.0.1
- PySpark
- Python2.7
- NumPy
- Scipy (For loading the binary Matlab datafile used by the Paper)
Apart from a working installation of Matlab, there is no other dependency.
by: Kunal Ghosh and Jussi Ojala