-
Notifications
You must be signed in to change notification settings - Fork 10
implementations of CX, PCA, and NMF factorizations in Spark and MPI
License
alexgittens/SparkAndMPIFactorizations
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
A collection of code for computing truncated PCAs (Spark and C/MPI), Nonnegative Matrix Factorization (Spark and C/MPI), and randomized CX (Spark), collated from the separate code-bases used to compile the experimental results in "Matrix Factorization at Scale: a Comparison of Scientific Data Analytics in Spark and C+MPI Using Three Case Studies" by Alex Gittens, Aditya Devarakonda, Evan Racah, Michael Ringenburg, Lisa Gerhardt, Jey Kottaalam, et al. (technical report available at https://arxiv.org/abs/1607.01335) Originally, this code was for timing and producing output for specific scientific problems, so there are places where extra code not relevant to general purpose use is present and places where code that would be relevant for general purpose use (e.g., storing the C and X from the CX decomposition) is missing. These issues are being worked on. One specific issue worth noting is that these codes were written to compile on the Cori NERSC System, so some of the compilation procedures will need to be changed for your system. Authors of the code: Alex Gittens (corresponding author: gittens@alumni.caltech.edu) Aditya Devarakonda Jey Kottalam
About
implementations of CX, PCA, and NMF factorizations in Spark and MPI
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published