Apache Spark project for Advanced Topics on Databases course
-
Updated
Mar 19, 2021 - Python
Apache Spark project for Advanced Topics on Databases course
An Apache Spark application to analyze word frequencies and compute TF-IDF weights across multiple text file sets using Spark's MLlib library.
Add a description, image, and links to the apachespark-rdd topic page so that developers can more easily learn about it.
To associate your repository with the apachespark-rdd topic, visit your repo's landing page and select "manage topics."