spark-sql
Here are 24 public repositories matching this topic...
Created a SparkML RandomForest model to predict total employee compensation. Queried data with SparkSQL, ran PySpark scripts to run EDA, pre-process data, and train model achieving with 0.98 R2 score.
-
Updated
Feb 4, 2024 - HTML
Apache Spark™ and Scala Workshops
-
Updated
Jan 8, 2023 - HTML
-
Updated
Dec 8, 2022 - HTML
-
Updated
Dec 8, 2022 - HTML
Music prediction using PySpark
-
Updated
Dec 5, 2022 - HTML
Explanatory Data Analysis and ML model building using Apache Spark and PySpark
-
Updated
Oct 12, 2022 - HTML
End-to-end real-time credit card transactions application. Made with Kafka, Spark, Bootstrap, ECharts, RxJS.
-
Updated
Jun 18, 2022 - HTML
This project will show an auto-updated map with the people interaction during COVID19 in the US using big data technologies to analysis a real-time stream of Twitter data.
-
Updated
May 20, 2022 - HTML
Ralph Winters Website
-
Updated
Feb 15, 2022 - HTML
Analysis for a streaming daily retail data using Spark structured streaming and querying this data to get insights
-
Updated
Jan 12, 2022 - HTML
Capstone Project in the Udacity Data Scientist Nanodegree program. We manipulate large and realistic datasets with Spark to engineer relevant features for predicting churn. We'll learn how to use Spark MLlib to build machine learning models with large datasets, far beyond what could be done with non-distributed technologies like scikit-learn.
-
Updated
Jul 29, 2021 - HTML
-
Updated
Jan 12, 2021 - HTML
Big Data Analytics for anazon.com using Spark Framework and Scala Programming Language
-
Updated
Aug 7, 2019 - HTML
An investigatory analysis of restaurant sales data using Apache Spark in an attempt to give some insights as to how to boost up the sales of less frequently sold items. This is a real-world dataset from an actual restaurant.
-
Updated
May 11, 2018 - HTML
Analysis of a dataset of flights, using the SparkSQL framework and extra web scraping techniques
-
Updated
May 7, 2018 - HTML
Recommendation System written in Python, using the pySpark framework and other Data Science libraries
-
Updated
May 7, 2018 - HTML
Improve this page
Add a description, image, and links to the spark-sql topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the spark-sql topic, visit your repo's landing page and select "manage topics."