pyspark
Here are 103 public repositories matching this topic...
Use spark to analyze user churn behaviour data from music app company as they move from paid and free tier services or cancel their subscription all together. The dataset contains two months of user activity logs.
-
Updated
Jan 8, 2020 - HTML
Binary classification project in PySpark on an AWS-EMR cluster to predict customer churn.
-
Updated
Jun 6, 2022 - HTML
Citibike data analysis for NYC using Hadoop MapReduce/Hive and Spark. Data visualization with Tableau.
-
Updated
Oct 31, 2022 - HTML
data enginerring project - visualize visa numbers by country, time issued from japan
-
Updated
Nov 22, 2023 - HTML
This repo contains scripts used for game analysis and recommendation for the project GamerHood
-
Updated
Aug 16, 2022 - HTML
From image to text - a handwriting recognition tool prototype using Image Classification - Deep Learning in DataBricks.
-
Updated
Feb 1, 2024 - HTML
•Achieved real-time analysis of over 10,000 Ethereum transactions scraped using Selenium analyzed with PySpark while also reducing the cost to store the massive data in a database by storing only the analytics instead of all the data stored as a container in Docker which can be pulled on any machine as a local image and run the service.
-
Updated
May 21, 2023 - HTML
-
Updated
Nov 10, 2019 - HTML
Wind energy prediction employing PySpark in Databricks.
-
Updated
Jul 4, 2022 - HTML
Distributed ML: Predicting Churn from Click Data with Apache Spark
-
Updated
Oct 25, 2019 - HTML
Use Machine Learning (NLP Transformers model) to identify negative-sentiment tweets about JWST space mission
-
Updated
Nov 22, 2022 - HTML
An assignment on preprocessing of text including tokenization, stop word removal
-
Updated
May 1, 2022 - HTML
An assignment on preprocessing of text including tokenization, stop word removal, noise reduction, and stemming
-
Updated
May 6, 2022 - HTML
Improve this page
Add a description, image, and links to the pyspark topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the pyspark topic, visit your repo's landing page and select "manage topics."