Skip to content

danaelshrbiny10/pyspark-playstore

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Google Playstore Dataset Analysis

We have a Google Play Store dataset containing information about different apps, including installed ratings, versions, and other details. Let's perform some analysis based on the data we have.

Top 10 Reviews Given to the Apps

  • We will identify the top 10 reviews given to the apps.

Top 10 Installed Apps and Distribution of Type [Free/Paid]

  • We will determine the top 10 installed apps and analyze the distribution of app types (free/paid).

Category-wise Distribution of Installed Apps

  • We will examine the distribution of installed apps across different categories.

Top Paid Apps

  • We will identify the top paid apps based on their installation numbers.

Top Paid Rating Apps

  • We will determine the top paid apps with the highest ratings.

Quick Start

Setup

  • Git clone the repository on your local machine or download it.
  • Install requirements using pip install -r requirements/base.txt.

Usage

  • You can utilize the dataset located at data folder.
  • You can utilize the database located at db folder.
  • You can utilize the powerpoint located at ppt folder.

License Information

This project is licensed under the MIT License. For more details, see the LICENSE file.