Skip to content

gauthamp10/Google-Playstore-Dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Google-Playstore-Dataset -(Backup repo to store off Kaggle)

Google PlayStore App analytics. (2.3 Million App Data) and 24 attributes.

I've collected the data with the help of Python and Scrapy running on a cloud vm.The data was collected on June 2021.

For latest dataset updates. Download from Kaggle: https://www.kaggle.com/gauthamp10/google-playstore-apps/


Instructions to combine the datasets

git clone https://github.com/gauthamp10/Google-Playstore-Dataset.git

cd Google-Playstore-Dataset/dataset/

for f in *.tar.gz; do tar -xvf "$f"; done

cat Part?.csv > Googple-Playstore-Dataset.csv

Also checkout the Android App Permission dataset: https://github.com/gauthamp10/android-permissions-dataset


Author

Gautham Prakash

My other projects: github.com/gauthamp10

Website: gauthamp10.github.io

Blog: gauthamp10/blog

License

This project is licensed under the MIT License - see the LICENSE file for details