-
Notifications
You must be signed in to change notification settings - Fork 0
Home
jianhuashao edited this page Sep 17, 2012
·
5 revisions
Welcome to the AndroidAppsCollector wiki!
Scraping information of app in Android Market. It is part of JianhuaShao's PhD research data collection (Valuation of User Data).
- cate_read_**.py: try to get a list of app_id.
- app_read_**.py: try to get detail for each app with app_id.
- db_**.py: sqlite3 database file for the list of app.
#CMD:
1. python cate_read_**.py
2. python app_read_**.py ## needs to merge the db first
3. python main.py
- https://play.google.com/store/apps: it provides a list of top 504 apps in each category. However, given a app_id, it would tell you the detail of each app.
- http://www.androidzoom.com: it provides a nearly complete list of apps according to each category. The app_id needs to find out when turning app page in its domain. Some app_id are out of dates.
- http://www.androlib.com: it provides a nearly complete list of apps according to each category. It also specify the app distribution according to language. I needs to turn into app page to find out app_id. Some app_id are out of dates.
- play.google.com: 1
- www.androidzoom.com: 10
- www.andrlib.com: 1
- www.youtube.com: 1
- plusone.google.com: 1
- play.google.com/getreview: 10
- customer review for each app can only get from ajax to google.com.
- it is easy to delay the response from google server. So it is experience to set timeout=10 for httplib.