Skip to content

google app information ninja ,crawl google app information

License

Notifications You must be signed in to change notification settings

ycombanitor/gac4net

 
 

Repository files navigation

Gac4Net

Gac4Net is short of google app crawl for net.Simple scalable crawler for Apps data from the Play Store.

You don't have to input any of your Google Account credentials since this Crawler acts like a "Logged Out" user.

Exporting the Database

For the database this project used is mongodb(version 2.6 recommended,3.0 will be also supported),you can just install mongodb,and create the datebase

  • PlayStore, collections:ProcessedApps,QueuedApps
  • GOOGStore, collections:UserInfo

you do not need to create the table(the program will do it for you).

#what data crawled? Gac4Net crawl the below information

  • (1)The app detail information from Google app store
  • (2)Google+ User information who use google app,and the user app use history
  • (3)The user's Reviewers

these all information Guaranteed the info wholeness,these information will sufficient for research.In my machine,Gac4Net have crawled about 40M apps information ,20M Google + ,and 100M user Reviews.

Even though the program may down,you can start it again ,then it will work.

About me

My name is swinghu, i am a developer from China who study at WHU at the moment, Knowing C/C++,Java,C#,Liking backend developing.

Email: ogrecpp@gmail.com

Personal page:http://swinghu.github.com

What is this project about ?

The main idea of this project is to gather/mine data about apps of the Google Play Store and build a rich database so that developers, android fans and anyone else can use to generate statistics about the current play store situation

There are many questions we have no answer at the moment and we should be able to answer than with this database.

What do i need before i start?

  • Recommend you read all the pages of this wiki, which won`t take long.
  • Before ask,Read the code first ,Do Second!
  • Know C#, mongodb and visual studio.

The future work

Because this project only crawl the app from google play,it not include any other app store like apple,this is Insufficient. So the next step,Gac4Net will embrace apple store and any other local store(especial the store in China like 安卓市场 etc.).Suggestion is most welcome to me.

Refer to the Pages section of this wiki for individual information about each aspect of the project.

About

google app information ninja ,crawl google app information

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C# 78.6%
  • HTML 18.2%
  • PowerShell 3.2%