Online social media research and computational journalism project by the Journalism and Media Studies Centre at the University of Hong Kong. Led by Dr. King-wa Fu (Principal Investigator). Developer until August 2012: Cedric Sam
Python Shell
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Failed to load latest commit information.
facebook_pull.sql occasional commit Jul 30, 2012
sinaweibo.sql added support for getting reposts in orig posts Jan 10, 2013


Code repository for the computational journalism project (Social) of the Journalism and Media Studies Centre at HKU

We developed tools to pull data from Sina Weibo, Twitter and Facebook, as well as blogs and HK forums.

Since the second part of 2011, the Sina Weibo tools are Python scripts that leverage the librairies provided by Sina (weibopy) to access their API through OAuth2. and perform various pull operations from Twitter and Facebook. The OAuth token and secret, Facebook ID, should be in your version of Usage information is generally available when running the script. is used for searching (a way of discovering new contents by keyword search).

twitter_pull.sql and facebook_pull.sql should create the postgresql databases needed to store the data of these aforementioned scripts.

In late November 2011, we added scripts to fetch data from the Google+ and the QQ Weibo API. The database schema are available in their respective sub-directories.

These scripts are developed under Linux (Ubuntu Lucid 10.04). I used Python 2.6.5 and my version of Postgresql is 8.4 (with Postgis 1.5).