Skip to content

NGC4258/SCI-9

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SCI-9

This is a crawler to crawling public data from Facebook

And this code will not be updated in the future

Notice of this crawler is used multiprocess to get data

So please take care of your operation, not my fault

This code is very ugly, I give suggestion to "Do Not Use"

How to use ?

  1. Prepare the OS enviroment, last test was in Ubuntu Xenial

This is common sense, you can choose which one that you love

  1. Prepare your account of Facebook development to use GraphAPI as following:
    https://developers.facebook.com/

  2. Prepare stored place, only support PostgreSQL https://www.postgresql.org/download/linux/ubuntu/

  3. Install packages, the example is Ubuntu Xenial as following:

sudo apt install git python-pip libffi-dev libssl-dev

  1. Install python libraries as following:

sudo pip install python-dateutil psycopg2 'requests[security]' elasticsearch

  1. Download SCI-9 to your machine:

git clone https://github.com/NGC4258/SCI-9.git

  1. Change ini file to ingratiate your environment:

vi SCI-9/sci-9.ini

  1. Insert group ID what you want to fetching:

vi SCI-9/groups

  1. Enjoy!

python SCI-9

About

This is a Facebook crawler

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages