FB-Spider

A program which accepts the id of a Facebook page and transforms into a table of the latest 5 posts and the respective latest 5 comments per post. The table will be in .html format

The number of posts and comments can be changed by editing the graph.py file.

Get Started

(Back to top)

You would require a Facebook developer account to get an access token : https://developers.facebook.com/

Register your app and replace the 'YOUR_ACCESS_TOKEN' in graph.py by your User Token : https://developers.facebook.com/tools/accesstoken/.

Requirements

(Back to top)

Other than requiring Python3.x , you require the following libraries:

facepy
json
json2html
webbrowser
PyQt4
SIP

Instructions for the CLI Version

(Back to top)

Clone the repository to your machine.
Open your terminal and change directory to your cloned project folder.
$ pip3 install --editable . This will install all the requirements listed.

Now run the program $ python3 graph.py
Enter the page name of the page you want to scrape. You see a display of 5 choices and then select one out.

Please note that this app will only work for public pages and not from profiles of other people.
That would require permission from the user.

Enjoy the contents in a table format. A file called 'Table.html' would be made in the folder itself. You can open it and see the table.

Instructions for the GUI Version

(Back to top)

For the GUI Version, follow the steps

Clone the repository to your local machine, open the terminal and change directory to your cloned project folder.
Use a virtual environment virtualenv --python=python3.5 --no-site-packages venv
PyQt4 is not supported in Virtual Environment, so you need to setup PyQt manually in the virtualenv.
Two dependencies need to be downloaded,PyQt4 and SIP.
Download the binaries from the given website - (Choose the Source Packages)

PYQT4
SIP
In the project folder, make a folder named build, and extract the PyQt4 and SIP, in the build folder.
Activate virtualenv using . venv/bin/activate
Install some prerequisites
- sudo apt-get install python2.7-dev libxext-dev python-qt4 qt4-dev-tools build-essential
Go to build/SIP directory, and type the following commands: python configure.py sudo make sudo make install
Similarily for PyQt4, go to the build/PYQT4 directory and type the commands: python configure.py sudo make sudo make install
After the installation, run the following file python3 main.py
A GUI will open, enter your Access Code, and press Go.
Enter the page you want to search, and at last the choice of page number.
Press Generate HTML, to generate the HTML File.

Thanks to the Youtube Tutorial for preparing the instruction manual: How to Install PyQt4 Inside Virtualenv

Contribute

(Back to top)

You can contribute to the repo via this really-simple steps:

Star and fork this repo.
Join the (slack channel) . Discuss your ideas on how to improve the project with members or ask your doubts regarding the project.
Clone your repo.
Tackle down the issues or add your own innovations.
Pull Request

Name		Name	Last commit message	Last commit date
Latest commit History 74 Commits
data		data
README.md		README.md
experiment.py		experiment.py
graph.py		graph.py
main.py		main.py
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FB-Spider

Table of contents

Get Started

Requirements

Instructions for the CLI Version

Instructions for the GUI Version

Contribute

About

Releases

Packages

Languages

dibyasonu/FB-Spider

Folders and files

Latest commit

History

Repository files navigation

FB-Spider

Table of contents

Get Started

Requirements

Instructions for the CLI Version

Instructions for the GUI Version

Contribute

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages