fbscraper

Scraping posts, comments and replies from Facebook.

Installing

$ git clone https://github.com/utkarsh512/fbscraper.git
$ cd fbscraper
$ pip install . -r requirements.txt

How to use

Creating a session

Create a Session object for scraping:

from fbscraper import Session
sess = Session(
    credentials=(EMAIL, 
                 PASSWORD), 
    chromeDriverPath="chromedriver"
)

where (EMAIL, PASSWORD) are your facebook credentials and chromeDriverPath is the path to the chromedriver.

Fetching post URLs from the public pages

Then, you can extract recent post URLs of a public pages as

sess.getPage("nytimes")
sess.scroll(10)
postURLs = sess.getPostURLs()

Scraping posts using the fetched URLs

As you now have the list of URLs for the required posts, post data (including comments) can be scraped as

sess.getPost(
    postURL="https://mbasic.facebook.com/story.php?...",
    dump="posts.pkl",
    getComments=True,
    getReplies=True,
    nComments=1000,
    nReplies=10
)

where

postURL is the URL of the post
dump is the name of binary file used for dumping the post data
getComments should be True if you want to scrape comments to the post as well
getReplies should be True if you want to scrape replies to the comments as well
nComments is the upper-bound on number of comments per post
nReplies is the upper-bound on number of replies per comment

Note: Just make sure postURL starts with https://mbasic.facebook.com instead of https://www.facebook.com, https://mobile.facebook.com, etc.

Name		Name	Last commit message	Last commit date
Latest commit History 92 Commits
data		data
fbscraper		fbscraper
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

fbscraper

Installing

How to use

Creating a session

Fetching post URLs from the public pages

Scraping posts using the fetched URLs

About

Releases

Packages

Languages

License

utkarsh512/fbscraper

Folders and files

Latest commit

History

Repository files navigation

fbscraper

Installing

How to use

Creating a session

Fetching post URLs from the public pages

Scraping posts using the fetched URLs

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages