Skip to content
This repository has been archived by the owner on Nov 8, 2022. It is now read-only.

Scrape IMDb for Actors Details and Perform Sentiment Analysis on their Tweets.

License

Notifications You must be signed in to change notification settings

maneeshd/Twitter-Sentimental-Analysis

Repository files navigation

Twitter-Sentimental-Analysis

Project Description

IMDB provides a list of celebrities born on the current date. Below is the link: http://m.imdb.com/feature/bornondate

Get the list of these celebrities from this webpage using web scraping (the ones that are displayed i.e top 10). You have to extract the below information:

  • Name of the celebrity
  • Celebrity Image
  • Profession
  • Best Work

Once you have this list, run a sentiment analysis on twitter for each celebrity and finally the output should be in the below format

  • Name of the celebrity:
  • Celebrity Image:
  • Profession:
  • Best Work:
  • Overall Sentiment on Twitter: Positive, Negative or Neutral

Tools Used/Requirements

  1. Python >= 3.6.1 (64-bit)

  2. Beautifulsoup4 - Python library for pulling data out of HTML and XML files.

  3. Tweepy - OpenSource Twitter API for Python.

  4. Selenium - The webdriver kit emulates a web-browser and executes JavaScripts to load the dynamic content.

  5. Textblob - Python library using nltk to find polarity of text/tweet.

  6. lxml - A fast html and xml parser for beautifulsoup4

  7. Mozilla Firefox - Web Browser to perform web scraping.

  8. Gecko Driver - Driver for Selenium to invoke Firefox.

  9. API Keys for Twitter has to be put in /data/twitter_api_keys.json (Refer sample_twitter_api_keys.json for format.)

Running the application

  1. Make sure you have all the requirements installed. See requirements.txt or run

    pip install -r requirements.txt --upgrade

  2. Make sure you have the latest version of Mozilla Firefox installed and latest version of geckodriver in utils folder.

Run the application using:

python App.py

Author

Maneesh D - maneeshd77@gmail.com

About

Scrape IMDb for Actors Details and Perform Sentiment Analysis on their Tweets.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages