Skip to content
This repository has been archived by the owner on Jan 5, 2021. It is now read-only.
/ tomatopy Public archive

Web scraper for rotten tomatoes reviews and info

License

Notifications You must be signed in to change notification settings

sjmiller8182/tomatopy

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

60 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

TomatoPy - Rotten Tomatoes Scraper

GitHub

Easy web scraping for Rotten Tomatoes movie reviews and info.

Enjoy the recipe

Why Rotten Tomatoes?
Rotten Tomatoes is the world's most trusted source for entertainment recommendations. They provide users with indications of fresh or rotten by aggreation of reviews from critics. Learn more about Rotten Tomatoes.

Recent Versions

Initial Release 0.0.0

  • One line scraper
  • Critic review scraper
  • Main info scraper

Planned Updates for 0.1.0

  • Add code to get movie names from Wikipedia
  • Update secondary API functions to accept movie name instead of link
  • Check robots.txt for crawl rate in case of future addition

Planned Updates for 0.1.1

  • Type hints

Planned Updates for 0.2.0

  • Scrape user reviews
  • Scrape Tomatometer and audience score in main info.

Basic Usage

import tomatopy as rtp

# get main information and critic reivews for X2
main_info, reviews = rtp.scrape_movie_info('X2: X-Men United')

# just get main info
reivews = rtp.get_critic_reviews('https://www.rottentomatoes.com/m/x2_xmen_united')

# just get critic reviews
main_info = rtp.get_main_page_info('https://www.rottentomatoes.com/m/x2_xmen_united')

# change crawl rate to 1 second per request (default rate)
rtp.set_crawl_rate(1.0)

# get movie names from wikipedia [2008 in film] (https://en.wikipedia.org/wiki/2008_in_film)
names = rtp.scrape_movie_names(2008)

About

Web scraper for rotten tomatoes reviews and info

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages