Skip to content

Pulls TV season episode information for longest running TV series and stores in NoSQL database

Notifications You must be signed in to change notification settings

ErikaJacobs/TV-Series-NoSQL-Database

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 

Repository files navigation

A NoSQL Database of TV Show Episode Data

This project scrapes data from a Wikipedia table for longest running scripted U.S. primetime television series, and pulls episode information for those shows from a television API through TV Maze. This project then organizes this data with an appropriate key/value structure, and uploads the data to DynamoDB, an AWS NoSQL database.

Methods Used

  • Web Scraping
  • API Connection
  • Aggregation

Technologies Used

  • Python
  • DynamoDB
  • NoSQL

Packages Used

  • Requests
  • Boto3
  • bs4

How To Run

AWS Credentials

AWS Credentials will need to be saved locally in the .aws directory of an operating system in order for this project to successfully run. Click here to learn more about this process.

Install Requirements and Run

On the command line of your operating system, navigate to the repository directory (ideally using a Python virtual environment).

Run the following code on the command line to install requirements:

pip install -r requirements.txt 

Run the following code on the command line to run this project:

Python main.py

Featured Scripts or Deliverables

Other Repository Contents

Sources

About

Pulls TV season episode information for longest running TV series and stores in NoSQL database

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages