Scrapy Movie Subtitles Scraper

Description

The Scrapy Movie Subtitles Scraper is a Python-based project using Scrapy, a popular web crawling and web scraping framework. The project is designed to extract movie data, including titles, plots, and scripts, from the website subslikescript.com. The extracted data can be stored in either a MongoDB Atlas database or a SQLite database, showcasing the data dumping capabilities of the project.

Installation

Clone this repository: git clone https://github.com/zararashraf/ScrapyMoviesSubtitlesScraper.git
Install the required libraries: pip install scrapy pymongo
Configure the MongoDB connection string and the SQLite database settings in the pipelines.

Usage

Run the spider by executing the command: scrapy crawl transcripts
Check the output data in the configured MongoDB Atlas or SQLite database.

Project Structure

The project consists of the following key components:

spiders/transcripts.py: Contains the Scrapy spider for scraping movie data from subslikescript.com.
pipelines.py: Includes two pipelines for dumping the scraped data into a MongoDB Atlas database and a SQLite database.
requirements.txt: Lists all the required dependencies for the project.

Images

The Website in question.

Data in SQLite DB

Data in MongoDB Atlas

Libraries and Technologies Used

Python 3.x
Scrapy for web scraping
pymongo for interacting with MongoDB
sqlite3 for working with SQLite databases

Code Repository

The source code for this project can be found on GitHub.

Credits

Scrapy: https://scrapy.org/
pymongo: https://pypi.org/project/pymongo/
SQLite: https://www.sqlite.org/index.html

License

This project is open-source and available under the MIT License. Feel free to use, modify, and distribute the code as needed.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
subslikescript		subslikescript
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Scrapy Movie Subtitles Scraper

Description

Installation

Usage

Project Structure

Images

Libraries and Technologies Used

Code Repository

Credits

License

About

Releases

Packages

Languages

License

zararashraf/ScrapyMoviesSubtitlesScraper

Folders and files

Latest commit

History

Repository files navigation

Scrapy Movie Subtitles Scraper

Description

Installation

Usage

Project Structure

Images

Libraries and Technologies Used

Code Repository

Credits

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages