SCRiBa

A SCRIpt-BAsed recommender system for movies. SCRiBa computes the similarity between movies according to their scripts. Each script is approximated with the english subtitles available on OpenSubtitles and downloadable without restrictions with Tor.

Please refer to the report and to the notes for additional details on the algorithm, the pre-processing steps and the evaluation on Netflix data.

Information

Status: Completed

Type: Academic project

Course: Data Mining

Development year(s): 2015-2016

Author(s): gcorsi, ShadowTemplate

Getting Started

Each script is required to complete one pre-processing step. Please refer to the project report to get information about the pipeline.

Prerequisites

Clone the repository and install the required Python dependencies:

$ git clone https://github.com/ShadowTemplate/scriba.git
$ cd scriba/
$ pip install --user -r requirements.txt

Download the datasets.

Building tools

Python 3.4 - Programming language
Python 2.7 - Programming language
scikit-learn - TF-IDF features extraction, linear kernel
stem - Anonymous and parallel download with Tor
Beautiful Soup - Web page scraping

Contributing

This project is not actively maintained and issues or pull requests may be ignored.

License

This project is licensed under the GNU GPLv3 license. Please refer to the LICENSE.md file for details.

This README.md complies with this project template. Feel free to adopt it and reuse it.

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
2.7		2.7
LICENSE.md		LICENSE.md
README.md		README.md
constants.py		constants.py
imdb.py		imdb.py
imdbtoos.py		imdbtoos.py
notes.txt		notes.txt
opensubtitles.py		opensubtitles.py
recsys.py		recsys.py
report.pdf		report.pdf
requirements.txt		requirements.txt
subs_cleaner.py		subs_cleaner.py
subs_downloader.py		subs_downloader.py
test.py		test.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SCRiBa

Information

Getting Started

Prerequisites

Building tools

Contributing

License

About

Releases

Packages

Contributors 3

Languages

License

ShadowTemplate/scriba

Folders and files

Latest commit

History

Repository files navigation

SCRiBa

Information

Getting Started

Prerequisites

Building tools

Contributing

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages