ScriptSage

ScriptSage is a tool designed to parse and evaluate movie scripts. It provides functionalities to scrape screenplay data, structure it into a JSON format, and visualize character interactions and dialogue distributions.

Features

Screenplay Scraping: Extract screenplay content from the web.
Data Structuring: Convert screenplay content into structured JSON format.
Visualization: Generate visualizations for dialogue distribution and character interactions.

Example Output Conversation Heatmap

Installation

Ensure you have Python 3.11 installed. You can install the required dependencies using Poetry:

poetry install

Usage

Command Line Interface (CLI)

ScriptSage provides a command line interface to scrape, parse, and visualize movie scripts.

To use the CLI, run the following command:

python scriptsage_cli.py <script_url> [--metric]

The --metric flag is optional. When included, it will display additional metrics about the screenplay.

For example, to scrape the screenplay of "Reservoir Dogs" and save it as a structured JSON file:

python scriptsage_cli.py https://imsdb.com/scripts/Reservoir-Dogs.html

To scrape the screenplay and display additional metrics:

python scriptsage_cli.py https://imsdb.com/scripts/Reservoir-Dogs.html --metric

This will perform the following actions:

Scrape the screenplay content from the provided URL.
Parse the screenplay content to extract scenes, characters, and dialogue interactions.
Save the structured data as a JSON file in ~/.scriptsage/screenplays/.
Generate and save visualizations for dialogue distribution and character interactions in ~/.scriptsage/viz/.
If the --metric flag is used, display additional metrics such as total word count, scene count, character count, and top words used in the screenplay.

Scraping Screenplay

To scrape the screenplay of "Reservoir Dogs" and save it as a structured JSON file:

startLine: 1
endLine: 14

Parsing Screenplay

To parse the screenplay content and save it to a JSON file:

startLine: 76
endLine: 83

Generating Visualizations

To generate visualizations for dialogue distribution and character interactions:

startLine: 1
endLine: 52

Project Structure

scriptsage/helpers/scraper.py: Contains the code to scrape screenplay content from the web.
scriptsage/helpers/parse-dialogues.py: Contains the code to parse the screenplay content and save it as a structured JSON file.
scriptsage/helpers/generate-viz.py: Contains the code to generate visualizations for dialogue distribution and character interactions.
scriptsage/helpers/Reservoir-Dogs-structured.json: Example of a structured JSON file generated from the screenplay.
scriptsage/helpers/Reservoir-Dogs.html: Example of the raw HTML content of the screenplay.

Dependencies

The project uses the following dependencies:

requests: For making HTTP requests to fetch screenplay content.
beautifulsoup4: For parsing HTML content.
pandas: For data manipulation and analysis.
matplotlib: For creating visualizations.
seaborn: For creating statistical visualizations.
numpy: For numerical operations.
json: For handling JSON data.
poetry: For dependency management and packaging.

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
scripts		scripts
scriptsage		scriptsage
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ScriptSage

Features

Example Output Conversation Heatmap

Installation

Usage

Command Line Interface (CLI)

Scraping Screenplay

Parsing Screenplay

Generating Visualizations

Project Structure

Dependencies

License

About

Uh oh!

Releases

Packages

Languages

james-see/scriptsage

Folders and files

Latest commit

History

Repository files navigation

ScriptSage

Features

Example Output Conversation Heatmap

Installation

Usage

Command Line Interface (CLI)

Scraping Screenplay

Parsing Screenplay

Generating Visualizations

Project Structure

Dependencies

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages