She-Ra and the Princesses of Power

Text analysis of scripts from the recent reboot of She-Ra and the Princesses of Power

Background Information

She-Ra and the Princesses of Power is a Netflix reboot of the 80's cartoon, She-Ra: Princess of Power. Over the past few years, 52 episodes have come out and the series' stories have concluded. The series is known for its variety of representation, both causual and central to the story, and themes of friendship, abandonment, colonialism, and queer relationships. As a fan of the show and data scientist, I wanted to look deeper into the dailogue of the show.

Data Source

I gathered transcripts from a She-Ra fan-made wiki page. Since this source is dependent solely on fan contributions, there may be some mistakes and many episodes do not have complete scripts. I will be updating this repo and my data analysis as more scripts become available to me. For now, we can consider the results a reflection of mainly seasons 1-3.

Link to Transcripts:https://she-raandtheprincessesofpower.fandom.com/wiki/Category:Transcripts

Below is a list of episodes for which I did not obtain a completed script:

Season 3: Episode 2
Season 4: Episode 2-5, 9, 10, 12, 13
Season 5: Episode 1-4, 7-12

Research Questions:

Who are the main or most active characters of this show? Does this change by season?
What are the most common words used by characters in this show?
What are the most "important" words spoken by certain characters on this show?
What are some relationships among words on this show?
Sentiment Analysis by Character and Season
Data Modeling: Can we predict which lines were spoken by Adora? or Catra?

Files in this Repo:

README.md: Markdown file discussing details of this project.
txt_scripts/: path to txt files of the show's scripts.
img_files/: path to image files I used.
tv_script_to_csv.py: Python file used to convert the txt files into one csv file.
she_ra_dialogue.csv: CSV file with each line of dialogue and who said it during which episode.
She_Ra_Project.ipynb: Jupyter Notebook holding my text analysis work and completmentary data visualization.
Entrapta_Wordclouds.ipynb: Jupyter Notebook containing code to create some wordclouds using the character Entrapta's lines. Kept separate from the She_Ra notebooks as I used this work for a fan competition and not for the purpose of answering my research questions.

While I will be uploading updates to my Jupyter Notebook on GitHub, the latest version will be on Google Colab.

References

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

She-Ra and the Princesses of Power

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 81 Commits
img_files		img_files
txt_scripts		txt_scripts
Bow_WordCloud.ipynb		Bow_WordCloud.ipynb
Entrapta_Wordclouds.ipynb		Entrapta_Wordclouds.ipynb
README.md		README.md
She_Ra_Project.ipynb		She_Ra_Project.ipynb
she_ra_dialogue.csv		she_ra_dialogue.csv
tv_script_to_csv.py		tv_script_to_csv.py

prishanmu/She-Ra

Folders and files

Latest commit

History

Repository files navigation

She-Ra and the Princesses of Power

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages