Corpus analysis using SpongeBob transcript to analyze each character.
-
Reference Transcript: http://spongebob.wikia.com/wiki/List_of_transcripts
-
Using AntConc in this project.
- ListCrawling.py: crawl the list of the files.
- SampleTranscriptCrawling.py: crawl the sample transcript. (s1e1. Help_Wanted)
- TranscriptCrawling.py: crawl the actual transcript.
- raw transcripts
- categrized into speakers.txt from raw_data
- keyword result from antconc
- ~script.txt is the script file for each character for testing the score_app.
- score_app.py: speaker recognizing program
- score_app_tester.py: testing the score_app.py
- using keyness formula, I calculated keyword for Noun, Pronoun, Verb, Adverb, Adjective
- .net: network files
- .txt-info.txt: information
- working server program ~ created with node.js server (nodejs.org)
- npm install: install packages
- npm start: for running the server