Toy example for STAT 545A Homework 6.
Shows how to create a small-but-automated analytical pipeline using a Makefile.
Demonstration data: the number of words spoken by various characters in the Lord of the Rings trilogy. Each observation gives total word count for a character in a specific scene of a single movie. Variables are character, race (hobbit vs. dwarf, etc.), film, chapter ("scene"), number of words spoken. JB will document general cleaning, analyses, etc. of that data here; these scripts are deliberately simple.
How to replicate my analysis
- (Clone the repo! Ha! OK I'm pretending the analyst doesn't use github.)
- Download into an empty directory:
- In a shell:
make all. Or just:
- New files you should see after running the pipeline:
stripplot_wordsByRace_FILM.png, where FILM is one of the 3 movies. Example:
- To remove the output and get a clean slate, in a shell:
Tip to learn from above: experiment with deleting various output files. Then run
make allto note which scripts are rerun.