GitHub - ttimbers/data-analysis-pipeline-make-practice

Building a Data Analysis pipeline tutorial

This example data analysis project analyzes the word count for all words in 4 novels. It reports the top 10 most occurring words in each book in a report.

Current usage:

Set-up (first time only)

Clone this repo, and using the command line, navigate to the root of this project.
Run the following commands to create the conda environment:

conda env create -f environment.yml

Run the analysis

Activate the conda environment:

conda activate data-analysis-practice

Run the analysis:

bash runall.sh

Exercise:

Your task is to add a "smarter" data analysis pipeline using GNU Make! It should accomplish the same task as bash runall.sh when you type make all.

It should reset the analysis the starting point (the state when you first copied this repo) when you type make clean.

Depenedencies

GNU Make
Quarto
Python & Python libraries:
- click
- matplotlib
- pandas

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
data		data
report		report
results		results
scripts		scripts
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
environment.yml		environment.yml
runall.sh		runall.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

report

report

results

results

scripts

scripts

.gitignore

.gitignore

LICENSE.md

LICENSE.md

README.md

README.md

environment.yml

environment.yml

runall.sh

runall.sh

Repository files navigation

Building a Data Analysis pipeline tutorial

Current usage:

Set-up (first time only)

Run the analysis

Exercise:

Depenedencies

About

Releases

Packages

Languages

License

ttimbers/data-analysis-pipeline-make-practice

Folders and files

Latest commit

History

Repository files navigation

Building a Data Analysis pipeline tutorial

Current usage:

Set-up (first time only)

Run the analysis

Exercise:

Depenedencies

About

Resources

License

Stars

Watchers

Forks

Languages