- GitHub repo creation
- Clone the GitHub repo
- Run
dvc init
- Download MNIST dataset from here
- make MNIST dataset to be half link
- place MNIST dataset under
data/train.csv
- Run
dvc add data/train.csv
- Run
dvc remote add my_stroage -d /tmp/dvc-test
- Run
git add .
- Run
git commit -m "initial commit"
- Run
dvc push
- Create
params.yaml
- Create
src/
- Create
src/split.py
- Run
dvc run -n split -p split.ratio -d src/split.py -d data/train.csv -o data/prepared python src/split.py data/train.csv
- Create
src/preprocessing.py
- Run
dvc run -n preprocess -d src/preprocessing.py -d data/prepared -o data/preprocessed python src/preprocessing.py data/prepared
- Create
src/train.py
- Run
dvc run -n train -d src/train.py -d data/preprocessed -o data/model python src/train.py data/preprocessed data/model
- Create
src/evaluate.py
[ ] Run `dvc run -n evaluate -d src/evaluate.py -d data/model