DVC Pipeline for Pokémon type classifier

This DVC pipeline trains a CNN to classify images of Pokémon. It will predict whether a Pokémon is of a predetermined type (default: water).

Note: due to the limited size of the dataset, the evaluation dataset is the same data set as the train+test. Take the results of the model with a grain of salt.

From Notebook to pipeline

This project details the transformation from Notebook to DVC pipeline. In the different branches, you can find three stages in this process:

snapshot-jupyter: a prototype as you might build it in a Jupyter Notebook
papermill-dvc: a DVC pipeline with a single stage to run a parameterized notebook using Papermill
dvc-pipeline: pure DVC pipeline with Python modules

Requirements

How to run

Create a new virtual environment with virtualenv -p python3 .venv
Activate the virtual environment with source .venv/bin/activate
Install the dependencies with pip install -r requirements.txt

Download the datasets from Kaggle into the data/external/ directory.

$ wget https://www.kaggle.com/datasets/robdewit/pokemon-images -o data/external/pokemon-gen-1-8
$ wget https://www.kaggle.com/datasets/rounakbanik/pokemon -o data/external/stats/pokemon-gen-1-8.csv

Run the pipeline with dvc repro or run an experiment with dvc exp run

Notes on hardware

The requirements specify tensorflow-macos and tensorflow-metal, which are the appropriate requirements when you are using a Mac with an M1 CPU or later. In case you are using a different system, you will need to replace these with tensorflow.

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
.dvc		.dvc
data		data
outputs		outputs
src		src
.dvcignore		.dvcignore
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dvc.lock		dvc.lock
dvc.yaml		dvc.yaml
params.yaml		params.yaml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DVC Pipeline for Pokémon type classifier

From Notebook to pipeline

Requirements

How to run

Notes on hardware

About

Languages

License

iterative/example-pokemon-classifier

Folders and files

Latest commit

History

Repository files navigation

DVC Pipeline for Pokémon type classifier

From Notebook to pipeline

Requirements

How to run

Notes on hardware

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Languages