Model evaluation workflow designed for WildObs Image Management Platform https://wildobs.org.au/

Description

This script can be used for benchmarking an ai species recognition model with a local dataset to get an independent assessment of Recall, Precision and F1 Score for a given location
The purpose is to evaluate the most suitable or best performing model for a given location and inform an appropriate validation workflow to ensure accuracy requirements are met

Setup instructions:

Prepare the testing dataset. Start by organising camera trap images into folders by species on your local computer. The quality of the testing dataset will determine the accuracy of the report generated. Here are a few tips:

Use a representive number of images of each species e.g. at least 1000 if possible
Avoid using images that have been used as part of the model training dataset as these will create a biased result
If possible, select a random subset of testing images from a larger pool aiming to get a wide range of images over space and time
For the purpose of calculating Recall, include species that are relevant to your monitoring program
For the the purpose of calculating Precision, also include images of other species that are commonly detected on the cameras at that location even if they are not relevant to your monitoring. Also include some blank images. Below is an example breakdown:

Species	# images	Reason for inclusion
Feral Cat	1000	Target species in monitoring program
Red Fox	1000	Target species in monitoring program
European Rabbit	1000	Target species in monitoring program
Kangaroo	1000	Non-target species but abundant at this location. Impact on Precision.
Emu	1000	Non-target species but abundant at this location. Impact on Precision.
Blank	1000	Impact on Precision.

Establish new Project/s in the WildObs WIMP for benchmarking purposes:

You will need a separate Project for each model you are testing.
Name the project based on the model that will be tested e.g. "Model benchmark testing: WildObs National".
Set the Sequence cutoff to 0 seconds. This aims to prevent the software from creating sequences so that each image is assessed independently.
Define Tags in the project based on the scientific names of the species you are testing. Tags need to match with the species names used in the WIMP.
Configure the project to use the model you want to test

Create Deployments:

You will need to create a Deployment for each of the species you are testing.
Upload the relevant images into each deployment.
Use the tags created earlier to assign to the deployment so you know which species it is supposed to be. This will be used by the script to match the species to the model predictions
Repeat for each Project, uploading the same set of images to each

Run the uploaded images through the AI species recognition model
Once model processing is complete for all deployments, export the project data in Camtrap DP format
Download and extract (unzip) the exported data to a folder on your local computer
Use the folder path as input to this script

Prerequisites

Python 3.x
Required packages listed in requirements.txt

Install dependencies with:

pip install -r requirements.txt

Installation

Clone the repository:

git clone https://github.com/WildObs/model-benchmark-testing-workflow.git
cd model-benchmark-testing-workflow

Data Preparation

Log in to the WildObs image management platform
Export your dataset in Camtrap-DP format
Place the exported files in the root directory of this repository

Example structure:

model-benchmark-testing-workflow/
│
├── wildObs-CV-model-benchmark-testing-workflow.py
├── requirements.txt
├── README.md
├── data/                  # (optional) your exported dataset
│   ├── observations.csv
│   ├── media.csv
│   └── deployments.csv
│   └── ...

Note: Adjust paths in the script if your data is stored in a different location.

Usage

Run the workflow from the command line:

python wildObs-CV-model-benchmark-testing-workflow.py

OR

Run the workflow from the Python notebook WildObs-CV-model-benchmark-testing-workflow.ipynb

Outputs

Misclassified Images

misclassified_images.csv

Contains records of incorrectly classified images
Useful for error analysis and model comparison

Model Benchmark Reports

model_Benchmarking_Report_Exports/

Directory containing HTML reports for each model
Each report includes performance metrics and visual summaries

Example Workflow

Export dataset from WildObs (Camtrap-DP format)
Place data in project folder
Install dependencies
Run the script
Review outputs:

misclassified_images.csv
HTML reports in model_Benchmarking_Report_Exports/

Troubleshooting

Missing packages

Run:

pip install -r requirements.txt

File not found errors

Ensure your exported dataset is in the correct directory

No outputs generated

Export the data again from the WildObs platform and place the extracted data in the /data folder to make sure that you are using the most recent classification results acquired from WildObs models.

Contributing

Contributions are welcome:

Fork the repository
Create a new branch
Submit a pull request

License

Apache License 2.0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Model evaluation workflow designed for WildObs Image Management Platform https://wildobs.org.au/

Description

Setup instructions:

Prerequisites

Installation

Data Preparation

Usage

Outputs

Misclassified Images

Model Benchmark Reports

Example Workflow

Troubleshooting

Missing packages

File not found errors

No outputs generated

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
Input_Data		Input_Data
Output_Reports		Output_Reports
LICENSE		LICENSE
README.md		README.md
WildObs-CV-model-benchmark-testing-workflow.ipynb		WildObs-CV-model-benchmark-testing-workflow.ipynb
requirements.txt		requirements.txt
wildObs-CV-model-benchmark-testing-workflow.py		wildObs-CV-model-benchmark-testing-workflow.py

Folders and files

Latest commit

History

Repository files navigation

Model evaluation workflow designed for WildObs Image Management Platform https://wildobs.org.au/

Description

Setup instructions:

Prerequisites

Installation

Data Preparation

Usage

Outputs

Misclassified Images

Model Benchmark Reports

Example Workflow

Troubleshooting

Missing packages

File not found errors

No outputs generated

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages