AgriTaxon

AgriTaxon: Can Large Multimodal Models Identify What They See in Agriculture?

AgriTaxon is the first benchmark for open-ended taxonomic naming in agriculture. It tests whether Large Multimodal Models (LMMs) can correctly name a species from its image—without predefined options.

Highlights

7,432 species across four domains: Crops (1,971), Livestock (178), Pests (3,485), Weeds (1,798)
Every label linked to FAO / EPPO authoritative databases via Wikidata
Two evaluation protocols: multiple-choice (with semantically hard negatives) and open-ended naming
LLM-as-a-Judge scoring with 98% expert agreement
14 LMMs benchmarked, revealing a striking seeing-without-naming gap

Dataset

The dataset is hosted on Hugging Face:

🔗 Xin1818/AgriTaxon

Potential Applications

AgriTaxon supports a broad range of research directions:

Open-ended visual recognition — developing models that produce free-form species names rather than selecting from a fixed label set
Long-tail and rare-entity understanding — with popularity metadata (Wikipedia pageviews) for every species, enabling controlled study of how accuracy degrades for rare organisms
Retrieval-augmented generation (RAG) — authority-grounded labels (FAO, EPPO, Wikidata QIDs) serve as natural retrieval anchors for augmenting LMMs with external knowledge
Agentic and tool-augmented reasoning — our agentic baseline shows that tool use (e.g., image cropping) significantly boosts accuracy on hard samples
Agricultural AI deployment — pest surveillance, quarantine enforcement, crop variety verification, and livestock breed identification
Fine-grained visual classification (FGVC) — semantically hard negatives and cross-domain coverage make AgriTaxon a challenging FGVC benchmark

Supplementary Material

Full prompt templates, API cost breakdown, resolution analysis, agentic baseline details, and error type examples are available on the Supplementary Material page.

All evaluation prompts are also provided as raw files in the prompts/ directory for reproducibility.

Getting Started

# Download the dataset from Hugging Face
pip install huggingface_hub
huggingface-cli download Xin1818/AgriTaxon --repo-type dataset --local-dir dataset

Licensing & Access

AgriTaxon is publicly available and free for academic and research use.

Source code, prompt templates & documentation (this repository): MIT License
Benchmark metadata & annotations (species labels, evaluation splits, distractor sets): CC BY 4.0
Images: sourced from Wikimedia Commons under their respective Creative Commons licenses (predominantly CC BY and CC BY-SA). Each image retains its original license.
Authority identifiers: Wikidata QIDs are available under CC0; FAO and EPPO identifiers are used for reference linking only.

The dataset is hosted on Hugging Face at Xin1818/AgriTaxon and can be downloaded freely without registration.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
docs		docs
prompts		prompts
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AgriTaxon

Highlights

Dataset

Potential Applications

Supplementary Material

Getting Started

Licensing & Access

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

AgriTaxon

Highlights

Dataset

Potential Applications

Supplementary Material

Getting Started

Licensing & Access

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages