Word for Person: Zero-shot Composed Person Retrieval (Word4Per)

Word4Per is an innovative framework for Zero-Shot Composed Person Retrieval (ZS-CPR), integrating visual and textual information for enhancing person identification. This repository includes the Word4Per code and the Image-Text Composed Person Retrieval (ITCPR) dataset, offering new tools for research in security and social applications.

News

[2023.11.16] Repo is created. Code and Dataset will come soon.
[2023.11.25] The ITCPR dataset is now publicly available for download.

ITCPR Dataset

Overview

The ITCPR dataset is a comprehensive collection specifically designed for the Zero-Shot Composed Person Retrieval (ZS-CPR) task. It consists of a total of 2,225 annotated triplets, derived from three distinct datasets: Celeb-reID, PRCC, and LAST. To access the ITCPR dataset, please use the following download link: ITCPR Dataset Download.

Dataset Scale

Total Annotated Triplets: 2,225
Unique Query Combinations: 2,202
Total Images: 1,151 from Celeb-reID, 146 from PRCC, 905 from LAST
Total Identities: 512 from Celeb-reID, 146 from PRCC, 541 from LAST
Target Gallery: 20,510 images with 2,225 corresponding ground truths

Image Sources

The images in the ITCPR dataset are sourced from the following datasets:

Celeb-reID
PRCC
LAST

These are utilized solely for testing purposes in the ZS-CPR task.

Annotation Files

The dataset includes two annotation files: query.json and gallery.json.

`query.json` Format

Each entry in the query.json file follows this structure:

{
    "file_path": "Celeb-reID/001/1_1_0.jpg",
    "datasets": "Celeb-reID",
    "person_id": 1,
    "instance_id": 1,
    "caption": "Wearing a brown plaid shirt, black leather shoes, another dark gray T-shirt, another blue jeans"
}

file_path: Reference image path relative to the data root directory.
datasets: Source dataset of the image.
person_id: Person ID in the original dataset.
instance_id: Unique identifier for gallery ground truth matching.
caption: Relative caption of the reference image.

`gallery.json` Format

Each entry in the gallery.json file follows this structure:

{
    "file_path": "Celeb-reID/001/1_2_0.jpg",
    "datasets": "Celeb-reID",
    "person_id": 1,
    "instance_id": 1
}

instance_id: Matches with query.json for target images; -1 for non-matching query instances.
Others: Correspond to target image path, original dataset, and person ID.

Data Directory Structure

data
|-- Celeb-reID
|   |-- 001
|   |-- 002
|   |-- 003
|   ...
|-- PRCC
|   |-- train
|   |-- val
|   |-- test
|-- LAST
|   |-- 000000
|   |-- 000001
|   |-- 000002
|   ...
|-- query.json
|-- gallery.json

Dataset Download and Preparation

Download and prepare the datasets as follows:

Celeb-reID: GitHub Repository
PRCC: Google Drive Link
LAST: GitHub Repository

After downloading, use the img_process.py script to process Celeb-reID and LAST datasets into the standard format. The PRCC (subfolder PRCC/rgb) dataset can be directly placed in the corresponding directory upon extraction.

Acknowledgments

We are deeply thankful to the creators of the Celebrities-ReID, PRCC, and LAST datasets for their significant contributions to the field of person re-identification. Their commitment to open-sourcing these valuable resources has greatly facilitated advancements in academic and practical research.

Celebrities-ReID: "Celebrities-ReID: A Benchmark for Clothes Variation in Long-Term Person Re-Identification" - View Paper
PRCC: "Person Re-identification by Contour Sketch under Moderate Clothing Change" - View Paper
LAST: "Large-Scale Spatio-Temporal Person Re-identification: Algorithms and Benchmark" - View Paper

Certainly! You can use the following Markdown paragraph for your GitHub repository to instruct users to cite your paper if they utilize your code and dataset. Here's how you can format it:

Citation

If you use our code or dataset in your research, please cite our paper as follows:

@misc{liu2023word,
  title={Word for Person: Zero-shot Composed Person Retrieval},
  author={Delong Liu and Haiwen Li and Zhicheng Zhao and Fei Su and Hongying Meng},
  year={2023},
  eprint={2311.16515},
  archivePrefix={arXiv},
  primaryClass={cs.CV}
}

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
LICENSE		LICENSE
README.md		README.md
img_process.py		img_process.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LICENSE

LICENSE

README.md

README.md

img_process.py

img_process.py

Repository files navigation

Word for Person: Zero-shot Composed Person Retrieval (Word4Per)

News

ITCPR Dataset

Overview

Dataset Scale

Image Sources

Annotation Files

`query.json` Format

`gallery.json` Format

Data Directory Structure

Dataset Download and Preparation

Acknowledgments

Citation

About

Releases

Packages

Languages

License

Delong-liu-bupt/Word4Per

Folders and files

Latest commit

History

Repository files navigation

Word for Person: Zero-shot Composed Person Retrieval (Word4Per)

News

ITCPR Dataset

Overview

Dataset Scale

Image Sources

Annotation Files

query.json Format

gallery.json Format

Data Directory Structure

Dataset Download and Preparation

Acknowledgments

Citation

About

Resources

License

Stars

Watchers

Forks

Languages

`query.json` Format

`gallery.json` Format