landmark-retrieval

In this project, we extend two representation learning algorithms to perform image retrieval on Google Landmark Dataset v2.

The first algorithm is inspired by Generalized End-to-End Loss for Speaker Verification (GE2E), which was proposed to perform speaker verification by leveraging the centroids of the embedding vectors for different speakers to maximize intra-class compactness and inter-class discrepancy.

The second algorithm, Additive Angular Margin Loss (ArcFace), adds an angular margin to the angle between the features and target weights in each dimension of class, which modifies the cross entropy loss to achieve more distinguishable embeddings.

To compare the two algorithms, we use the same ResNet-101 network pre-trained on ImageNet as the encoder before the representation learning stage. We also design a variant of U-Net as the baseline to learn low-dimensional embeddings during image reconstruction. As a result, ArcFace obtains a superior retrieval performance.

Please refer to the final report for more details.

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
ArcFace		ArcFace
GE2E		GE2E
baseline_cls		baseline_cls
baseline_unet		baseline_unet
data_utils		data_utils
results		results
.gitignore		.gitignore
ImageRetrieval.pdf		ImageRetrieval.pdf
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

landmark-retrieval

About

Releases

Packages

Contributors 3

Languages

License

rwang97/landmark-retrieval

Folders and files

Latest commit

History

Repository files navigation

landmark-retrieval

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages