Skip to content

Paranioar/Awesome_Image_Text_Retrieval_Benchmark

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 

Repository files navigation

Image_Text_Retrieval_Benchmark

PyTorch implementation for Module Collection of Image-Text Retrieval for Further Exploration.

Importantly, the code (completed in September 2022) is not comprehensive and is not executable directly.
It functions as a compilation of popular modules, designed to ease adaptation for other domains.

Call for Contributors

We welcome any improvements and supplements by pulling requests to enhance the functionality of this code. Feel free to promote and share your papers during this collaborative process.

Structure and Location

Basic Aggregation, Sequential GRU, Global Attention, Generalized Pooling, etc.

Like-Cosine Attention, Focal Attention, Relation-wise Attention,
Recurrent Attention, Transformer Attention, Bilinear Attention, etc.

Scalar Representation: Inner-product Similarity, Order-embedding Similarity, etc.
Vector Representation: Block Similarity, Symmetric or Asymmetric Similarity, etc.
Graph-based Aggregation: Local Alignments Enhancement, Global Alignments Guidance, etc.
Attention-based Aggregation: Local Alignments Filtration, Guidance Alignments Aggregation, etc.

Birank Loss, CMPL Loss, Binary Cross-entropy Loss, Angular Loss, etc.

Reference

If this code is useful for your research, please cite the relative papers in Awesome_Matching_Pretraining_Transfering.

License

Apache License 2.0.

About

The Unified Code of Image-Text Retrieval for Further Exploration.

Topics

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages