Image_Text_Retrieval_Benchmark

PyTorch implementation for Module Collection of Image-Text Retrieval for Further Exploration.

Importantly, the code (completed in September 2022) is not comprehensive and is not executable directly.
It functions as a compilation of popular modules, designed to ease adaptation for other domains.

Call for Contributors

We welcome any improvements and supplements by pulling requests to enhance the functionality of this code. Feel free to promote and share your papers during this collaborative process.

Structure and Location

Holistic Feature Aggregation

Basic Aggregation, Sequential GRU, Global Attention, Generalized Pooling, etc.

Cross-Modality Interaction

Like-Cosine Attention, Focal Attention, Relation-wise Attention,
Recurrent Attention, Transformer Attention, Bilinear Attention, etc.

Similarity Construction

Scalar Representation: Inner-product Similarity, Order-embedding Similarity, etc.
Vector Representation: Block Similarity, Symmetric or Asymmetric Similarity, etc.
Graph-based Aggregation: Local Alignments Enhancement, Global Alignments Guidance, etc.
Attention-based Aggregation: Local Alignments Filtration, Guidance Alignments Aggregation, etc.

Objective Function

Birank Loss, CMPL Loss, Binary Cross-entropy Loss, Angular Loss, etc.

Reference

If this code is useful for your research, please cite the relative papers in Awesome_Matching_Pretraining_Transfering.

License

Apache License 2.0.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
lib		lib
.DS_Store		.DS_Store
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image_Text_Retrieval_Benchmark

Call for Contributors

Structure and Location

Holistic Feature Aggregation

Cross-Modality Interaction

Similarity Construction

Objective Function

Reference

License

About

Packages

Languages

Paranioar/Awesome_Image_Text_Retrieval_Benchmark

Folders and files

Latest commit

History

Repository files navigation

Image_Text_Retrieval_Benchmark

Call for Contributors

Structure and Location

Holistic Feature Aggregation

Cross-Modality Interaction

Similarity Construction

Objective Function

Reference

License

About

Topics

Resources

Stars

Watchers

Forks

Packages 0

Languages

Packages