Zhou, N.[Ning],
Fan, J.P.[Jian-Ping],
Automatic image-text alignment for large-scale web image indexing and
retrieval,
PR(48), No. 1, 2015, pp. 205-219.
Elsevier DOI
1410
Automatic image-text alignment
BibRef
Huang, F.R.[Fei-Ran],
Zhang, X.M.[Xiao-Ming],
Zhao, Z.H.[Zhong-Hua],
Li, Z.J.[Zhou-Jun],
Bi-Directional Spatial-Semantic Attention Networks for Image-Text
Matching,
IP(28), No. 4, April 2019, pp. 2008-2020.
IEEE DOI
1901
Word to region and visual object to words.
image matching, image retrieval,
learning (artificial intelligence), spatial-semantic
BibRef
Otto, C.[Christian],
Springstein, M.[Matthias],
Anand, A.[Avishek],
Ewerth, R.[Ralph],
Characterization and classification of semantic image-text relations,
MultInfoRetr(9), No. 1, March 2020, pp. 31-45.
Springer DOI
2003
Textual information with the imagery.
BibRef
Niu, K.[Kai],
Huang, Y.[Yan],
Wang, L.[Liang],
Re-ranking image-text matching by adaptive metric fusion,
PR(104), 2020, pp. 107351.
Elsevier DOI
2005
Image-text matching, Re-ranking method, Adaptive metric fusion
BibRef
Wen, K.Y.[Ke-Yu],
Gu, X.D.[Xiao-Dong],
Cheng, Q.R.[Qing-Rong],
Learning Dual Semantic Relations With Graph Attention for Image-Text
Matching,
CirSysVideo(31), No. 7, July 2021, pp. 2866-2879.
IEEE DOI
2107
Semantics, Visualization, Task analysis, Feature extraction, Birds,
Automobiles, Image retrieval, Cross-modal retrieval,
image text matching
BibRef
Yang, S.[Song],
Li, Q.[Qiang],
Li, W.H.[Wen-Hui],
Li, X.[Xuanya],
Liu, A.A.[An-An],
Dual-Level Representation Enhancement on Characteristic and Context
for Image-Text Retrieval,
CirSysVideo(32), No. 11, November 2022, pp. 8037-8050.
IEEE DOI
2211
Semantics, Visualization, Feature extraction, Correlation,
Learning systems, Task analysis, Filtration, image-text retrieval
BibRef
Jing, Y.[Ya],
Wang, W.[Wei],
Wang, L.[Liang],
Tan, T.N.[Tie-Niu],
Learning Aligned Image-Text Representations Using Graph Attentive
Relational Network,
IP(30), 2021, pp. 1840-1852.
IEEE DOI
2101
Graph neural networks, Visualization, Semantics, Task analysis,
Feature extraction, Annotations, Recurrent neural networks,
graph neural network
BibRef
Zhao, F.[Fang],
Huang, Y.Z.[Yong-Zhen],
Wang, L.[Liang],
Tan, T.N.[Tie-Niu],
Deep Semantic Ranking Based Hashing for Multi-Label Image Retrieval,
CVPR15(1556-1564)
IEEE DOI
1510
BibRef
Lan, H.[Hong],
Zhang, P.[Pufen],
Learning and Integrating Multi-Level Matching Features for Image-Text
Retrieval,
SPLetters(29), 2022, pp. 374-378.
IEEE DOI
2202
Pattern matching, Feature extraction, Visualization, Head,
Task analysis, Semantics, Recurrent neural networks,
attention mechanism
BibRef
Wu, J.[Jie],
Wu, C.L.[Chun-Lei],
Lu, J.[Jing],
Wang, L.Q.[Lei-Quan],
Cui, X.R.[Xue-Rong],
Region Reinforcement Network With Topic Constraint for Image-Text
Matching,
CirSysVideo(32), No. 1, January 2022, pp. 388-397.
IEEE DOI
2201
Semantics, Visualization, Petroleum,
Linear programming, Cameras, Training data, Image-text matching,
topic constraint
BibRef
Malali, N.[Noam],
Keller, Y.[Yosi],
Learning to Embed Semantic Similarity for Joint Image-Text Retrieval,
PAMI(44), No. 12, December 2022, pp. 10252-10260.
IEEE DOI
2212
Semantics, Training, Measurement, Fasteners, Visualization,
Correlation, Deep learning, Text and image fusion, deep learning,
joint embedding
BibRef
Tian, M.X.[Meng-Xiao],
Wu, X.X.[Xin-Xiao],
Jia, Y.D.[Yun-De],
Adaptive Latent Graph Representation Learning for Image-Text Matching,
IP(32), 2023, pp. 471-482.
IEEE DOI
2301
Visualization, Feature extraction, Representation learning,
Image edge detection, Adaptation models, Task analysis, Semantics,
graph variational autoencoder
BibRef
Li, K.P.[Kun-Peng],
Zhang, Y.L.[Yu-Lun],
Li, K.[Kai],
Li, Y.Y.[Yuan-Yuan],
Fu, Y.[Yun],
Image-Text Embedding Learning via Visual and Textual Semantic
Reasoning,
PAMI(45), No. 1, January 2023, pp. 641-656.
IEEE DOI
2212
BibRef
Earlier:
Visual Semantic Reasoning for Image-Text Matching,
ICCV19(4653-4661)
IEEE DOI
2004
Semantics, Cognition, Visualization, Correlation,
Image representation, Logic gates, Image-text retrieval, deep learning.
convolutional neural nets, graph theory,
image matching, image retrieval, Image edge detection
BibRef
Diao, H.W.[Hai-Wen],
Zhang, Y.[Ying],
Liu, W.[Wei],
Ruan, X.[Xiang],
Lu, H.C.[Hu-Chuan],
Plug-and-Play Regulators for Image-Text Matching,
IP(32), 2023, pp. 2322-2334.
IEEE DOI
2305
Regulators, Visualization, Semantics, Regulation, Pattern matching,
Feature extraction, Image-text matching,
plug-and-play operation
BibRef
Tian, Y.[Yumin],
Ding, A.[Aqiang],
Wang, D.[Di],
Luo, X.M.[Xue-Mei],
Wan, B.[Bo],
Wang, Y.F.[Yi-Feng],
Bi-Attention enhanced representation learning for image-text matching,
PR(140), 2023, pp. 109548.
Elsevier DOI
2305
Image-text matching, Bi-attention, Polynomial loss
BibRef
Zhang, K.[Kun],
Mao, Z.D.[Zhen-Dong],
Liu, A.A.[An-An],
Zhang, Y.D.[Yong-Dong],
Unified Adaptive Relevance Distinguishable Attention Network for
Image-Text Matching,
MultMed(25), 2023, pp. 1320-1332.
IEEE DOI
2305
Semantics, Optimization, Visualization, Training, Task analysis,
Representation learning, Correlation, Image-text matching,
unified adaptive relevance distinguishable learning
BibRef
Liu, Z.[Zejun],
Chen, F.L.[Fang-Lin],
Xu, J.[Jun],
Pei, W.J.[Wen-Jie],
Lu, G.M.[Guang-Ming],
Image-Text Retrieval With Cross-Modal Semantic Importance Consistency,
CirSysVideo(33), No. 5, May 2023, pp. 2465-2476.
IEEE DOI
2305
Semantics, Visualization, Task analysis, Representation learning,
Recurrent neural networks, Glass, Feature extraction, Cross-modal,
alignment
BibRef
Li, Z.[Zhao],
Nian, X.H.[Xiao-Hong],
Pan, C.[Chao],
Yang, D.[Dayin],
Xiong, H.Y.[Hong-Yun],
Wang, H.B.[Hai-Bo],
Relation Graph Reasoning for Image-Text Matching,
ICIVC22(319-324)
IEEE DOI
2301
Visualization, Semantics, Brightness, Feature extraction, Cognition,
Data mining, Task analysis, Image-Text Matching, Relation Graph,
Visual Reasoning
BibRef
Zhang, K.[Kun],
Mao, Z.D.[Zhen-Dong],
Wang, Q.[Quan],
Zhang, Y.D.[Yong-Dong],
Negative-Aware Attention Framework for Image-Text Matching,
CVPR22(15640-15649)
IEEE DOI
2210
Force measurement, Codes, Machine vision, Optimization methods,
Benchmark testing, Pattern recognition, Vision+language,
Vision applications and systems
BibRef
Long, S.[Siqu],
Han, S.C.[Soyeon Caren],
Wan, X.J.[Xiao-Jun],
Poon, J.[Josiah],
GraDual: Graph-based Dual-modal Representation for Image-Text
Matching,
WACV22(2463-2472)
IEEE DOI
2202
Visualization, Matched filters,
Computational modeling, Semantics, Image retrieval,
Vision and Languages
BibRef
Biten, A.F.[Ali Furkan],
Mafla, A.[Andrés],
Gómez, L.[Lluís],
Karatzas, D.[Dimosthenis],
Is An Image Worth Five Sentences? A New Look into Semantics for
Image-Text Matching,
WACV22(2483-2492)
IEEE DOI
2202
Measurement, Training, Integrated circuits, Annotations, Semantics,
Training data, Semisupervised learning, Vision and Languages
BibRef
Mithun, N.C.[Niluthpol C.],
Pasricha, R.[Ravdeep],
Papalexakis, E.[Evangelos],
Roy-Chowdhury, A.K.[Amit K.],
Webly Supervised Image-Text Embedding with Noisy Tag Refinement,
ICPR21(7454-7461)
IEEE DOI
2105
utilizing web images in training.
Training, Tensors, Correlation, Performance gain, Benchmark testing,
Data models
BibRef
Chen, J.A.[Jian-An],
Zhang, L.[Lu],
Wang, Q.[Qiong],
Bai, C.[Cong],
Kpalma, K.[Kidiyo],
Intra-Modal Constraint Loss for Image-Text Retrieval,
ICIP22(4023-4027)
IEEE DOI
2211
Convolutional codes, Recurrent neural networks,
Bidirectional control, Network architecture, Feature extraction,
similarity distance
BibRef
Liu, Y.[Yang],
Wang, H.Q.[Hua-Qiu],
Meng, F.Y.[Fan-Yang],
Liu, M.Y.[Meng-Yuan],
Liu, H.[Hong],
Attend, Correct and Focus: A Bidirectional Correct Attention Network
for Image-Text Matching,
ICIP21(2673-2677)
IEEE DOI
2201
Image processing, Semantics, Benchmark testing, Task analysis,
Image-text matching, cross modal retrieval, attention mechanism
BibRef
Yang, S.T.[Sean T.],
Huang, K.H.[Kuan-Hao],
Howe, B.[Bill],
JECL: Joint Embedding and Cluster Learning for Image-Text Pairs,
ICPR21(8344-8351)
IEEE DOI
2105
Training, Sensitivity, Computational modeling, Training data,
Benchmark testing, Robustness
BibRef
Mikriukov, G.[Georgii],
Ravanbakhsh, M.[Mahdyar],
Demir, B.[Begüm],
An Unsupervised Cross-Modal Hashing Method Robust to Noisy Training
Image-Text Correspondences in Remote Sensing,
ICIP22(2556-2560)
IEEE DOI
2211
Training, Codes, Coherence, Feature extraction, Distortion, Sensors,
Noise robustness, cross-modal retrieval, caption-noise
BibRef
Anwaar, M.U.[Muhammad Umer],
Labintcev, E.[Egor],
Kleinsteuber, M.[Martin],
Compositional Learning of Image-Text Query for Image Retrieval,
WACV21(1139-1148)
IEEE DOI
2106
Training, Image color analysis, Databases,
Image retrieval, Benchmark testing
BibRef
Messina, N.[Nicola],
Falchi, F.[Fabrizio],
Esuli, A.[Andrea],
Amato, G.[Giuseppe],
Transformer Reasoning Network for Image-Text Matching and Retrieval,
ICPR21(5222-5229)
IEEE DOI
2105
Measurement, Weight measurement, Training, Visualization,
Feature extraction, Cognition
BibRef
Zhang, Q.[Qi],
Lei, Z.[Zhen],
Zhang, Z.X.[Zhao-Xiang],
Li, S.Z.[Stan Z.],
Context-Aware Attention Network for Image-Text Retrieval,
CVPR20(3533-3542)
IEEE DOI
2008
Semantics, Correlation, Visualization, Feature extraction,
Task analysis, Bidirectional control, Knowledge discovery
BibRef
Chen, Y.C.[Yen-Chun],
Li, L.J.[Lin-Jie],
Yu, L.C.[Li-Cheng],
El Kholy, A.[Ahmed],
Ahmed, F.[Faisal],
Gan, Z.[Zhe],
Cheng, Y.[Yu],
Liu, J.J.[Jing-Jing],
Uniter: Universal Image-Text Representation Learning,
ECCV20(XXX: 104-120).
Springer DOI
2010
joint visual and textual understanding.
BibRef
Wang, H.R.[Hao-Ran],
Zhang, Y.[Ying],
Ji, Z.[Zhong],
Pang, Y.W.[Yan-Wei],
Ma, L.[Lin],
Consensus-aware Visual-semantic Embedding for Image-Text Matching,
ECCV20(XXIV:18-34).
Springer DOI
2012
BibRef
Chen, T.L.[Tian-Lang],
Deng, J.J.[Jia-Jun],
Luo, J.B.[Jie-Bo],
Adaptive Offline Quintuplet Loss for Image-text Matching,
ECCV20(XIII:549-565).
Springer DOI
2011
Code, Retrieval.
WWW Link.
BibRef
Lee, K.H.[Kuang-Huei],
Chen, X.[Xi],
Hua, G.[Gang],
Hu, H.D.[Hou-Dong],
He, X.D.[Xiao-Dong],
Stacked Cross Attention for Image-Text Matching,
ECCV18(II: 212-228).
Springer DOI
1810
Match text descriptions to the image.
BibRef
Zhang, Y.[Ying],
Lu, H.C.[Hu-Chuan],
Deep Cross-Modal Projection Learning for Image-Text Matching,
ECCV18(I: 707-723).
Springer DOI
1810
Measure similarity between image and text descriptions.
BibRef
Plummer, B.A.[Bryan A.],
Kordas, P.[Paige],
Kiapour, M.H.[M. Hadi],
Zheng, S.[Shuai],
Piramuthu, R.[Robinson],
Lazebnik, S.[Svetlana],
Conditional Image-Text Embedding Networks,
ECCV18(XII: 258-274).
Springer DOI
1810
Code:
WWW Link.
BibRef
Chapter on Implementations and Applications, Databases, QBIC, Video Analysis, Hardware and Software, Inspection continues in
Visual Grounding, Grounding Expressions .