20.6.1 Cross-Modal Indexing, Cross-Modal Retrieval

Chapter Contents (Back)
Multi-Modal Retrieval. Cross-Modal Retrieval.

Costa Pereira, J., Coviello, E.[Emanuele], Doyle, G., Rasiwasia, N., Lanckriet, G.R.G.[Gert R.G.], Levy, R., Vasconcelos, N.M.,
On the Role of Correlation and Abstraction in Cross-Modal Multimedia Retrieval,
PAMI(36), No. 3, March 2014, pp. 521-535.
IEEE DOI 1403
image matching. E.g. use image to search for text. Correlation matching. Semantic matching. Semantic correlation matching. BibRef

Costa Pereira, J.[Jose], Vasconcelos, N.M.[Nuno M.],
Cross-modal domain adaptation for text-based regularization of image semantics in image retrieval systems,
CVIU(124), No. 1, 2014, pp. 123-135.
Elsevier DOI 1406
Content-based image retrieval BibRef

Zhai, X.H.[Xiao-Hua], Peng, Y.X.[Yu-Xin], Xiao, J.G.[Jian-Guo],
Learning Cross-Media Joint Representation With Sparse and Semisupervised Regularization,
CirSysVideo(24), No. 6, June 2014, pp. 965-978.
IEEE DOI 1407
Correlation BibRef

Peng, Y.X.[Yu-Xin], Qi, J.W.[Jin-Wei],
Quintuple-Media Joint Correlation Learning With Deep Compression and Regularization,
CirSysVideo(30), No. 8, August 2020, pp. 2709-2722.
IEEE DOI 2008
Media, Correlation, Semantics, Solid modeling, Data models, Image coding, Cross-media retrieval, network regularization BibRef

Peng, Y., Zhai, X., Zhao, Y., Huang, X.,
Semi-Supervised Cross-Media Feature Learning With Unified Patch Graph Regularization,
CirSysVideo(26), No. 3, March 2016, pp. 583-596.
IEEE DOI 1603
Correlation BibRef

Bellini, P.[Pierfrancesco], Cenni, D.[Daniele], Nesi, P.[Paolo],
Optimization of information retrieval for cross media contents in a best practice network,
MultInfoRetr(3), No. 3, September 2014, pp. 147-159.
Springer DOI 1408
BibRef

Kang, C., Xiang, S., Liao, S., Xu, C., Pan, C.,
Learning Consistent Feature Representation for Cross-Modal Multimedia Retrieval,
MultMed(17), No. 3, March 2015, pp. 370-381.
IEEE DOI 1502
Algorithm design and analysis BibRef

He, Y., Xiang, S., Kang, C., Wang, J., Pan, C.,
Cross-Modal Retrieval via Deep and Bidirectional Representation Learning,
MultMed(18), No. 7, July 2016, pp. 1363-1377.
IEEE DOI 1608
backpropagation BibRef

Zhang, S., Wang, X., Lin, Y., Tian, Q.,
Cross Indexing With Grouplets,
MultMed(17), No. 11, November 2015, pp. 1969-1979.
IEEE DOI 1511
Feature extraction BibRef

Chu, L., Zhang, Y., Li, G., Wang, S., Zhang, W., Huang, Q.,
Effective Multimodality Fusion Framework for Cross-Media Topic Detection,
CirSysVideo(26), No. 3, March 2016, pp. 556-569.
IEEE DOI 1603
Complexity theory BibRef

Ding, K.[Kun], Fan, B.[Bin], Huo, C.L.[Chun-Lei], Xiang, S.M.[Shi-Ming], Pan, C.H.[Chun-Hong],
Cross-Modal Hashing via Rank-Order Preserving,
MultMed(19), No. 3, March 2017, pp. 571-585.
IEEE DOI 1702
Binary codes BibRef

Han, C.W.[Chao-Wei], Meng, G.F.[Gao-Feng], Huo, C.L.[Chun-Lei],
SFD: Similar Frame Dataset for Content-Based Video Retrieval,
ICIP24(2403-2409)
IEEE DOI Code:
WWW Link. 2411
Codes, Databases, Scalability, Large language models, Image retrieval, Contrastive learning, Object detection, Dataset, Contrastive learning BibRef

Jiang, B.[Bin], Yang, J.C.[Jia-Chen], Lv, Z.H.[Zhi-Han], Tian, K.[Kun], Meng, Q.G.[Qing-Gang], Yan, Y.[Yan],
Internet cross-media retrieval based on deep learning,
JVCIR(48), No. 1, 2017, pp. 356-366.
Elsevier DOI 1708
Cross-media, retrieval BibRef

Hu, Y., Zheng, L., Yang, Y., Huang, Y.,
Twitter100k: A Real-World Dataset for Weakly Supervised Cross-Media Retrieval,
MultMed(20), No. 4, April 2018, pp. 927-938.
IEEE DOI 1804
Electronic publishing, Encyclopedias, Internet, Optical character recognition software, Training, Visualization, weakly supervised method BibRef

Verma, Y.[Yashaswi], Jha, A.[Abhishek], Jawahar, C.V.,
Cross-specificity: modelling data semantics for cross-modal matching and retrieval,
MultInfoRetr(8), No. 2, June 2018, pp. 139-146.
Springer DOI 1805
BibRef

Dorfer, M.[Matthias], Schlüter, J.[Jan], Vall, A.[Andreu], Korzeniowski, F.[Filip], Widmer, G.[Gerhard],
End-to-end cross-modality retrieval with CCA projections and pairwise ranking loss,
MultInfoRetr(8), No. 2, June 2018, pp. 117-128.
Springer DOI 1805
BibRef

Lu, X.[Xu], Zhang, H.X.[Hua-Xiang], Sun, J.D.[Jian-De], Wang, Z.H.[Zhen-Hua], Guo, P.L.[Pei-Lian], Wan, W.B.[Wen-Bo],
Discriminative correlation hashing for supervised cross-modal retrieval,
SP:IC(65), 2018, pp. 221-230.
Elsevier DOI 1805
Cross-modal retrieval, Hashing, Subspace learning, Discriminant analysis BibRef

Wang, L.[Li], Zhu, L.[Lei], Dong, X.[Xiao], Liu, L.[Li], Sun, J.D.[Jian-De], Zhang, H.X.[Hua-Xiang],
Joint Feature Selection and Graph Regularization for Modality-Dependent Cross-Modal Retrieval,
JVCIR(54), 2018, pp. 213-222.
Elsevier DOI 1806
Cross-modal retrieval, Feature selection, Subspace learning, Graph regularization BibRef

Zhong, F.M.[Fang-Ming], Chen, Z.K.[Zhi-Kui], Min, G.Y.[Ge-Yong],
Deep Discrete Cross-Modal Hashing for Cross-Media Retrieval,
PR(83), 2018, pp. 64-77.
Elsevier DOI 1808
Cross-modal retrieval, deep learning, discrete hashing, alternative optimization BibRef

Yuan, X.[Xu], Wang, G.Z.[Guang-Ze], Chen, Z.K.[Zhi-Kui], Zhong, F.M.[Fang-Ming],
CHOP: An orthogonal hashing method for zero-shot cross-modal retrieval,
PRL(145), 2021, pp. 247-253.
Elsevier DOI 2104
Zero-shot, Cross-modal retrieval, Orthogonal projection BibRef

Vukotic, V., Raymond, C., Gravier, G.,
A Crossmodal Approach to Multimodal Fusion in Video Hyperlinking,
MultMedMag(25), No. 2, April 2018, pp. 11-23.
IEEE DOI 1808
Task analysis, Neural networks, Visualization, Streaming media, Hypertext systems, Training, multimedia BibRef

Liu, R., Wei, S., Zhao, Y., Zhu, Z., Wang, J.,
Multiview Cross-Media Hashing with Semantic Consistency,
MultMedMag(25), No. 2, April 2018, pp. 71-86.
IEEE DOI 1808
Media, Semantics, Correlation, Multimedia communication, Optimization, Feature extraction, hashing, cross-media, multiview, searching BibRef

Wang, D.[Di], Wang, Q.[Quan], Gao, X.B.[Xin-Bo],
Robust and Flexible Discrete Hashing for Cross-Modal Similarity Search,
CirSysVideo(28), No. 10, October 2018, pp. 2703-2715.
IEEE DOI 1811
Robustness, Training, Binary codes, Quantization (signal), Linear programming, Matrix decomposition, Sparse matrices, Hashing, unsupervised learning BibRef

Wang, D.[Di], Gao, X.B.[Xin-Bo], Wang, X.M.[Xiu-Mei], He, L.H.[Li-Huo],
Label Consistent Matrix Factorization Hashing for Large-Scale Cross-Modal Similarity Search,
PAMI(41), No. 10, October 2019, pp. 2466-2479.
IEEE DOI 1909
Semantics, Correlation, Training, Transforms, Binary codes, Image reconstruction, Sparse matrices, Hashing, multimodal, cross-modal BibRef

Wang, D.[Di], Gao, X.B.[Xin-Bo], Wang, X.M.[Xiu-Mei], He, L.[Lihuo], Yuan, B.[Bo],
Multimodal Discriminative Binary Embedding for Large-Scale Cross-Modal Retrieval,
IP(25), No. 10, October 2016, pp. 4540-4554.
IEEE DOI 1610
Internet BibRef

Wang, D.[Di], Wang, Q.[Quan], He, L.[Lihuo], Gao, X.B.[Xin-Bo], Tian, Y.M.[Yu-Min],
Joint and individual matrix factorization hashing for large-scale cross-modal retrieval,
PR(107), 2020, pp. 107479.
Elsevier DOI 2008
Hashing, Multimodal, Retrieval, Cross-modal, Matrix factorization BibRef

Dong, F.[Fei], Nie, X.S.[Xiu-Shan], Liu, X.B.[Xing-Bo], Geng, L.L.[Lei-Lei], Wang, Q.[Qian],
Cross-Modal Hashing Based on Category Structure Preserving,
JVCIR(57), 2018, pp. 28-33.
Elsevier DOI 1812
Cross-modal retrieval, Supervised hashing, Category-specific structure preserving BibRef

Zhang, M.J.[Mei-Jia], Zhang, H.X.[Hua-Xiang], Li, J.Z.[Jun-Zheng], Wang, L.[Li], Fang, Y.X.[Yi-Xian], Sun, J.D.[Jian-De],
Supervised graph regularization based cross media retrieval with intra and inter-class correlation,
JVCIR(58), 2019, pp. 1-11.
Elsevier DOI 1901
Cross media retrieval, Subspace learning, Supervised graph regularization BibRef

Yao, T.[Tao], Wang, G.[Gang], Yan, L.S.[Lian-Shan], Kong, X.W.[Xiang-Wei], Su, Q.T.[Qing-Tang], Zhang, C.M.[Cai-Ming], Tian, Q.[Qi],
Online latent semantic hashing for cross-media retrieval,
PR(89), 2019, pp. 1-11.
Elsevier DOI 1902
Cross-media retrieval, Online learning, Hashing, Latent semantic concept BibRef

Yao, T.[Tao], Kong, X.W.[Xiang-Wei], Fu, H.Y.[Hai-Yan], Tian, Q.[Qi],
Discrete Semantic Alignment Hashing for Cross-Media Retrieval,
Cyber(50), No. 12, December 2020, pp. 4896-4907.
IEEE DOI 2012
Semantics, Hash functions, Correlation, Quantization (signal), Optimization, Task analysis, Internet, Attribute, hashing BibRef

Dutta, T.[Titir], Biswas, S.[Soma],
Cross-modal retrieval in challenging scenarios using attributes,
PRL(125), 2019, pp. 618-624.
Elsevier DOI 1909
Cross-modal retrieval, Attributes, Unseen query, Low-resolution data BibRef

Liu, H.P.[Hua-Ping], Wang, F.[Feng], Zhang, X.Y.[Xin-Yu], Sun, F.C.[Fu-Chun],
Weakly-paired deep dictionary learning for cross-modal retrieval,
PRL(130), 2020, pp. 199-206.
Elsevier DOI 2002
Deep dictionary learning, Cross-modal retrieval, Weak pairing BibRef

Zhang, H.[Hong], Wang, T.[Ting], Dai, G.[Gang],
Semi-supervised cross-modal common representation learning with vector-valued manifold regularization,
PRL(130), 2020, pp. 335-344.
Elsevier DOI 2002
Cross-media retrieval, Vector-valued RKHS, Manifold regularization, Semi-supervised, Kernel method BibRef

Chaudhuri, U.[Ushasi], Banerjee, B.[Biplab], Bhattacharya, A.[Avik], Datcu, M.[Mihai],
CMIR-NET: A deep learning based model for cross-modal retrieval in remote sensing,
PRL(131), 2020, pp. 456-462.
Elsevier DOI 2004
Remote sensing, Cross-modal retrieval, Deep learning, Panchromatic, Multispectral, Audio samples BibRef

Chi, J.Z.[Jing-Ze], Peng, Y.X.[Yu-Xin],
Zero-Shot Cross-Media Embedding Learning With Dual Adversarial Distribution Network,
CirSysVideo(30), No. 4, April 2020, pp. 1173-1187.
IEEE DOI 2004
Semantics, Media, Correlation, Training, Dogs, Measurement, Cross-media retrieval, zero-shot learning, maximum mean discrepancy BibRef

Wu, F.[Fei], Jing, X.Y.[Xiao-Yuan], Wu, Z.Y.[Zhi-Yong], Ji, Y.[Yimu], Dong, X.[Xiwei], Luo, X.K.[Xiao-Kai], Huang, Q.H.[Qing-Hua], Wang, R.[Ruchuan],
Modality-specific and shared generative adversarial network for cross-modal retrieval,
PR(104), 2020, pp. 107335.
Elsevier DOI 2005
Cross-modal retrieval, Generative adversarial networks (GAN), Modality-specific feature learning, Modality-shared feature learning BibRef

Wu, F.[Fei], Luo, X.K.[Xiao-Kai], Huang, Q.H.[Qing-Hua], Wei, P.F.[Peng-Fei], Sun, Y.[Ying], Dong, X.[Xiwei], Wu, Z.Y.[Zhi-Yong],
Semantic Preserving Generative Adversarial Network for Cross-Modal Hashing,
ICIP21(2743-2747)
IEEE DOI 2201
Measurement, Quantization (signal), Image processing, Semantics, Focusing, Network architecture, cross-modal hashing, semantic preserving BibRef

Zhong, F.M.[Fang-Ming], Chen, Z.K.[Zhi-Kui], Min, G.Y.[Ge-Yong], Xia, F.[Feng],
A novel strategy to balance the results of cross-modal hashing,
PR(107), 2020, pp. 107523.
Elsevier DOI 2008
Cross-modal hashing, Semantic gap, Semantic augmentation, Cross-modal retrieval BibRef

Peng, Y., Chi, J.,
Unsupervised Cross-Media Retrieval Using Domain Adaptation With Scene Graph,
CirSysVideo(30), No. 11, November 2020, pp. 4368-4379.
IEEE DOI 2011
Media, Correlation, Visualization, Genomics, Bioinformatics, Training data, Training, Cross-media retrieval, domain adaptation, scene graph BibRef

Zhu, L.[Lei], Song, J.[Jiayu], Zhu, X.F.[Xiao-Feng], Zhang, C.Y.[Cheng-Yuan], Zhang, S.C.[Shi-Chao], Yuan, X.P.[Xin-Pan],
Adversarial Learning-Based Semantic Correlation Representation for Cross-Modal Retrieval,
MultMedMag(27), No. 4, October 2020, pp. 79-90.
IEEE DOI 2012
Correlation, Semantics, Computer science, Internet, Streaming media BibRef

Zhu, L.[Lei], Zhang, C.Y.[Cheng-Yuan], Song, J.[Jiayu], Zhang, S.C.[Shi-Chao], Tian, C.[Chunwei], Zhu, X.[Xinghui],
Deep Multigraph Hierarchical Enhanced Semantic Representation for Cross-Modal Retrieval,
MultMedMag(29), No. 3, July 2022, pp. 17-26.
IEEE DOI 2209
Semantics, Adversarial machine learning, Correlation, Visualization, Generators, Generative adversarial networks, Computer science BibRef

Chaudhuri, U.[Ushasi], Banerjee, B.[Biplab], Bhattacharya, A.[Avik], Datcu, M.[Mihai],
CrossATNet: A novel cross-attention based framework for sketch-based image retrieval,
IVC(104), 2020, pp. 104003.
Elsevier DOI 2012
Neural networks, Sketch-based image retrieval, Cross-modal retrieval, Deep-learning, Cross-attention network, Cross-triplets BibRef

Zhang, Y., Zhou, W., Wang, M., Tian, Q., Li, H.,
Deep Relation Embedding for Cross-Modal Retrieval,
IP(30), 2021, pp. 617-627.
IEEE DOI 2012
Semantics, Feature extraction, Visualization, Computational modeling, Task analysis, Training, Optimization, relation BibRef

Zhang, L.[Lei], Chen, L.T.[Lei-Ting], Ou, W.H.[Wei-Hua], Zhou, C.[Chuan],
Semi-supervised cross-modal representation learning with GAN-based Asymmetric Transfer Network,
JVCIR(73), 2020, pp. 102899.
Elsevier DOI 2012
Cross-modal retrieval, Modality gap, Generative adversarial network BibRef

Wang, L.[Lu], Yang, J.[Jie], Zareapoor, M.[Masoumeh], Zheng, Z.L.[Zhong-Long],
Cluster-wise unsupervised hashing for cross-modal similarity search,
PR(111), 2021, pp. 107732.
Elsevier DOI 2012
Cross-modal similarity retrieval, Multi-view clustering, The cluster-wise code-prototypes, Cross-modal hashing, BibRef

Meng, M., Wang, H., Yu, J., Chen, H., Wu, J.,
Asymmetric Supervised Consistent and Specific Hashing for Cross-Modal Retrieval,
IP(30), 2021, pp. 986-1000.
IEEE DOI 2012
Semantics, Optimization, Quantization (signal), Correlation, Symmetric matrices, Image coding, Sparse matrices, multimedia BibRef

Matsubara, T.[Takashi],
Target-Oriented Deformation of Visual-Semantic Embedding Space,
IEICE(E104-D), No. 1, January 2021, pp. 24-33.
WWW Link. 2101
BibRef

Nie, X., Wang, B., Li, J., Hao, F., Jian, M., Yin, Y.,
Deep Multiscale Fusion Hashing for Cross-Modal Retrieval,
CirSysVideo(31), No. 1, January 2021, pp. 401-410.
IEEE DOI 2101
Semantics, Machine learning, Training data, Media, Correlation, Retrieval, hashing, deep learning, cross-modal BibRef

Liu, X.[Xin], Hu, Z.K.[Zhi-Kai], Ling, H.B.[Hai-Bin], Cheung, Y.M.[Yiu-Ming],
MTFH: A Matrix Tri-Factorization Hashing Framework for Efficient Cross-Modal Retrieval,
PAMI(43), No. 3, March 2021, pp. 964-981.
IEEE DOI 2102
Lips, Semantics, Adaptation models, Task analysis, Encoding, Correlation, Cross-modal retrieval, semantic correlation matrix BibRef

Wu, Y., Wang, S., Song, G., Huang, Q.,
Augmented Adversarial Training for Cross-Modal Retrieval,
MultMed(23), 2021, pp. 559-571.
IEEE DOI 2102
image representation, image retrieval, neural nets, text analysis, adversarial training process, adversa-rial training BibRef

Lin, Q., Cao, W., He, Z., He, Z.,
Mask Cross-Modal Hashing Networks,
MultMed(23), 2021, pp. 550-558.
IEEE DOI 2102
deep learning (artificial intelligence), feature extraction, file organisation, image retrieval, text analysis, cross-modal retrieval BibRef

Qi, M., Qin, J., Yang, Y., Wang, Y., Luo, J.,
Semantics-Aware Spatial-Temporal Binaries for Cross-Modal Video Retrieval,
IP(30), 2021, pp. 2989-3004.
IEEE DOI 2102
Semantics, Binary codes, Feature extraction, Visualization, Task analysis, Natural languages, Stochastic processes, natural language BibRef

Wu, J.L.[Jian-Long], Xie, X.X.[Xing-Xu], Nie, L.Q.[Li-Qiang], Lin, Z.C.[Zhou-Chen], Zha, H.B.[Hong-Bin],
Reconstruction regularized low-rank subspace learning for cross-modal retrieval,
PR(113), 2021, pp. 107813.
Elsevier DOI 2103
Cross-modal retrieval, Low-rank subspace learning, Reconstruction regularization BibRef

Zou, X.T.[Xi-Tao], Wang, X.Z.[Xin-Zhi], Bakker, E.M.[Erwin M.], Wu, S.[Song],
Multi-label semantics preserving based deep cross-modal hashing,
SP:IC(93), 2021, pp. 116131.
Elsevier DOI 2103
Multi-modal retrieval, Deep cross-modal hashing, Multi-label semantic learning BibRef

Shu, X.[Xin], Zhao, G.Y.[Guo-Ying],
Scalable multi-label canonical correlation analysis for cross-modal retrieval,
PR(115), 2021, pp. 107905.
Elsevier DOI 2104
Canonical correlation analysis, Semantic transformation, Cross-modal retrieval, Singular value decomposition BibRef

Song, G.[Ge], Tan, X.Y.[Xiao-Yang],
Real-world Cross-modal Retrieval via Sequential Learning,
MultMed(23), 2021, pp. 1708-1721.
IEEE DOI 2106
BibRef
Earlier:
Sequential Learning for Cross-Modal Retrieval,
CroMoL19(4531-4539)
IEEE DOI 2004
Plugs, Task analysis, Data models, Learning systems, Brain modeling, Adaptation models, Technological innovation, meta learning. information retrieval, learning (artificial intelligence), multimodal data, meta learning BibRef

Chen, W.[Wei], Liu, Y.[Yu], Bakker, E.M.[Erwin M.], Lew, M.S.[Michael S.],
Integrating information theory and adversarial learning for cross-modal retrieval,
PR(117), 2021, pp. 107983.
Elsevier DOI 2106
Cross-modal retrieval, Shannon information theory, Adversarial learning, Modality uncertainty, Data imbalance BibRef

Huang, Z.Y.[Zhen-Yu], Zhou, J.T.Y.[Joey Tian-Yi], Zhu, H.Y.[Hong-Yuan], Zhang, C.Q.[Chang-Qing], Lv, J.C.[Jian-Cheng], Peng, X.[Xi],
Deep Spectral Representation Learning from Multi-View Data,
IP(30), 2021, pp. 5352-5362.
IEEE DOI 2106
Deep learning, Laplace equations, Neural networks, Collaboration, Data models, Task analysis, cross-modal retrieval BibRef

Wen, X.[Xin], Han, Z.Z.[Zhi-Zhong], Liu, Y.S.[Yu-Shen],
CMPD: Using Cross Memory Network With Pair Discrimination for Image-Text Retrieval,
CirSysVideo(31), No. 6, June 2021, pp. 2427-2437.
IEEE DOI 2106
Semantics, Task analysis, Training, Generators, Optimization, Marine vehicles, Retrieval, cross-modal retrieval, adversarial learning BibRef

Liu, J.H.[Jun-Hao], Yang, M.[Min], Li, C.M.[Cheng-Ming], Xu, R.F.[Rui-Feng],
Improving Cross-Modal Image-Text Retrieval With Teacher-Student Learning,
CirSysVideo(31), No. 8, August 2021, pp. 3242-3253.
IEEE DOI 2108
Semantics, Task analysis, Data models, Neural networks, Correlation, Binary codes, Feature extraction, teacher-student learning BibRef

Song, G.[Ge], Tan, X.Y.[Xiao-Yang], Zhao, J.[Jun], Yang, M.[Ming],
Deep robust multilevel semantic hashing for multi-label cross-modal retrieval,
PR(120), 2021, pp. 108084.
Elsevier DOI 2109
Hashing, Multi-label, Cross-modal retrieval, Deep learning BibRef

Song, G.[Ge], Huang, K.[Kai], Su, H.W.[Han-Wen], Song, F.Y.[Feng-Yi], Yang, M.[Ming],
Deep Ranking Distribution Preserving Hashing for Robust Multi-Label Cross-Modal Retrieval,
MultMed(26), 2024, pp. 7027-7042.
IEEE DOI 2405
Codes, Semantics, Training, Correlation, Task analysis, Robustness, Hamming distances, Cross-modal retrieval, deep hashing, multi-label BibRef

Song, G.[Ge], Su, H.W.[Han-Wen], Huang, K.[Kai], Song, F.Y.[Feng-Yi], Yang, M.[Ming],
Deep self-enhancement hashing for robust multi-label cross-modal retrieval,
PR(147), 2024, pp. 110079.
Elsevier DOI 2312
Cross-modal retrieval, Deep hashing, Out-of-distribution, Multi-label BibRef

Fang, Y.Z.[Yu-Zhi],
Robust multimodal discrete hashing for cross-modal similarity search,
JVCIR(79), 2021, pp. 103256.
Elsevier DOI 2109
Hashing, Robust, Cross-modal retrieval, Unsupervised learning BibRef

Nie, X.S.[Xiu-Shan], Liu, X.B.[Xing-Bo], Xi, X.M.[Xiao-Ming], Li, C.L.[Cheng-Long], Yin, Y.L.[Yi-Long],
Fast Unmediated Hashing for Cross-Modal Retrieval,
CirSysVideo(31), No. 9, September 2021, pp. 3669-3678.
IEEE DOI 2109
Semantics, Training, Optimization, Training data, Binary codes, Correlation, Videos, Cross-modal retrieval, hashing, unmediated, double supervision BibRef

Zhang, D.L.[Dong-Lin], Wu, X.J.[Xiao-Jun], Yin, H.F.[He-Feng], Kittler, J.V.[Josef V.],
MOON: Multi-hash codes joint learning for cross-media retrieval,
PRL(151), 2021, pp. 19-25.
Elsevier DOI 2110
Cross-media retrieval, Hashing, Discrete optimization, Joint learning BibRef

Hu, P.[Peng], Peng, X.[Xi], Zhu, H.Y.[Hong-Yuan], Lin, J.[Jie], Zhen, L.L.[Liang-Li], Peng, D.Z.[De-Zhong],
Joint Versus Independent Multiview Hashing for Cross-View Retrieval,
Cyber(51), No. 10, October 2021, pp. 4982-4993.
IEEE DOI 2110
Semantics, Decoding, Training, Computer science, Kernel, Logistics, Cybernetics, Common hamming space, cross-view retrieval, multiview representation learning BibRef

Zhang, D.L.[Dong-Lin], Wu, X.J.[Xiao-Jun],
Robust and discrete matrix factorization hashing for cross-modal retrieval,
PR(122), 2022, pp. 108343.
Elsevier DOI 2112
Cross-modal retrieval, Hashing, Autoencoder, Discrete optimization, BibRef

Zhang, D.L.[Dong-Lin], Wu, X.J.[Xiao-Jun], Xu, T.Y.[Tian-Yang], Kittler, J.V.[Josef V.],
Two-Stage Supervised Discrete Hashing for Cross-Modal Retrieval,
SMCS(52), No. 11, November 2022, pp. 7014-7026.
IEEE DOI 2210
Semantics, Binary codes, Hash functions, Optimization, Quantization (signal), Task analysis, Costs, Cross-modal retrieval, hashing BibRef

Zhang, D.L.[Dong-Lin], Wu, X.J.[Xiao-Jun], Liu, Z.[Zhen], Yu, J.[Jun], Kittler, J.V.[Josef V.],
Fast Discrete Cross-Modal Hashing Based on Label Relaxation and Matrix Factorization,
ICPR21(4845-4850)
IEEE DOI 2105
Technological innovation, Quantization (signal), Databases, Instruments, Semantics, Binary codes, Media BibRef

Zhang, L.[Li], Wu, X.Q.[Xiang-Qian],
Multi-task framework based on feature separation and reconstruction for cross-modal retrieval,
PR(122), 2022, pp. 108217.
Elsevier DOI 2112
Cross-modal retrieval, Feature separation, Image reconstruction, Text reconstruction BibRef

Liu, F.[Fangcen], Gao, C.Q.[Chen-Qiang], Sun, Y.Q.[Yong-Qing], Zhao, Y.[Yue], Yang, F.[Feng], Qin, A.[Anyong], Meng, D.Y.[De-Yu],
Infrared and Visible Cross-Modal Image Retrieval Through Shared Features,
CirSysVideo(31), No. 11, November 2021, pp. 4485-4496.
IEEE DOI 2112
Image retrieval, Feature extraction, Task analysis, Imaging, Semantics, Image color analysis, Cameras, maximum mean discrepancy BibRef

Wang, C.Y.[Chao-Yi], Li, L.[Liang], Yan, C.G.[Cheng-Gang], Wang, Z.[Zhan], Sun, Y.Q.[Yao-Qi], Zhang, J.Y.[Ji-Yong],
Cross-modal semantic correlation learning by Bi-CNN network,
IET-IPR(15), No. 14, 2021, pp. 3674-3684.
DOI Link 2112
BibRef

Chakraborty, B.[Bela], Wang, P.[Peng], Wang, L.[Lei],
Inter-Modality Fusion Based Attention for Zero-Shot Cross-Modal Retrieval,
ICIP21(2648-2652)
IEEE DOI 2201
Training, Heating systems, Image processing, Semantics, Pipelines, MIMICs, Zero-shot Learning, Inter-Modality Fusion, Cross-modal Retrieval BibRef

Zhang, P.F.[Peng-Fei], Li, Y.[Yang], Huang, Z.[Zi], Xu, X.S.[Xin-Shun],
Aggregation-Based Graph Convolutional Hashing for Unsupervised Cross-Modal Retrieval,
MultMed(24), 2022, pp. 466-479.
IEEE DOI 2202
Semantics, Convolutional codes, Binary codes, Convolution, Measurement, Feature extraction, Sparse matrices, Multimodal, graph convolutional networks BibRef

Shin, A.[Andrew], Ishii, M.[Masato], Narihira, T.[Takuya],
Perspectives and Prospects on Transformer Architecture for Cross-Modal Tasks with Language and Vision,
IJCV(130), No. 2, February 2022, pp. 435-454.
Springer DOI 2202
BibRef

Ji, Z.[Zhong], Wang, H.R.[Hao-Ran], Han, J.G.[Jun-Gong], Pang, Y.W.[Yan-Wei],
SMAN: Stacked Multimodal Attention Network for Cross-Modal Image-Text Retrieval,
Cyber(52), No. 2, February 2022, pp. 1086-1097.
IEEE DOI 2202
Visualization, Semantics, Feature extraction, Correlation, Task analysis, Extraterrestrial measurements, Deep learning, vision and language BibRef

Ma, J.J.[Jing-Jing], Shi, D.[Duanpeng], Tang, X.[Xu], Zhang, X.R.[Xiang-Rong], Jiao, L.C.[Li-Cheng],
Dual Modality Collaborative Learning for Cross-Source Remote Sensing Retrieval,
RS(14), No. 6, 2022, pp. xx-yy.
DOI Link 2204
BibRef

Huang, Y.[Yan], Wang, J.D.[Jing-Dong], Wang, L.[Liang],
Few-Shot Image and Sentence Matching via Aligned Cross-Modal Memory,
PAMI(44), No. 6, June 2022, pp. 2968-2983.
IEEE DOI 2205
BibRef
Earlier: A1, A3, Only:
ACMM: Aligned Cross-Modal Memory for Few-Shot Image and Sentence Matching,
ICCV19(5773-5782)
IEEE DOI 2004
Adaptation models, Task analysis, Pattern matching, Logic gates, Visualization, Image color analysis, Data models, similarity gated fusion. image matching, learning (artificial intelligence), storage management, few-shot content, sentence matching tasks, Micromechanical devices BibRef

Xu, X.[Xing], Lin, K.Y.[Kai-Yi], Yang, Y.[Yang], Hanjalic, A.[Alan], Shen, H.T.[Heng Tao],
Joint Feature Synthesis and Embedding: Adversarial Cross-Modal Retrieval Revisited,
PAMI(44), No. 6, June 2022, pp. 3030-3047.
IEEE DOI 2205
Art, Generative adversarial networks, Training, Correlation, Visualization, Standards, Cross-modal retrieval, knowledge transfer BibRef

Li, S.S.[Shen-Shen], Xu, X.[Xing], Jiang, X.[Xun], Shen, F.M.[Fu-Min], Liu, X.[Xin], Shen, H.T.[Heng Tao],
Multi-Grained Attention Network With Mutual Exclusion for Composed Query-Based Image Retrieval,
CirSysVideo(34), No. 4, April 2024, pp. 2959-2972.
IEEE DOI 2404
Semantics, Image retrieval, Task analysis, Feature extraction, Visualization, Fuses, preserved and modified attentions BibRef

Duan, Y.X.[You-Xiang], Chen, N.[Ning], Zhang, P.Y.[Pei-Ying], Kumar, N.[Neeraj], Chang, L.[Lunjie], Wen, W.[Wu],
MS2GAH: Multi-label semantic supervised graph attention hashing for robust cross-modal retrieval,
PR(128), 2022, pp. 108676.
Elsevier DOI 2205
Cross-modal retrieval, Deep hashing, Graph attention network BibRef

Hamroun, M.[Mohamed], Tamine, K.[Karim], Crespin, B.[Benoît],
Multimodal Video Indexing (MVI): A New Method Based on Machine Learning and Semi-Automatic Annotation on Large Video Collections,
IJIG(22), No. 2, April 2022, pp. 2250022.
DOI Link 2205
BibRef

Parida, K.K.[Kranti Kumar], Sharma, G.[Gaurav],
Discriminative semantic transitive consistency for cross-modal learning,
CVIU(219), 2022, pp. 103404.
Elsevier DOI 2205
Cross-modal retrieval, Distributional matching BibRef

Xu, L.M.[Li-Ming], Zeng, X.H.[Xian-Hua], Zheng, B.[Bochuan], Li, W.S.[Wei-Sheng],
Multi-Manifold Deep Discriminative Cross-Modal Hashing for Medical Image Retrieval,
IP(31), 2022, pp. 3371-3385.
IEEE DOI 2205
Codes, Manifolds, Semantics, Correlation, Image retrieval, Medical diagnostic imaging, Data models, Cross-modal hashing, weak discriminability BibRef

Song, X.[Xue], Chen, J.J.[Jing-Jing], Wu, Z.[Zuxuan], Jiang, Y.G.[Yu-Gang],
Spatial-Temporal Graphs for Cross-Modal Text2Video Retrieval,
MultMed(24), 2022, pp. 2914-2923.
IEEE DOI 2206
Visualization, Semantics, Bit error rate, Encoding, Task analysis, Feature extraction, Microphones, Cross-modal retrieval, cross-modal learning BibRef

Ma, X.H.[Xin-Hong], Yang, X.S.[Xiao-Shan], Gao, J.Y.[Jun-Yu], Xu, C.S.[Chang-Sheng],
The Model May Fit You: User-Generalized Cross-Modal Retrieval,
MultMed(24), 2022, pp. 2998-3012.
IEEE DOI 2206
Data models, Task analysis, Adaptation models, Training, Benchmark testing, Pediatrics, Bridges, cross-modal retrieval, meta-learning BibRef

Yang, F.[Fan], Liu, Y.F.[Yu-Feng], Ding, X.J.[Xiao-Jian], Ma, F.M.[Fu-Min], Cao, J.[Jie],
Asymmetric cross-modal hashing with high-level semantic similarity,
PR(130), 2022, pp. 108823.
Elsevier DOI 2206
Cross-modal retrieval, Hashing, Similarity search, Supervised, Optimization BibRef

Shan, W.[Wei], Huang, D.[Dan], Wang, J.T.[Jiang-Tao], Zou, F.[Feng], Li, S.[Suwen],
Self-Attention based fine-grained cross-media hybrid network,
PR(130), 2022, pp. 108748.
Elsevier DOI 2206
Fine-Grained, Cross-Media, Retrieval, Attention BibRef

Zhang, D.L.[Dong-Lin], Wu, X.J.[Xiao-Jun],
Scalable Discrete Matrix Factorization and Semantic Autoencoder for Cross-Media Retrieval,
Cyber(52), No. 7, July 2022, pp. 5947-5960.
IEEE DOI 2207
Semantics, Hash functions, Binary codes, Quantization (signal), Training data, Training, Task analysis, Autoencoder, hashing BibRef

Qian, S.S.[Sheng-Sheng], Xue, D.Z.[Di-Zhan], Fang, Q.[Quan], Xu, C.S.[Chang-Sheng],
Adaptive Label-Aware Graph Convolutional Networks for Cross-Modal Retrieval,
MultMed(24), 2022, pp. 3520-3532.
IEEE DOI 2207
Correlation, Semantics, Task analysis, Adaptation models, Adaptive systems, Birds, Oceans, Cross-modal retrieval, Graph convolutional networks BibRef

Wang, Y.[Yunbo], Peng, Y.X.[Yu-Xin],
MARS: Learning Modality-Agnostic Representation for Scalable Cross-Media Retrieval,
CirSysVideo(32), No. 7, July 2022, pp. 4765-4777.
IEEE DOI 2207
Semantics, Correlation, Training, Cats, Automobiles, Transforms, Media, Multi-modality learning, cross-media retrieval, similarity retrieval BibRef

Wang, L.[Lu], Zareapoor, M.[Masoumeh], Yang, J.[Jie], Zheng, Z.L.[Zhong-Long],
Asymmetric Correlation Quantization Hashing for Cross-Modal Retrieval,
MultMed(24), 2022, pp. 3665-3678.
IEEE DOI 2208
Semantics, Quantization (signal), Correlation, Binary codes, Databases, Optimization, Hash functions, Compositional quantization BibRef

Qin, J.Y.[Jian-Yang], Fei, L.[Lunke], Zhang, Z.[Zheng], Wen, J.[Jie], Xu, Y.[Yong], Zhang, D.[David],
Joint Specifics and Consistency Hash Learning for Large-Scale Cross-Modal Retrieval,
IP(31), 2022, pp. 5343-5358.
IEEE DOI 2208
Binary codes, Semantics, Hash functions, Feature extraction, Collaboration, Training, Optimization, Learning to hash, large-scale similarity searching BibRef

Liu, G.H.[Guang-Hai], Li, Z.Y.[Zuo-Yong], Yang, J.Y.[Jing-Yu], Zhang, D.[David],
Exploiting sublimated deep features for image retrieval,
PR(147), 2024, pp. 110076.
Elsevier DOI 2312
Image retrieval, Deep feature, Orientation-selective mechanism, Sublimated deep feature histogram, Gain whitening learning BibRef

Shi, Y.F.[Yu-Feng], Zhao, Y.[Yue], Liu, X.[Xin], Zheng, F.[Feng], Ou, W.H.[Wei-Hua], You, X.G.[Xin-Ge], Peng, Q.[Qinmu],
Deep Adaptively-Enhanced Hashing With Discriminative Similarity Guidance for Unsupervised Cross-Modal Retrieval,
CirSysVideo(32), No. 10, October 2022, pp. 7255-7268.
IEEE DOI 2210
Hash functions, Optimization, Codes, Semantics, Estimation, Computer science, Annotations, Cross-modal retrieval, optimization strategy BibRef

Liu, Z.[Zhi], Zhao, F.Y.[Fang-Yuan], Zhang, M.M.[Meng-Meng],
An Efficient Multimodal Aggregation Network for Video-Text Retrieval,
IEICE(E105-D), No. 10, October 2022, pp. 1825-1828.
WWW Link. 2210
BibRef

Guo, D.J.[Dong-Jin], Su, X.M.[Xiao-Ming], Lian, Y.[Yahong], Liu, L.M.[Li-Min], Wang, H.B.[Hai-Bo],
Two-stage partial image-text clustering (TPIT-C),
IET-CV(16), No. 8, 2022, pp. 694-708.
DOI Link 2210
BibRef

Wang, S.[Song], Zhao, H.[Huan], Li, K.Q.[Ke-Qin],
Discrete Joint Semantic Alignment Hashing for Cross-Modal Image-Text Search,
CirSysVideo(32), No. 11, November 2022, pp. 8022-8036.
IEEE DOI 2211
Semantics, Codes, Optimization, Training, Task analysis, Matrix converters, Hash functions, Cross-modal image-text search, supervised hashing BibRef

Liu, X.H.[Xing-Hua], Cao, G.T.[Gui-Tao], Lin, Q.B.[Qiu-Bin], Cao, W.M.[Wen-Ming],
Adaptive weight multi-channel center similar deep hashing,
JVCIR(89), 2022, pp. 103642.
Elsevier DOI 2212
Multi-channel, Center similar, Multimodal retrieval, Deep cross-modal hashing BibRef

Lan, R.[Rushi], Tan, Y.[Yu], Wang, X.Q.[Xiao-Qin], Liu, Z.B.[Zhen-Bing], Luo, X.N.[Xiao-Nan],
Label Guided Discrete Hashing for Cross-Modal Retrieval,
ITS(23), No. 12, December 2022, pp. 25236-25248.
IEEE DOI 2212
Codes, Manifolds, Semantics, Training, Binary codes, Task analysis, Sparse matrices, Cross-modal retrieval, manifold embedding, balanced matrix BibRef

Wang, Y.X.[Yong-Xin], Chen, Z.D.[Zhen-Duo], Luo, X.[Xin], Xu, X.S.[Xin-Shun],
A High-Dimensional Sparse Hashing Framework for Cross-Modal Retrieval,
CirSysVideo(32), No. 12, December 2022, pp. 8822-8836.
IEEE DOI 2212
Codes, Semantics, Encoding, Task analysis, Optimization, Streaming media, Sparse matrices, Sparse hashing, fine-grained similarity BibRef

Jin, M.[Ming], Zhang, H.X.[Hua-Xiang], Zhu, L.[Lei], Sun, J.D.[Jian-De], Liu, L.[Li],
Video Sampled Frame Category Aggregation and Consistent Representation for Cross-Modal Retrieval,
CirSysVideo(33), No. 2, February 2023, pp. 909-919.
IEEE DOI 2302
Feature extraction, Semantics, Training, Convolution, Dogs, Network architecture, Video and text cross-modal retrieval, video internal frame aggregation loss module BibRef

Liao, L.[Lei], Yang, M.[Meng], Zhang, B.[Bob],
Deep Supervised Dual Cycle Adversarial Network for Cross-Modal Retrieval,
CirSysVideo(33), No. 2, February 2023, pp. 920-934.
IEEE DOI 2302
Semantics, Generative adversarial networks, Feature extraction, Task analysis, Media, Deep learning, Neural networks, deep supervised learning BibRef

Su, M.Y.[Ming-Yue], Gu, G.H.[Guang-Hua], Ren, X.[Xianlong], Fu, H.[Hao], Zhao, Y.[Yao],
Semi-Supervised Knowledge Distillation for Cross-Modal Hashing,
MultMed(25), 2023, pp. 662-675.
IEEE DOI 2302
Semantics, Knowledge engineering, Codes, Predictive models, Data models, Cows, Bridges, Cross-modal retrieval, triplet ranking loss BibRef

Gong, Y.[Yan], Cosma, G.[Georgina],
Improving visual-semantic embeddings by learning semantically-enhanced hard negatives for cross-modal information retrieval,
PR(137), 2023, pp. 109272.
Elsevier DOI 2302
Visual semantic embedding network, Cross-modal, Information retrieval, Hard negatives BibRef

Li, W.H.[Wen-Hui], Wang, Y.[Yan], Su, Y.T.[Yu-Ting], Li, X.Y.[Xuan-Ya], Liu, A.A.[An-An], Zhang, Y.D.[Yong-Dong],
Multi-Scale Fine-Grained Alignments for Image and Sentence Matching,
MultMed(25), 2023, pp. 543-556.
IEEE DOI 2302
Semantics, Visualization, Dogs, Mouth, Task analysis, Feature extraction, Bridges, Bi-directional aggregations, multi-scale alignments BibRef

Ou, W.H.[Wei-Hua], Deng, J.X.[Jia-Xin], Zhang, L.[Lei], Gou, J.P.[Jian-Ping], Zhou, Q.[Quan],
Cross-Modal Generation and Pair Correlation Alignment Hashing,
ITS(24), No. 3, March 2023, pp. 3018-3026.
IEEE DOI 2303
Semantics, Feature extraction, Correlation, Codes, Transformers, Generative adversarial networks, Data mining, cross-modal interaction BibRef

Wang, D.[Di], Zhang, C.P.[Cai-Ping], Wang, Q.[Quan], Tian, Y.M.[Yu-Min], He, L.[Lihuo], Zhao, L.[Lin],
Hierarchical Semantic Structure Preserving Hashing for Cross-Modal Retrieval,
MultMed(25), 2023, pp. 1217-1229.
IEEE DOI 2305
Semantics, Codes, Binary codes, Representation learning, Correlation, Hash functions, Feature extraction, Cross-modal retrieval, hierarchical learning BibRef

Hu, P.[Peng], Huang, Z.Y.[Zhen-Yu], Peng, D.Z.[De-Zhong], Wang, X.[Xu], Peng, X.[Xi],
Cross-Modal Retrieval With Partially Mismatched Pairs,
PAMI(45), No. 8, August 2023, pp. 9595-9610.
IEEE DOI 2307
Semantics, Force, Cognition, Visualization, Upper bound, Stability analysis, Robustness, mismatched pairs BibRef

Liu, Y.X.[Ya-Xin], Wu, J.L.[Jian-Long], Qu, L.[Leigang], Gan, T.[Tian], Yin, J.H.[Jian-Hua], Nie, L.Q.[Li-Qiang],
Self-Supervised Correlation Learning for Cross-Modal Retrieval,
MultMed(25), 2023, pp. 2851-2863.
IEEE DOI 2307
Correlation, Semantics, Mutual information, Kernel, Unsupervised learning, Supervised learning, mutual information estimation BibRef

Wang, B.H.[Ben-Hui], Zhang, H.X.[Hua-Xiang], Zhu, L.[Lei], Nie, L.Q.[Li-Qiang], Liu, L.[Li],
Multi-level adversarial attention cross-modal hashing,
SP:IC(117), 2023, pp. 117017.
Elsevier DOI 2308
Cross-modal retrieval, Adversarial Learning, Attentional mechanism, Hashing BibRef

Sun, C.[Chunpu], Zhang, H.X.[Hua-Xiang], Liu, L.[Li], Liu, D.M.[Dong-Mei], Wang, L.[Lin],
Multi-Label Adversarial Fine-Grained Cross-Modal Retrieval,
SP:IC(117), 2023, pp. 117018.
Elsevier DOI 2308
Common representation, Transformer, Adversarial learning, Cross-modal retrieval BibRef

Guo, S.T.[Sheng-Tang], Zhang, H.X.[Hua-Xiang], Liu, L.[Li], Liu, D.M.[Dong-Mei], Lu, X.[Xu], Li, L.J.[Liu-Jian],
Hypergraph clustering based multi-label cross-modal retrieval,
JVCIR(103), 2024, pp. 104258.
Elsevier DOI 2409
Cross-modal retrieval, Hypergraph, Clustering, Alignment BibRef

Huo, Y.D.[Ya-Dong], Qin, Q.[Qibing], Dai, J.Y.[Jiang-Yan], Wang, L.[Lei], Zhang, W.F.[Wen-Feng], Huang, L.[Lei], Wang, C.[Chengduan],
Deep Semantic-Aware Proxy Hashing for Multi-Label Cross-Modal Retrieval,
CirSysVideo(34), No. 1, January 2024, pp. 576-589.
IEEE DOI Code:
WWW Link. 2401
BibRef

Song, D.[Dan], Ling, Y.T.[Yu-Ting], Li, T.[Tianbao], Wang, T.[Teng], Li, X.[Xuanya],
Hierarchical deep semantic alignment for cross-domain 3D model retrieval,
JVCIR(95), 2023, pp. 103895.
Elsevier DOI 2309
3D model retrieval, Unsupervised domain adaptation, Representation learning BibRef

Li, T.B.[Tian-Bao], Liu, A.A.[An-An], Song, D.[Dan], Li, W.H.[Wen-Hui], Li, X.Y.[Xuan-Ya], Su, Y.T.[Yu-Ting],
Focus on Hard Samples: Hierarchical Unbiased Constraints for Cross-Domain 3D Model Retrieval,
CirSysVideo(33), No. 11, November 2023, pp. 7036-7049.
IEEE DOI 2311
BibRef

Dong, X.[Xiao], Zhan, X.L.[Xun-Lin], Wei, Y.C.[Yun-Chao], Wei, X.Y.[Xiao-Yong], Wang, Y.[Yaowei], Lu, M.L.[Min-Long], Cao, X.C.[Xiao-Chun], Liang, X.D.[Xiao-Dan],
Entity-Graph Enhanced Cross-Modal Pretraining for Instance-Level Product Retrieval,
PAMI(45), No. 11, November 2023, pp. 13117-13133.
IEEE DOI 2310
BibRef

Zhan, X.L.[Xun-Lin], Wu, Y.X.[Yang-Xin], Dong, X.[Xiao], Wei, Y.C.[Yun-Chao], Lu, M.L.[Min-Long], Zhang, Y.C.[Yi-Chi], Xu, H.[Hang], Liang, X.D.[Xiao-Dan],
Product1M: Towards Weakly Supervised Instance-Level Product Retrieval via Cross-Modal Pretraining,
ICCV21(11762-11771)
IEEE DOI 2203
Industries, Measurement, Codes, Transformers, Solids, Electronic commerce, Image and video retrieval, Vision + language BibRef

Zhang, X.[Xiong], Li, W.P.[Wei-Peng], Wang, X.[Xu], Wang, L.[Luyao], Zheng, F.Z.[Fu-Zhong], Wang, L.[Long], Zhang, H.[Haisu],
A Fusion Encoder with Multi-Task Guidance for Cross-Modal Text-Image Retrieval in Remote Sensing,
RS(15), No. 18, 2023, pp. 4637.
DOI Link 2310
BibRef

Tu, R.C.[Rong-Cheng], Jiang, J.[Jie], Lin, Q.H.[Qing-Hong], Cai, C.F.[Cheng-Fei], Tian, S.X.[Shang-Xuan], Wang, H.F.[Hong-Fa], Liu, W.[Wei],
Unsupervised Cross-Modal Hashing With Modality-Interaction,
CirSysVideo(33), No. 9, September 2023, pp. 5296-5308.
IEEE DOI 2310
BibRef

Liu, X.[Xin], Yi, J.H.[Jin-Han], Cheung, Y.M.[Yiu-Ming], Xu, X.[Xing], Cui, Z.[Zhen],
OMGH: Online Manifold-Guided Hashing for Flexible Cross-Modal Retrieval,
MultMed(25), 2023, pp. 3811-3824.
IEEE DOI 2310
BibRef

Peng, S.J.[Shu-Juan], Yi, J.H.[Jin-Han], Liu, X.[Xin], Cheung, Y.M.[Yiu-Ming], Cui, Z.[Zhen], Li, T.H.[Tai-Hao],
OLCH: Online Label Consistent Hashing for streaming cross-modal retrieval,
PR(150), 2024, pp. 110335.
Elsevier DOI 2403
Cross-modal hashing, Online label consistent hashing, Mini-batch online gradient descent, Forward-backward splitting BibRef

Tan, W.T.[Wen-Tao], Zhu, L.[Lei], Li, J.J.[Jing-Jing], Zhang, H.X.[Hua-Xiang], Han, J.W.[Jun-Wei],
Teacher-Student Learning: Efficient Hierarchical Message Aggregation Hashing for Cross-Modal Retrieval,
MultMed(25), 2023, pp. 4520-4532.
IEEE DOI 2310
BibRef

Song, L.Y.[Ling-Yun], Shang, X.[Xuequn], Yang, C.[Chen], Sun, M.X.[Ming-Xuan],
Attribute-Guided Multiple Instance Hashing Network for Cross-Modal Zero-Shot Hashing,
MultMed(25), 2023, pp. 5305-5318.
IEEE DOI 2311
BibRef

Li, L.[Li], Shu, Z.Q.[Zhen-Qiu], Yu, Z.T.[Zheng-Tao], Wu, X.J.[Xiao-Jun],
Robust online hashing with label semantic enhancement for cross-modal retrieval,
PR(145), 2024, pp. 109972.
Elsevier DOI 2311
Robust, Noise, Low-rank, Sparse, Multi-label semantic correlations, Similarity, Online hashing, Cross-modal retrieval BibRef

Ye, Z.[Zesheng], Yao, L.[Lina], Zhang, Y.[Yu], Gustin, S.[Sylvia],
Self-supervised cross-modal visual retrieval from brain activities,
PR(145), 2024, pp. 109915.
Elsevier DOI 2311
Visual stimuli recovery, Cross-modal retrieval, Self-supervised learning, Brain-Computer Interface BibRef

Chen, Z.J.[Zheng-Jie], Zhang, Y.[Yu], Mi, S.[Siya],
Assisting Multimodal Named Entity Recognition by cross-modal auxiliary tasks,
PRL(175), 2023, pp. 52-58.
Elsevier DOI 2311
Multimodal named entity recognition, Multi-task learning, Cross-modal learning BibRef

Liu, X.Q.[Xiao-Qing], Zeng, H.Q.[Huan-Qiang], Shi, Y.F.[Yi-Fan], Zhu, J.Q.[Jian-Qing], Hsia, C.H.[Chih-Hsien], Ma, K.K.[Kai-Kuang],
Deep Cross-Modal Hashing Based on Semantic Consistent Ranking,
MultMed(25), 2023, pp. 9530-9542.
IEEE DOI 2312
BibRef

Luo, K.Y.[Kai-Yi], Zhang, C.[Chao], Li, H.X.[Hua-Xiong], Jia, X.[Xiuyi], Chen, C.L.[Chun-Lin],
Adaptive Marginalized Semantic Hashing for Unpaired Cross-Modal Retrieval,
MultMed(25), 2023, pp. 9082-9095.
IEEE DOI 2312
BibRef

Li, Z.X.[Zheng-Xin], Zhao, W.Z.[Wen-Zhe], Du, X.Y.[Xuan-Yi], Zhou, G.Y.[Guang-Yao], Zhang, S.[Songlin],
Cross-Modal Retrieval and Semantic Refinement for Remote Sensing Image Captioning,
RS(16), No. 1, 2024, pp. xx-yy.
DOI Link 2401
BibRef

Xu, R.Q.[Rui-Qing], Mayer, W.[Wolfgang], Chu, H.L.[Hai-Long], Zhang, Y.[Yitao], Zhang, H.Y.[Hong-Yu], Wang, Y.L.[Yu-Long], Liu, Y.[Youfa], Feng, Z.[Zaiwen],
Automatic semantic modeling of structured data sources with cross-modal retrieval,
PRL(177), 2024, pp. 7-14.
Elsevier DOI 2401
Semantic model, Ontology, Cross-modal retrieval, Attention mechanism, Graph representation learning BibRef

Okamura, D.[Daiki], Harakawa, R.[Ryosuke], Iwahashi, M.[Masahiro],
LCNME: Label Correction Using Network Prediction Based on Memorization Effects for Cross-Modal Retrieval With Noisy Labels,
CirSysVideo(34), No. 1, January 2024, pp. 590-602.
IEEE DOI 2401
BibRef

Yang, F.[Fan], Han, M.[Meng], Ma, F.M.[Fu-Min], Liu, Y.F.[Yu-Feng], Ding, X.J.[Xiao-Jian], Tong, D.Y.[De-Yu],
Disperse Asymmetric Subspace Relation Hashing for Cross-Modal Retrieval,
CirSysVideo(34), No. 1, January 2024, pp. 603-617.
IEEE DOI 2401
BibRef

Zhang, G.J.[Gang-Jian], Li, S.K.[Shi-Kun], Wei, S.K.[Shi-Kui], Ge, S.M.[Shi-Ming], Cai, N.[Na], Zhao, Y.[Yao],
Multimodal Composition Example Mining for Composed Query Image Retrieval,
IP(33), 2024, pp. 1149-1161.
IEEE DOI 2402
Image retrieval, Training, Task analysis, Extraterrestrial measurements, Training data, Force, Semantics, hard example mining BibRef

Sun, Y.[Yuan], Ren, Z.W.[Zhen-Wen], Hu, P.[Peng], Peng, D.Z.[De-Zhong], Wang, X.[Xu],
Hierarchical Consensus Hashing for Cross-Modal Retrieval,
MultMed(26), 2024, pp. 824-836.
IEEE DOI 2402
Codes, Semantics, Hash functions, Correlation, Kernel, Feature extraction, Eigenvalues and eigenfunctions, learning to hash BibRef

Zhang, L.[Lei], Chen, L.[Leiting], Zhou, C.[Chuan], Li, X.[Xin], Yang, F.[Fan], Yi, Z.[Zhang],
Weighted Graph-Structured Semantics Constraint Network for Cross-Modal Retrieval,
MultMed(26), 2024, pp. 1551-1564.
IEEE DOI 2402
Semantics, Training, Feature extraction, Representation learning, Data models, Correlation, Games, Cross-modal retrieval, graph neural network BibRef

Wang, Y.B.[Ya-Bing], Wang, S.H.[Shu-Hui], Luo, H.[Hao], Dong, J.F.[Jian-Feng], Wang, F.[Fan], Han, M.[Meng], Wang, X.[Xun], Wang, M.[Meng],
Dual-View Curricular Optimal Transport for Cross-Lingual Cross-Modal Retrieval,
IP(33), 2024, pp. 1522-1533.
IEEE DOI 2403
Visualization, Noise measurement, Estimation, Costs, Transportation, Training, Task analysis, Cross-modal retrieval, machine translation BibRef

Meng, M.[Min], Sun, J.X.[Jia-Xuan], Liu, J.G.[Ji-Gang], Yu, J.[Jun], Wu, J.G.[Ji-Gang],
Semantic Disentanglement Adversarial Hashing for Cross-Modal Retrieval,
CirSysVideo(34), No. 3, March 2024, pp. 1914-1926.
IEEE DOI 2403
Semantics, Representation learning, Task analysis, Feature extraction, Shape, Robustness, disentangled representation BibRef

Zhang, H.[Han], Li, Y.D.[Yi-Ding], Li, X.L.[Xue-Long],
Constrained Bipartite Graph Learning for Imbalanced Multi-Modal Retrieval,
MultMed(26), 2024, pp. 4502-4514.
IEEE DOI 2403
Correlation, Bipartite graph, Semantics, Task analysis, Optimization, Visualization, Annotations, Constrained bipartite graph, query graph BibRef

Wang, Z.[Zheng], Xu, X.[Xing], Wei, J.[Jiwei], Xie, N.[Ning], Yang, Y.[Yang], Shen, H.T.[Heng Tao],
Semantics Disentangling for Cross-Modal Retrieval,
IP(33), 2024, pp. 2226-2237.
IEEE DOI 2404
Semantics, Correlation, Feature extraction, Representation learning, Interference, Task analysis, Shape, subspace learning BibRef

Ma, X.R.[Xin-Ran], Yang, M.X.[Mou-Xing], Li, Y.F.[Yun-Fan], Hu, P.[Peng], Lv, J.C.[Jian-Cheng], Peng, X.[Xi],
Cross-Modal Retrieval With Noisy Correspondence via Consistency Refining and Mining,
IP(33), 2024, pp. 2587-2598.
IEEE DOI Code:
WWW Link. 2404
Noise measurement, Refining, Self-supervised learning, Task analysis, Robustness, Data mining, Annotations, graph matching BibRef

Feng, Y.L.[Yang-Lin], Zhu, H.Y.[Hong-Yuan], Peng, D.Z.[De-Zhong], Peng, X.[Xi], Hu, P.[Peng],
RONO: Robust Discriminative Learning with Noisy Labels for 2D-3D Cross-Modal Retrieval,
CVPR23(11610-11619)
IEEE DOI 2309
BibRef

Hu, P.[Peng], Peng, X.[Xi], Zhu, H.Y.[Hong-Yuan], Zhen, L.L.[Liang-Li], Lin, J.[Jie],
Learning Cross-Modal Retrieval with Noisy Labels,
CVPR21(5399-5409)
IEEE DOI 2111
Costs, Annotations, Interference, Noise measurement, Labeling BibRef

Wen, H.[Haokun], Song, X.[Xuemeng], Yin, J.H.[Jian-Hua], Wu, J.L.[Jian-Long], Guan, W.[Weili], Nie, L.Q.[Li-Qiang],
Self-Training Boosted Multi-Factor Matching Network for Composed Image Retrieval,
PAMI(46), No. 5, May 2024, pp. 3665-3678.
IEEE DOI 2404
Iterative methods, Task analysis, Image retrieval, Training, Benchmark testing, Image color analysis, multimodal retrieval BibRef

Ji, Z.[Zhong], Lin, Z.G.[Zhi-Gang], Wang, H.R.[Hao-Ran], Pang, Y.W.[Yan-Wei], Li, X.L.[Xue-Long],
Multi-task hierarchical convolutional network for visual-semantic cross-modal retrieval,
PR(151), 2024, pp. 110398.
Elsevier DOI 2404
Vision and language, Cross-modal retrieval, Multi-task learning, Metric learning BibRef

Hu, Z.K.[Zhi-Kai], Cheung, Y.M.[Yiu-Ming], Li, M.K.[Meng-Ke], Lan, W.C.[Wei-Chao], Zhang, D.L.[Dong-Lin], Liu, Q.[Qiang],
Joint Semantic Preserving Sparse Hashing for Cross-Modal Retrieval,
CirSysVideo(34), No. 4, April 2024, pp. 2989-3002.
IEEE DOI 2404
Codes, Semantics, Sparse matrices, Hash functions, Encoding, Task analysis, Quantization (signal), Cross-modal retrieval, discrete optimization BibRef

Qin, Q.B.[Qi-Bing], Huo, Y.D.[Ya-Dong], Huang, L.[Lei], Dai, J.Y.[Jiang-Yan], Zhang, H.H.[Hui-Hui], Zhang, W.F.[Wen-Feng],
Deep Neighborhood-Preserving Hashing With Quadratic Spherical Mutual Information for Cross-Modal Retrieval,
MultMed(26), 2024, pp. 6361-6374.
IEEE DOI 2404
Semantics, Mutual information, Transformers, Feature extraction, Clamps, Binary codes, Tuning, Cross-modal retrieval, deep hashing, transformer encoder BibRef

Liang, X.[Xiao], Yang, E.[Erkun], Yang, Y.H.[Yan-Hua], Deng, C.[Cheng],
Multi-Relational Deep Hashing for Cross-Modal Search,
IP(33), 2024, pp. 3009-3020.
IEEE DOI 2405
Codes, Semantics, Loss measurement, Training, Hash functions, Data models, Correlation, Cross-modal retrieval, metric learning BibRef

Pang, S.[Shanmin], Zeng, Y.[Yueyang], Zhao, J.W.[Jia-Wei], Xue, J.R.[Jian-Ru],
A Mutually Textual and Visual Refinement Network for Image-Text Matching,
MultMed(26), 2024, pp. 7555-7566.
IEEE DOI 2405
Semantics, Visualization, Vectors, Cameras, Image segmentation, Feature extraction, Image coding, Cross-modal retrieval, semantic alignment enhancement BibRef

Teng, S.H.[Shao-Hua], Li, J.B.[Jiang-Bo], Teng, L.[Luyao], Fei, L.[Lunke], Wu, N.Q.[Nai-Qi], Zhang, W.[Wei],
Scalable Discrete and Asymmetric Unequal Length Hashing Learning for Cross-Modal Retrieval,
MultMed(26), 2024, pp. 7917-7932.
IEEE DOI 2405
Codes, Semantics, Encoding, Optimization, Linear matrix inequalities, Costs, Hash functions, Unequal length encoding, dual semantic embedding learning BibRef

Yang, D.K.[Ding-Kang], Kuang, H.P.[Hao-Peng], Yang, K.[Kun], Li, M.C.[Ming-Cheng], Zhang, L.H.[Li-Hua],
Towards Asynchronous Multimodal Signal Interaction and Fusion via Tailored Transformers,
SPLetters(31), 2024, pp. 1550-1554.
IEEE DOI 2406
Transformers, Matrix decomposition, Kernel, Complexity theory, Benchmark testing, Visualization, Feature extraction, sentiment analysis BibRef

Wang, Y.X.[Yong-Xin], Zhan, Y.W.[Yu-Wei], Chen, Z.D.[Zhen-Duo], Luo, X.[Xin], Xu, X.S.[Xin-Shun],
Multiple Information Embedded Hashing for Large-Scale Cross-Modal Retrieval,
CirSysVideo(34), No. 6, June 2024, pp. 5118-5131.
IEEE DOI Code:
WWW Link. 2406
Codes, Semantics, Hash functions, Optimization, Noise measurement, Data mining, Linear regression, Cross-modal retrieval, hashing, robustness BibRef

Hou, Y.L.[Yi-Lin], Zhong, X.J.[Xian-Jing], Cao, H.[Hui], Zhu, Z.[Zheng], Zhou, Y.F.[Yun-Feng], Zhang, J.[Jie],
A shared-private sentiment analysis approach based on cross-modal information interaction,
PRL(183), 2024, pp. 140-146.
Elsevier DOI 2406
Sentiment analysis, Multimodal data, Improved transformer, Self-attention mechanism, Multi-head attention BibRef

Chen, S.W.[Shao-Wei], Liu, S.[Shuaipeng], Liu, J.[Jie],
Type-Specific Modality Alignment for Multi-Modal Information Extraction,
SPLetters(31), 2024, pp. 1525-1529.
IEEE DOI 2406
Visualization, Semantics, Task analysis, Information retrieval, Training, Measurement, Image coding, global modality integration BibRef

Zheng, Z.Q.[Zi-Qiang], Ren, H.[Hao], Wu, Y.[Yang], Zhang, W.C.[Wei-Chuan], Lu, H.[Hong], Yang, Y.[Yang], Shen, H.T.[Heng Tao],
Fully Unsupervised Domain-Agnostic Image Retrieval,
CirSysVideo(34), No. 6, June 2024, pp. 5077-5090.
IEEE DOI 2406
Image retrieval, Task analysis, Training, Feature extraction, Annotations, Visualization, Data models, domain adaptation BibRef

Zhang, J.Z.[Jin-Zhi], Wang, L.[Luyao], Zheng, F.Z.[Fu-Zhong], Wang, X.[Xu], Zhang, H.[Haisu],
An Enhanced Feature Extraction Framework for Cross-Modal Image-Text Retrieval,
RS(16), No. 12, 2024, pp. 2201.
DOI Link 2406
BibRef

Cheng, Q.R.[Qing-Rong], Tan, Z.S.[Zhen-Shan], Wen, K.Y.[Ke-Yu], Chen, C.[Cheng], Gu, X.D.[Xiao-Dong],
Semantic Pre-Alignment and Ranking Learning With Unified Framework for Cross-Modal Retrieval,
CirSysVideo(34), No. 7, July 2024, pp. 6503-6516.
IEEE DOI 2407
Semantics, Visualization, Optimization, Feature extraction, Uniform resource locators, Task analysis, Correlation, Retrieval, average precision BibRef

Kang, X.[Xiao], Liu, X.B.[Xing-Bo], Zhang, X.N.[Xue-Ning], Nie, X.S.[Xiu-Shan], Yin, Y.L.[Yi-Long],
Online Discriminative Cross-Modal Hashing,
CirSysVideo(34), No. 7, July 2024, pp. 5242-5254.
IEEE DOI 2407
Codes, Semantics, Training, Hash functions, Data models, Weight measurement, Correlation, Cross-modal retrieval, adaptive bit-wise weighting BibRef

Zhang, X.N.[Xue-Ning], Liu, X.B.[Xing-Bo], Nie, X.[Xiushan], Kang, X.[Xiao], Yin, Y.L.[Yi-Long],
Semi-Supervised Semi-Paired Cross-Modal Hashing,
CirSysVideo(34), No. 7, July 2024, pp. 6517-6529.
IEEE DOI 2407
Semantics, Correlation, Codes, Labeling, Hash functions, Costs, Training data, Cross-modal retrieval, semi-supervised learning, label-enhanced strategy BibRef

Li, J.X.[Jia-Xing], Wong, W.K.[Wai Keung], Jiang, L.[Lin], Fang, X.Z.[Xiao-Zhao], Xie, S.L.[Sheng-Li], Xu, Y.[Yong],
CKDH: CLIP-Based Knowledge Distillation Hashing for Cross-Modal Retrieval,
CirSysVideo(34), No. 7, July 2024, pp. 6530-6541.
IEEE DOI 2407
Feature extraction, Codes, Training, Semantics, Data models, Data mining, Cross-modal retrieval, deep hashing BibRef

Yong, K.L.[Kai-Ling], Shu, Z.Q.[Zhen-Qiu], Wang, H.B.[Hong-Bin], Yu, Z.T.[Zheng-Tao],
Two-stage zero-shot sparse hashing with missing labels for cross-modal retrieval,
PR(155), 2024, pp. 110717.
Elsevier DOI Code:
WWW Link. 2408
Missing labels, Zero-shot sparse hashing, Cross-modal retrieval, Joint semantic similarity, Clustering-wise similarity BibRef

Xue, P.[Peng], Niu, S.[Sijie],
A novel active contour model based on features for image segmentation,
PR(155), 2024, pp. 110673.
Elsevier DOI Code:
WWW Link. 2408
Active contour model, Energy functional, Feature energy function, Complex natural image BibRef

Mao, Y.Q.[Yi-Qiao], Yan, X.Q.[Xiao-Qiang], Hu, S.Z.[Shi-Zhe], Ye, Y.D.[Yang-Dong],
Contrastive cross-modal clustering with twin network,
PR(155), 2024, pp. 110645.
Elsevier DOI 2408
Cross-modal clustering, Correlation information, Contrastive learning, Twin network BibRef

Yan, J.[Jiexi], Deng, C.[Cheng], Huang, H.[Heng], Liu, W.[Wei],
Causality-Invariant Interactive Mining for Cross-Modal Similarity Learning,
PAMI(46), No. 9, September 2024, pp. 6216-6230.
IEEE DOI 2408
Data mining, Correlation, Semantics, Task analysis, Extraterrestrial measurements, Training, Image retrieval, similarity learning BibRef

Wang, J.P.[Jin-Peng], Zeng, Z.Y.[Zi-Yun], Chen, B.[Bin], Wang, Y.T.[Yu-Ting], Liao, D.L.[Dong-Liang], Li, G.F.[Gong-Fu], Wang, Y.R.[Yi-Ru], Xia, S.T.[Shu-Tao],
Hugs Bring Double Benefits: Unsupervised Cross-Modal Hashing with Multi-granularity Aligned Transformers,
IJCV(132), No. 8, August 2024, pp. 2765-2797.
Springer DOI 2408
BibRef

Kang, X.[Xiao], Liu, X.[Xingbo], Xue, W.[Wen], Zhang, X.[Xuening], Nie, X.[Xiushan], Yin, Y.L.[Yi-Long],
Discrete online cross-modal hashing with consistency preservation,
PR(155), 2024, pp. 110688.
Elsevier DOI 2408
Cross-modal retrieval, Supervised online hashing, Continuous semantic embedding, Modality deviation calibration BibRef

Tu, J.F.[Jun-Feng], Liu, X.L.[Xue-Liang], Hao, Y.B.[Yan-Bin], Hong, R.C.[Ri-Chang], Wang, M.[Meng],
Two-Step Discrete Hashing for Cross-Modal Retrieval,
MultMed(26), 2024, pp. 8730-8741.
IEEE DOI 2408
Codes, Feature extraction, Quantization (signal), Transformers, Semantics, Transforms, Binary codes, Cross-modal hashing, hashing BibRef

Cai, H.M.[Hong-Min], Zhang, B.[Bin], Li, J.Y.[Jun-Yu], Hu, B.[Bin], Chen, J.Z.[Jia-Zhou],
Unsupervised Dual Hashing Coding (UDC) on Semantic Tagging and Sample Content for Cross-Modal Retrieval,
MultMed(26), 2024, pp. 9109-9120.
IEEE DOI 2409
Codes, Semantics, Tail, Encoding, Transforms, Tagging, Matrix decomposition, Cross-modal retrieval, multimodal BibRef

Wu, W.J.[Wei-Jia], Zhao, Y.Z.[Yu-Zhong], Li, Z.[Zhuang], Li, J.H.[Jia-Hong], Zhou, H.[Hong], Shou, M.Z.[Mike Zheng], Bai, X.[Xiang],
A large cross-modal video retrieval dataset with reading comprehension,
PR(157), 2025, pp. 110818.
Elsevier DOI 2409
Cross-modal, Retrieval, Text reading, Contrastive learning BibRef

Jiang, L.[Lin], Wu, J.G.[Ji-Gang], Zhao, S.P.[Shu-Ping], Li, J.X.[Jia-Xing],
Coding self-representative and label-relaxed hashing for cross-modal retrieval,
PRL(185), 2024, pp. 264-270.
Elsevier DOI 2410
Cross-modal retrieval, Hashing learning, Label-relaxed regression, Similarity preservation BibRef

Yuan, Z.[Zhe], Wu, D.[Dan], Zhou, L.[Liang],
Achieving the Optimum Rate for Cross-Modal Source Coding,
MultMed(26), 2024, pp. 9722-9735.
IEEE DOI 2410
Semantics, Source coding, Haptic interfaces, Reliability, Streams, Redundancy, Decoding, Cross-modal, source coding, semantic relevance, video and haptic coding BibRef

Chen, R.[Ruihan], Tan, J.P.[Jun-Peng], Yang, Z.J.[Zhi-Jing], Yang, X.J.[Xiao-Jun], Dai, Q.Y.[Qing-Yun], Cheng, Y.Q.[Yong-Qiang], Lin, L.[Liang],
DPHANet: Discriminative Parallel and Hierarchical Attention Network for Natural Language Video Localization,
MultMed(26), 2024, pp. 9575-9590.
IEEE DOI 2410
Location awareness, Semantics, TV, Natural languages, Correlation, Glass, Cross-modal retrieval, video understanding BibRef

Zheng, A.[Aihua], Yuan, F.[Fan], Zhang, H.[Haichuan], Wang, J.X.[Jia-Xiang], Tang, C.[Chao], Li, C.L.[Cheng-Long],
Public-Private Attributes-Based Variational Adversarial Network for Audio-Visual Cross-Modal Matching,
CirSysVideo(34), No. 9, September 2024, pp. 8698-8709.
IEEE DOI 2410
Visualization, Semantics, Feature extraction, Face recognition, Adversarial machine learning, Task analysis, Decoding, metric learning BibRef

Li, D.[Dongyue], Du, S.[Songlin],
ContextMatcher: Detector-Free Feature Matching With Cross-Modality Context,
CirSysVideo(34), No. 9, September 2024, pp. 7922-7934.
IEEE DOI 2410
Feature extraction, Transformers, Visualization, Task analysis, Detectors, Correlation, Reliability, Local feature matching, neighborhood consensus BibRef

Li, F.L.[Feng-Ling], Wang, B.[Bowen], Zhu, L.[Lei], Li, J.J.[Jing-Jing], Zhang, Z.[Zheng], Chang, X.J.[Xiao-Jun],
Cross-Domain Transfer Hashing for Efficient Cross-Modal Retrieval,
CirSysVideo(34), No. 10, October 2024, pp. 9664-9677.
IEEE DOI Code:
WWW Link. 2411
Semantics, Correlation, Training, Adaptation models, Codes, Circuits and systems, Optimization, Cross-modal hashing, weakly-supervised BibRef

Zhang, F.[Fan], Zhou, H.[Hang], Hua, X.S.[Xian-Sheng], Chen, C.[Chong], Luo, X.[Xiao],
HOPE: A Hierarchical Perspective for Semi-Supervised 2D-3D Cross-Modal Retrieval,
PAMI(46), No. 12, December 2024, pp. 8976-8993.
IEEE DOI 2411
Semantics, Neural networks, Optimization, Semisupervised learning, Feature extraction, Solid modeling, 3D multimedia, semi-supervised learning BibRef

Zhu, Y.[Ye], Wu, Y.[Yu], Sebe, N.[Nicu], Yan, Y.[Yan],
Vision + X: A Survey on Multimodal Learning in the Light of Data,
PAMI(46), No. 12, December 2024, pp. 9102-9122.
IEEE DOI 2411
Visualization, Task analysis, Music, Feature extraction, Surveys, Representation learning, Multimodal representation learning BibRef


Chen, S.J.[Si-Jin], Chen, X.[Xin], Zhang, C.[Chi], Li, M.S.[Ming-Sheng], Yu, G.[Gang], Fei, H.[Hao], Zhu, H.Y.[Hong-Yuan], Fan, J.Y.[Jia-Yuan], Chen, T.[Tao],
LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning,
CVPR24(26418-26428)
IEEE DOI 2410
Point cloud compression, Training, Visualization, Solid modeling, Computational modeling, Cognition, Multi-modal learning, vision and language BibRef

Xu, H.R.[Hao-Ran], Peng, P.X.[Pei-Xi], Tan, G.[Guang], Li, Y.[Yuan], Xu, X.H.[Xin-Hai], Tian, Y.H.[Yong-Hong],
DMR: Decomposed Multi-Modality Representations for Frames and Events Fusion in Visual Reinforcement Learning,
CVPR24(26498-26508)
IEEE DOI 2410
Visualization, Noise, Reinforcement learning, Vision sensors, Feature extraction, Data mining, Multi-Modality, DVS, Representation Learning BibRef

You, C.Y.[Chen-Yu], Mint, Y.F.[Yi-Fei], Dai, W.C.[Wei-Cheng], Sekhon, J.S.[Jasjeet S.], Staib, L.[Lawrence], Duncan, J.S.[James S.],
Calibrating Multi-modal Representations: A Pursuit of Group Robustness without Annotations,
CVPR24(26140-26150)
IEEE DOI 2410
Visualization, Annotations, Computational modeling, Refining, Training data, Contrastive learning, Benchmark testing, BibRef

Zhang, Z.H.[Zhi-Hao], Cao, S.C.[Sheng-Cao], Wang, Y.X.[Yu-Xiong],
TAMM: TriAdapter Multi-Modal Learning for 3D Shape Understanding,
CVPR24(21413-21423)
IEEE DOI Code:
WWW Link. 2410
Representation learning, Visualization, Solid modeling, Accuracy, Shape, 3D vision, multi-modal learning, 3D shape classification BibRef

Zhao, Z.[Zihua], Chen, M.X.[Meng-Xi], Dai, T.J.[Tian-Jie], Yao, J.C.[Jiang-Chao], Han, B.[Bo], Zhang, Y.[Ya], Wang, Y.F.[Yan-Feng],
Mitigating Noisy Correspondence by Geometrical Structure Consistency Learning,
CVPR24(27371-27380)
IEEE DOI Code:
WWW Link. 2410
Accuracy, Filtering, Source coding, Benchmark testing, Robustness, Multi-modal learning, Noisy correspondence BibRef

Tuzcuoglu, Ö.[Önder], Köksal, A.[Aybora], Sofu, B.[Bugra], Kalkan, S.[Sinan], Alatan, A.A.[A. Aydin],
XoFTR: Cross-modal Feature Matching Transformer,
IMW24(4275-4286)
IEEE DOI Code:
WWW Link. 2410
Learning systems, Image matching, Pipelines, Lighting, Benchmark testing, Transformers, Image augmentation, thermal infrared BibRef

Wu, J.L.[Jia-Lin], Hu, X.[Xia], Wang, Y.Q.[Ya-Qing], Pang, B.[Bo], Soricut, R.[Radu],
Omni-SMoLA: Boosting Generalist Multimodal Models with Soft Mixture of Low-Rank Experts,
CVPR24(14205-14215)
IEEE DOI 2410
Degradation, Training, Costs, Computational modeling, Computer architecture, Boosting, MoE, LoRA, generalist model, multimodal BibRef

Sun, Q.[Quan], Cui, Y.F.[Yu-Feng], Zhang, X.S.[Xiao-Song], Zhang, F.[Fan], Yu, Q.[Qiying], Wang, Y.[Yueze], Rao, Y.M.[Yong-Ming], Liu, J.J.[Jing-Jing], Huang, T.J.[Tie-Jun], Wang, X.L.[Xin-Long],
Generative Multimodal Models are In-Context Learners,
CVPR24(14398-14409)
IEEE DOI 2410
Visualization, Adaptation models, Codes, Reviews, Computational modeling, Benchmark testing BibRef

Zhao, S.T.[Shi-Tian], Li, Z.W.[Zhuo-Wan], Lu, Y.D.[Ya-Dong], Yuille, A.L.[Alan L.], Wang, Y.[Yan],
Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-Modal Language Models,
CVPR24(13342-13351)
IEEE DOI 2410
Visualization, Cause effect analysis, Benchmark testing, Information filters, Boosting, Causality BibRef

Li, Z.[Zhang], Yang, B.[Biao], Liu, Q.[Qiang], Ma, Z.Y.[Zhi-Yin], Zhang, S.[Shuo], Yang, J.X.[Jing-Xu], Sun, Y.[Yabo], Liu, Y.L.[Yu-Liang], Bai, X.[Xiang],
Monkey: Image Resolution and Text Label are Important Things for Large Multi-Modal Models,
CVPR24(26753-26763)
IEEE DOI Code:
WWW Link. 2410
Training, Visualization, Image resolution, Codes, Computational modeling, Benchmark testing, Large Multimodal Model BibRef

Han, H.C.[Hao-Chen], Zheng, Q.H.[Qing-Hua], Dai, G.[Guang], Luo, M.[Minnan], Wang, J.D.[Jing-Dong],
Learning to Rematch Mismatched Pairs for Robust Cross-Modal Retrieval,
CVPR24(26669-26678)
IEEE DOI Code:
WWW Link. 2410
Training, Codes, Computational modeling, Semantics, Excavation, Cost function, Cross-modal retrieval, Optimal transport, noisy correspondence learning BibRef

Yuan, J.L.[Jia-Lin], Yu, Y.[Ye], Mittal, G.[Gaurav], Hall, M.[Matthew], Sajeev, S.[Sandra], Chen, M.[Mei],
Rethinking Multimodal Content Moderation from an Asymmetric Angle with Mixed-modality,
WACV24(8517-8527)
IEEE DOI 2404
Art, Fuses, Social networking (online), Semantics, Computer architecture, Benchmark testing, Applications, Vision + language and/or other modalities BibRef

Liu, Z.Y.[Zhe-Yuan], Sun, W.X.[Wei-Xuan], Hong, Y.C.[Yi-Cong], Teney, D.[Damien], Gould, S.[Stephen],
Bi-directional Training for Composed Image Retrieval via Text Prompt Learning,
WACV24(5741-5750)
IEEE DOI Code:
WWW Link. 2404
Training, Costs, Computational modeling, Image retrieval, Semantics, Bidirectional control, Algorithms, Vision + language and/or other modalities BibRef

Shoshan, A.[Alon], Linial, O.[Ori], Bhonker, N.[Nadav], Hirsch, E.[Elad], Zamir, L.[Lior], Kviatkovsky, I.[Igor], Medioni, G.[Gérard],
Asymmetric Image Retrieval with Cross Model Compatible Ensembles,
WACV24(1-11)
IEEE DOI 2404
Training, Uncertainty, Computational modeling, Face recognition, Image retrieval, Diversity reception, Algorithms, body pose BibRef

Hönig, R.[Robert], Ackermann, J.[Jan], Chi, M.Y.[Ming-Yuan],
Bi-Encoder Cascades for Efficient Image Search,
REDLCV23(1350-1355)
IEEE DOI 2401
BibRef

Cao, Y.C.[Yi-Chao], Tang, Q.[Qingfei], Yang, F.[Feng], Su, X.[Xiu], You, S.[Shan], Lu, X.B.[Xiao-Bo], Xu, C.[Chang],
Re-mine, Learn and Reason: Exploring the Cross-modal Semantic Correlations for Language-guided HOI detection,
ICCV23(23435-23446)
IEEE DOI 2401
BibRef

Trinci, T.[Tomaso], Bianconcini, T.[Tommaso], Sarti, L.[Leonardo], Taccari, L.[Leonardo], Sambo, F.[Francesco],
Cross-model temporal cooperation via saliency maps for efficient frame classification,
REDLCV23(1156-1160)
IEEE DOI 2401
BibRef

Long, T.[Teng], van Noord, N.[Nanne],
Cross-modal Scalable Hyperbolic Hierarchical Clustering,
ICCV23(16609-16618)
IEEE DOI 2401
BibRef

Li, H.[Hong], Li, X.Y.[Xing-Yu], Hu, P.[Pengbo], Lei, Y.[Yinuo], Li, C.X.[Chun-Xiao], Zhou, Y.[Yi],
Boosting Multi-modal Model Performance with Adaptive Gradient Modulation,
ICCV23(22157-22167)
IEEE DOI Code:
WWW Link. 2401
BibRef

Li, W.[Wenyun], Pun, C.M.[Chi-Man],
Asymmetric Scalable Cross-Modal Hashing,
ICIP23(316-320)
IEEE DOI 2312
BibRef

Zhao, L.J.[Long-Jiao], Wang, Y.[Yu], Kato, J.[Jien],
Using Classifier Discrepancy for Cross-Domain Image Retrieval,
ICIP23(3314-3318)
IEEE DOI 2312
BibRef

Era, Y.[Yuki], Togo, R.[Ren], Maeda, K.[Keisuke], Ogawa, T.[Takahiro], Haseyama, M.[Miki],
Video-Music Retrieval with Fine-Grained Cross-Modal Alignment,
ICIP23(2005-2009)
IEEE DOI 2312
BibRef

Yu, Y.[Youngjae], Chung, J.[Jiwan], Yun, H.[Heeseung], Hessel, J.[Jack], Park, J.S.[Jae Sung], Lu, X.[Ximing], Zellers, R.[Rowan], Ammanabrolu, P.[Prithviraj], Le Bras, R.[Ronan], Kim, G.[Gunhee], Choi, Y.[Yejin],
Fusing Pre-Trained Language Models with Multimodal Prompts through Reinforcement Learning,
CVPR23(10845-10856)
IEEE DOI 2309
BibRef

Huang, S.[Siteng], Gong, B.[Biao], Pan, Y.L.[Yu-Lin], Jiang, J.W.[Jian-Wen], Lv, Y.L.[Yi-Liang], Li, Y.Y.[Yu-Yuan], Wang, D.L.[Dong-Lin],
VoP: Text-Video Co-Operative Prompt Tuning for Cross-Modal Retrieval,
CVPR23(6565-6574)
IEEE DOI 2309
BibRef

Chen, M.X.[Meng-Xi], Xing, L.Y.[Lin-Yu], Wang, Y.[Yu], Zhang, X.[Xa],
Enhanced Multimodal Representation Learning with Cross-Modal KD,
CVPR23(11766-11775)
IEEE DOI 2309
BibRef

Yang, S.[Shuo], Xu, Z.[Zhaopan], Wang, K.[Kai], You, Y.[Yang], Yao, H.X.[Hong-Xun], Liu, T.L.[Tong-Liang], Xu, M.[Min],
BiCro: Noisy Correspondence Rectification for Multi-modality Data via Bi-directional Cross-modal Similarity Consistency,
CVPR23(19883-19892)
IEEE DOI 2309
BibRef

Kim, D.[Dongwon], Kim, N.[Namyup], Kwak, S.[Suha],
Improving Cross-Modal Retrieval with Set of Diverse Embeddings,
CVPR23(23422-23431)
IEEE DOI 2309
BibRef

Kim, J.M.[Jae Myung], Koepke, A.S.[A. Sophia], Schmid, C.[Cordelia], Akata, Z.[Zeynep],
Exposing and Mitigating Spurious Correlations for Cross-Modal Retrieval,
MULA23(2585-2595)
IEEE DOI 2309
BibRef

Tran, V.[Vinh], Balasubramanian, N.[Niranjan], Hoai, M.[Minh],
From Within to Between: Knowledge Distillation for Cross Modality Retrieval,
ACCV22(IV:605-622).
Springer DOI 2307
BibRef

Zhao, Y.[Yang], Zhu, Y.Z.[Ya-Zhou], Liao, S.B.[Sheng-Bin], Ye, Q.L.[Qiao-Lin], Zhang, H.F.[Hao-Feng],
Class Concentration with Twin Variational Autoencoders for Unsupervised Cross-modal Hashing,
ACCV22(VI:235-251).
Springer DOI 2307
BibRef

Fragomeni, A.[Adriano], Wray, M.[Michael], Damen, D.[Dima],
Contra: (con)text (tra)nsformer for Cross-modal Video Retrieval,
ACCV22(IV:451-468).
Springer DOI 2307
BibRef

Zheng, Y.C.[Yuan-Chao], Zhang, X.W.[Xiao-Wei],
Heterogeneous Interactive Learning Network for Unsupervised Cross-modal Retrieval,
ACCV22(IV:692-707).
Springer DOI 2307
BibRef

Zhao, Y.[Yang], Yu, J.G.[Jia-Guo], Liao, S.[Shengbin], Zhang, Z.[Zheng], Zhang, H.F.[Hao-Feng],
From Sparse to Dense: Semantic Graph Evolutionary Hashing for Unsupervised Cross-Modal Retrieval,
ACCV22(IV:521-536).
Springer DOI 2307
BibRef

Arnold, R.[Rahel], Sauter, L.[Loris], Schuldt, H.[Heiko],
Free-Form Multi-Modal Multimedia Retrieval (4MR),
MMMod23(I: 678-683).
Springer DOI 2304
BibRef

Xuan, H.[Hong], Chen, X.S.[Xi Stephen],
Dissecting Deep Metric Learning Losses for Image-Text Retrieval,
WACV23(2163-2172)
IEEE DOI 2302
Measurement, Training, Analytical models, Semantics, Space exploration, Task analysis, visual reasoning BibRef

Ge, X.[Xuri], Chen, F.[Fuhai], Xu, S.[Songpei], Tao, F.[Fuxiang], Jose, J.M.[Joemon M.],
Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval,
WACV23(1022-1031)
IEEE DOI 2302
Measurement, Representation learning, Visualization, Correlation, Computational modeling, Semantics, Algorithms: Vision + language and/or other modalities BibRef

Jawade, B.[Bhavin], Mohan, D.D.[Deen Dayal], Ali, N.M.[Naji Mohamed], Setlur, S.[Srirangaraj], Govindaraju, V.[Venu],
NAPReg: Nouns As Proxies Regularization for Semantically Aware Cross-Modal Embeddings,
WACV23(1135-1144)
IEEE DOI 2302
Training, Measurement, Visualization, Codes, Databases, Semantics, Algorithms: Vision + language and/or other modalities BibRef

Nakatsuka, T.[Takayuki], Hamasaki, M.[Masahiro], Goto, M.[Masataka],
Content-Based Music-Image Retrieval Using Self- and Cross-Modal Feature Embedding Memory,
WACV23(2173-2183)
IEEE DOI 2302
Training, Measurement, Art, Multiple signal classification, Task analysis BibRef

Chen, Y.X.[Yu-Xiao], Yuan, J.B.[Jian-Bo], Zhao, L.[Long], Chen, T.L.[Tian-Lang], Luo, R.[Rui], Davis, L.[Larry], Metaxas, D.N.[Dimitris N.],
More Than Just Attention: Improving Cross-Modal Attentions with Contrastive Constraints for Image-Text Matching,
WACV23(4421-4429)
IEEE DOI 2302
Training, Measurement, Visualization, Annotations, Computational modeling, Algorithms: Vision + language and/or other modalities BibRef

Agarwal, A.[Aishwarya], Karanam, S.[Srikrishna], Srinivasan, B.V.[Balaji Vasan], Banerjee, B.[Biplab],
Contrastive Learning of Semantic Concepts for Open-set Cross-domain Retrieval,
WACV23(4104-4113)
IEEE DOI 2302
Training, Technological innovation, Semantics, Natural languages, Image retrieval, Feature extraction BibRef

Yang, Y.[Yulou], Shen, H.[Hao], Yang, M.[Ming],
Relation-Guided Network for Image-Text Retrieval,
ICIP22(1856-1860)
IEEE DOI 2211
Transformers, Feature extraction, Cognition, Data mining, Image-text retrieval, asymmetric structure, relation-guided BibRef

Sumbul, G.[Gencer], Müller, M.[Markus], Demir, B.[Begüm],
A Novel Self-Supervised Cross-Modal Image Retrieval Method in Remote Sensing,
ICIP22(2426-2430)
IEEE DOI 2211
Training, Codes, Image retrieval, Search problems, Sensors, Reliability, Cross-modal image retrieval, deep learning, remote sensing BibRef

Wang, H.[Hu], Zhang, J.P.[Jian-Peng], Chen, Y.H.[Yuan-Hong], Ma, C.B.[Cong-Bo], Avery, J.[Jodie], Hull, L.[Louise], Carneiro, G.[Gustavo],
Uncertainty-Aware Multi-modal Learning via Cross-Modal Random Network Prediction,
ECCV22(XXXVII:200-217).
Springer DOI 2211
BibRef

de Almeida, L.B.[Lucas Barbosa], Valem, L.P.[Lucas Pascotti], Pedronette, D.C.G.[Daniel Carlos Guimarães],
Graph Convolutional Networks and Manifold Ranking for Multimodal Video Retrieval,
ICIP22(2811-2815)
IEEE DOI 2211
Training, Manifolds, Deep learning, Transfer learning, Feature extraction, Content-based retrieval, Manifold learning, rank aggregation BibRef

Liang, T.[Tao], Lin, G.S.[Guo-Sheng], Wan, M.Y.[Ming-Yang], Li, T.R.[Tian-Rui], Ma, G.J.[Guo-Jun], Lv, F.M.[Feng-Mao],
Expanding Large Pre-trained Unimodal Models with Multimodal Information Injection for Image-Text Multimodal Classification,
CVPR22(15471-15480)
IEEE DOI 2210
Deep learning, Visualization, Image recognition, Correlation, Bit error rate, Vision+language BibRef

Yang, J.H.[Jin-Hui], Chen, X.Y.[Xian-Yu], Jiang, M.[Ming], Chen, S.[Shi], Wang, L.[Louis], Zhao, Q.[Qi],
VisualHow: Multimodal Problem Solving,
CVPR22(15606-15616)
IEEE DOI 2210
Training, Visualization, Technological innovation, Annotations, Natural language processing, Datasets and evaluation BibRef

Girdhar, R.[Rohit], Singh, M.[Mannat], Ravi, N.[Nikhila], van der Maaten, L.[Laurens], Joulin, A.[Armand], Misra, I.[Ishan],
Omnivore: A Single Model for Many Visual Modalities,
CVPR22(16081-16091)
IEEE DOI 2210
Visualization, Solid modeling, Computational modeling, Transformers, Data models, Action and event recognition BibRef

Ma, M.M.[Meng-Meng], Ren, J.[Jian], Zhao, L.[Long], Testuggine, D.[Davide], Peng, X.[Xi],
Are Multimodal Transformers Robust to Missing Modality?,
CVPR22(18156-18165)
IEEE DOI 2210
Training, Benchmark testing, Transformers, Multitasking, Search problems, Data models, Vision+language, Machine learning BibRef

Han, Z.B.[Zong-Bo], Yang, F.[Fan], Huang, J.Z.[Jun-Zhou], Zhang, C.Q.[Chang-Qing], Yao, J.H.[Jian-Hua],
Multimodal Dynamics: Dynamical Fusion for Trustworthy Multimodal Classification,
CVPR22(20675-20685)
IEEE DOI 2210
Heuristic algorithms, Estimation, Classification algorithms, Medical diagnosis, Machine learning BibRef

Gupta, V.[Vikram], Mittal, T.[Trisha], Mathur, P.[Puneet], Mishra, V.[Vaibhav], Maheshwari, M.[Mayank], Bera, A.[Aniket], Mukherjee, D.[Debdoot], Manocha, D.[Dinesh],
3MASSIV: Multilingual, Multimodal and Multi-Aspect dataset of Social Media Short Videos,
CVPR22(21032-21043)
IEEE DOI 2210
Social networking (online), Semantics, Media, Task analysis, Datasets and evaluation, Video analysis and understanding BibRef

Bogolin, S.V.[Simion-Vlad], Croitoru, I.[Ioana], Jin, H.L.[Hai-Lin], Liu, Y.[Yang], Albanie, S.[Samuel],
Cross Modal Retrieval with Querybank Normalisation,
CVPR22(5184-5195)
IEEE DOI 2210
Training, Codes, Computational modeling, Benchmark testing, Vision + language, retrieval BibRef

Yang, E.[Erkun], Yao, D.R.[Dong-Ren], Liu, T.L.[Tong-Liang], Deng, C.[Cheng],
Mutual Quantization for Cross-Modal Search with Noisy Labels,
CVPR22(7541-7550)
IEEE DOI 2210
Training, Representation learning, Quantization (signal), Codes, Training data, Benchmark testing, Recognition: detection, Representation learning BibRef

Neculai, A.[Andrei], Chen, Y.B.[Yan-Bei], Akata, Z.[Zeynep],
Probabilistic Compositional Embeddings for Multimodal Image Retrieval,
MULA22(4546-4556)
IEEE DOI 2210
Visualization, Codes, Computational modeling, Image retrieval, Semantics BibRef

Couairon, G.[Guillaume], Douze, M.[Matthijs], Cord, M.[Matthieu], Schwenk, H.[Holger],
Embedding Arithmetic of Multimodal Queries for Image Retrieval,
ODRUM22(4946-4954)
IEEE DOI 2210
Conferences, Semantics, Image retrieval, Lasers, Transforms, Image representation BibRef

Sun, C.C.[Chang-Chang], Latapie, H.[Hugo], Liu, G.[Gaowen], Yan, Y.[Yan],
Deep Normalized Cross-Modal Hashing with Bi-Direction Relation Reasoning,
ODRUM22(4937-4945)
IEEE DOI 2210
Codes, Computational modeling, Semantics, Bidirectional control, Benchmark testing BibRef

Li, Y.H.[Yi-Hao], Yu, J.[Jun], Cai, Z.[Zhongpeng], Pan, Y.[Yuwen],
Cross-modal Target Retrieval for Tracking by Natural Language,
ODRUM22(4927-4936)
IEEE DOI 2210
Visualization, Target tracking, Natural languages, Semantics, Switches, Benchmark testing BibRef

Thomas, C.[Christopher], Kovashka, A.[Adriana],
Emphasizing Complementary Samples for Non-literal Cross-modal Retrieval,
MULA22(4631-4640)
IEEE DOI 2210
Spatial diversity, Semantics, Channel estimation, Performance gain, Benchmark testing BibRef

Xu, B.[Bocheng], Xiong, Y.H.[Yi-Hua], Zhang, R.[Rui], Feng, Y.[Yanyi], Wu, H.F.[Hai-Feng],
Natural Language-Based Vehicle Retrieval with Explicit Cross-Modal Representation Learning,
AICity22(3141-3148)
IEEE DOI 2210
Representation learning, Visualization, Semantics, Urban areas, Feature extraction, Robustness BibRef

Shvetsova, N.[Nina], Chen, B.[Brian], Rouditchenko, A.[Andrew], Thomas, S.[Samuel], Kingsbury, B.[Brian], Feris, R.S.[Rogerio S.], Harwath, D.[David], Glass, J.[James], Kuehne, H.[Hilde],
Everything at Once - Multi-modal Fusion Transformer for Video Retrieval,
CVPR22(19988-19997)
IEEE DOI 2210
Location awareness, Training, Codes, Fuses, Benchmark testing, Transformers, Action and event recognition, Video analysis and understanding BibRef

Andonian, A.[Alex], Chen, S.X.[Shi-Xing], Hamid, R.[Raffay],
Robust Cross-Modal Representation Learning with Progressive Self-Distillation,
CVPR22(16409-16420)
IEEE DOI 2210
Training, Representation learning, Computational modeling, Redundancy, Benchmark testing, Robustness, Noise measurement, Representation learning BibRef

Lu, H.Y.[Hao-Yu], Fei, N.[Nanyi], Huo, Y.Q.[Yu-Qi], Gao, Y.Z.[Yi-Zhao], Lu, Z.W.[Zhi-Wu], Wen, J.R.[Ji-Rong],
COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval,
CVPR22(15671-15680)
IEEE DOI 2210
Visualization, Collaboration, Streaming media, Probability distribution, Task analysis, Video analysis and understanding BibRef

Abdelnabi, S.[Sahar], Hasan, R.[Rakibul], Fritz, M.[Mario],
Open-Domain, Content-based, Multi-modal Fact-checking of Out-of-Context Images via Online Resources,
CVPR22(14920-14929)
IEEE DOI 2210
Visualization, Machine vision, MIMICs, Manuals, Cognition, retrieval, Vision + language, Recognition: detection BibRef

Wang, Y.[Yun], Zhang, T.[Tong], Zhang, X.[Xueya], Cui, Z.[Zhen], Huang, Y.[Yuge], Shen, P.C.[Peng-Cheng], Li, S.X.[Shao-Xin], Yang, J.[Jian],
Wasserstein Coupled Graph Learning for Cross-Modal Retrieval,
ICCV21(1793-1802)
IEEE DOI 2203
Training, Representation learning, Analytical models, Dictionaries, Correlation, Computational modeling, Vision + language, BibRef

Cai, G.[Guanyu], Zhang, J.[Jun], Jiang, X.Y.[Xin-Yang], Gong, Y.F.[Yi-Fei], He, L.[Lianghua], Yu, F.[Fufu], Peng, P.[Pai], Guo, X.W.[Xiao-Wei], Huang, F.Y.[Fei-Yue], Sun, X.[Xing],
Ask amp;Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query,
ICCV21(1815-1824)
IEEE DOI 2203
Training, Codes, Computational modeling, Image retrieval, Search problems, Robustness, Vision + language, Image and video retrieval BibRef

Wen, K.Y.[Ke-Yu], Xia, J.[Jin], Huang, Y.Y.[Yuan-Yuan], Li, L.Y.[Lin-Yang], Xu, J.Y.[Jia-Yan], Shao, J.[Jie],
COOKIE: Contrastive Cross-Modal Knowledge Sharing Pre-training for Vision-Language Representation,
ICCV21(2188-2197)
IEEE DOI 2203
Visualization, Codes, Computational modeling, Image retrieval, Semantics, Transformers, Vision + language, Representation learning BibRef

Patrick, M.[Mandela], Huang, P.Y.[Po-Yao], Misra, I.[Ishan], Metze, F.[Florian], Vedaldi, A.[Andrea], Asano, Y.M.[Yuki M.], Henriques, J.[João],
Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning,
ICCV21(10540-10552)
IEEE DOI 2203
Representation learning, Costs, Codes, Computational modeling, Crops, Image representation, Representation learning, Vision + other modalities BibRef

Lin, M.X.[Ming-Xian], Yang, J.[Jie], Wang, H.[He], Lai, Y.K.[Yu-Kun], Jia, R.[Rongfei], Zhao, B.Q.[Bin-Qiang], Gao, L.[Lin],
Single Image 3D Shape Retrieval via Cross-Modal Instance and Category Contrastive Learning,
ICCV21(11385-11395)
IEEE DOI 2203
Representation learning, Deep learning, Shape, Image color analysis, Pipelines, Gray-scale, 3D from a single image and shape-from-x BibRef

Changpinyo, S.[Soravit], Pont-Tuset, J.[Jordi], Ferrari, V.[Vittorio], Soricut, R.[Radu],
Telling the What while Pointing to the Where: Multimodal Queries for Image Retrieval,
ICCV21(12116-12126)
IEEE DOI 2203
Location awareness, Error analysis, Computational modeling, Image retrieval, Natural languages, Mice, Vision + other modalities BibRef

Gabeur, V.[Valentin], Nagrani, A.[Arsha], Sun, C.[Chen], Alahari, K.[Karteek], Schmid, C.[Cordelia],
Masking Modalities for Cross-modal Video Retrieval,
WACV22(2111-2120)
IEEE DOI 2202
Manuals, Benchmark testing, Motion pictures, Natural language processing, Proposals, Speech processing, Scene Understanding BibRef

Galanopoulos, D.[Damianos], Mezaris, V.[Vasileios],
Hard-Negatives or Non-Negatives? A Hard-Negative Selection Strategy for Cross-Modal Retrieval Using the Improved Marginal Ranking Loss,
ViRaL21(2312-2316)
IEEE DOI 2112
Training, Computational modeling, Network architecture BibRef

Jing, L.L.[Long-Long], Vahdani, E.[Elahe], Tan, J.X.[Jia-Xing], Tian, Y.L.[Ying-Li],
Cross-Modal Center Loss for 3D Cross-Modal Retrieval,
CVPR21(3141-3150)
IEEE DOI 2111
Solid modeling, Computational modeling, Metadata, Feature extraction BibRef

Almazán, J.[Jon], Ko, B.[Byungsoo], Gu, G.[Geonmo], Larlus, D.[Diane], Kalantidis, Y.[Yannis],
Granularity-Aware Adaptation for Image Retrieval Over Multiple Tasks,
ECCV22(XIV:389-406).
Springer DOI 2211
BibRef

Chun, S.[Sanghyuk], Oh, S.J.[Seong Joon], Sampaio de Rezende, R.[Rafael], Kalantidis, Y.[Yannis], Larlus, D.[Diane],
Probabilistic Embeddings for Cross-Modal Retrieval,
CVPR21(8411-8420)
IEEE DOI 2111
Uncertainty, Codes, Databases, Annotations, Tools, Benchmark testing BibRef

Croitoru, I.[Ioana], Bogolin, S.V.[Simion-Vlad], Leordeanu, M.[Marius], Jin, H.L.[Hai-Lin], Zisserman, A.[Andrew], Albanie, S.[Samuel], Liu, Y.[Yang],
TeachText: CrossModal Generalized Distillation for Text-Video Retrieval,
ICCV21(11563-11573)
IEEE DOI 2203
Visualization, Codes, Computational modeling, Noise reduction, Benchmark testing, Vision + language BibRef

Liu, Y.[Yang], Chen, Q.C.[Qing-Chao], Albanie, S.[Samuel],
Adaptive Cross-Modal Prototypes for Cross-Domain Visual-Language Retrieval,
CVPR21(14949-14959)
IEEE DOI 2111
Visualization, Prototypes, Task analysis, Mutual information, Videos BibRef

Salvador, A.[Amaia], Gundogdu, E.[Erhan], Bazzani, L.[Loris], Donoser, M.[Michael],
Revamping Cross-Modal Recipe Retrieval with Hierarchical Transformers and Self-supervised Learning,
CVPR21(15470-15479)
IEEE DOI 2111
Training, Codes, Computational modeling, Semantics, Machine learning, Transformers BibRef

Dzabraev, M.[Maksim], Kalashnikov, M.[Maksim], Komkov, S.[Stepan], Petiushko, A.[Aleksandr],
MDMMT: Multidomain Multimodal Transformer for Video Retrieval,
HVU21(3349-3358)
IEEE DOI 2109
Training, Benchmark testing, Task analysis BibRef

Wang, K.[Kai], Herranz, L.[Luis], van de Weijer, J.[Joost],
Continual learning in cross-modal retrieval,
OmniCV21(3623-3633)
IEEE DOI 2109
Training, Visualization, Human intelligence, Focusing, Interference, Tools BibRef

Mafla, A.[Andrés], Rezende, R.S.[Rafael S.], Gómez, L.[Lluís], Larlus, D.[Diane], Karatzas, D.[Dimosthenis],
StacMR: Scene-Text Aware Cross-Modal Retrieval,
WACV21(2219-2229)
IEEE DOI 2106
Visualization, Annotations, Computational modeling, Semantics, Task analysis BibRef

Feng, C.T.[Chang-Ting], Li, D.G.[Da-Gang], Zheng, J.W.[Jing-Wei],
Improving Supervised Cross-modal Retrieval with Semantic Graph Embedding,
MMMod21(I:187-199).
Springer DOI 2106
BibRef

Wen, Z.Y.[Zhen-Yu], Feng, A.[Aimin],
Deep Centralized Cross-modal Retrieval,
MMMod21(I:443-455).
Springer DOI 2106
BibRef

Li, Z.X.[Zhi-Xin], Ling, F.[Feng], Xu, C.S.[Chuan-Sheng], Zhang, C.L.[Can-Long], Ma, H.F.[Hui-Fang],
Cross-Media Hash Retrieval Using Multi-Head Attention Network,
ICPR21(1290-1297)
IEEE DOI 2105
Correlation, Semantics, Neural networks, Media, Extraterrestrial measurements, cross-media retrieval BibRef

Jin, C.[Cong], Zhang, T.[Tian], Liu, S.X.[Shou-Xun], Tie, Y.[Yun], Lv, X.[Xin], Li, J.G.[Jian-Guang], Yan, W.C.[Wen-Cai], Yan, M.[Ming], Xu, Q.[Qian], Guan, Y.C.[Yi-Cong], Yang, Z.G.[Zheng-Gougou],
Cross-modal Deep Learning Applications: Audio-visual Retrieval,
MMDLCA20(301-313).
Springer DOI 2103
BibRef

Thomas, C.[Christopher], Kovashka, A.[Adriana],
Preserving Semantic Neighborhoods for Robust Cross-modal Retrieval,
ECCV20(XVIII:317-335).
Springer DOI 2012
BibRef

Wang, Z., Liu, X., Li, H., Sheng, L., Yan, J., Wang, X., Shao, J.,
CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval,
ICCV19(5763-5772)
IEEE DOI 2004
entropy, feature extraction, image matching, image retrieval, message passing, natural language processing, text analysis, Task analysis BibRef

Nawaz, S., Janjua, M.K., Gallo, I., Mahmood, A., Calefati, A., Shafait, F.,
Do Cross Modal Systems Leverage Semantic Relationships?,
CroMoL19(4501-4510)
IEEE DOI 2004
image representation, image retrieval, image segmentation, learning (artificial intelligence), Text to Image BibRef

Su, S., Zhong, Z., Zhang, C.,
Deep Joint-Semantics Reconstructing Hashing for Large-Scale Unsupervised Cross-Modal Retrieval,
ICCV19(3027-3035)
IEEE DOI 2004
binary codes, image coding, image retrieval, multimedia computing, neural nets, binary codes, reconstructing framework, DJSRH, Correlation BibRef

Ning, X.C.[Xue-Cheng], Yang, X.S.[Xiao-Shan], Xu, C.S.[Chang-Sheng],
Multi-Hop Interactive Cross-modal Retrieval,
MMMod20(II:681-693).
Springer DOI 2003
BibRef

Cornia, M.[Marcella], Baraldi, L.[Lorenzo], Tavakoli, H.R.[Hamed R.], Cucchiara, R.[Rita],
Towards Cycle-Consistent Models for Text and Image Retrieval,
WiCV-E18(IV:687-691).
Springer DOI 1905
BibRef

Surís, D.[Didac], Duarte, A.[Amanda], Salvador, A.[Amaia], Torres, J.[Jordi], Giró-i-Nieto, X.[Xavier],
Cross-modal Embeddings for Video and Audio Retrieval,
WiCV-E18(IV:711-716).
Springer DOI 1905
BibRef

Liu, C.L.[Chen-Lu], Xu, X.[Xing], Yang, Y.[Yang], Lu, H.M.[Hui-Min], Shen, F.M.[Fu-Min], Ji, Y.L.[Yan-Li],
Domain Invariant Subspace Learning for Cross-Modal Retrieval,
MMMod18(II:94-105).
Springer DOI 1802
BibRef

Yuan, Y.X.[Yu-Xin], Peng, Y.X.[Yu-Xin],
Recursive Pyramid Network with Joint Attention for Cross-Media Retrieval,
MMMod18(I:405-416).
Springer DOI 1802
BibRef

Jia, Y.H.[Yu-Hua], Bai, L.[Liang], Wang, P.[Peng], Guo, J.L.[Jin-Lin], Xie, Y.X.[Yu-Xiang], Yu, T.Y.[Tian-Yuan],
Utilizing Locality-Sensitive Hash Learning for Cross-Media Retrieval,
MMMod17(I: 550-561).
Springer DOI 1701
BibRef

Shang, X.[Xindi], Zhang, H.W.[Han-Wang], Chua, T.S.[Tat-Seng],
Deep Learning Generic Features for Cross-Media Retrieval,
MMMod16(I: 264-275).
Springer DOI 1601
BibRef

Huang, L.[Lei], Peng, Y.X.[Yu-Xin],
Cross-Media Retrieval via Semantic Entity Projection,
MMMod16(I: 276-288).
Springer DOI 1601
BibRef

Gu, Y.[Yun], Xue, H.Y.[Hao-Yang], Yang, J.[Jie], Shi, P.F.[Peng-Fei],
Cross-modality hashing with partial correspondence,
ICIP15(1925-1929)
IEEE DOI 1512
Cross-modality; Hashing; Multimedia Search; Partial Correspondence BibRef

Zhang, H.[Hong], Chen, L.[Li],
Learning optimal data representation for cross-media retrieval,
ICIP12(1925-1928).
IEEE DOI 1302
BibRef

Lin, W.X.[Wan-Xia], Lu, T.[Tong], Su, F.[Feng],
A Novel Multi-modal Integration and Propagation Model for Cross-Media Information Retrieval,
MMMod12(740-749).
Springer DOI 1201
BibRef

Chapter on Implementations and Applications, Databases, QBIC, Video Analysis, Hardware and Software, Inspection continues in
Video Delivery, Video-on-Demand, Indexing, Techniques, Systems .


Last update:Nov 26, 2024 at 16:40:19