Costa Pereira, J.,
Coviello, E.[Emanuele],
Doyle, G.,
Rasiwasia, N.,
Lanckriet, G.R.G.[Gert R.G.],
Levy, R.,
Vasconcelos, N.M.,
On the Role of Correlation and Abstraction in Cross-Modal Multimedia
Retrieval,
PAMI(36), No. 3, March 2014, pp. 521-535.
IEEE DOI
1403
image matching. E.g. use image to search for text.
Correlation matching. Semantic matching. Semantic correlation matching.
BibRef
Costa Pereira, J.[Jose],
Vasconcelos, N.M.[Nuno M.],
Cross-modal domain adaptation for text-based regularization of image
semantics in image retrieval systems,
CVIU(124), No. 1, 2014, pp. 123-135.
Elsevier DOI
1406
Content-based image retrieval
BibRef
Zhai, X.H.[Xiao-Hua],
Peng, Y.X.[Yu-Xin],
Xiao, J.G.[Jian-Guo],
Learning Cross-Media Joint Representation With Sparse and
Semisupervised Regularization,
CirSysVideo(24), No. 6, June 2014, pp. 965-978.
IEEE DOI
1407
Correlation
BibRef
Peng, Y.X.[Yu-Xin],
Qi, J.W.[Jin-Wei],
Quintuple-Media Joint Correlation Learning With Deep Compression and
Regularization,
CirSysVideo(30), No. 8, August 2020, pp. 2709-2722.
IEEE DOI
2008
Media, Correlation, Semantics,
Solid modeling, Data models, Image coding, Cross-media retrieval,
network regularization
BibRef
Peng, Y.,
Zhai, X.,
Zhao, Y.,
Huang, X.,
Semi-Supervised Cross-Media Feature Learning With Unified Patch Graph
Regularization,
CirSysVideo(26), No. 3, March 2016, pp. 583-596.
IEEE DOI
1603
Correlation
BibRef
Bellini, P.[Pierfrancesco],
Cenni, D.[Daniele],
Nesi, P.[Paolo],
Optimization of information retrieval for cross media contents in a
best practice network,
MultInfoRetr(3), No. 3, September 2014, pp. 147-159.
Springer DOI
1408
BibRef
Kang, C.,
Xiang, S.,
Liao, S.,
Xu, C.,
Pan, C.,
Learning Consistent Feature Representation for Cross-Modal Multimedia
Retrieval,
MultMed(17), No. 3, March 2015, pp. 370-381.
IEEE DOI
1502
Algorithm design and analysis
BibRef
He, Y.,
Xiang, S.,
Kang, C.,
Wang, J.,
Pan, C.,
Cross-Modal Retrieval via Deep and Bidirectional Representation
Learning,
MultMed(18), No. 7, July 2016, pp. 1363-1377.
IEEE DOI
1608
backpropagation
BibRef
Zhang, S.,
Wang, X.,
Lin, Y.,
Tian, Q.,
Cross Indexing With Grouplets,
MultMed(17), No. 11, November 2015, pp. 1969-1979.
IEEE DOI
1511
Feature extraction
BibRef
Chu, L.,
Zhang, Y.,
Li, G.,
Wang, S.,
Zhang, W.,
Huang, Q.,
Effective Multimodality Fusion Framework for Cross-Media Topic
Detection,
CirSysVideo(26), No. 3, March 2016, pp. 556-569.
IEEE DOI
1603
Complexity theory
BibRef
Ding, K.[Kun],
Fan, B.[Bin],
Huo, C.L.[Chun-Lei],
Xiang, S.M.[Shi-Ming],
Pan, C.H.[Chun-Hong],
Cross-Modal Hashing via Rank-Order Preserving,
MultMed(19), No. 3, March 2017, pp. 571-585.
IEEE DOI
1702
Binary codes
BibRef
Han, C.W.[Chao-Wei],
Meng, G.F.[Gao-Feng],
Huo, C.L.[Chun-Lei],
SFD: Similar Frame Dataset for Content-Based Video Retrieval,
ICIP24(2403-2409)
IEEE DOI Code:
WWW Link.
2411
Codes, Databases, Scalability, Large language models,
Image retrieval, Contrastive learning, Object detection, Dataset,
Contrastive learning
BibRef
Jiang, B.[Bin],
Yang, J.C.[Jia-Chen],
Lv, Z.H.[Zhi-Han],
Tian, K.[Kun],
Meng, Q.G.[Qing-Gang],
Yan, Y.[Yan],
Internet cross-media retrieval based on deep learning,
JVCIR(48), No. 1, 2017, pp. 356-366.
Elsevier DOI
1708
Cross-media, retrieval
BibRef
Hu, Y.,
Zheng, L.,
Yang, Y.,
Huang, Y.,
Twitter100k: A Real-World Dataset for Weakly Supervised Cross-Media
Retrieval,
MultMed(20), No. 4, April 2018, pp. 927-938.
IEEE DOI
1804
Electronic publishing, Encyclopedias, Internet,
Optical character recognition software, Training, Visualization,
weakly supervised method
BibRef
Verma, Y.[Yashaswi],
Jha, A.[Abhishek],
Jawahar, C.V.,
Cross-specificity: modelling data semantics for cross-modal matching
and retrieval,
MultInfoRetr(8), No. 2, June 2018, pp. 139-146.
Springer DOI
1805
BibRef
Dorfer, M.[Matthias],
Schlüter, J.[Jan],
Vall, A.[Andreu],
Korzeniowski, F.[Filip],
Widmer, G.[Gerhard],
End-to-end cross-modality retrieval with CCA projections and pairwise
ranking loss,
MultInfoRetr(8), No. 2, June 2018, pp. 117-128.
Springer DOI
1805
BibRef
Lu, X.[Xu],
Zhang, H.X.[Hua-Xiang],
Sun, J.D.[Jian-De],
Wang, Z.H.[Zhen-Hua],
Guo, P.L.[Pei-Lian],
Wan, W.B.[Wen-Bo],
Discriminative correlation hashing for supervised cross-modal
retrieval,
SP:IC(65), 2018, pp. 221-230.
Elsevier DOI
1805
Cross-modal retrieval, Hashing, Subspace learning, Discriminant analysis
BibRef
Wang, L.[Li],
Zhu, L.[Lei],
Dong, X.[Xiao],
Liu, L.[Li],
Sun, J.D.[Jian-De],
Zhang, H.X.[Hua-Xiang],
Joint Feature Selection and Graph Regularization for
Modality-Dependent Cross-Modal Retrieval,
JVCIR(54), 2018, pp. 213-222.
Elsevier DOI
1806
Cross-modal retrieval, Feature selection, Subspace learning,
Graph regularization
BibRef
Zhong, F.M.[Fang-Ming],
Chen, Z.K.[Zhi-Kui],
Min, G.Y.[Ge-Yong],
Deep Discrete Cross-Modal Hashing for Cross-Media Retrieval,
PR(83), 2018, pp. 64-77.
Elsevier DOI
1808
Cross-modal retrieval, deep learning, discrete hashing,
alternative optimization
BibRef
Yuan, X.[Xu],
Wang, G.Z.[Guang-Ze],
Chen, Z.K.[Zhi-Kui],
Zhong, F.M.[Fang-Ming],
CHOP: An orthogonal hashing method for zero-shot cross-modal
retrieval,
PRL(145), 2021, pp. 247-253.
Elsevier DOI
2104
Zero-shot, Cross-modal retrieval, Orthogonal projection
BibRef
Vukotic, V.,
Raymond, C.,
Gravier, G.,
A Crossmodal Approach to Multimodal Fusion in Video Hyperlinking,
MultMedMag(25), No. 2, April 2018, pp. 11-23.
IEEE DOI
1808
Task analysis, Neural networks,
Visualization, Streaming media, Hypertext systems, Training,
multimedia
BibRef
Liu, R.,
Wei, S.,
Zhao, Y.,
Zhu, Z.,
Wang, J.,
Multiview Cross-Media Hashing with Semantic Consistency,
MultMedMag(25), No. 2, April 2018, pp. 71-86.
IEEE DOI
1808
Media, Semantics, Correlation, Multimedia communication,
Optimization, Feature extraction, hashing, cross-media, multiview,
searching
BibRef
Wang, D.[Di],
Wang, Q.[Quan],
Gao, X.B.[Xin-Bo],
Robust and Flexible Discrete Hashing for Cross-Modal Similarity
Search,
CirSysVideo(28), No. 10, October 2018, pp. 2703-2715.
IEEE DOI
1811
Robustness, Training, Binary codes, Quantization (signal),
Linear programming, Matrix decomposition, Sparse matrices, Hashing,
unsupervised learning
BibRef
Wang, D.[Di],
Gao, X.B.[Xin-Bo],
Wang, X.M.[Xiu-Mei],
He, L.H.[Li-Huo],
Label Consistent Matrix Factorization Hashing for Large-Scale
Cross-Modal Similarity Search,
PAMI(41), No. 10, October 2019, pp. 2466-2479.
IEEE DOI
1909
Semantics, Correlation, Training, Transforms, Binary codes,
Image reconstruction, Sparse matrices, Hashing, multimodal,
cross-modal
BibRef
Wang, D.[Di],
Gao, X.B.[Xin-Bo],
Wang, X.M.[Xiu-Mei],
He, L.[Lihuo],
Yuan, B.[Bo],
Multimodal Discriminative Binary Embedding for Large-Scale
Cross-Modal Retrieval,
IP(25), No. 10, October 2016, pp. 4540-4554.
IEEE DOI
1610
Internet
BibRef
Wang, D.[Di],
Wang, Q.[Quan],
He, L.[Lihuo],
Gao, X.B.[Xin-Bo],
Tian, Y.M.[Yu-Min],
Joint and individual matrix factorization hashing for large-scale
cross-modal retrieval,
PR(107), 2020, pp. 107479.
Elsevier DOI
2008
Hashing, Multimodal, Retrieval, Cross-modal, Matrix factorization
BibRef
Dong, F.[Fei],
Nie, X.S.[Xiu-Shan],
Liu, X.B.[Xing-Bo],
Geng, L.L.[Lei-Lei],
Wang, Q.[Qian],
Cross-Modal Hashing Based on Category Structure Preserving,
JVCIR(57), 2018, pp. 28-33.
Elsevier DOI
1812
Cross-modal retrieval, Supervised hashing,
Category-specific structure preserving
BibRef
Zhang, M.J.[Mei-Jia],
Zhang, H.X.[Hua-Xiang],
Li, J.Z.[Jun-Zheng],
Wang, L.[Li],
Fang, Y.X.[Yi-Xian],
Sun, J.D.[Jian-De],
Supervised graph regularization based cross media retrieval with
intra and inter-class correlation,
JVCIR(58), 2019, pp. 1-11.
Elsevier DOI
1901
Cross media retrieval, Subspace learning, Supervised graph regularization
BibRef
Yao, T.[Tao],
Wang, G.[Gang],
Yan, L.S.[Lian-Shan],
Kong, X.W.[Xiang-Wei],
Su, Q.T.[Qing-Tang],
Zhang, C.M.[Cai-Ming],
Tian, Q.[Qi],
Online latent semantic hashing for cross-media retrieval,
PR(89), 2019, pp. 1-11.
Elsevier DOI
1902
Cross-media retrieval, Online learning, Hashing, Latent semantic concept
BibRef
Yao, T.[Tao],
Kong, X.W.[Xiang-Wei],
Fu, H.Y.[Hai-Yan],
Tian, Q.[Qi],
Discrete Semantic Alignment Hashing for Cross-Media Retrieval,
Cyber(50), No. 12, December 2020, pp. 4896-4907.
IEEE DOI
2012
Semantics, Hash functions, Correlation, Quantization (signal),
Optimization, Task analysis, Internet, Attribute,
hashing
BibRef
Dutta, T.[Titir],
Biswas, S.[Soma],
Cross-modal retrieval in challenging scenarios using attributes,
PRL(125), 2019, pp. 618-624.
Elsevier DOI
1909
Cross-modal retrieval, Attributes, Unseen query, Low-resolution data
BibRef
Liu, H.P.[Hua-Ping],
Wang, F.[Feng],
Zhang, X.Y.[Xin-Yu],
Sun, F.C.[Fu-Chun],
Weakly-paired deep dictionary learning for cross-modal retrieval,
PRL(130), 2020, pp. 199-206.
Elsevier DOI
2002
Deep dictionary learning, Cross-modal retrieval, Weak pairing
BibRef
Zhang, H.[Hong],
Wang, T.[Ting],
Dai, G.[Gang],
Semi-supervised cross-modal common representation learning with
vector-valued manifold regularization,
PRL(130), 2020, pp. 335-344.
Elsevier DOI
2002
Cross-media retrieval, Vector-valued RKHS,
Manifold regularization, Semi-supervised, Kernel method
BibRef
Chaudhuri, U.[Ushasi],
Banerjee, B.[Biplab],
Bhattacharya, A.[Avik],
Datcu, M.[Mihai],
CMIR-NET: A deep learning based model for cross-modal retrieval in
remote sensing,
PRL(131), 2020, pp. 456-462.
Elsevier DOI
2004
Remote sensing, Cross-modal retrieval, Deep learning,
Panchromatic, Multispectral, Audio samples
BibRef
Chi, J.Z.[Jing-Ze],
Peng, Y.X.[Yu-Xin],
Zero-Shot Cross-Media Embedding Learning With Dual Adversarial
Distribution Network,
CirSysVideo(30), No. 4, April 2020, pp. 1173-1187.
IEEE DOI
2004
Semantics, Media, Correlation, Training, Dogs,
Measurement, Cross-media retrieval, zero-shot learning,
maximum mean discrepancy
BibRef
Wu, F.[Fei],
Jing, X.Y.[Xiao-Yuan],
Wu, Z.Y.[Zhi-Yong],
Ji, Y.[Yimu],
Dong, X.[Xiwei],
Luo, X.K.[Xiao-Kai],
Huang, Q.H.[Qing-Hua],
Wang, R.[Ruchuan],
Modality-specific and shared generative adversarial network for
cross-modal retrieval,
PR(104), 2020, pp. 107335.
Elsevier DOI
2005
Cross-modal retrieval, Generative adversarial networks (GAN),
Modality-specific feature learning, Modality-shared feature learning
BibRef
Wu, F.[Fei],
Luo, X.K.[Xiao-Kai],
Huang, Q.H.[Qing-Hua],
Wei, P.F.[Peng-Fei],
Sun, Y.[Ying],
Dong, X.[Xiwei],
Wu, Z.Y.[Zhi-Yong],
Semantic Preserving Generative Adversarial Network for Cross-Modal
Hashing,
ICIP21(2743-2747)
IEEE DOI
2201
Measurement, Quantization (signal), Image processing, Semantics,
Focusing, Network architecture, cross-modal hashing,
semantic preserving
BibRef
Zhong, F.M.[Fang-Ming],
Chen, Z.K.[Zhi-Kui],
Min, G.Y.[Ge-Yong],
Xia, F.[Feng],
A novel strategy to balance the results of cross-modal hashing,
PR(107), 2020, pp. 107523.
Elsevier DOI
2008
Cross-modal hashing, Semantic gap, Semantic augmentation, Cross-modal retrieval
BibRef
Peng, Y.,
Chi, J.,
Unsupervised Cross-Media Retrieval Using Domain Adaptation With Scene
Graph,
CirSysVideo(30), No. 11, November 2020, pp. 4368-4379.
IEEE DOI
2011
Media, Correlation, Visualization, Genomics, Bioinformatics,
Training data, Training, Cross-media retrieval, domain adaptation,
scene graph
BibRef
Zhu, L.[Lei],
Song, J.[Jiayu],
Zhu, X.F.[Xiao-Feng],
Zhang, C.Y.[Cheng-Yuan],
Zhang, S.C.[Shi-Chao],
Yuan, X.P.[Xin-Pan],
Adversarial Learning-Based Semantic Correlation Representation for
Cross-Modal Retrieval,
MultMedMag(27), No. 4, October 2020, pp. 79-90.
IEEE DOI
2012
Correlation, Semantics, Computer science, Internet, Streaming media
BibRef
Zhu, L.[Lei],
Zhang, C.Y.[Cheng-Yuan],
Song, J.[Jiayu],
Zhang, S.C.[Shi-Chao],
Tian, C.[Chunwei],
Zhu, X.[Xinghui],
Deep Multigraph Hierarchical Enhanced Semantic Representation for
Cross-Modal Retrieval,
MultMedMag(29), No. 3, July 2022, pp. 17-26.
IEEE DOI
2209
Semantics, Adversarial machine learning, Correlation,
Visualization, Generators, Generative adversarial networks, Computer science
BibRef
Chaudhuri, U.[Ushasi],
Banerjee, B.[Biplab],
Bhattacharya, A.[Avik],
Datcu, M.[Mihai],
CrossATNet: A novel cross-attention based framework for sketch-based
image retrieval,
IVC(104), 2020, pp. 104003.
Elsevier DOI
2012
Neural networks, Sketch-based image retrieval,
Cross-modal retrieval, Deep-learning, Cross-attention network, Cross-triplets
BibRef
Zhang, Y.,
Zhou, W.,
Wang, M.,
Tian, Q.,
Li, H.,
Deep Relation Embedding for Cross-Modal Retrieval,
IP(30), 2021, pp. 617-627.
IEEE DOI
2012
Semantics, Feature extraction, Visualization,
Computational modeling, Task analysis, Training, Optimization,
relation
BibRef
Zhang, L.[Lei],
Chen, L.T.[Lei-Ting],
Ou, W.H.[Wei-Hua],
Zhou, C.[Chuan],
Semi-supervised cross-modal representation learning with GAN-based
Asymmetric Transfer Network,
JVCIR(73), 2020, pp. 102899.
Elsevier DOI
2012
Cross-modal retrieval, Modality gap, Generative adversarial network
BibRef
Wang, L.[Lu],
Yang, J.[Jie],
Zareapoor, M.[Masoumeh],
Zheng, Z.L.[Zhong-Long],
Cluster-wise unsupervised hashing for cross-modal similarity search,
PR(111), 2021, pp. 107732.
Elsevier DOI
2012
Cross-modal similarity retrieval, Multi-view clustering,
The cluster-wise code-prototypes, Cross-modal hashing,
BibRef
Meng, M.,
Wang, H.,
Yu, J.,
Chen, H.,
Wu, J.,
Asymmetric Supervised Consistent and Specific Hashing for Cross-Modal
Retrieval,
IP(30), 2021, pp. 986-1000.
IEEE DOI
2012
Semantics, Optimization, Quantization (signal), Correlation,
Symmetric matrices, Image coding, Sparse matrices, multimedia
BibRef
Matsubara, T.[Takashi],
Target-Oriented Deformation of Visual-Semantic Embedding Space,
IEICE(E104-D), No. 1, January 2021, pp. 24-33.
WWW Link.
2101
BibRef
Nie, X.,
Wang, B.,
Li, J.,
Hao, F.,
Jian, M.,
Yin, Y.,
Deep Multiscale Fusion Hashing for Cross-Modal Retrieval,
CirSysVideo(31), No. 1, January 2021, pp. 401-410.
IEEE DOI
2101
Semantics, Machine learning, Training data, Media,
Correlation, Retrieval, hashing, deep learning, cross-modal
BibRef
Liu, X.[Xin],
Hu, Z.K.[Zhi-Kai],
Ling, H.B.[Hai-Bin],
Cheung, Y.M.[Yiu-Ming],
MTFH: A Matrix Tri-Factorization Hashing Framework for Efficient
Cross-Modal Retrieval,
PAMI(43), No. 3, March 2021, pp. 964-981.
IEEE DOI
2102
Lips, Semantics, Adaptation models, Task analysis,
Encoding, Correlation, Cross-modal retrieval,
semantic correlation matrix
BibRef
Wu, Y.,
Wang, S.,
Song, G.,
Huang, Q.,
Augmented Adversarial Training for Cross-Modal Retrieval,
MultMed(23), 2021, pp. 559-571.
IEEE DOI
2102
image representation, image retrieval, neural nets, text analysis,
adversarial training process,
adversa-rial training
BibRef
Lin, Q.,
Cao, W.,
He, Z.,
He, Z.,
Mask Cross-Modal Hashing Networks,
MultMed(23), 2021, pp. 550-558.
IEEE DOI
2102
deep learning (artificial intelligence), feature extraction,
file organisation, image retrieval, text analysis,
cross-modal retrieval
BibRef
Qi, M.,
Qin, J.,
Yang, Y.,
Wang, Y.,
Luo, J.,
Semantics-Aware Spatial-Temporal Binaries for Cross-Modal Video
Retrieval,
IP(30), 2021, pp. 2989-3004.
IEEE DOI
2102
Semantics, Binary codes, Feature extraction, Visualization,
Task analysis, Natural languages, Stochastic processes,
natural language
BibRef
Wu, J.L.[Jian-Long],
Xie, X.X.[Xing-Xu],
Nie, L.Q.[Li-Qiang],
Lin, Z.C.[Zhou-Chen],
Zha, H.B.[Hong-Bin],
Reconstruction regularized low-rank subspace learning for cross-modal
retrieval,
PR(113), 2021, pp. 107813.
Elsevier DOI
2103
Cross-modal retrieval, Low-rank subspace learning,
Reconstruction regularization
BibRef
Zou, X.T.[Xi-Tao],
Wang, X.Z.[Xin-Zhi],
Bakker, E.M.[Erwin M.],
Wu, S.[Song],
Multi-label semantics preserving based deep cross-modal hashing,
SP:IC(93), 2021, pp. 116131.
Elsevier DOI
2103
Multi-modal retrieval, Deep cross-modal hashing, Multi-label semantic learning
BibRef
Shu, X.[Xin],
Zhao, G.Y.[Guo-Ying],
Scalable multi-label canonical correlation analysis for cross-modal
retrieval,
PR(115), 2021, pp. 107905.
Elsevier DOI
2104
Canonical correlation analysis, Semantic transformation,
Cross-modal retrieval, Singular value decomposition
BibRef
Song, G.[Ge],
Tan, X.Y.[Xiao-Yang],
Real-world Cross-modal Retrieval via Sequential Learning,
MultMed(23), 2021, pp. 1708-1721.
IEEE DOI
2106
BibRef
Earlier:
Sequential Learning for Cross-Modal Retrieval,
CroMoL19(4531-4539)
IEEE DOI
2004
Plugs, Task analysis, Data models, Learning systems, Brain modeling,
Adaptation models, Technological innovation,
meta learning.
information retrieval,
learning (artificial intelligence), multimodal data, meta learning
BibRef
Chen, W.[Wei],
Liu, Y.[Yu],
Bakker, E.M.[Erwin M.],
Lew, M.S.[Michael S.],
Integrating information theory and adversarial learning for
cross-modal retrieval,
PR(117), 2021, pp. 107983.
Elsevier DOI
2106
Cross-modal retrieval, Shannon information theory,
Adversarial learning, Modality uncertainty, Data imbalance
BibRef
Huang, Z.Y.[Zhen-Yu],
Zhou, J.T.Y.[Joey Tian-Yi],
Zhu, H.Y.[Hong-Yuan],
Zhang, C.Q.[Chang-Qing],
Lv, J.C.[Jian-Cheng],
Peng, X.[Xi],
Deep Spectral Representation Learning from Multi-View Data,
IP(30), 2021, pp. 5352-5362.
IEEE DOI
2106
Deep learning, Laplace equations, Neural networks, Collaboration,
Data models, Task analysis,
cross-modal retrieval
BibRef
Wen, X.[Xin],
Han, Z.Z.[Zhi-Zhong],
Liu, Y.S.[Yu-Shen],
CMPD: Using Cross Memory Network With Pair Discrimination for
Image-Text Retrieval,
CirSysVideo(31), No. 6, June 2021, pp. 2427-2437.
IEEE DOI
2106
Semantics, Task analysis, Training, Generators, Optimization,
Marine vehicles, Retrieval, cross-modal retrieval, adversarial learning
BibRef
Liu, J.H.[Jun-Hao],
Yang, M.[Min],
Li, C.M.[Cheng-Ming],
Xu, R.F.[Rui-Feng],
Improving Cross-Modal Image-Text Retrieval With Teacher-Student
Learning,
CirSysVideo(31), No. 8, August 2021, pp. 3242-3253.
IEEE DOI
2108
Semantics, Task analysis, Data models, Neural networks, Correlation,
Binary codes, Feature extraction,
teacher-student learning
BibRef
Song, G.[Ge],
Tan, X.Y.[Xiao-Yang],
Zhao, J.[Jun],
Yang, M.[Ming],
Deep robust multilevel semantic hashing for multi-label cross-modal
retrieval,
PR(120), 2021, pp. 108084.
Elsevier DOI
2109
Hashing, Multi-label, Cross-modal retrieval, Deep learning
BibRef
Song, G.[Ge],
Huang, K.[Kai],
Su, H.W.[Han-Wen],
Song, F.Y.[Feng-Yi],
Yang, M.[Ming],
Deep Ranking Distribution Preserving Hashing for Robust Multi-Label
Cross-Modal Retrieval,
MultMed(26), 2024, pp. 7027-7042.
IEEE DOI
2405
Codes, Semantics, Training, Correlation, Task analysis, Robustness,
Hamming distances, Cross-modal retrieval, deep hashing,
multi-label
BibRef
Song, G.[Ge],
Su, H.W.[Han-Wen],
Huang, K.[Kai],
Song, F.Y.[Feng-Yi],
Yang, M.[Ming],
Deep self-enhancement hashing for robust multi-label cross-modal
retrieval,
PR(147), 2024, pp. 110079.
Elsevier DOI
2312
Cross-modal retrieval, Deep hashing, Out-of-distribution, Multi-label
BibRef
Fang, Y.Z.[Yu-Zhi],
Robust multimodal discrete hashing for cross-modal similarity search,
JVCIR(79), 2021, pp. 103256.
Elsevier DOI
2109
Hashing, Robust, Cross-modal retrieval, Unsupervised learning
BibRef
Nie, X.S.[Xiu-Shan],
Liu, X.B.[Xing-Bo],
Xi, X.M.[Xiao-Ming],
Li, C.L.[Cheng-Long],
Yin, Y.L.[Yi-Long],
Fast Unmediated Hashing for Cross-Modal Retrieval,
CirSysVideo(31), No. 9, September 2021, pp. 3669-3678.
IEEE DOI
2109
Semantics, Training, Optimization, Training data, Binary codes,
Correlation, Videos, Cross-modal retrieval, hashing, unmediated,
double supervision
BibRef
Zhang, D.L.[Dong-Lin],
Wu, X.J.[Xiao-Jun],
Yin, H.F.[He-Feng],
Kittler, J.V.[Josef V.],
MOON: Multi-hash codes joint learning for cross-media retrieval,
PRL(151), 2021, pp. 19-25.
Elsevier DOI
2110
Cross-media retrieval, Hashing, Discrete optimization, Joint learning
BibRef
Hu, P.[Peng],
Peng, X.[Xi],
Zhu, H.Y.[Hong-Yuan],
Lin, J.[Jie],
Zhen, L.L.[Liang-Li],
Peng, D.Z.[De-Zhong],
Joint Versus Independent Multiview Hashing for Cross-View Retrieval,
Cyber(51), No. 10, October 2021, pp. 4982-4993.
IEEE DOI
2110
Semantics, Decoding, Training, Computer science, Kernel, Logistics,
Cybernetics, Common hamming space, cross-view retrieval,
multiview representation learning
BibRef
Zhang, D.L.[Dong-Lin],
Wu, X.J.[Xiao-Jun],
Robust and discrete matrix factorization hashing for cross-modal
retrieval,
PR(122), 2022, pp. 108343.
Elsevier DOI
2112
Cross-modal retrieval, Hashing, Autoencoder,
Discrete optimization,
BibRef
Zhang, D.L.[Dong-Lin],
Wu, X.J.[Xiao-Jun],
Xu, T.Y.[Tian-Yang],
Kittler, J.V.[Josef V.],
Two-Stage Supervised Discrete Hashing for Cross-Modal Retrieval,
SMCS(52), No. 11, November 2022, pp. 7014-7026.
IEEE DOI
2210
Semantics, Binary codes, Hash functions, Optimization,
Quantization (signal), Task analysis, Costs, Cross-modal retrieval,
hashing
BibRef
Zhang, D.L.[Dong-Lin],
Wu, X.J.[Xiao-Jun],
Liu, Z.[Zhen],
Yu, J.[Jun],
Kittler, J.V.[Josef V.],
Fast Discrete Cross-Modal Hashing Based on Label Relaxation and
Matrix Factorization,
ICPR21(4845-4850)
IEEE DOI
2105
Technological innovation, Quantization (signal), Databases,
Instruments, Semantics, Binary codes, Media
BibRef
Zhang, L.[Li],
Wu, X.Q.[Xiang-Qian],
Multi-task framework based on feature separation and reconstruction
for cross-modal retrieval,
PR(122), 2022, pp. 108217.
Elsevier DOI
2112
Cross-modal retrieval, Feature separation,
Image reconstruction, Text reconstruction
BibRef
Liu, F.[Fangcen],
Gao, C.Q.[Chen-Qiang],
Sun, Y.Q.[Yong-Qing],
Zhao, Y.[Yue],
Yang, F.[Feng],
Qin, A.[Anyong],
Meng, D.Y.[De-Yu],
Infrared and Visible Cross-Modal Image Retrieval Through Shared
Features,
CirSysVideo(31), No. 11, November 2021, pp. 4485-4496.
IEEE DOI
2112
Image retrieval, Feature extraction, Task analysis, Imaging,
Semantics, Image color analysis, Cameras,
maximum mean discrepancy
BibRef
Wang, C.Y.[Chao-Yi],
Li, L.[Liang],
Yan, C.G.[Cheng-Gang],
Wang, Z.[Zhan],
Sun, Y.Q.[Yao-Qi],
Zhang, J.Y.[Ji-Yong],
Cross-modal semantic correlation learning by Bi-CNN network,
IET-IPR(15), No. 14, 2021, pp. 3674-3684.
DOI Link
2112
BibRef
Chakraborty, B.[Bela],
Wang, P.[Peng],
Wang, L.[Lei],
Inter-Modality Fusion Based Attention for Zero-Shot Cross-Modal
Retrieval,
ICIP21(2648-2652)
IEEE DOI
2201
Training, Heating systems, Image processing, Semantics, Pipelines,
MIMICs, Zero-shot Learning, Inter-Modality Fusion,
Cross-modal Retrieval
BibRef
Zhang, P.F.[Peng-Fei],
Li, Y.[Yang],
Huang, Z.[Zi],
Xu, X.S.[Xin-Shun],
Aggregation-Based Graph Convolutional Hashing for Unsupervised
Cross-Modal Retrieval,
MultMed(24), 2022, pp. 466-479.
IEEE DOI
2202
Semantics, Convolutional codes, Binary codes, Convolution,
Measurement, Feature extraction, Sparse matrices, Multimodal,
graph convolutional networks
BibRef
Shin, A.[Andrew],
Ishii, M.[Masato],
Narihira, T.[Takuya],
Perspectives and Prospects on Transformer Architecture for Cross-Modal
Tasks with Language and Vision,
IJCV(130), No. 2, February 2022, pp. 435-454.
Springer DOI
2202
BibRef
Ji, Z.[Zhong],
Wang, H.R.[Hao-Ran],
Han, J.G.[Jun-Gong],
Pang, Y.W.[Yan-Wei],
SMAN: Stacked Multimodal Attention Network for Cross-Modal Image-Text
Retrieval,
Cyber(52), No. 2, February 2022, pp. 1086-1097.
IEEE DOI
2202
Visualization, Semantics, Feature extraction, Correlation,
Task analysis, Extraterrestrial measurements, Deep learning,
vision and language
BibRef
Ma, J.J.[Jing-Jing],
Shi, D.[Duanpeng],
Tang, X.[Xu],
Zhang, X.R.[Xiang-Rong],
Jiao, L.C.[Li-Cheng],
Dual Modality Collaborative Learning for Cross-Source Remote Sensing
Retrieval,
RS(14), No. 6, 2022, pp. xx-yy.
DOI Link
2204
BibRef
Huang, Y.[Yan],
Wang, J.D.[Jing-Dong],
Wang, L.[Liang],
Few-Shot Image and Sentence Matching via Aligned Cross-Modal Memory,
PAMI(44), No. 6, June 2022, pp. 2968-2983.
IEEE DOI
2205
BibRef
Earlier: A1, A3, Only:
ACMM: Aligned Cross-Modal Memory for Few-Shot Image and Sentence
Matching,
ICCV19(5773-5782)
IEEE DOI
2004
Adaptation models, Task analysis, Pattern matching, Logic gates,
Visualization, Image color analysis, Data models,
similarity gated fusion.
image matching, learning (artificial intelligence),
storage management, few-shot content, sentence matching tasks,
Micromechanical devices
BibRef
Xu, X.[Xing],
Lin, K.Y.[Kai-Yi],
Yang, Y.[Yang],
Hanjalic, A.[Alan],
Shen, H.T.[Heng Tao],
Joint Feature Synthesis and Embedding:
Adversarial Cross-Modal Retrieval Revisited,
PAMI(44), No. 6, June 2022, pp. 3030-3047.
IEEE DOI
2205
Art, Generative adversarial networks, Training,
Correlation, Visualization, Standards, Cross-modal retrieval,
knowledge transfer
BibRef
Li, S.S.[Shen-Shen],
Xu, X.[Xing],
Jiang, X.[Xun],
Shen, F.M.[Fu-Min],
Liu, X.[Xin],
Shen, H.T.[Heng Tao],
Multi-Grained Attention Network With Mutual Exclusion for Composed
Query-Based Image Retrieval,
CirSysVideo(34), No. 4, April 2024, pp. 2959-2972.
IEEE DOI
2404
Semantics, Image retrieval, Task analysis, Feature extraction,
Visualization, Fuses, preserved and modified attentions
BibRef
Duan, Y.X.[You-Xiang],
Chen, N.[Ning],
Zhang, P.Y.[Pei-Ying],
Kumar, N.[Neeraj],
Chang, L.[Lunjie],
Wen, W.[Wu],
MS2GAH: Multi-label semantic supervised graph attention hashing for
robust cross-modal retrieval,
PR(128), 2022, pp. 108676.
Elsevier DOI
2205
Cross-modal retrieval, Deep hashing, Graph attention network
BibRef
Hamroun, M.[Mohamed],
Tamine, K.[Karim],
Crespin, B.[Benoît],
Multimodal Video Indexing (MVI): A New Method Based on Machine Learning
and Semi-Automatic Annotation on Large Video Collections,
IJIG(22), No. 2, April 2022, pp. 2250022.
DOI Link
2205
BibRef
Parida, K.K.[Kranti Kumar],
Sharma, G.[Gaurav],
Discriminative semantic transitive consistency for cross-modal
learning,
CVIU(219), 2022, pp. 103404.
Elsevier DOI
2205
Cross-modal retrieval, Distributional matching
BibRef
Xu, L.M.[Li-Ming],
Zeng, X.H.[Xian-Hua],
Zheng, B.[Bochuan],
Li, W.S.[Wei-Sheng],
Multi-Manifold Deep Discriminative Cross-Modal Hashing for Medical
Image Retrieval,
IP(31), 2022, pp. 3371-3385.
IEEE DOI
2205
Codes, Manifolds, Semantics, Correlation, Image retrieval,
Medical diagnostic imaging, Data models, Cross-modal hashing,
weak discriminability
BibRef
Song, X.[Xue],
Chen, J.J.[Jing-Jing],
Wu, Z.[Zuxuan],
Jiang, Y.G.[Yu-Gang],
Spatial-Temporal Graphs for Cross-Modal Text2Video Retrieval,
MultMed(24), 2022, pp. 2914-2923.
IEEE DOI
2206
Visualization, Semantics, Bit error rate, Encoding, Task analysis,
Feature extraction, Microphones, Cross-modal retrieval,
cross-modal learning
BibRef
Ma, X.H.[Xin-Hong],
Yang, X.S.[Xiao-Shan],
Gao, J.Y.[Jun-Yu],
Xu, C.S.[Chang-Sheng],
The Model May Fit You: User-Generalized Cross-Modal Retrieval,
MultMed(24), 2022, pp. 2998-3012.
IEEE DOI
2206
Data models, Task analysis, Adaptation models, Training,
Benchmark testing, Pediatrics, Bridges, cross-modal retrieval,
meta-learning
BibRef
Yang, F.[Fan],
Liu, Y.F.[Yu-Feng],
Ding, X.J.[Xiao-Jian],
Ma, F.M.[Fu-Min],
Cao, J.[Jie],
Asymmetric cross-modal hashing with high-level semantic similarity,
PR(130), 2022, pp. 108823.
Elsevier DOI
2206
Cross-modal retrieval, Hashing, Similarity search, Supervised, Optimization
BibRef
Shan, W.[Wei],
Huang, D.[Dan],
Wang, J.T.[Jiang-Tao],
Zou, F.[Feng],
Li, S.[Suwen],
Self-Attention based fine-grained cross-media hybrid network,
PR(130), 2022, pp. 108748.
Elsevier DOI
2206
Fine-Grained, Cross-Media, Retrieval, Attention
BibRef
Zhang, D.L.[Dong-Lin],
Wu, X.J.[Xiao-Jun],
Scalable Discrete Matrix Factorization and Semantic Autoencoder for
Cross-Media Retrieval,
Cyber(52), No. 7, July 2022, pp. 5947-5960.
IEEE DOI
2207
Semantics, Hash functions, Binary codes, Quantization (signal),
Training data, Training, Task analysis, Autoencoder, hashing
BibRef
Qian, S.S.[Sheng-Sheng],
Xue, D.Z.[Di-Zhan],
Fang, Q.[Quan],
Xu, C.S.[Chang-Sheng],
Adaptive Label-Aware Graph Convolutional Networks for Cross-Modal
Retrieval,
MultMed(24), 2022, pp. 3520-3532.
IEEE DOI
2207
Correlation, Semantics, Task analysis, Adaptation models,
Adaptive systems, Birds, Oceans, Cross-modal retrieval,
Graph convolutional networks
BibRef
Wang, Y.[Yunbo],
Peng, Y.X.[Yu-Xin],
MARS: Learning Modality-Agnostic Representation for Scalable
Cross-Media Retrieval,
CirSysVideo(32), No. 7, July 2022, pp. 4765-4777.
IEEE DOI
2207
Semantics, Correlation, Training, Cats, Automobiles, Transforms, Media,
Multi-modality learning, cross-media retrieval,
similarity retrieval
BibRef
Wang, L.[Lu],
Zareapoor, M.[Masoumeh],
Yang, J.[Jie],
Zheng, Z.L.[Zhong-Long],
Asymmetric Correlation Quantization Hashing for Cross-Modal Retrieval,
MultMed(24), 2022, pp. 3665-3678.
IEEE DOI
2208
Semantics, Quantization (signal), Correlation, Binary codes,
Databases, Optimization, Hash functions,
Compositional quantization
BibRef
Qin, J.Y.[Jian-Yang],
Fei, L.[Lunke],
Zhang, Z.[Zheng],
Wen, J.[Jie],
Xu, Y.[Yong],
Zhang, D.[David],
Joint Specifics and Consistency Hash Learning for Large-Scale
Cross-Modal Retrieval,
IP(31), 2022, pp. 5343-5358.
IEEE DOI
2208
Binary codes, Semantics, Hash functions, Feature extraction,
Collaboration, Training, Optimization, Learning to hash,
large-scale similarity searching
BibRef
Liu, G.H.[Guang-Hai],
Li, Z.Y.[Zuo-Yong],
Yang, J.Y.[Jing-Yu],
Zhang, D.[David],
Exploiting sublimated deep features for image retrieval,
PR(147), 2024, pp. 110076.
Elsevier DOI
2312
Image retrieval, Deep feature, Orientation-selective mechanism,
Sublimated deep feature histogram, Gain whitening learning
BibRef
Shi, Y.F.[Yu-Feng],
Zhao, Y.[Yue],
Liu, X.[Xin],
Zheng, F.[Feng],
Ou, W.H.[Wei-Hua],
You, X.G.[Xin-Ge],
Peng, Q.[Qinmu],
Deep Adaptively-Enhanced Hashing With Discriminative Similarity
Guidance for Unsupervised Cross-Modal Retrieval,
CirSysVideo(32), No. 10, October 2022, pp. 7255-7268.
IEEE DOI
2210
Hash functions, Optimization, Codes, Semantics, Estimation,
Computer science, Annotations, Cross-modal retrieval,
optimization strategy
BibRef
Liu, Z.[Zhi],
Zhao, F.Y.[Fang-Yuan],
Zhang, M.M.[Meng-Meng],
An Efficient Multimodal Aggregation Network for Video-Text Retrieval,
IEICE(E105-D), No. 10, October 2022, pp. 1825-1828.
WWW Link.
2210
BibRef
Guo, D.J.[Dong-Jin],
Su, X.M.[Xiao-Ming],
Lian, Y.[Yahong],
Liu, L.M.[Li-Min],
Wang, H.B.[Hai-Bo],
Two-stage partial image-text clustering (TPIT-C),
IET-CV(16), No. 8, 2022, pp. 694-708.
DOI Link
2210
BibRef
Wang, S.[Song],
Zhao, H.[Huan],
Li, K.Q.[Ke-Qin],
Discrete Joint Semantic Alignment Hashing for Cross-Modal Image-Text
Search,
CirSysVideo(32), No. 11, November 2022, pp. 8022-8036.
IEEE DOI
2211
Semantics, Codes, Optimization, Training, Task analysis,
Matrix converters, Hash functions, Cross-modal image-text search,
supervised hashing
BibRef
Liu, X.H.[Xing-Hua],
Cao, G.T.[Gui-Tao],
Lin, Q.B.[Qiu-Bin],
Cao, W.M.[Wen-Ming],
Adaptive weight multi-channel center similar deep hashing,
JVCIR(89), 2022, pp. 103642.
Elsevier DOI
2212
Multi-channel, Center similar, Multimodal retrieval, Deep cross-modal hashing
BibRef
Lan, R.[Rushi],
Tan, Y.[Yu],
Wang, X.Q.[Xiao-Qin],
Liu, Z.B.[Zhen-Bing],
Luo, X.N.[Xiao-Nan],
Label Guided Discrete Hashing for Cross-Modal Retrieval,
ITS(23), No. 12, December 2022, pp. 25236-25248.
IEEE DOI
2212
Codes, Manifolds, Semantics, Training, Binary codes, Task analysis,
Sparse matrices, Cross-modal retrieval, manifold embedding, balanced matrix
BibRef
Wang, Y.X.[Yong-Xin],
Chen, Z.D.[Zhen-Duo],
Luo, X.[Xin],
Xu, X.S.[Xin-Shun],
A High-Dimensional Sparse Hashing Framework for Cross-Modal Retrieval,
CirSysVideo(32), No. 12, December 2022, pp. 8822-8836.
IEEE DOI
2212
Codes, Semantics, Encoding, Task analysis, Optimization,
Streaming media, Sparse matrices, Sparse hashing, fine-grained similarity
BibRef
Jin, M.[Ming],
Zhang, H.X.[Hua-Xiang],
Zhu, L.[Lei],
Sun, J.D.[Jian-De],
Liu, L.[Li],
Video Sampled Frame Category Aggregation and Consistent
Representation for Cross-Modal Retrieval,
CirSysVideo(33), No. 2, February 2023, pp. 909-919.
IEEE DOI
2302
Feature extraction, Semantics, Training, Convolution, Dogs,
Network architecture, Video and text cross-modal retrieval,
video internal frame aggregation loss module
BibRef
Liao, L.[Lei],
Yang, M.[Meng],
Zhang, B.[Bob],
Deep Supervised Dual Cycle Adversarial Network for Cross-Modal
Retrieval,
CirSysVideo(33), No. 2, February 2023, pp. 920-934.
IEEE DOI
2302
Semantics, Generative adversarial networks, Feature extraction,
Task analysis, Media, Deep learning, Neural networks,
deep supervised learning
BibRef
Su, M.Y.[Ming-Yue],
Gu, G.H.[Guang-Hua],
Ren, X.[Xianlong],
Fu, H.[Hao],
Zhao, Y.[Yao],
Semi-Supervised Knowledge Distillation for Cross-Modal Hashing,
MultMed(25), 2023, pp. 662-675.
IEEE DOI
2302
Semantics, Knowledge engineering, Codes, Predictive models,
Data models, Cows, Bridges, Cross-modal retrieval, triplet ranking loss
BibRef
Gong, Y.[Yan],
Cosma, G.[Georgina],
Improving visual-semantic embeddings by learning
semantically-enhanced hard negatives for cross-modal information
retrieval,
PR(137), 2023, pp. 109272.
Elsevier DOI
2302
Visual semantic embedding network, Cross-modal,
Information retrieval, Hard negatives
BibRef
Li, W.H.[Wen-Hui],
Wang, Y.[Yan],
Su, Y.T.[Yu-Ting],
Li, X.Y.[Xuan-Ya],
Liu, A.A.[An-An],
Zhang, Y.D.[Yong-Dong],
Multi-Scale Fine-Grained Alignments for Image and Sentence Matching,
MultMed(25), 2023, pp. 543-556.
IEEE DOI
2302
Semantics, Visualization, Dogs, Mouth, Task analysis, Feature extraction,
Bridges, Bi-directional aggregations, multi-scale alignments
BibRef
Ou, W.H.[Wei-Hua],
Deng, J.X.[Jia-Xin],
Zhang, L.[Lei],
Gou, J.P.[Jian-Ping],
Zhou, Q.[Quan],
Cross-Modal Generation and Pair Correlation Alignment Hashing,
ITS(24), No. 3, March 2023, pp. 3018-3026.
IEEE DOI
2303
Semantics, Feature extraction, Correlation, Codes, Transformers,
Generative adversarial networks, Data mining,
cross-modal interaction
BibRef
Wang, D.[Di],
Zhang, C.P.[Cai-Ping],
Wang, Q.[Quan],
Tian, Y.M.[Yu-Min],
He, L.[Lihuo],
Zhao, L.[Lin],
Hierarchical Semantic Structure Preserving Hashing for Cross-Modal
Retrieval,
MultMed(25), 2023, pp. 1217-1229.
IEEE DOI
2305
Semantics, Codes, Binary codes, Representation learning, Correlation,
Hash functions, Feature extraction, Cross-modal retrieval,
hierarchical learning
BibRef
Hu, P.[Peng],
Huang, Z.Y.[Zhen-Yu],
Peng, D.Z.[De-Zhong],
Wang, X.[Xu],
Peng, X.[Xi],
Cross-Modal Retrieval With Partially Mismatched Pairs,
PAMI(45), No. 8, August 2023, pp. 9595-9610.
IEEE DOI
2307
Semantics, Force, Cognition, Visualization, Upper bound,
Stability analysis, Robustness,
mismatched pairs
BibRef
Liu, Y.X.[Ya-Xin],
Wu, J.L.[Jian-Long],
Qu, L.[Leigang],
Gan, T.[Tian],
Yin, J.H.[Jian-Hua],
Nie, L.Q.[Li-Qiang],
Self-Supervised Correlation Learning for Cross-Modal Retrieval,
MultMed(25), 2023, pp. 2851-2863.
IEEE DOI
2307
Correlation, Semantics, Mutual information, Kernel, Unsupervised learning,
Supervised learning, mutual information estimation
BibRef
Wang, B.H.[Ben-Hui],
Zhang, H.X.[Hua-Xiang],
Zhu, L.[Lei],
Nie, L.Q.[Li-Qiang],
Liu, L.[Li],
Multi-level adversarial attention cross-modal hashing,
SP:IC(117), 2023, pp. 117017.
Elsevier DOI
2308
Cross-modal retrieval, Adversarial Learning, Attentional mechanism, Hashing
BibRef
Sun, C.[Chunpu],
Zhang, H.X.[Hua-Xiang],
Liu, L.[Li],
Liu, D.M.[Dong-Mei],
Wang, L.[Lin],
Multi-Label Adversarial Fine-Grained Cross-Modal Retrieval,
SP:IC(117), 2023, pp. 117018.
Elsevier DOI
2308
Common representation, Transformer, Adversarial learning, Cross-modal retrieval
BibRef
Guo, S.T.[Sheng-Tang],
Zhang, H.X.[Hua-Xiang],
Liu, L.[Li],
Liu, D.M.[Dong-Mei],
Lu, X.[Xu],
Li, L.J.[Liu-Jian],
Hypergraph clustering based multi-label cross-modal retrieval,
JVCIR(103), 2024, pp. 104258.
Elsevier DOI
2409
Cross-modal retrieval, Hypergraph, Clustering, Alignment
BibRef
Huo, Y.D.[Ya-Dong],
Qin, Q.[Qibing],
Dai, J.Y.[Jiang-Yan],
Wang, L.[Lei],
Zhang, W.F.[Wen-Feng],
Huang, L.[Lei],
Wang, C.[Chengduan],
Deep Semantic-Aware Proxy Hashing for Multi-Label Cross-Modal
Retrieval,
CirSysVideo(34), No. 1, January 2024, pp. 576-589.
IEEE DOI Code:
WWW Link.
2401
BibRef
Song, D.[Dan],
Ling, Y.T.[Yu-Ting],
Li, T.[Tianbao],
Wang, T.[Teng],
Li, X.[Xuanya],
Hierarchical deep semantic alignment for cross-domain 3D model
retrieval,
JVCIR(95), 2023, pp. 103895.
Elsevier DOI
2309
3D model retrieval, Unsupervised domain adaptation, Representation learning
BibRef
Li, T.B.[Tian-Bao],
Liu, A.A.[An-An],
Song, D.[Dan],
Li, W.H.[Wen-Hui],
Li, X.Y.[Xuan-Ya],
Su, Y.T.[Yu-Ting],
Focus on Hard Samples: Hierarchical Unbiased Constraints for
Cross-Domain 3D Model Retrieval,
CirSysVideo(33), No. 11, November 2023, pp. 7036-7049.
IEEE DOI
2311
BibRef
Dong, X.[Xiao],
Zhan, X.L.[Xun-Lin],
Wei, Y.C.[Yun-Chao],
Wei, X.Y.[Xiao-Yong],
Wang, Y.[Yaowei],
Lu, M.L.[Min-Long],
Cao, X.C.[Xiao-Chun],
Liang, X.D.[Xiao-Dan],
Entity-Graph Enhanced Cross-Modal Pretraining for Instance-Level
Product Retrieval,
PAMI(45), No. 11, November 2023, pp. 13117-13133.
IEEE DOI
2310
BibRef
Zhan, X.L.[Xun-Lin],
Wu, Y.X.[Yang-Xin],
Dong, X.[Xiao],
Wei, Y.C.[Yun-Chao],
Lu, M.L.[Min-Long],
Zhang, Y.C.[Yi-Chi],
Xu, H.[Hang],
Liang, X.D.[Xiao-Dan],
Product1M: Towards Weakly Supervised Instance-Level Product Retrieval
via Cross-Modal Pretraining,
ICCV21(11762-11771)
IEEE DOI
2203
Industries, Measurement, Codes, Transformers, Solids,
Electronic commerce, Image and video retrieval, Vision + language
BibRef
Zhang, X.[Xiong],
Li, W.P.[Wei-Peng],
Wang, X.[Xu],
Wang, L.[Luyao],
Zheng, F.Z.[Fu-Zhong],
Wang, L.[Long],
Zhang, H.[Haisu],
A Fusion Encoder with Multi-Task Guidance for Cross-Modal Text-Image
Retrieval in Remote Sensing,
RS(15), No. 18, 2023, pp. 4637.
DOI Link
2310
BibRef
Tu, R.C.[Rong-Cheng],
Jiang, J.[Jie],
Lin, Q.H.[Qing-Hong],
Cai, C.F.[Cheng-Fei],
Tian, S.X.[Shang-Xuan],
Wang, H.F.[Hong-Fa],
Liu, W.[Wei],
Unsupervised Cross-Modal Hashing With Modality-Interaction,
CirSysVideo(33), No. 9, September 2023, pp. 5296-5308.
IEEE DOI
2310
BibRef
Liu, X.[Xin],
Yi, J.H.[Jin-Han],
Cheung, Y.M.[Yiu-Ming],
Xu, X.[Xing],
Cui, Z.[Zhen],
OMGH: Online Manifold-Guided Hashing for Flexible Cross-Modal
Retrieval,
MultMed(25), 2023, pp. 3811-3824.
IEEE DOI
2310
BibRef
Peng, S.J.[Shu-Juan],
Yi, J.H.[Jin-Han],
Liu, X.[Xin],
Cheung, Y.M.[Yiu-Ming],
Cui, Z.[Zhen],
Li, T.H.[Tai-Hao],
OLCH: Online Label Consistent Hashing for streaming cross-modal
retrieval,
PR(150), 2024, pp. 110335.
Elsevier DOI
2403
Cross-modal hashing, Online label consistent hashing,
Mini-batch online gradient descent, Forward-backward splitting
BibRef
Tan, W.T.[Wen-Tao],
Zhu, L.[Lei],
Li, J.J.[Jing-Jing],
Zhang, H.X.[Hua-Xiang],
Han, J.W.[Jun-Wei],
Teacher-Student Learning: Efficient Hierarchical Message Aggregation
Hashing for Cross-Modal Retrieval,
MultMed(25), 2023, pp. 4520-4532.
IEEE DOI
2310
BibRef
Song, L.Y.[Ling-Yun],
Shang, X.[Xuequn],
Yang, C.[Chen],
Sun, M.X.[Ming-Xuan],
Attribute-Guided Multiple Instance Hashing Network for Cross-Modal
Zero-Shot Hashing,
MultMed(25), 2023, pp. 5305-5318.
IEEE DOI
2311
BibRef
Li, L.[Li],
Shu, Z.Q.[Zhen-Qiu],
Yu, Z.T.[Zheng-Tao],
Wu, X.J.[Xiao-Jun],
Robust online hashing with label semantic enhancement for cross-modal
retrieval,
PR(145), 2024, pp. 109972.
Elsevier DOI
2311
Robust, Noise, Low-rank, Sparse, Multi-label semantic correlations,
Similarity, Online hashing, Cross-modal retrieval
BibRef
Ye, Z.[Zesheng],
Yao, L.[Lina],
Zhang, Y.[Yu],
Gustin, S.[Sylvia],
Self-supervised cross-modal visual retrieval from brain activities,
PR(145), 2024, pp. 109915.
Elsevier DOI
2311
Visual stimuli recovery, Cross-modal retrieval,
Self-supervised learning, Brain-Computer Interface
BibRef
Chen, Z.J.[Zheng-Jie],
Zhang, Y.[Yu],
Mi, S.[Siya],
Assisting Multimodal Named Entity Recognition by cross-modal
auxiliary tasks,
PRL(175), 2023, pp. 52-58.
Elsevier DOI
2311
Multimodal named entity recognition, Multi-task learning, Cross-modal learning
BibRef
Liu, X.Q.[Xiao-Qing],
Zeng, H.Q.[Huan-Qiang],
Shi, Y.F.[Yi-Fan],
Zhu, J.Q.[Jian-Qing],
Hsia, C.H.[Chih-Hsien],
Ma, K.K.[Kai-Kuang],
Deep Cross-Modal Hashing Based on Semantic Consistent Ranking,
MultMed(25), 2023, pp. 9530-9542.
IEEE DOI
2312
BibRef
Luo, K.Y.[Kai-Yi],
Zhang, C.[Chao],
Li, H.X.[Hua-Xiong],
Jia, X.[Xiuyi],
Chen, C.L.[Chun-Lin],
Adaptive Marginalized Semantic Hashing for Unpaired Cross-Modal
Retrieval,
MultMed(25), 2023, pp. 9082-9095.
IEEE DOI
2312
BibRef
Li, Z.X.[Zheng-Xin],
Zhao, W.Z.[Wen-Zhe],
Du, X.Y.[Xuan-Yi],
Zhou, G.Y.[Guang-Yao],
Zhang, S.[Songlin],
Cross-Modal Retrieval and Semantic Refinement for Remote Sensing
Image Captioning,
RS(16), No. 1, 2024, pp. xx-yy.
DOI Link
2401
BibRef
Xu, R.Q.[Rui-Qing],
Mayer, W.[Wolfgang],
Chu, H.L.[Hai-Long],
Zhang, Y.[Yitao],
Zhang, H.Y.[Hong-Yu],
Wang, Y.L.[Yu-Long],
Liu, Y.[Youfa],
Feng, Z.[Zaiwen],
Automatic semantic modeling of structured data sources with
cross-modal retrieval,
PRL(177), 2024, pp. 7-14.
Elsevier DOI
2401
Semantic model, Ontology, Cross-modal retrieval,
Attention mechanism, Graph representation learning
BibRef
Okamura, D.[Daiki],
Harakawa, R.[Ryosuke],
Iwahashi, M.[Masahiro],
LCNME: Label Correction Using Network Prediction Based on
Memorization Effects for Cross-Modal Retrieval With Noisy Labels,
CirSysVideo(34), No. 1, January 2024, pp. 590-602.
IEEE DOI
2401
BibRef
Yang, F.[Fan],
Han, M.[Meng],
Ma, F.M.[Fu-Min],
Liu, Y.F.[Yu-Feng],
Ding, X.J.[Xiao-Jian],
Tong, D.Y.[De-Yu],
Disperse Asymmetric Subspace Relation Hashing for Cross-Modal
Retrieval,
CirSysVideo(34), No. 1, January 2024, pp. 603-617.
IEEE DOI
2401
BibRef
Zhang, G.J.[Gang-Jian],
Li, S.K.[Shi-Kun],
Wei, S.K.[Shi-Kui],
Ge, S.M.[Shi-Ming],
Cai, N.[Na],
Zhao, Y.[Yao],
Multimodal Composition Example Mining for Composed Query Image
Retrieval,
IP(33), 2024, pp. 1149-1161.
IEEE DOI
2402
Image retrieval, Training, Task analysis,
Extraterrestrial measurements, Training data, Force, Semantics,
hard example mining
BibRef
Sun, Y.[Yuan],
Ren, Z.W.[Zhen-Wen],
Hu, P.[Peng],
Peng, D.Z.[De-Zhong],
Wang, X.[Xu],
Hierarchical Consensus Hashing for Cross-Modal Retrieval,
MultMed(26), 2024, pp. 824-836.
IEEE DOI
2402
Codes, Semantics, Hash functions, Correlation, Kernel,
Feature extraction, Eigenvalues and eigenfunctions,
learning to hash
BibRef
Zhang, L.[Lei],
Chen, L.[Leiting],
Zhou, C.[Chuan],
Li, X.[Xin],
Yang, F.[Fan],
Yi, Z.[Zhang],
Weighted Graph-Structured Semantics Constraint Network for
Cross-Modal Retrieval,
MultMed(26), 2024, pp. 1551-1564.
IEEE DOI
2402
Semantics, Training, Feature extraction, Representation learning,
Data models, Correlation, Games, Cross-modal retrieval, graph neural network
BibRef
Wang, Y.B.[Ya-Bing],
Wang, S.H.[Shu-Hui],
Luo, H.[Hao],
Dong, J.F.[Jian-Feng],
Wang, F.[Fan],
Han, M.[Meng],
Wang, X.[Xun],
Wang, M.[Meng],
Dual-View Curricular Optimal Transport for Cross-Lingual Cross-Modal
Retrieval,
IP(33), 2024, pp. 1522-1533.
IEEE DOI
2403
Visualization, Noise measurement, Estimation, Costs, Transportation,
Training, Task analysis, Cross-modal retrieval, machine translation
BibRef
Meng, M.[Min],
Sun, J.X.[Jia-Xuan],
Liu, J.G.[Ji-Gang],
Yu, J.[Jun],
Wu, J.G.[Ji-Gang],
Semantic Disentanglement Adversarial Hashing for Cross-Modal
Retrieval,
CirSysVideo(34), No. 3, March 2024, pp. 1914-1926.
IEEE DOI
2403
Semantics, Representation learning, Task analysis,
Feature extraction, Shape, Robustness,
disentangled representation
BibRef
Zhang, H.[Han],
Li, Y.D.[Yi-Ding],
Li, X.L.[Xue-Long],
Constrained Bipartite Graph Learning for Imbalanced Multi-Modal
Retrieval,
MultMed(26), 2024, pp. 4502-4514.
IEEE DOI
2403
Correlation, Bipartite graph, Semantics, Task analysis, Optimization,
Visualization, Annotations, Constrained bipartite graph, query graph
BibRef
Wang, Z.[Zheng],
Xu, X.[Xing],
Wei, J.[Jiwei],
Xie, N.[Ning],
Yang, Y.[Yang],
Shen, H.T.[Heng Tao],
Semantics Disentangling for Cross-Modal Retrieval,
IP(33), 2024, pp. 2226-2237.
IEEE DOI
2404
Semantics, Correlation, Feature extraction, Representation learning,
Interference, Task analysis, Shape, subspace learning
BibRef
Ma, X.R.[Xin-Ran],
Yang, M.X.[Mou-Xing],
Li, Y.F.[Yun-Fan],
Hu, P.[Peng],
Lv, J.C.[Jian-Cheng],
Peng, X.[Xi],
Cross-Modal Retrieval With Noisy Correspondence via Consistency
Refining and Mining,
IP(33), 2024, pp. 2587-2598.
IEEE DOI Code:
WWW Link.
2404
Noise measurement, Refining, Self-supervised learning,
Task analysis, Robustness, Data mining, Annotations,
graph matching
BibRef
Feng, Y.L.[Yang-Lin],
Zhu, H.Y.[Hong-Yuan],
Peng, D.Z.[De-Zhong],
Peng, X.[Xi],
Hu, P.[Peng],
RONO: Robust Discriminative Learning with Noisy Labels for 2D-3D
Cross-Modal Retrieval,
CVPR23(11610-11619)
IEEE DOI
2309
BibRef
Hu, P.[Peng],
Peng, X.[Xi],
Zhu, H.Y.[Hong-Yuan],
Zhen, L.L.[Liang-Li],
Lin, J.[Jie],
Learning Cross-Modal Retrieval with Noisy Labels,
CVPR21(5399-5409)
IEEE DOI
2111
Costs, Annotations, Interference,
Noise measurement, Labeling
BibRef
Wen, H.[Haokun],
Song, X.[Xuemeng],
Yin, J.H.[Jian-Hua],
Wu, J.L.[Jian-Long],
Guan, W.[Weili],
Nie, L.Q.[Li-Qiang],
Self-Training Boosted Multi-Factor Matching Network for Composed
Image Retrieval,
PAMI(46), No. 5, May 2024, pp. 3665-3678.
IEEE DOI
2404
Iterative methods, Task analysis, Image retrieval, Training,
Benchmark testing, Image color analysis,
multimodal retrieval
BibRef
Ji, Z.[Zhong],
Lin, Z.G.[Zhi-Gang],
Wang, H.R.[Hao-Ran],
Pang, Y.W.[Yan-Wei],
Li, X.L.[Xue-Long],
Multi-task hierarchical convolutional network for visual-semantic
cross-modal retrieval,
PR(151), 2024, pp. 110398.
Elsevier DOI
2404
Vision and language, Cross-modal retrieval,
Multi-task learning, Metric learning
BibRef
Hu, Z.K.[Zhi-Kai],
Cheung, Y.M.[Yiu-Ming],
Li, M.K.[Meng-Ke],
Lan, W.C.[Wei-Chao],
Zhang, D.L.[Dong-Lin],
Liu, Q.[Qiang],
Joint Semantic Preserving Sparse Hashing for Cross-Modal Retrieval,
CirSysVideo(34), No. 4, April 2024, pp. 2989-3002.
IEEE DOI
2404
Codes, Semantics, Sparse matrices, Hash functions, Encoding,
Task analysis, Quantization (signal), Cross-modal retrieval,
discrete optimization
BibRef
Qin, Q.B.[Qi-Bing],
Huo, Y.D.[Ya-Dong],
Huang, L.[Lei],
Dai, J.Y.[Jiang-Yan],
Zhang, H.H.[Hui-Hui],
Zhang, W.F.[Wen-Feng],
Deep Neighborhood-Preserving Hashing With Quadratic Spherical Mutual
Information for Cross-Modal Retrieval,
MultMed(26), 2024, pp. 6361-6374.
IEEE DOI
2404
Semantics, Mutual information, Transformers, Feature extraction,
Clamps, Binary codes, Tuning, Cross-modal retrieval, deep hashing,
transformer encoder
BibRef
Liang, X.[Xiao],
Yang, E.[Erkun],
Yang, Y.H.[Yan-Hua],
Deng, C.[Cheng],
Multi-Relational Deep Hashing for Cross-Modal Search,
IP(33), 2024, pp. 3009-3020.
IEEE DOI
2405
Codes, Semantics, Loss measurement, Training, Hash functions,
Data models, Correlation, Cross-modal retrieval, metric learning
BibRef
Pang, S.[Shanmin],
Zeng, Y.[Yueyang],
Zhao, J.W.[Jia-Wei],
Xue, J.R.[Jian-Ru],
A Mutually Textual and Visual Refinement Network for Image-Text
Matching,
MultMed(26), 2024, pp. 7555-7566.
IEEE DOI
2405
Semantics, Visualization, Vectors, Cameras, Image segmentation,
Feature extraction, Image coding, Cross-modal retrieval,
semantic alignment enhancement
BibRef
Teng, S.H.[Shao-Hua],
Li, J.B.[Jiang-Bo],
Teng, L.[Luyao],
Fei, L.[Lunke],
Wu, N.Q.[Nai-Qi],
Zhang, W.[Wei],
Scalable Discrete and Asymmetric Unequal Length Hashing Learning for
Cross-Modal Retrieval,
MultMed(26), 2024, pp. 7917-7932.
IEEE DOI
2405
Codes, Semantics, Encoding, Optimization, Linear matrix inequalities,
Costs, Hash functions, Unequal length encoding,
dual semantic embedding learning
BibRef
Yang, D.K.[Ding-Kang],
Kuang, H.P.[Hao-Peng],
Yang, K.[Kun],
Li, M.C.[Ming-Cheng],
Zhang, L.H.[Li-Hua],
Towards Asynchronous Multimodal Signal Interaction and Fusion via
Tailored Transformers,
SPLetters(31), 2024, pp. 1550-1554.
IEEE DOI
2406
Transformers, Matrix decomposition, Kernel, Complexity theory,
Benchmark testing, Visualization, Feature extraction, sentiment analysis
BibRef
Wang, Y.X.[Yong-Xin],
Zhan, Y.W.[Yu-Wei],
Chen, Z.D.[Zhen-Duo],
Luo, X.[Xin],
Xu, X.S.[Xin-Shun],
Multiple Information Embedded Hashing for Large-Scale Cross-Modal
Retrieval,
CirSysVideo(34), No. 6, June 2024, pp. 5118-5131.
IEEE DOI Code:
WWW Link.
2406
Codes, Semantics, Hash functions, Optimization, Noise measurement,
Data mining, Linear regression, Cross-modal retrieval, hashing, robustness
BibRef
Hou, Y.L.[Yi-Lin],
Zhong, X.J.[Xian-Jing],
Cao, H.[Hui],
Zhu, Z.[Zheng],
Zhou, Y.F.[Yun-Feng],
Zhang, J.[Jie],
A shared-private sentiment analysis approach based on cross-modal
information interaction,
PRL(183), 2024, pp. 140-146.
Elsevier DOI
2406
Sentiment analysis, Multimodal data, Improved transformer,
Self-attention mechanism, Multi-head attention
BibRef
Chen, S.W.[Shao-Wei],
Liu, S.[Shuaipeng],
Liu, J.[Jie],
Type-Specific Modality Alignment for Multi-Modal Information
Extraction,
SPLetters(31), 2024, pp. 1525-1529.
IEEE DOI
2406
Visualization, Semantics, Task analysis, Information retrieval,
Training, Measurement, Image coding,
global modality integration
BibRef
Zheng, Z.Q.[Zi-Qiang],
Ren, H.[Hao],
Wu, Y.[Yang],
Zhang, W.C.[Wei-Chuan],
Lu, H.[Hong],
Yang, Y.[Yang],
Shen, H.T.[Heng Tao],
Fully Unsupervised Domain-Agnostic Image Retrieval,
CirSysVideo(34), No. 6, June 2024, pp. 5077-5090.
IEEE DOI
2406
Image retrieval, Task analysis, Training, Feature extraction,
Annotations, Visualization, Data models, domain adaptation
BibRef
Zhang, J.Z.[Jin-Zhi],
Wang, L.[Luyao],
Zheng, F.Z.[Fu-Zhong],
Wang, X.[Xu],
Zhang, H.[Haisu],
An Enhanced Feature Extraction Framework for Cross-Modal Image-Text
Retrieval,
RS(16), No. 12, 2024, pp. 2201.
DOI Link
2406
BibRef
Cheng, Q.R.[Qing-Rong],
Tan, Z.S.[Zhen-Shan],
Wen, K.Y.[Ke-Yu],
Chen, C.[Cheng],
Gu, X.D.[Xiao-Dong],
Semantic Pre-Alignment and Ranking Learning With Unified Framework
for Cross-Modal Retrieval,
CirSysVideo(34), No. 7, July 2024, pp. 6503-6516.
IEEE DOI
2407
Semantics, Visualization, Optimization, Feature extraction,
Uniform resource locators, Task analysis, Correlation, Retrieval,
average precision
BibRef
Kang, X.[Xiao],
Liu, X.B.[Xing-Bo],
Zhang, X.N.[Xue-Ning],
Nie, X.S.[Xiu-Shan],
Yin, Y.L.[Yi-Long],
Online Discriminative Cross-Modal Hashing,
CirSysVideo(34), No. 7, July 2024, pp. 5242-5254.
IEEE DOI
2407
Codes, Semantics, Training, Hash functions, Data models,
Weight measurement, Correlation, Cross-modal retrieval,
adaptive bit-wise weighting
BibRef
Zhang, X.N.[Xue-Ning],
Liu, X.B.[Xing-Bo],
Nie, X.[Xiushan],
Kang, X.[Xiao],
Yin, Y.L.[Yi-Long],
Semi-Supervised Semi-Paired Cross-Modal Hashing,
CirSysVideo(34), No. 7, July 2024, pp. 6517-6529.
IEEE DOI
2407
Semantics, Correlation, Codes, Labeling, Hash functions, Costs,
Training data, Cross-modal retrieval, semi-supervised learning,
label-enhanced strategy
BibRef
Li, J.X.[Jia-Xing],
Wong, W.K.[Wai Keung],
Jiang, L.[Lin],
Fang, X.Z.[Xiao-Zhao],
Xie, S.L.[Sheng-Li],
Xu, Y.[Yong],
CKDH: CLIP-Based Knowledge Distillation Hashing for Cross-Modal
Retrieval,
CirSysVideo(34), No. 7, July 2024, pp. 6530-6541.
IEEE DOI
2407
Feature extraction, Codes, Training, Semantics, Data models,
Data mining, Cross-modal retrieval, deep hashing
BibRef
Yong, K.L.[Kai-Ling],
Shu, Z.Q.[Zhen-Qiu],
Wang, H.B.[Hong-Bin],
Yu, Z.T.[Zheng-Tao],
Two-stage zero-shot sparse hashing with missing labels for
cross-modal retrieval,
PR(155), 2024, pp. 110717.
Elsevier DOI Code:
WWW Link.
2408
Missing labels, Zero-shot sparse hashing, Cross-modal retrieval,
Joint semantic similarity, Clustering-wise similarity
BibRef
Xue, P.[Peng],
Niu, S.[Sijie],
A novel active contour model based on features for image segmentation,
PR(155), 2024, pp. 110673.
Elsevier DOI Code:
WWW Link.
2408
Active contour model, Energy functional,
Feature energy function, Complex natural image
BibRef
Mao, Y.Q.[Yi-Qiao],
Yan, X.Q.[Xiao-Qiang],
Hu, S.Z.[Shi-Zhe],
Ye, Y.D.[Yang-Dong],
Contrastive cross-modal clustering with twin network,
PR(155), 2024, pp. 110645.
Elsevier DOI
2408
Cross-modal clustering, Correlation information,
Contrastive learning, Twin network
BibRef
Yan, J.[Jiexi],
Deng, C.[Cheng],
Huang, H.[Heng],
Liu, W.[Wei],
Causality-Invariant Interactive Mining for Cross-Modal Similarity
Learning,
PAMI(46), No. 9, September 2024, pp. 6216-6230.
IEEE DOI
2408
Data mining, Correlation, Semantics, Task analysis,
Extraterrestrial measurements, Training, Image retrieval,
similarity learning
BibRef
Wang, J.P.[Jin-Peng],
Zeng, Z.Y.[Zi-Yun],
Chen, B.[Bin],
Wang, Y.T.[Yu-Ting],
Liao, D.L.[Dong-Liang],
Li, G.F.[Gong-Fu],
Wang, Y.R.[Yi-Ru],
Xia, S.T.[Shu-Tao],
Hugs Bring Double Benefits: Unsupervised Cross-Modal Hashing with
Multi-granularity Aligned Transformers,
IJCV(132), No. 8, August 2024, pp. 2765-2797.
Springer DOI
2408
BibRef
Kang, X.[Xiao],
Liu, X.[Xingbo],
Xue, W.[Wen],
Zhang, X.[Xuening],
Nie, X.[Xiushan],
Yin, Y.L.[Yi-Long],
Discrete online cross-modal hashing with consistency preservation,
PR(155), 2024, pp. 110688.
Elsevier DOI
2408
Cross-modal retrieval, Supervised online hashing,
Continuous semantic embedding, Modality deviation calibration
BibRef
Tu, J.F.[Jun-Feng],
Liu, X.L.[Xue-Liang],
Hao, Y.B.[Yan-Bin],
Hong, R.C.[Ri-Chang],
Wang, M.[Meng],
Two-Step Discrete Hashing for Cross-Modal Retrieval,
MultMed(26), 2024, pp. 8730-8741.
IEEE DOI
2408
Codes, Feature extraction, Quantization (signal), Transformers,
Semantics, Transforms, Binary codes, Cross-modal hashing, hashing
BibRef
Cai, H.M.[Hong-Min],
Zhang, B.[Bin],
Li, J.Y.[Jun-Yu],
Hu, B.[Bin],
Chen, J.Z.[Jia-Zhou],
Unsupervised Dual Hashing Coding (UDC) on Semantic Tagging and Sample
Content for Cross-Modal Retrieval,
MultMed(26), 2024, pp. 9109-9120.
IEEE DOI
2409
Codes, Semantics, Tail, Encoding, Transforms, Tagging,
Matrix decomposition, Cross-modal retrieval, multimodal
BibRef
Wu, W.J.[Wei-Jia],
Zhao, Y.Z.[Yu-Zhong],
Li, Z.[Zhuang],
Li, J.H.[Jia-Hong],
Zhou, H.[Hong],
Shou, M.Z.[Mike Zheng],
Bai, X.[Xiang],
A large cross-modal video retrieval dataset with reading
comprehension,
PR(157), 2025, pp. 110818.
Elsevier DOI
2409
Cross-modal, Retrieval, Text reading, Contrastive learning
BibRef
Jiang, L.[Lin],
Wu, J.G.[Ji-Gang],
Zhao, S.P.[Shu-Ping],
Li, J.X.[Jia-Xing],
Coding self-representative and label-relaxed hashing for cross-modal
retrieval,
PRL(185), 2024, pp. 264-270.
Elsevier DOI
2410
Cross-modal retrieval, Hashing learning,
Label-relaxed regression, Similarity preservation
BibRef
Yuan, Z.[Zhe],
Wu, D.[Dan],
Zhou, L.[Liang],
Achieving the Optimum Rate for Cross-Modal Source Coding,
MultMed(26), 2024, pp. 9722-9735.
IEEE DOI
2410
Semantics, Source coding, Haptic interfaces, Reliability, Streams,
Redundancy, Decoding, Cross-modal, source coding, semantic relevance,
video and haptic coding
BibRef
Chen, R.[Ruihan],
Tan, J.P.[Jun-Peng],
Yang, Z.J.[Zhi-Jing],
Yang, X.J.[Xiao-Jun],
Dai, Q.Y.[Qing-Yun],
Cheng, Y.Q.[Yong-Qiang],
Lin, L.[Liang],
DPHANet: Discriminative Parallel and Hierarchical Attention Network
for Natural Language Video Localization,
MultMed(26), 2024, pp. 9575-9590.
IEEE DOI
2410
Location awareness, Semantics, TV, Natural languages, Correlation,
Glass, Cross-modal retrieval,
video understanding
BibRef
Zheng, A.[Aihua],
Yuan, F.[Fan],
Zhang, H.[Haichuan],
Wang, J.X.[Jia-Xiang],
Tang, C.[Chao],
Li, C.L.[Cheng-Long],
Public-Private Attributes-Based Variational Adversarial Network for
Audio-Visual Cross-Modal Matching,
CirSysVideo(34), No. 9, September 2024, pp. 8698-8709.
IEEE DOI
2410
Visualization, Semantics, Feature extraction, Face recognition,
Adversarial machine learning, Task analysis, Decoding, metric learning
BibRef
Li, D.[Dongyue],
Du, S.[Songlin],
ContextMatcher: Detector-Free Feature Matching With Cross-Modality
Context,
CirSysVideo(34), No. 9, September 2024, pp. 7922-7934.
IEEE DOI
2410
Feature extraction, Transformers, Visualization, Task analysis,
Detectors, Correlation, Reliability, Local feature matching,
neighborhood consensus
BibRef
Li, F.L.[Feng-Ling],
Wang, B.[Bowen],
Zhu, L.[Lei],
Li, J.J.[Jing-Jing],
Zhang, Z.[Zheng],
Chang, X.J.[Xiao-Jun],
Cross-Domain Transfer Hashing for Efficient Cross-Modal Retrieval,
CirSysVideo(34), No. 10, October 2024, pp. 9664-9677.
IEEE DOI Code:
WWW Link.
2411
Semantics, Correlation, Training, Adaptation models, Codes,
Circuits and systems, Optimization, Cross-modal hashing,
weakly-supervised
BibRef
Zhang, F.[Fan],
Zhou, H.[Hang],
Hua, X.S.[Xian-Sheng],
Chen, C.[Chong],
Luo, X.[Xiao],
HOPE: A Hierarchical Perspective for Semi-Supervised 2D-3D
Cross-Modal Retrieval,
PAMI(46), No. 12, December 2024, pp. 8976-8993.
IEEE DOI
2411
Semantics, Neural networks, Optimization, Semisupervised learning,
Feature extraction, Solid modeling, 3D multimedia,
semi-supervised learning
BibRef
Zhu, Y.[Ye],
Wu, Y.[Yu],
Sebe, N.[Nicu],
Yan, Y.[Yan],
Vision + X: A Survey on Multimodal Learning in the Light of Data,
PAMI(46), No. 12, December 2024, pp. 9102-9122.
IEEE DOI
2411
Visualization, Task analysis, Music, Feature extraction, Surveys,
Representation learning, Multimodal representation learning
BibRef
Xu, H.R.[Hao-Ran],
Peng, P.X.[Pei-Xi],
Tan, G.[Guang],
Li, Y.[Yuan],
Xu, X.H.[Xin-Hai],
Tian, Y.H.[Yong-Hong],
DMR: Decomposed Multi-Modality Representations for Frames and Events
Fusion in Visual Reinforcement Learning,
CVPR24(26498-26508)
IEEE DOI
2410
Visualization, Noise, Reinforcement learning, Vision sensors,
Feature extraction, Data mining, Multi-Modality, DVS, Representation Learning
BibRef
You, C.Y.[Chen-Yu],
Mint, Y.F.[Yi-Fei],
Dai, W.C.[Wei-Cheng],
Sekhon, J.S.[Jasjeet S.],
Staib, L.[Lawrence],
Duncan, J.S.[James S.],
Calibrating Multi-modal Representations:
A Pursuit of Group Robustness without Annotations,
CVPR24(26140-26150)
IEEE DOI
2410
Visualization, Annotations, Computational modeling, Refining,
Training data, Contrastive learning, Benchmark testing,
BibRef
Zhang, Z.H.[Zhi-Hao],
Cao, S.C.[Sheng-Cao],
Wang, Y.X.[Yu-Xiong],
TAMM: TriAdapter Multi-Modal Learning for 3D Shape Understanding,
CVPR24(21413-21423)
IEEE DOI Code:
WWW Link.
2410
Representation learning, Visualization, Solid modeling, Accuracy,
Shape, 3D vision, multi-modal learning, 3D shape classification
BibRef
Zhao, Z.[Zihua],
Chen, M.X.[Meng-Xi],
Dai, T.J.[Tian-Jie],
Yao, J.C.[Jiang-Chao],
Han, B.[Bo],
Zhang, Y.[Ya],
Wang, Y.F.[Yan-Feng],
Mitigating Noisy Correspondence by Geometrical Structure Consistency
Learning,
CVPR24(27371-27380)
IEEE DOI Code:
WWW Link.
2410
Accuracy, Filtering, Source coding, Benchmark testing, Robustness,
Multi-modal learning, Noisy correspondence
BibRef
Tuzcuoglu, Ö.[Önder],
Köksal, A.[Aybora],
Sofu, B.[Bugra],
Kalkan, S.[Sinan],
Alatan, A.A.[A. Aydin],
XoFTR: Cross-modal Feature Matching Transformer,
IMW24(4275-4286)
IEEE DOI Code:
WWW Link.
2410
Learning systems, Image matching, Pipelines, Lighting,
Benchmark testing, Transformers, Image augmentation, thermal infrared
BibRef
Wu, J.L.[Jia-Lin],
Hu, X.[Xia],
Wang, Y.Q.[Ya-Qing],
Pang, B.[Bo],
Soricut, R.[Radu],
Omni-SMoLA: Boosting Generalist Multimodal Models with Soft Mixture
of Low-Rank Experts,
CVPR24(14205-14215)
IEEE DOI
2410
Degradation, Training, Costs, Computational modeling,
Computer architecture, Boosting, MoE, LoRA, generalist model, multimodal
BibRef
Sun, Q.[Quan],
Cui, Y.F.[Yu-Feng],
Zhang, X.S.[Xiao-Song],
Zhang, F.[Fan],
Yu, Q.[Qiying],
Wang, Y.[Yueze],
Rao, Y.M.[Yong-Ming],
Liu, J.J.[Jing-Jing],
Huang, T.J.[Tie-Jun],
Wang, X.L.[Xin-Long],
Generative Multimodal Models are In-Context Learners,
CVPR24(14398-14409)
IEEE DOI
2410
Visualization, Adaptation models, Codes, Reviews,
Computational modeling, Benchmark testing
BibRef
Zhao, S.T.[Shi-Tian],
Li, Z.W.[Zhuo-Wan],
Lu, Y.D.[Ya-Dong],
Yuille, A.L.[Alan L.],
Wang, Y.[Yan],
Causal-CoG: A Causal-Effect Look at Context Generation for Boosting
Multi-Modal Language Models,
CVPR24(13342-13351)
IEEE DOI
2410
Visualization, Cause effect analysis, Benchmark testing,
Information filters, Boosting,
Causality
BibRef
Li, Z.[Zhang],
Yang, B.[Biao],
Liu, Q.[Qiang],
Ma, Z.Y.[Zhi-Yin],
Zhang, S.[Shuo],
Yang, J.X.[Jing-Xu],
Sun, Y.[Yabo],
Liu, Y.L.[Yu-Liang],
Bai, X.[Xiang],
Monkey: Image Resolution and Text Label are Important Things for
Large Multi-Modal Models,
CVPR24(26753-26763)
IEEE DOI Code:
WWW Link.
2410
Training, Visualization, Image resolution, Codes,
Computational modeling, Benchmark testing, Large Multimodal Model
BibRef
Han, H.C.[Hao-Chen],
Zheng, Q.H.[Qing-Hua],
Dai, G.[Guang],
Luo, M.[Minnan],
Wang, J.D.[Jing-Dong],
Learning to Rematch Mismatched Pairs for Robust Cross-Modal Retrieval,
CVPR24(26669-26678)
IEEE DOI Code:
WWW Link.
2410
Training, Codes, Computational modeling, Semantics, Excavation, Cost function,
Cross-modal retrieval, Optimal transport, noisy correspondence learning
BibRef
Yuan, J.L.[Jia-Lin],
Yu, Y.[Ye],
Mittal, G.[Gaurav],
Hall, M.[Matthew],
Sajeev, S.[Sandra],
Chen, M.[Mei],
Rethinking Multimodal Content Moderation from an Asymmetric Angle
with Mixed-modality,
WACV24(8517-8527)
IEEE DOI
2404
Art, Fuses, Social networking (online), Semantics,
Computer architecture, Benchmark testing, Applications,
Vision + language and/or other modalities
BibRef
Liu, Z.Y.[Zhe-Yuan],
Sun, W.X.[Wei-Xuan],
Hong, Y.C.[Yi-Cong],
Teney, D.[Damien],
Gould, S.[Stephen],
Bi-directional Training for Composed Image Retrieval via Text Prompt
Learning,
WACV24(5741-5750)
IEEE DOI Code:
WWW Link.
2404
Training, Costs, Computational modeling, Image retrieval, Semantics,
Bidirectional control, Algorithms, Vision + language and/or other modalities
BibRef
Shoshan, A.[Alon],
Linial, O.[Ori],
Bhonker, N.[Nadav],
Hirsch, E.[Elad],
Zamir, L.[Lior],
Kviatkovsky, I.[Igor],
Medioni, G.[Gérard],
Asymmetric Image Retrieval with Cross Model Compatible Ensembles,
WACV24(1-11)
IEEE DOI
2404
Training, Uncertainty, Computational modeling, Face recognition,
Image retrieval, Diversity reception, Algorithms, body pose
BibRef
Hönig, R.[Robert],
Ackermann, J.[Jan],
Chi, M.Y.[Ming-Yuan],
Bi-Encoder Cascades for Efficient Image Search,
REDLCV23(1350-1355)
IEEE DOI
2401
BibRef
Cao, Y.C.[Yi-Chao],
Tang, Q.[Qingfei],
Yang, F.[Feng],
Su, X.[Xiu],
You, S.[Shan],
Lu, X.B.[Xiao-Bo],
Xu, C.[Chang],
Re-mine, Learn and Reason: Exploring the Cross-modal Semantic
Correlations for Language-guided HOI detection,
ICCV23(23435-23446)
IEEE DOI
2401
BibRef
Trinci, T.[Tomaso],
Bianconcini, T.[Tommaso],
Sarti, L.[Leonardo],
Taccari, L.[Leonardo],
Sambo, F.[Francesco],
Cross-model temporal cooperation via saliency maps for efficient
frame classification,
REDLCV23(1156-1160)
IEEE DOI
2401
BibRef
Long, T.[Teng],
van Noord, N.[Nanne],
Cross-modal Scalable Hyperbolic Hierarchical Clustering,
ICCV23(16609-16618)
IEEE DOI
2401
BibRef
Li, H.[Hong],
Li, X.Y.[Xing-Yu],
Hu, P.[Pengbo],
Lei, Y.[Yinuo],
Li, C.X.[Chun-Xiao],
Zhou, Y.[Yi],
Boosting Multi-modal Model Performance with Adaptive Gradient
Modulation,
ICCV23(22157-22167)
IEEE DOI Code:
WWW Link.
2401
BibRef
Li, W.[Wenyun],
Pun, C.M.[Chi-Man],
Asymmetric Scalable Cross-Modal Hashing,
ICIP23(316-320)
IEEE DOI
2312
BibRef
Zhao, L.J.[Long-Jiao],
Wang, Y.[Yu],
Kato, J.[Jien],
Using Classifier Discrepancy for Cross-Domain Image Retrieval,
ICIP23(3314-3318)
IEEE DOI
2312
BibRef
Era, Y.[Yuki],
Togo, R.[Ren],
Maeda, K.[Keisuke],
Ogawa, T.[Takahiro],
Haseyama, M.[Miki],
Video-Music Retrieval with Fine-Grained Cross-Modal Alignment,
ICIP23(2005-2009)
IEEE DOI
2312
BibRef
Yu, Y.[Youngjae],
Chung, J.[Jiwan],
Yun, H.[Heeseung],
Hessel, J.[Jack],
Park, J.S.[Jae Sung],
Lu, X.[Ximing],
Zellers, R.[Rowan],
Ammanabrolu, P.[Prithviraj],
Le Bras, R.[Ronan],
Kim, G.[Gunhee],
Choi, Y.[Yejin],
Fusing Pre-Trained Language Models with Multimodal Prompts through
Reinforcement Learning,
CVPR23(10845-10856)
IEEE DOI
2309
BibRef
Huang, S.[Siteng],
Gong, B.[Biao],
Pan, Y.L.[Yu-Lin],
Jiang, J.W.[Jian-Wen],
Lv, Y.L.[Yi-Liang],
Li, Y.Y.[Yu-Yuan],
Wang, D.L.[Dong-Lin],
VoP: Text-Video Co-Operative Prompt Tuning for Cross-Modal Retrieval,
CVPR23(6565-6574)
IEEE DOI
2309
BibRef
Chen, M.X.[Meng-Xi],
Xing, L.Y.[Lin-Yu],
Wang, Y.[Yu],
Zhang, X.[Xa],
Enhanced Multimodal Representation Learning with Cross-Modal KD,
CVPR23(11766-11775)
IEEE DOI
2309
BibRef
Yang, S.[Shuo],
Xu, Z.[Zhaopan],
Wang, K.[Kai],
You, Y.[Yang],
Yao, H.X.[Hong-Xun],
Liu, T.L.[Tong-Liang],
Xu, M.[Min],
BiCro: Noisy Correspondence Rectification for Multi-modality Data via
Bi-directional Cross-modal Similarity Consistency,
CVPR23(19883-19892)
IEEE DOI
2309
BibRef
Kim, D.[Dongwon],
Kim, N.[Namyup],
Kwak, S.[Suha],
Improving Cross-Modal Retrieval with Set of Diverse Embeddings,
CVPR23(23422-23431)
IEEE DOI
2309
BibRef
Kim, J.M.[Jae Myung],
Koepke, A.S.[A. Sophia],
Schmid, C.[Cordelia],
Akata, Z.[Zeynep],
Exposing and Mitigating Spurious Correlations for Cross-Modal
Retrieval,
MULA23(2585-2595)
IEEE DOI
2309
BibRef
Tran, V.[Vinh],
Balasubramanian, N.[Niranjan],
Hoai, M.[Minh],
From Within to Between: Knowledge Distillation for Cross Modality
Retrieval,
ACCV22(IV:605-622).
Springer DOI
2307
BibRef
Zhao, Y.[Yang],
Zhu, Y.Z.[Ya-Zhou],
Liao, S.B.[Sheng-Bin],
Ye, Q.L.[Qiao-Lin],
Zhang, H.F.[Hao-Feng],
Class Concentration with Twin Variational Autoencoders for Unsupervised
Cross-modal Hashing,
ACCV22(VI:235-251).
Springer DOI
2307
BibRef
Fragomeni, A.[Adriano],
Wray, M.[Michael],
Damen, D.[Dima],
Contra: (con)text (tra)nsformer for Cross-modal Video Retrieval,
ACCV22(IV:451-468).
Springer DOI
2307
BibRef
Zheng, Y.C.[Yuan-Chao],
Zhang, X.W.[Xiao-Wei],
Heterogeneous Interactive Learning Network for Unsupervised Cross-modal
Retrieval,
ACCV22(IV:692-707).
Springer DOI
2307
BibRef
Zhao, Y.[Yang],
Yu, J.G.[Jia-Guo],
Liao, S.[Shengbin],
Zhang, Z.[Zheng],
Zhang, H.F.[Hao-Feng],
From Sparse to Dense: Semantic Graph Evolutionary Hashing for
Unsupervised Cross-Modal Retrieval,
ACCV22(IV:521-536).
Springer DOI
2307
BibRef
Arnold, R.[Rahel],
Sauter, L.[Loris],
Schuldt, H.[Heiko],
Free-Form Multi-Modal Multimedia Retrieval (4MR),
MMMod23(I: 678-683).
Springer DOI
2304
BibRef
Xuan, H.[Hong],
Chen, X.S.[Xi Stephen],
Dissecting Deep Metric Learning Losses for Image-Text Retrieval,
WACV23(2163-2172)
IEEE DOI
2302
Measurement, Training, Analytical models, Semantics,
Space exploration, Task analysis, visual reasoning
BibRef
Ge, X.[Xuri],
Chen, F.[Fuhai],
Xu, S.[Songpei],
Tao, F.[Fuxiang],
Jose, J.M.[Joemon M.],
Cross-modal Semantic Enhanced Interaction for Image-Sentence
Retrieval,
WACV23(1022-1031)
IEEE DOI
2302
Measurement, Representation learning, Visualization, Correlation,
Computational modeling, Semantics,
Algorithms: Vision + language and/or other modalities
BibRef
Jawade, B.[Bhavin],
Mohan, D.D.[Deen Dayal],
Ali, N.M.[Naji Mohamed],
Setlur, S.[Srirangaraj],
Govindaraju, V.[Venu],
NAPReg: Nouns As Proxies Regularization for Semantically Aware
Cross-Modal Embeddings,
WACV23(1135-1144)
IEEE DOI
2302
Training, Measurement, Visualization, Codes, Databases, Semantics,
Algorithms: Vision + language and/or other modalities
BibRef
Nakatsuka, T.[Takayuki],
Hamasaki, M.[Masahiro],
Goto, M.[Masataka],
Content-Based Music-Image Retrieval Using Self- and Cross-Modal
Feature Embedding Memory,
WACV23(2173-2183)
IEEE DOI
2302
Training, Measurement, Art, Multiple signal classification,
Task analysis
BibRef
Chen, Y.X.[Yu-Xiao],
Yuan, J.B.[Jian-Bo],
Zhao, L.[Long],
Chen, T.L.[Tian-Lang],
Luo, R.[Rui],
Davis, L.[Larry],
Metaxas, D.N.[Dimitris N.],
More Than Just Attention: Improving Cross-Modal Attentions with
Contrastive Constraints for Image-Text Matching,
WACV23(4421-4429)
IEEE DOI
2302
Training, Measurement, Visualization, Annotations,
Computational modeling,
Algorithms: Vision + language and/or other modalities
BibRef
Agarwal, A.[Aishwarya],
Karanam, S.[Srikrishna],
Srinivasan, B.V.[Balaji Vasan],
Banerjee, B.[Biplab],
Contrastive Learning of Semantic Concepts for Open-set Cross-domain
Retrieval,
WACV23(4104-4113)
IEEE DOI
2302
Training, Technological innovation, Semantics, Natural languages,
Image retrieval, Feature extraction
BibRef
Yang, Y.[Yulou],
Shen, H.[Hao],
Yang, M.[Ming],
Relation-Guided Network for Image-Text Retrieval,
ICIP22(1856-1860)
IEEE DOI
2211
Transformers, Feature extraction, Cognition, Data mining,
Image-text retrieval, asymmetric structure, relation-guided
BibRef
Sumbul, G.[Gencer],
Müller, M.[Markus],
Demir, B.[Begüm],
A Novel Self-Supervised Cross-Modal Image Retrieval Method in Remote
Sensing,
ICIP22(2426-2430)
IEEE DOI
2211
Training, Codes, Image retrieval, Search problems, Sensors,
Reliability, Cross-modal image retrieval, deep learning, remote sensing
BibRef
Wang, H.[Hu],
Zhang, J.P.[Jian-Peng],
Chen, Y.H.[Yuan-Hong],
Ma, C.B.[Cong-Bo],
Avery, J.[Jodie],
Hull, L.[Louise],
Carneiro, G.[Gustavo],
Uncertainty-Aware Multi-modal Learning via Cross-Modal Random Network
Prediction,
ECCV22(XXXVII:200-217).
Springer DOI
2211
BibRef
de Almeida, L.B.[Lucas Barbosa],
Valem, L.P.[Lucas Pascotti],
Pedronette, D.C.G.[Daniel Carlos Guimarães],
Graph Convolutional Networks and Manifold Ranking for Multimodal
Video Retrieval,
ICIP22(2811-2815)
IEEE DOI
2211
Training, Manifolds, Deep learning, Transfer learning,
Feature extraction, Content-based retrieval, Manifold learning,
rank aggregation
BibRef
Liang, T.[Tao],
Lin, G.S.[Guo-Sheng],
Wan, M.Y.[Ming-Yang],
Li, T.R.[Tian-Rui],
Ma, G.J.[Guo-Jun],
Lv, F.M.[Feng-Mao],
Expanding Large Pre-trained Unimodal Models with Multimodal
Information Injection for Image-Text Multimodal Classification,
CVPR22(15471-15480)
IEEE DOI
2210
Deep learning, Visualization, Image recognition, Correlation,
Bit error rate, Vision+language
BibRef
Yang, J.H.[Jin-Hui],
Chen, X.Y.[Xian-Yu],
Jiang, M.[Ming],
Chen, S.[Shi],
Wang, L.[Louis],
Zhao, Q.[Qi],
VisualHow: Multimodal Problem Solving,
CVPR22(15606-15616)
IEEE DOI
2210
Training, Visualization, Technological innovation, Annotations,
Natural language processing,
Datasets and evaluation
BibRef
Girdhar, R.[Rohit],
Singh, M.[Mannat],
Ravi, N.[Nikhila],
van der Maaten, L.[Laurens],
Joulin, A.[Armand],
Misra, I.[Ishan],
Omnivore: A Single Model for Many Visual Modalities,
CVPR22(16081-16091)
IEEE DOI
2210
Visualization, Solid modeling, Computational modeling,
Transformers, Data models, Action and event recognition
BibRef
Ma, M.M.[Meng-Meng],
Ren, J.[Jian],
Zhao, L.[Long],
Testuggine, D.[Davide],
Peng, X.[Xi],
Are Multimodal Transformers Robust to Missing Modality?,
CVPR22(18156-18165)
IEEE DOI
2210
Training, Benchmark testing, Transformers, Multitasking,
Search problems, Data models, Vision+language, Machine learning
BibRef
Han, Z.B.[Zong-Bo],
Yang, F.[Fan],
Huang, J.Z.[Jun-Zhou],
Zhang, C.Q.[Chang-Qing],
Yao, J.H.[Jian-Hua],
Multimodal Dynamics: Dynamical Fusion for Trustworthy Multimodal
Classification,
CVPR22(20675-20685)
IEEE DOI
2210
Heuristic algorithms, Estimation, Classification algorithms,
Medical diagnosis, Machine learning
BibRef
Gupta, V.[Vikram],
Mittal, T.[Trisha],
Mathur, P.[Puneet],
Mishra, V.[Vaibhav],
Maheshwari, M.[Mayank],
Bera, A.[Aniket],
Mukherjee, D.[Debdoot],
Manocha, D.[Dinesh],
3MASSIV: Multilingual, Multimodal and Multi-Aspect dataset of Social
Media Short Videos,
CVPR22(21032-21043)
IEEE DOI
2210
Social networking (online), Semantics, Media,
Task analysis, Datasets and evaluation,
Video analysis and understanding
BibRef
Bogolin, S.V.[Simion-Vlad],
Croitoru, I.[Ioana],
Jin, H.L.[Hai-Lin],
Liu, Y.[Yang],
Albanie, S.[Samuel],
Cross Modal Retrieval with Querybank Normalisation,
CVPR22(5184-5195)
IEEE DOI
2210
Training, Codes, Computational modeling,
Benchmark testing, Vision + language, retrieval
BibRef
Yang, E.[Erkun],
Yao, D.R.[Dong-Ren],
Liu, T.L.[Tong-Liang],
Deng, C.[Cheng],
Mutual Quantization for Cross-Modal Search with Noisy Labels,
CVPR22(7541-7550)
IEEE DOI
2210
Training, Representation learning, Quantization (signal), Codes,
Training data, Benchmark testing, Recognition: detection,
Representation learning
BibRef
Neculai, A.[Andrei],
Chen, Y.B.[Yan-Bei],
Akata, Z.[Zeynep],
Probabilistic Compositional Embeddings for Multimodal Image Retrieval,
MULA22(4546-4556)
IEEE DOI
2210
Visualization, Codes, Computational modeling,
Image retrieval, Semantics
BibRef
Couairon, G.[Guillaume],
Douze, M.[Matthijs],
Cord, M.[Matthieu],
Schwenk, H.[Holger],
Embedding Arithmetic of Multimodal Queries for Image Retrieval,
ODRUM22(4946-4954)
IEEE DOI
2210
Conferences, Semantics, Image retrieval, Lasers, Transforms,
Image representation
BibRef
Sun, C.C.[Chang-Chang],
Latapie, H.[Hugo],
Liu, G.[Gaowen],
Yan, Y.[Yan],
Deep Normalized Cross-Modal Hashing with Bi-Direction Relation
Reasoning,
ODRUM22(4937-4945)
IEEE DOI
2210
Codes, Computational modeling, Semantics,
Bidirectional control, Benchmark testing
BibRef
Li, Y.H.[Yi-Hao],
Yu, J.[Jun],
Cai, Z.[Zhongpeng],
Pan, Y.[Yuwen],
Cross-modal Target Retrieval for Tracking by Natural Language,
ODRUM22(4927-4936)
IEEE DOI
2210
Visualization, Target tracking, Natural languages, Semantics,
Switches, Benchmark testing
BibRef
Thomas, C.[Christopher],
Kovashka, A.[Adriana],
Emphasizing Complementary Samples for Non-literal Cross-modal
Retrieval,
MULA22(4631-4640)
IEEE DOI
2210
Spatial diversity, Semantics, Channel estimation,
Performance gain, Benchmark testing
BibRef
Xu, B.[Bocheng],
Xiong, Y.H.[Yi-Hua],
Zhang, R.[Rui],
Feng, Y.[Yanyi],
Wu, H.F.[Hai-Feng],
Natural Language-Based Vehicle Retrieval with Explicit Cross-Modal
Representation Learning,
AICity22(3141-3148)
IEEE DOI
2210
Representation learning, Visualization, Semantics, Urban areas,
Feature extraction, Robustness
BibRef
Shvetsova, N.[Nina],
Chen, B.[Brian],
Rouditchenko, A.[Andrew],
Thomas, S.[Samuel],
Kingsbury, B.[Brian],
Feris, R.S.[Rogerio S.],
Harwath, D.[David],
Glass, J.[James],
Kuehne, H.[Hilde],
Everything at Once - Multi-modal Fusion Transformer for Video
Retrieval,
CVPR22(19988-19997)
IEEE DOI
2210
Location awareness, Training, Codes, Fuses, Benchmark testing,
Transformers, Action and event recognition, Video analysis and understanding
BibRef
Andonian, A.[Alex],
Chen, S.X.[Shi-Xing],
Hamid, R.[Raffay],
Robust Cross-Modal Representation Learning with Progressive
Self-Distillation,
CVPR22(16409-16420)
IEEE DOI
2210
Training, Representation learning, Computational modeling,
Redundancy, Benchmark testing, Robustness, Noise measurement,
Representation learning
BibRef
Lu, H.Y.[Hao-Yu],
Fei, N.[Nanyi],
Huo, Y.Q.[Yu-Qi],
Gao, Y.Z.[Yi-Zhao],
Lu, Z.W.[Zhi-Wu],
Wen, J.R.[Ji-Rong],
COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for
Cross-Modal Retrieval,
CVPR22(15671-15680)
IEEE DOI
2210
Visualization, Collaboration, Streaming media,
Probability distribution, Task analysis,
Video analysis and understanding
BibRef
Abdelnabi, S.[Sahar],
Hasan, R.[Rakibul],
Fritz, M.[Mario],
Open-Domain, Content-based, Multi-modal Fact-checking of
Out-of-Context Images via Online Resources,
CVPR22(14920-14929)
IEEE DOI
2210
Visualization, Machine vision, MIMICs, Manuals,
Cognition, retrieval, Vision + language,
Recognition: detection
BibRef
Wang, Y.[Yun],
Zhang, T.[Tong],
Zhang, X.[Xueya],
Cui, Z.[Zhen],
Huang, Y.[Yuge],
Shen, P.C.[Peng-Cheng],
Li, S.X.[Shao-Xin],
Yang, J.[Jian],
Wasserstein Coupled Graph Learning for Cross-Modal Retrieval,
ICCV21(1793-1802)
IEEE DOI
2203
Training, Representation learning, Analytical models, Dictionaries,
Correlation, Computational modeling, Vision + language,
BibRef
Cai, G.[Guanyu],
Zhang, J.[Jun],
Jiang, X.Y.[Xin-Yang],
Gong, Y.F.[Yi-Fei],
He, L.[Lianghua],
Yu, F.[Fufu],
Peng, P.[Pai],
Guo, X.W.[Xiao-Wei],
Huang, F.Y.[Fei-Yue],
Sun, X.[Xing],
Ask amp;Confirm: Active Detail Enriching for Cross-Modal Retrieval
with Partial Query,
ICCV21(1815-1824)
IEEE DOI
2203
Training, Codes, Computational modeling, Image retrieval,
Search problems, Robustness, Vision + language, Image and video retrieval
BibRef
Wen, K.Y.[Ke-Yu],
Xia, J.[Jin],
Huang, Y.Y.[Yuan-Yuan],
Li, L.Y.[Lin-Yang],
Xu, J.Y.[Jia-Yan],
Shao, J.[Jie],
COOKIE: Contrastive Cross-Modal Knowledge Sharing Pre-training for
Vision-Language Representation,
ICCV21(2188-2197)
IEEE DOI
2203
Visualization, Codes, Computational modeling, Image retrieval,
Semantics, Transformers, Vision + language,
Representation learning
BibRef
Patrick, M.[Mandela],
Huang, P.Y.[Po-Yao],
Misra, I.[Ishan],
Metze, F.[Florian],
Vedaldi, A.[Andrea],
Asano, Y.M.[Yuki M.],
Henriques, J.[João],
Space-Time Crop & Attend:
Improving Cross-modal Video Representation Learning,
ICCV21(10540-10552)
IEEE DOI
2203
Representation learning, Costs, Codes, Computational modeling, Crops,
Image representation, Representation learning, Vision + other modalities
BibRef
Lin, M.X.[Ming-Xian],
Yang, J.[Jie],
Wang, H.[He],
Lai, Y.K.[Yu-Kun],
Jia, R.[Rongfei],
Zhao, B.Q.[Bin-Qiang],
Gao, L.[Lin],
Single Image 3D Shape Retrieval via Cross-Modal Instance and Category
Contrastive Learning,
ICCV21(11385-11395)
IEEE DOI
2203
Representation learning, Deep learning, Shape,
Image color analysis, Pipelines, Gray-scale,
3D from a single image and shape-from-x
BibRef
Changpinyo, S.[Soravit],
Pont-Tuset, J.[Jordi],
Ferrari, V.[Vittorio],
Soricut, R.[Radu],
Telling the What while Pointing to the Where:
Multimodal Queries for Image Retrieval,
ICCV21(12116-12126)
IEEE DOI
2203
Location awareness, Error analysis, Computational modeling,
Image retrieval, Natural languages, Mice,
Vision + other modalities
BibRef
Gabeur, V.[Valentin],
Nagrani, A.[Arsha],
Sun, C.[Chen],
Alahari, K.[Karteek],
Schmid, C.[Cordelia],
Masking Modalities for Cross-modal Video Retrieval,
WACV22(2111-2120)
IEEE DOI
2202
Manuals, Benchmark testing, Motion pictures,
Natural language processing, Proposals, Speech processing, Scene Understanding
BibRef
Galanopoulos, D.[Damianos],
Mezaris, V.[Vasileios],
Hard-Negatives or Non-Negatives? A Hard-Negative Selection Strategy
for Cross-Modal Retrieval Using the Improved Marginal Ranking Loss,
ViRaL21(2312-2316)
IEEE DOI
2112
Training, Computational modeling, Network architecture
BibRef
Jing, L.L.[Long-Long],
Vahdani, E.[Elahe],
Tan, J.X.[Jia-Xing],
Tian, Y.L.[Ying-Li],
Cross-Modal Center Loss for 3D Cross-Modal Retrieval,
CVPR21(3141-3150)
IEEE DOI
2111
Solid modeling,
Computational modeling, Metadata, Feature extraction
BibRef
Almazán, J.[Jon],
Ko, B.[Byungsoo],
Gu, G.[Geonmo],
Larlus, D.[Diane],
Kalantidis, Y.[Yannis],
Granularity-Aware Adaptation for Image Retrieval Over Multiple Tasks,
ECCV22(XIV:389-406).
Springer DOI
2211
BibRef
Chun, S.[Sanghyuk],
Oh, S.J.[Seong Joon],
Sampaio de Rezende, R.[Rafael],
Kalantidis, Y.[Yannis],
Larlus, D.[Diane],
Probabilistic Embeddings for Cross-Modal Retrieval,
CVPR21(8411-8420)
IEEE DOI
2111
Uncertainty, Codes, Databases, Annotations, Tools, Benchmark testing
BibRef
Croitoru, I.[Ioana],
Bogolin, S.V.[Simion-Vlad],
Leordeanu, M.[Marius],
Jin, H.L.[Hai-Lin],
Zisserman, A.[Andrew],
Albanie, S.[Samuel],
Liu, Y.[Yang],
TeachText:
CrossModal Generalized Distillation for Text-Video Retrieval,
ICCV21(11563-11573)
IEEE DOI
2203
Visualization, Codes, Computational modeling, Noise reduction,
Benchmark testing,
Vision + language
BibRef
Liu, Y.[Yang],
Chen, Q.C.[Qing-Chao],
Albanie, S.[Samuel],
Adaptive Cross-Modal Prototypes for Cross-Domain Visual-Language
Retrieval,
CVPR21(14949-14959)
IEEE DOI
2111
Visualization, Prototypes,
Task analysis, Mutual information, Videos
BibRef
Salvador, A.[Amaia],
Gundogdu, E.[Erhan],
Bazzani, L.[Loris],
Donoser, M.[Michael],
Revamping Cross-Modal Recipe Retrieval with Hierarchical Transformers
and Self-supervised Learning,
CVPR21(15470-15479)
IEEE DOI
2111
Training, Codes, Computational modeling, Semantics,
Machine learning, Transformers
BibRef
Dzabraev, M.[Maksim],
Kalashnikov, M.[Maksim],
Komkov, S.[Stepan],
Petiushko, A.[Aleksandr],
MDMMT: Multidomain Multimodal Transformer for Video Retrieval,
HVU21(3349-3358)
IEEE DOI
2109
Training, Benchmark testing,
Task analysis
BibRef
Wang, K.[Kai],
Herranz, L.[Luis],
van de Weijer, J.[Joost],
Continual learning in cross-modal retrieval,
OmniCV21(3623-3633)
IEEE DOI
2109
Training, Visualization, Human intelligence, Focusing, Interference,
Tools
BibRef
Mafla, A.[Andrés],
Rezende, R.S.[Rafael S.],
Gómez, L.[Lluís],
Larlus, D.[Diane],
Karatzas, D.[Dimosthenis],
StacMR: Scene-Text Aware Cross-Modal Retrieval,
WACV21(2219-2229)
IEEE DOI
2106
Visualization, Annotations,
Computational modeling, Semantics, Task analysis
BibRef
Feng, C.T.[Chang-Ting],
Li, D.G.[Da-Gang],
Zheng, J.W.[Jing-Wei],
Improving Supervised Cross-modal Retrieval with Semantic Graph
Embedding,
MMMod21(I:187-199).
Springer DOI
2106
BibRef
Wen, Z.Y.[Zhen-Yu],
Feng, A.[Aimin],
Deep Centralized Cross-modal Retrieval,
MMMod21(I:443-455).
Springer DOI
2106
BibRef
Li, Z.X.[Zhi-Xin],
Ling, F.[Feng],
Xu, C.S.[Chuan-Sheng],
Zhang, C.L.[Can-Long],
Ma, H.F.[Hui-Fang],
Cross-Media Hash Retrieval Using Multi-Head Attention Network,
ICPR21(1290-1297)
IEEE DOI
2105
Correlation, Semantics, Neural networks, Media,
Extraterrestrial measurements, cross-media retrieval
BibRef
Jin, C.[Cong],
Zhang, T.[Tian],
Liu, S.X.[Shou-Xun],
Tie, Y.[Yun],
Lv, X.[Xin],
Li, J.G.[Jian-Guang],
Yan, W.C.[Wen-Cai],
Yan, M.[Ming],
Xu, Q.[Qian],
Guan, Y.C.[Yi-Cong],
Yang, Z.G.[Zheng-Gougou],
Cross-modal Deep Learning Applications: Audio-visual Retrieval,
MMDLCA20(301-313).
Springer DOI
2103
BibRef
Thomas, C.[Christopher],
Kovashka, A.[Adriana],
Preserving Semantic Neighborhoods for Robust Cross-modal Retrieval,
ECCV20(XVIII:317-335).
Springer DOI
2012
BibRef
Wang, Z.,
Liu, X.,
Li, H.,
Sheng, L.,
Yan, J.,
Wang, X.,
Shao, J.,
CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval,
ICCV19(5763-5772)
IEEE DOI
2004
entropy, feature extraction, image matching, image retrieval,
message passing, natural language processing, text analysis,
Task analysis
BibRef
Nawaz, S.,
Janjua, M.K.,
Gallo, I.,
Mahmood, A.,
Calefati, A.,
Shafait, F.,
Do Cross Modal Systems Leverage Semantic Relationships?,
CroMoL19(4501-4510)
IEEE DOI
2004
image representation, image retrieval, image segmentation,
learning (artificial intelligence),
Text to Image
BibRef
Su, S.,
Zhong, Z.,
Zhang, C.,
Deep Joint-Semantics Reconstructing Hashing for Large-Scale
Unsupervised Cross-Modal Retrieval,
ICCV19(3027-3035)
IEEE DOI
2004
binary codes, image coding, image retrieval, multimedia computing,
neural nets, binary codes, reconstructing framework, DJSRH, Correlation
BibRef
Ning, X.C.[Xue-Cheng],
Yang, X.S.[Xiao-Shan],
Xu, C.S.[Chang-Sheng],
Multi-Hop Interactive Cross-modal Retrieval,
MMMod20(II:681-693).
Springer DOI
2003
BibRef
Cornia, M.[Marcella],
Baraldi, L.[Lorenzo],
Tavakoli, H.R.[Hamed R.],
Cucchiara, R.[Rita],
Towards Cycle-Consistent Models for Text and Image Retrieval,
WiCV-E18(IV:687-691).
Springer DOI
1905
BibRef
Surís, D.[Didac],
Duarte, A.[Amanda],
Salvador, A.[Amaia],
Torres, J.[Jordi],
Giró-i-Nieto, X.[Xavier],
Cross-modal Embeddings for Video and Audio Retrieval,
WiCV-E18(IV:711-716).
Springer DOI
1905
BibRef
Liu, C.L.[Chen-Lu],
Xu, X.[Xing],
Yang, Y.[Yang],
Lu, H.M.[Hui-Min],
Shen, F.M.[Fu-Min],
Ji, Y.L.[Yan-Li],
Domain Invariant Subspace Learning for Cross-Modal Retrieval,
MMMod18(II:94-105).
Springer DOI
1802
BibRef
Yuan, Y.X.[Yu-Xin],
Peng, Y.X.[Yu-Xin],
Recursive Pyramid Network with Joint Attention for Cross-Media
Retrieval,
MMMod18(I:405-416).
Springer DOI
1802
BibRef
Jia, Y.H.[Yu-Hua],
Bai, L.[Liang],
Wang, P.[Peng],
Guo, J.L.[Jin-Lin],
Xie, Y.X.[Yu-Xiang],
Yu, T.Y.[Tian-Yuan],
Utilizing Locality-Sensitive Hash Learning for Cross-Media Retrieval,
MMMod17(I: 550-561).
Springer DOI
1701
BibRef
Shang, X.[Xindi],
Zhang, H.W.[Han-Wang],
Chua, T.S.[Tat-Seng],
Deep Learning Generic Features for Cross-Media Retrieval,
MMMod16(I: 264-275).
Springer DOI
1601
BibRef
Huang, L.[Lei],
Peng, Y.X.[Yu-Xin],
Cross-Media Retrieval via Semantic Entity Projection,
MMMod16(I: 276-288).
Springer DOI
1601
BibRef
Gu, Y.[Yun],
Xue, H.Y.[Hao-Yang],
Yang, J.[Jie],
Shi, P.F.[Peng-Fei],
Cross-modality hashing with partial correspondence,
ICIP15(1925-1929)
IEEE DOI
1512
Cross-modality; Hashing; Multimedia Search; Partial Correspondence
BibRef
Zhang, H.[Hong],
Chen, L.[Li],
Learning optimal data representation for cross-media retrieval,
ICIP12(1925-1928).
IEEE DOI
1302
BibRef
Lin, W.X.[Wan-Xia],
Lu, T.[Tong],
Su, F.[Feng],
A Novel Multi-modal Integration and Propagation Model for Cross-Media
Information Retrieval,
MMMod12(740-749).
Springer DOI
1201
BibRef
Chapter on Implementations and Applications, Databases, QBIC, Video Analysis, Hardware and Software, Inspection continues in
Video Delivery, Video-on-Demand, Indexing, Techniques, Systems .