Costa Pereira, J.,
Coviello, E.[Emanuele],
Doyle, G.,
Rasiwasia, N.,
Lanckriet, G.R.G.[Gert R.G.],
Levy, R.,
Vasconcelos, N.M.,
On the Role of Correlation and Abstraction in Cross-Modal Multimedia
Retrieval,
PAMI(36), No. 3, March 2014, pp. 521-535.
IEEE DOI
1403
image matching. E.g. use image to search for text.
Correlation matching. Semantic matching. Semantic correlation matching.
BibRef
Costa Pereira, J.[Jose],
Vasconcelos, N.M.[Nuno M.],
Cross-modal domain adaptation for text-based regularization of image
semantics in image retrieval systems,
CVIU(124), No. 1, 2014, pp. 123-135.
Elsevier DOI
1406
Content-based image retrieval
BibRef
Zhai, X.H.[Xiao-Hua],
Peng, Y.X.[Yu-Xin],
Xiao, J.G.[Jian-Guo],
Learning Cross-Media Joint Representation With Sparse and
Semisupervised Regularization,
CirSysVideo(24), No. 6, June 2014, pp. 965-978.
IEEE DOI
1407
Correlation
BibRef
Peng, Y.X.[Yu-Xin],
Qi, J.W.[Jin-Wei],
Quintuple-Media Joint Correlation Learning With Deep Compression and
Regularization,
CirSysVideo(30), No. 8, August 2020, pp. 2709-2722.
IEEE DOI
2008
Media, Correlation, Semantics,
Solid modeling, Data models, Image coding, Cross-media retrieval,
network regularization
BibRef
Peng, Y.,
Zhai, X.,
Zhao, Y.,
Huang, X.,
Semi-Supervised Cross-Media Feature Learning With Unified Patch Graph
Regularization,
CirSysVideo(26), No. 3, March 2016, pp. 583-596.
IEEE DOI
1603
Correlation
BibRef
Bellini, P.[Pierfrancesco],
Cenni, D.[Daniele],
Nesi, P.[Paolo],
Optimization of information retrieval for cross media contents in a
best practice network,
MultInfoRetr(3), No. 3, September 2014, pp. 147-159.
Springer DOI
1408
BibRef
Kang, C.,
Xiang, S.,
Liao, S.,
Xu, C.,
Pan, C.,
Learning Consistent Feature Representation for Cross-Modal Multimedia
Retrieval,
MultMed(17), No. 3, March 2015, pp. 370-381.
IEEE DOI
1502
Algorithm design and analysis
BibRef
He, Y.,
Xiang, S.,
Kang, C.,
Wang, J.,
Pan, C.,
Cross-Modal Retrieval via Deep and Bidirectional Representation
Learning,
MultMed(18), No. 7, July 2016, pp. 1363-1377.
IEEE DOI
1608
backpropagation
BibRef
Zhang, S.,
Wang, X.,
Lin, Y.,
Tian, Q.,
Cross Indexing With Grouplets,
MultMed(17), No. 11, November 2015, pp. 1969-1979.
IEEE DOI
1511
Feature extraction
BibRef
Chu, L.,
Zhang, Y.,
Li, G.,
Wang, S.,
Zhang, W.,
Huang, Q.,
Effective Multimodality Fusion Framework for Cross-Media Topic
Detection,
CirSysVideo(26), No. 3, March 2016, pp. 556-569.
IEEE DOI
1603
Complexity theory
BibRef
Ding, K.[Kun],
Fan, B.[Bin],
Huo, C.L.[Chun-Lei],
Xiang, S.M.[Shi-Ming],
Pan, C.H.[Chun-Hong],
Cross-Modal Hashing via Rank-Order Preserving,
MultMed(19), No. 3, March 2017, pp. 571-585.
IEEE DOI
1702
Binary codes
BibRef
Jiang, B.[Bin],
Yang, J.C.[Jia-Chen],
Lv, Z.H.[Zhi-Han],
Tian, K.[Kun],
Meng, Q.G.[Qing-Gang],
Yan, Y.[Yan],
Internet cross-media retrieval based on deep learning,
JVCIR(48), No. 1, 2017, pp. 356-366.
Elsevier DOI
1708
Cross-media, retrieval
BibRef
Hu, Y.,
Zheng, L.,
Yang, Y.,
Huang, Y.,
Twitter100k: A Real-World Dataset for Weakly Supervised Cross-Media
Retrieval,
MultMed(20), No. 4, April 2018, pp. 927-938.
IEEE DOI
1804
Electronic publishing, Encyclopedias, Internet,
Optical character recognition software, Training, Visualization,
weakly supervised method
BibRef
Verma, Y.[Yashaswi],
Jha, A.[Abhishek],
Jawahar, C.V.,
Cross-specificity: modelling data semantics for cross-modal matching
and retrieval,
MultInfoRetr(8), No. 2, June 2018, pp. 139-146.
Springer DOI
1805
BibRef
Dorfer, M.[Matthias],
Schlüter, J.[Jan],
Vall, A.[Andreu],
Korzeniowski, F.[Filip],
Widmer, G.[Gerhard],
End-to-end cross-modality retrieval with CCA projections and pairwise
ranking loss,
MultInfoRetr(8), No. 2, June 2018, pp. 117-128.
Springer DOI
1805
BibRef
Lu, X.[Xu],
Zhang, H.X.[Hua-Xiang],
Sun, J.[Jiande],
Wang, Z.H.[Zhen-Hua],
Guo, P.[Peilian],
Wan, W.[Wenbo],
Discriminative correlation hashing for supervised cross-modal
retrieval,
SP:IC(65), 2018, pp. 221-230.
Elsevier DOI
1805
Cross-modal retrieval, Hashing, Subspace learning, Discriminant analysis
BibRef
Wang, L.[Li],
Zhu, L.[Lei],
Dong, X.[Xiao],
Liu, L.[Li],
Sun, J.[Jiande],
Zhang, H.X.[Hua-Xiang],
Joint Feature Selection and Graph Regularization for
Modality-Dependent Cross-Modal Retrieval,
JVCIR(54), 2018, pp. 213-222.
Elsevier DOI
1806
Cross-modal retrieval, Feature selection, Subspace learning,
Graph regularization
BibRef
Zhong, F.M.[Fang-Ming],
Chen, Z.K.[Zhi-Kui],
Min, G.Y.[Ge-Yong],
Deep Discrete Cross-Modal Hashing for Cross-Media Retrieval,
PR(83), 2018, pp. 64-77.
Elsevier DOI
1808
Cross-modal retrieval, deep learning, discrete hashing,
alternative optimization
BibRef
Yuan, X.[Xu],
Wang, G.Z.[Guang-Ze],
Chen, Z.K.[Zhi-Kui],
Zhong, F.M.[Fang-Ming],
CHOP: An orthogonal hashing method for zero-shot cross-modal
retrieval,
PRL(145), 2021, pp. 247-253.
Elsevier DOI
2104
Zero-shot, Cross-modal retrieval, Orthogonal projection
BibRef
Vukotic, V.,
Raymond, C.,
Gravier, G.,
A Crossmodal Approach to Multimodal Fusion in Video Hyperlinking,
MultMedMag(25), No. 2, April 2018, pp. 11-23.
IEEE DOI
1808
Task analysis, Neural networks,
Visualization, Streaming media, Hypertext systems, Training,
multimedia
BibRef
Liu, R.,
Wei, S.,
Zhao, Y.,
Zhu, Z.,
Wang, J.,
Multiview Cross-Media Hashing with Semantic Consistency,
MultMedMag(25), No. 2, April 2018, pp. 71-86.
IEEE DOI
1808
Media, Semantics, Correlation, Multimedia communication,
Optimization, Feature extraction, hashing, cross-media, multiview,
searching
BibRef
Wang, D.[Di],
Wang, Q.[Quan],
Gao, X.B.[Xin-Bo],
Robust and Flexible Discrete Hashing for Cross-Modal Similarity
Search,
CirSysVideo(28), No. 10, October 2018, pp. 2703-2715.
IEEE DOI
1811
Robustness, Training, Binary codes, Quantization (signal),
Linear programming, Matrix decomposition, Sparse matrices, Hashing,
unsupervised learning
BibRef
Wang, D.[Di],
Gao, X.B.[Xin-Bo],
Wang, X.M.[Xiu-Mei],
He, L.H.[Li-Huo],
Label Consistent Matrix Factorization Hashing for Large-Scale
Cross-Modal Similarity Search,
PAMI(41), No. 10, October 2019, pp. 2466-2479.
IEEE DOI
1909
Semantics, Correlation, Training, Transforms, Binary codes,
Image reconstruction, Sparse matrices, Hashing, multimodal,
cross-modal
BibRef
Wang, D.[Di],
Gao, X.B.[Xin-Bo],
Wang, X.M.[Xiu-Mei],
He, L.[Lihuo],
Yuan, B.[Bo],
Multimodal Discriminative Binary Embedding for Large-Scale
Cross-Modal Retrieval,
IP(25), No. 10, October 2016, pp. 4540-4554.
IEEE DOI
1610
Internet
BibRef
Wang, D.[Di],
Wang, Q.[Quan],
He, L.[Lihuo],
Gao, X.B.[Xin-Bo],
Tian, Y.[Yumin],
Joint and individual matrix factorization hashing for large-scale
cross-modal retrieval,
PR(107), 2020, pp. 107479.
Elsevier DOI
2008
Hashing, Multimodal, Retrieval, Cross-modal, Matrix factorization
BibRef
Dong, F.[Fei],
Nie, X.S.[Xiu-Shan],
Liu, X.B.[Xing-Bo],
Geng, L.L.[Lei-Lei],
Wang, Q.[Qian],
Cross-Modal Hashing Based on Category Structure Preserving,
JVCIR(57), 2018, pp. 28-33.
Elsevier DOI
1812
Cross-modal retrieval, Supervised hashing,
Category-specific structure preserving
BibRef
Zhang, M.J.[Mei-Jia],
Zhang, H.X.[Hua-Xiang],
Li, J.Z.[Jun-Zheng],
Wang, L.[Li],
Fang, Y.X.[Yi-Xian],
Sun, J.[Jiande],
Supervised graph regularization based cross media retrieval with
intra and inter-class correlation,
JVCIR(58), 2019, pp. 1-11.
Elsevier DOI
1901
Cross media retrieval, Subspace learning, Supervised graph regularization
BibRef
Yao, T.[Tao],
Wang, G.[Gang],
Yan, L.S.[Lian-Shan],
Kong, X.W.[Xiang-Wei],
Su, Q.T.[Qing-Tang],
Zhang, C.M.[Cai-Ming],
Tian, Q.[Qi],
Online latent semantic hashing for cross-media retrieval,
PR(89), 2019, pp. 1-11.
Elsevier DOI
1902
Cross-media retrieval, Online learning, Hashing, Latent semantic concept
BibRef
Yao, T.[Tao],
Kong, X.W.[Xiang-Wei],
Fu, H.Y.[Hai-Yan],
Tian, Q.[Qi],
Discrete Semantic Alignment Hashing for Cross-Media Retrieval,
Cyber(50), No. 12, December 2020, pp. 4896-4907.
IEEE DOI
2012
Semantics, Hash functions, Correlation, Quantization (signal),
Optimization, Task analysis, Internet, Attribute,
hashing
BibRef
Dutta, T.[Titir],
Biswas, S.[Soma],
Cross-modal retrieval in challenging scenarios using attributes,
PRL(125), 2019, pp. 618-624.
Elsevier DOI
1909
Cross-modal retrieval, Attributes, Unseen query, Low-resolution data
BibRef
Liu, H.P.[Hua-Ping],
Wang, F.[Feng],
Zhang, X.Y.[Xin-Yu],
Sun, F.C.[Fu-Chun],
Weakly-paired deep dictionary learning for cross-modal retrieval,
PRL(130), 2020, pp. 199-206.
Elsevier DOI
2002
Deep dictionary learning, Cross-modal retrieval, Weak pairing
BibRef
Zhang, H.[Hong],
Wang, T.[Ting],
Dai, G.[Gang],
Semi-supervised cross-modal common representation learning with
vector-valued manifold regularization,
PRL(130), 2020, pp. 335-344.
Elsevier DOI
2002
Cross-media retrieval, Vector-valued RKHS,
Manifold regularization, Semi-supervised, Kernel method
BibRef
Chaudhuri, U.[Ushasi],
Banerjee, B.[Biplab],
Bhattacharya, A.[Avik],
Datcu, M.[Mihai],
CMIR-NET: A deep learning based model for cross-modal retrieval in
remote sensing,
PRL(131), 2020, pp. 456-462.
Elsevier DOI
2004
Remote sensing, Cross-modal retrieval, Deep learning,
Panchromatic, Multispectral, Audio samples
BibRef
Chi, J.Z.[Jing-Ze],
Peng, Y.X.[Yu-Xin],
Zero-Shot Cross-Media Embedding Learning With Dual Adversarial
Distribution Network,
CirSysVideo(30), No. 4, April 2020, pp. 1173-1187.
IEEE DOI
2004
Semantics, Media, Correlation, Training, Dogs,
Measurement, Cross-media retrieval, zero-shot learning,
maximum mean discrepancy
BibRef
Wu, F.[Fei],
Jing, X.Y.[Xiao-Yuan],
Wu, Z.Y.[Zhi-Yong],
Ji, Y.[Yimu],
Dong, X.[Xiwei],
Luo, X.K.[Xiao-Kai],
Huang, Q.H.[Qing-Hua],
Wang, R.[Ruchuan],
Modality-specific and shared generative adversarial network for
cross-modal retrieval,
PR(104), 2020, pp. 107335.
Elsevier DOI
2005
Cross-modal retrieval, Generative adversarial networks (GAN),
Modality-specific feature learning, Modality-shared feature learning
BibRef
Wu, F.[Fei],
Luo, X.K.[Xiao-Kai],
Huang, Q.H.[Qing-Hua],
Wei, P.F.[Peng-Fei],
Sun, Y.[Ying],
Dong, X.[Xiwei],
Wu, Z.Y.[Zhi-Yong],
Semantic Preserving Generative Adversarial Network for Cross-Modal
Hashing,
ICIP21(2743-2747)
IEEE DOI
2201
Measurement, Quantization (signal), Image processing, Semantics,
Focusing, Network architecture, cross-modal hashing,
semantic preserving
BibRef
Zhong, F.M.[Fang-Ming],
Chen, Z.K.[Zhi-Kui],
Min, G.Y.[Ge-Yong],
Xia, F.[Feng],
A novel strategy to balance the results of cross-modal hashing,
PR(107), 2020, pp. 107523.
Elsevier DOI
2008
Cross-modal hashing, Semantic gap, Semantic augmentation, Cross-modal retrieval
BibRef
Peng, Y.,
Chi, J.,
Unsupervised Cross-Media Retrieval Using Domain Adaptation With Scene
Graph,
CirSysVideo(30), No. 11, November 2020, pp. 4368-4379.
IEEE DOI
2011
Media, Correlation, Visualization, Genomics, Bioinformatics,
Training data, Training, Cross-media retrieval, domain adaptation,
scene graph
BibRef
Zhu, L.[Lei],
Song, J.[Jiayu],
Zhu, X.F.[Xiao-Feng],
Zhang, C.Y.[Cheng-Yuan],
Zhang, S.C.[Shi-Chao],
Yuan, X.P.[Xin-Pan],
Adversarial Learning-Based Semantic Correlation Representation for
Cross-Modal Retrieval,
MultMedMag(27), No. 4, October 2020, pp. 79-90.
IEEE DOI
2012
Correlation, Semantics, Computer science, Internet, Streaming media
BibRef
Zhu, L.[Lei],
Zhang, C.Y.[Cheng-Yuan],
Song, J.[Jiayu],
Zhang, S.C.[Shi-Chao],
Tian, C.[Chunwei],
Zhu, X.[Xinghui],
Deep Multigraph Hierarchical Enhanced Semantic Representation for
Cross-Modal Retrieval,
MultMedMag(29), No. 3, July 2022, pp. 17-26.
IEEE DOI
2209
Semantics, Adversarial machine learning, Correlation,
Visualization, Generators, Generative adversarial networks, Computer science
BibRef
Chaudhuri, U.[Ushasi],
Banerjee, B.[Biplab],
Bhattacharya, A.[Avik],
Datcu, M.[Mihai],
CrossATNet: A novel cross-attention based framework for sketch-based
image retrieval,
IVC(104), 2020, pp. 104003.
Elsevier DOI
2012
Neural networks, Sketch-based image retrieval,
Cross-modal retrieval, Deep-learning, Cross-attention network, Cross-triplets
BibRef
Zhang, Y.,
Zhou, W.,
Wang, M.,
Tian, Q.,
Li, H.,
Deep Relation Embedding for Cross-Modal Retrieval,
IP(30), 2021, pp. 617-627.
IEEE DOI
2012
Semantics, Feature extraction, Visualization,
Computational modeling, Task analysis, Training, Optimization,
relation
BibRef
Zhang, L.[Lei],
Chen, L.T.[Lei-Ting],
Ou, W.H.[Wei-Hua],
Zhou, C.[Chuan],
Semi-supervised cross-modal representation learning with GAN-based
Asymmetric Transfer Network,
JVCIR(73), 2020, pp. 102899.
Elsevier DOI
2012
Cross-modal retrieval, Modality gap, Generative adversarial network
BibRef
Wang, L.[Lu],
Yang, J.[Jie],
Zareapoor, M.[Masoumeh],
Zheng, Z.L.[Zhong-Long],
Cluster-wise unsupervised hashing for cross-modal similarity search,
PR(111), 2021, pp. 107732.
Elsevier DOI
2012
Cross-modal similarity retrieval, Multi-view clustering,
The cluster-wise code-prototypes, Cross-modal hashing,
BibRef
Meng, M.,
Wang, H.,
Yu, J.,
Chen, H.,
Wu, J.,
Asymmetric Supervised Consistent and Specific Hashing for Cross-Modal
Retrieval,
IP(30), 2021, pp. 986-1000.
IEEE DOI
2012
Semantics, Optimization, Quantization (signal), Correlation,
Symmetric matrices, Image coding, Sparse matrices, multimedia
BibRef
Matsubara, T.[Takashi],
Target-Oriented Deformation of Visual-Semantic Embedding Space,
IEICE(E104-D), No. 1, January 2021, pp. 24-33.
WWW Link.
2101
BibRef
Nie, X.,
Wang, B.,
Li, J.,
Hao, F.,
Jian, M.,
Yin, Y.,
Deep Multiscale Fusion Hashing for Cross-Modal Retrieval,
CirSysVideo(31), No. 1, January 2021, pp. 401-410.
IEEE DOI
2101
Semantics, Machine learning, Training data, Media,
Correlation, Retrieval, hashing, deep learning, cross-modal
BibRef
Liu, X.[Xin],
Hu, Z.K.[Zhi-Kai],
Ling, H.B.[Hai-Bin],
Cheung, Y.M.[Yiu-Ming],
MTFH: A Matrix Tri-Factorization Hashing Framework for Efficient
Cross-Modal Retrieval,
PAMI(43), No. 3, March 2021, pp. 964-981.
IEEE DOI
2102
Lips, Semantics, Adaptation models, Task analysis,
Encoding, Correlation, Cross-modal retrieval,
semantic correlation matrix
BibRef
Wu, Y.,
Wang, S.,
Song, G.,
Huang, Q.,
Augmented Adversarial Training for Cross-Modal Retrieval,
MultMed(23), 2021, pp. 559-571.
IEEE DOI
2102
image representation, image retrieval, neural nets, text analysis,
adversarial training process,
adversa-rial training
BibRef
Lin, Q.,
Cao, W.,
He, Z.,
He, Z.,
Mask Cross-Modal Hashing Networks,
MultMed(23), 2021, pp. 550-558.
IEEE DOI
2102
deep learning (artificial intelligence), feature extraction,
file organisation, image retrieval, text analysis,
cross-modal retrieval
BibRef
Qi, M.,
Qin, J.,
Yang, Y.,
Wang, Y.,
Luo, J.,
Semantics-Aware Spatial-Temporal Binaries for Cross-Modal Video
Retrieval,
IP(30), 2021, pp. 2989-3004.
IEEE DOI
2102
Semantics, Binary codes, Feature extraction, Visualization,
Task analysis, Natural languages, Stochastic processes,
natural language
BibRef
Wu, J.L.[Jian-Long],
Xie, X.X.[Xing-Xu],
Nie, L.Q.[Li-Qiang],
Lin, Z.C.[Zhou-Chen],
Zha, H.B.[Hong-Bin],
Reconstruction regularized low-rank subspace learning for cross-modal
retrieval,
PR(113), 2021, pp. 107813.
Elsevier DOI
2103
Cross-modal retrieval, Low-rank subspace learning,
Reconstruction regularization
BibRef
Zou, X.T.[Xi-Tao],
Wang, X.Z.[Xin-Zhi],
Bakker, E.M.[Erwin M.],
Wu, S.[Song],
Multi-label semantics preserving based deep cross-modal hashing,
SP:IC(93), 2021, pp. 116131.
Elsevier DOI
2103
Multi-modal retrieval, Deep cross-modal hashing, Multi-label semantic learning
BibRef
Shu, X.[Xin],
Zhao, G.Y.[Guo-Ying],
Scalable multi-label canonical correlation analysis for cross-modal
retrieval,
PR(115), 2021, pp. 107905.
Elsevier DOI
2104
Canonical correlation analysis, Semantic transformation,
Cross-modal retrieval, Singular value decomposition
BibRef
Song, G.[Ge],
Tan, X.Y.[Xiao-Yang],
Real-world Cross-modal Retrieval via Sequential Learning,
MultMed(23), 2021, pp. 1708-1721.
IEEE DOI
2106
BibRef
Earlier:
Sequential Learning for Cross-Modal Retrieval,
CroMoL19(4531-4539)
IEEE DOI
2004
Plugs, Task analysis, Data models, Learning systems, Brain modeling,
Adaptation models, Technological innovation,
meta learning.
information retrieval,
learning (artificial intelligence), multimodal data, meta learning
BibRef
Chen, W.[Wei],
Liu, Y.[Yu],
Bakker, E.M.[Erwin M.],
Lew, M.S.[Michael S.],
Integrating information theory and adversarial learning for
cross-modal retrieval,
PR(117), 2021, pp. 107983.
Elsevier DOI
2106
Cross-modal retrieval, Shannon information theory,
Adversarial learning, Modality uncertainty, Data imbalance
BibRef
Huang, Z.Y.[Zhen-Yu],
Zhou, J.T.Y.[Joey Tian-Yi],
Zhu, H.Y.[Hong-Yuan],
Zhang, C.Q.[Chang-Qing],
Lv, J.C.[Jian-Cheng],
Peng, X.[Xi],
Deep Spectral Representation Learning From Multi-View Data,
IP(30), 2021, pp. 5352-5362.
IEEE DOI
2106
Deep learning, Laplace equations, Neural networks, Collaboration,
Data models, Task analysis,
cross-modal retrieval
BibRef
Wen, X.[Xin],
Han, Z.Z.[Zhi-Zhong],
Liu, Y.S.[Yu-Shen],
CMPD: Using Cross Memory Network With Pair Discrimination for
Image-Text Retrieval,
CirSysVideo(31), No. 6, June 2021, pp. 2427-2437.
IEEE DOI
2106
Semantics, Task analysis, Training, Generators, Optimization,
Marine vehicles, Retrieval, cross-modal retrieval, adversarial learning
BibRef
Liu, J.H.[Jun-Hao],
Yang, M.[Min],
Li, C.M.[Cheng-Ming],
Xu, R.F.[Rui-Feng],
Improving Cross-Modal Image-Text Retrieval With Teacher-Student
Learning,
CirSysVideo(31), No. 8, August 2021, pp. 3242-3253.
IEEE DOI
2108
Semantics, Task analysis, Data models, Neural networks, Correlation,
Binary codes, Feature extraction,
teacher-student learning
BibRef
Song, G.[Ge],
Tan, X.Y.[Xiao-Yang],
Zhao, J.[Jun],
Yang, M.[Ming],
Deep robust multilevel semantic hashing for multi-label cross-modal
retrieval,
PR(120), 2021, pp. 108084.
Elsevier DOI
2109
Hashing, Multi-label, Cross-modal retrieval, Deep learning
BibRef
Fang, Y.Z.[Yu-Zhi],
Robust multimodal discrete hashing for cross-modal similarity search,
JVCIR(79), 2021, pp. 103256.
Elsevier DOI
2109
Hashing, Robust, Cross-modal retrieval, Unsupervised learning
BibRef
Nie, X.S.[Xiu-Shan],
Liu, X.B.[Xing-Bo],
Xi, X.M.[Xiao-Ming],
Li, C.L.[Cheng-Long],
Yin, Y.L.[Yi-Long],
Fast Unmediated Hashing for Cross-Modal Retrieval,
CirSysVideo(31), No. 9, September 2021, pp. 3669-3678.
IEEE DOI
2109
Semantics, Training, Optimization, Training data, Binary codes,
Correlation, Videos, Cross-modal retrieval, hashing, unmediated,
double supervision
BibRef
Zhang, D.L.[Dong-Lin],
Wu, X.J.[Xiao-Jun],
Yin, H.F.[He-Feng],
Kittler, J.V.[Josef V.],
MOON: Multi-hash codes joint learning for cross-media retrieval,
PRL(151), 2021, pp. 19-25.
Elsevier DOI
2110
Cross-media retrieval, Hashing, Discrete optimization, Joint learning
BibRef
Hu, P.[Peng],
Peng, X.[Xi],
Zhu, H.Y.[Hong-Yuan],
Lin, J.[Jie],
Zhen, L.L.[Liang-Li],
Peng, D.Z.[De-Zhong],
Joint Versus Independent Multiview Hashing for Cross-View Retrieval,
Cyber(51), No. 10, October 2021, pp. 4982-4993.
IEEE DOI
2110
Semantics, Decoding, Training, Computer science, Kernel, Logistics,
Cybernetics, Common hamming space, cross-view retrieval,
multiview representation learning
BibRef
Zhang, D.L.[Dong-Lin],
Wu, X.J.[Xiao-Jun],
Robust and discrete matrix factorization hashing for cross-modal
retrieval,
PR(122), 2022, pp. 108343.
Elsevier DOI
2112
Cross-modal retrieval, Hashing, Autoencoder,
Discrete optimization,
BibRef
Zhang, D.L.[Dong-Lin],
Wu, X.J.[Xiao-Jun],
Xu, T.Y.[Tian-Yang],
Kittler, J.V.[Josef V.],
Two-Stage Supervised Discrete Hashing for Cross-Modal Retrieval,
SMCS(52), No. 11, November 2022, pp. 7014-7026.
IEEE DOI
2210
Semantics, Binary codes, Hash functions, Optimization,
Quantization (signal), Task analysis, Costs, Cross-modal retrieval,
hashing
BibRef
Zhang, D.L.[Dong-Lin],
Wu, X.J.[Xiao-Jun],
Liu, Z.[Zhen],
Yu, J.[Jun],
Kittler, J.V.[Josef V.],
Fast Discrete Cross-Modal Hashing Based on Label Relaxation and
Matrix Factorization,
ICPR21(4845-4850)
IEEE DOI
2105
Technological innovation, Quantization (signal), Databases,
Instruments, Semantics, Binary codes, Media
BibRef
Zhang, L.[Li],
Wu, X.Q.[Xiang-Qian],
Multi-task framework based on feature separation and reconstruction
for cross-modal retrieval,
PR(122), 2022, pp. 108217.
Elsevier DOI
2112
Cross-modal retrieval, Feature separation,
Image reconstruction, Text reconstruction
BibRef
Liu, F.[Fangcen],
Gao, C.Q.[Chen-Qiang],
Sun, Y.Q.[Yong-Qing],
Zhao, Y.[Yue],
Yang, F.[Feng],
Qin, A.[Anyong],
Meng, D.Y.[De-Yu],
Infrared and Visible Cross-Modal Image Retrieval Through Shared
Features,
CirSysVideo(31), No. 11, November 2021, pp. 4485-4496.
IEEE DOI
2112
Image retrieval, Feature extraction, Task analysis, Imaging,
Semantics, Image color analysis, Cameras,
maximum mean discrepancy
BibRef
Wang, C.Y.[Chao-Yi],
Li, L.[Liang],
Yan, C.G.[Cheng-Gang],
Wang, Z.[Zhan],
Sun, Y.Q.[Yao-Qi],
Zhang, J.Y.[Ji-Yong],
Cross-modal semantic correlation learning by Bi-CNN network,
IET-IPR(15), No. 14, 2021, pp. 3674-3684.
DOI Link
2112
BibRef
Chakraborty, B.[Bela],
Wang, P.[Peng],
Wang, L.[Lei],
Inter-Modality Fusion Based Attention for Zero-Shot Cross-Modal
Retrieval,
ICIP21(2648-2652)
IEEE DOI
2201
Training, Heating systems, Image processing, Semantics, Pipelines,
MIMICs, Zero-shot Learning, Inter-Modality Fusion,
Cross-modal Retrieval
BibRef
Zhang, P.F.[Peng-Fei],
Li, Y.[Yang],
Huang, Z.[Zi],
Xu, X.S.[Xin-Shun],
Aggregation-Based Graph Convolutional Hashing for Unsupervised
Cross-Modal Retrieval,
MultMed(24), 2022, pp. 466-479.
IEEE DOI
2202
Semantics, Convolutional codes, Binary codes, Convolution,
Measurement, Feature extraction, Sparse matrices, Multimodal,
graph convolutional networks
BibRef
Shin, A.[Andrew],
Ishii, M.[Masato],
Narihira, T.[Takuya],
Perspectives and Prospects on Transformer Architecture for Cross-Modal
Tasks with Language and Vision,
IJCV(130), No. 2, February 2022, pp. 435-454.
Springer DOI
2202
BibRef
Ji, Z.[Zhong],
Wang, H.R.[Hao-Ran],
Han, J.G.[Jun-Gong],
Pang, Y.W.[Yan-Wei],
SMAN: Stacked Multimodal Attention Network for Cross-Modal Image-Text
Retrieval,
Cyber(52), No. 2, February 2022, pp. 1086-1097.
IEEE DOI
2202
Visualization, Semantics, Feature extraction, Correlation,
Task analysis, Extraterrestrial measurements, Deep learning,
vision and language
BibRef
Ma, J.J.[Jing-Jing],
Shi, D.[Duanpeng],
Tang, X.[Xu],
Zhang, X.R.[Xiang-Rong],
Jiao, L.C.[Li-Cheng],
Dual Modality Collaborative Learning for Cross-Source Remote Sensing
Retrieval,
RS(14), No. 6, 2022, pp. xx-yy.
DOI Link
2204
BibRef
Huang, Y.[Yan],
Wang, J.D.[Jing-Dong],
Wang, L.[Liang],
Few-Shot Image and Sentence Matching via Aligned Cross-Modal Memory,
PAMI(44), No. 6, June 2022, pp. 2968-2983.
IEEE DOI
2205
BibRef
Earlier: A1, A3, Only:
ACMM: Aligned Cross-Modal Memory for Few-Shot Image and Sentence
Matching,
ICCV19(5773-5782)
IEEE DOI
2004
Adaptation models, Task analysis, Pattern matching, Logic gates,
Visualization, Image color analysis, Data models,
similarity gated fusion.
image matching, learning (artificial intelligence),
storage management, few-shot content, sentence matching tasks,
Micromechanical devices
BibRef
Xu, X.[Xing],
Lin, K.Y.[Kai-Yi],
Yang, Y.[Yang],
Hanjalic, A.[Alan],
Shen, H.T.[Heng Tao],
Joint Feature Synthesis and Embedding:
Adversarial Cross-Modal Retrieval Revisited,
PAMI(44), No. 6, June 2022, pp. 3030-3047.
IEEE DOI
2205
Art, Generative adversarial networks, Training,
Correlation, Visualization, Standards, Cross-modal retrieval,
knowledge transfer
BibRef
Duan, Y.X.[You-Xiang],
Chen, N.[Ning],
Zhang, P.Y.[Pei-Ying],
Kumar, N.[Neeraj],
Chang, L.[Lunjie],
Wen, W.[Wu],
MS2GAH: Multi-label semantic supervised graph attention hashing for
robust cross-modal retrieval,
PR(128), 2022, pp. 108676.
Elsevier DOI
2205
Cross-modal retrieval, Deep hashing, Graph attention network
BibRef
Hamroun, M.[Mohamed],
Tamine, K.[Karim],
Crespin, B.[Benoît],
Multimodal Video Indexing (MVI): A New Method Based on Machine Learning
and Semi-Automatic Annotation on Large Video Collections,
IJIG(22), No. 2, April 2022, pp. 2250022.
DOI Link
2205
BibRef
Parida, K.K.[Kranti Kumar],
Sharma, G.[Gaurav],
Discriminative semantic transitive consistency for cross-modal
learning,
CVIU(219), 2022, pp. 103404.
Elsevier DOI
2205
Cross-modal retrieval, Distributional matching
BibRef
Xu, L.M.[Li-Ming],
Zeng, X.H.[Xian-Hua],
Zheng, B.[Bochuan],
Li, W.S.[Wei-Sheng],
Multi-Manifold Deep Discriminative Cross-Modal Hashing for Medical
Image Retrieval,
IP(31), 2022, pp. 3371-3385.
IEEE DOI
2205
Codes, Manifolds, Semantics, Correlation, Image retrieval,
Medical diagnostic imaging, Data models, Cross-modal hashing,
weak discriminability
BibRef
Song, X.[Xue],
Chen, J.J.[Jing-Jing],
Wu, Z.[Zuxuan],
Jiang, Y.G.[Yu-Gang],
Spatial-Temporal Graphs for Cross-Modal Text2Video Retrieval,
MultMed(24), 2022, pp. 2914-2923.
IEEE DOI
2206
Visualization, Semantics, Bit error rate, Encoding, Task analysis,
Feature extraction, Microphones, Cross-modal retrieval,
cross-modal learning
BibRef
Ma, X.H.[Xin-Hong],
Yang, X.S.[Xiao-Shan],
Gao, J.Y.[Jun-Yu],
Xu, C.S.[Chang-Sheng],
The Model May Fit You: User-Generalized Cross-Modal Retrieval,
MultMed(24), 2022, pp. 2998-3012.
IEEE DOI
2206
Data models, Task analysis, Adaptation models, Training,
Benchmark testing, Pediatrics, Bridges, cross-modal retrieval,
meta-learning
BibRef
Yang, F.[Fan],
Liu, Y.F.[Yu-Feng],
Ding, X.J.[Xiao-Jian],
Ma, F.M.[Fu-Min],
Cao, J.[Jie],
Asymmetric cross-modal hashing with high-level semantic similarity,
PR(130), 2022, pp. 108823.
Elsevier DOI
2206
Cross-modal retrieval, Hashing, Similarity search, Supervised, Optimization
BibRef
Shan, W.[Wei],
Huang, D.[Dan],
Wang, J.T.[Jiang-Tao],
Zou, F.[Feng],
Li, S.[Suwen],
Self-Attention based fine-grained cross-media hybrid network,
PR(130), 2022, pp. 108748.
Elsevier DOI
2206
Fine-Grained, Cross-Media, Retrieval, Attention
BibRef
Zhang, D.L.[Dong-Lin],
Wu, X.J.[Xiao-Jun],
Scalable Discrete Matrix Factorization and Semantic Autoencoder for
Cross-Media Retrieval,
Cyber(52), No. 7, July 2022, pp. 5947-5960.
IEEE DOI
2207
Semantics, Hash functions, Binary codes, Quantization (signal),
Training data, Training, Task analysis, Autoencoder, hashing
BibRef
Qian, S.S.[Sheng-Sheng],
Xue, D.Z.[Di-Zhan],
Fang, Q.[Quan],
Xu, C.S.[Chang-Sheng],
Adaptive Label-Aware Graph Convolutional Networks for Cross-Modal
Retrieval,
MultMed(24), 2022, pp. 3520-3532.
IEEE DOI
2207
Correlation, Semantics, Task analysis, Adaptation models,
Adaptive systems, Birds, Oceans, Cross-modal retrieval,
Graph convolutional networks
BibRef
Wang, Y.[Yunbo],
Peng, Y.X.[Yu-Xin],
MARS: Learning Modality-Agnostic Representation for Scalable
Cross-Media Retrieval,
CirSysVideo(32), No. 7, July 2022, pp. 4765-4777.
IEEE DOI
2207
Semantics, Correlation, Training, Cats, Automobiles, Transforms, Media,
Multi-modality learning, cross-media retrieval,
similarity retrieval
BibRef
Wang, L.[Lu],
Zareapoor, M.[Masoumeh],
Yang, J.[Jie],
Zheng, Z.L.[Zhong-Long],
Asymmetric Correlation Quantization Hashing for Cross-Modal Retrieval,
MultMed(24), 2022, pp. 3665-3678.
IEEE DOI
2208
Semantics, Quantization (signal), Correlation, Binary codes,
Databases, Optimization, Hash functions,
Compositional quantization
BibRef
Qin, J.Y.[Jian-Yang],
Fei, L.[Lunke],
Zhang, Z.[Zheng],
Wen, J.[Jie],
Xu, Y.[Yong],
Zhang, D.[David],
Joint Specifics and Consistency Hash Learning for Large-Scale
Cross-Modal Retrieval,
IP(31), 2022, pp. 5343-5358.
IEEE DOI
2208
Binary codes, Semantics, Hash functions, Feature extraction,
Collaboration, Training, Optimization, Learning to hash,
large-scale similarity searching
BibRef
Shi, Y.F.[Yu-Feng],
Zhao, Y.[Yue],
Liu, X.[Xin],
Zheng, F.[Feng],
Ou, W.H.[Wei-Hua],
You, X.G.[Xin-Ge],
Peng, Q.[Qinmu],
Deep Adaptively-Enhanced Hashing With Discriminative Similarity
Guidance for Unsupervised Cross-Modal Retrieval,
CirSysVideo(32), No. 10, October 2022, pp. 7255-7268.
IEEE DOI
2210
Hash functions, Optimization, Codes, Semantics, Estimation,
Computer science, Annotations, Cross-modal retrieval,
optimization strategy
BibRef
Liu, Z.[Zhi],
Zhao, F.Y.[Fang-Yuan],
Zhang, M.M.[Meng-Meng],
An Efficient Multimodal Aggregation Network for Video-Text Retrieval,
IEICE(E105-D), No. 10, October 2022, pp. 1825-1828.
WWW Link.
2210
BibRef
Guo, D.J.[Dong-Jin],
Su, X.M.[Xiao-Ming],
Lian, Y.[Yahong],
Liu, L.M.[Li-Min],
Wang, H.B.[Hai-Bo],
Two-stage partial image-text clustering (TPIT-C),
IET-CV(16), No. 8, 2022, pp. 694-708.
DOI Link
2210
BibRef
Wang, S.[Song],
Zhao, H.[Huan],
Li, K.Q.[Ke-Qin],
Discrete Joint Semantic Alignment Hashing for Cross-Modal Image-Text
Search,
CirSysVideo(32), No. 11, November 2022, pp. 8022-8036.
IEEE DOI
2211
Semantics, Codes, Optimization, Training, Task analysis,
Matrix converters, Hash functions, Cross-modal image-text search,
supervised hashing
BibRef
Liu, X.H.[Xing-Hua],
Cao, G.T.[Gui-Tao],
Lin, Q.B.[Qiu-Bin],
Cao, W.M.[Wen-Ming],
Adaptive weight multi-channel center similar deep hashing,
JVCIR(89), 2022, pp. 103642.
Elsevier DOI
2212
Multi-channel, Center similar, Multimodal retrieval, Deep cross-modal hashing
BibRef
Lan, R.[Rushi],
Tan, Y.[Yu],
Wang, X.Q.[Xiao-Qin],
Liu, Z.B.[Zhen-Bing],
Luo, X.N.[Xiao-Nan],
Label Guided Discrete Hashing for Cross-Modal Retrieval,
ITS(23), No. 12, December 2022, pp. 25236-25248.
IEEE DOI
2212
Codes, Manifolds, Semantics, Training, Binary codes, Task analysis,
Sparse matrices, Cross-modal retrieval, manifold embedding, balanced matrix
BibRef
Wang, Y.X.[Yong-Xin],
Chen, Z.D.[Zhen-Duo],
Luo, X.[Xin],
Xu, X.S.[Xin-Shun],
A High-Dimensional Sparse Hashing Framework for Cross-Modal Retrieval,
CirSysVideo(32), No. 12, December 2022, pp. 8822-8836.
IEEE DOI
2212
Codes, Semantics, Encoding, Task analysis, Optimization,
Streaming media, Sparse matrices, Sparse hashing, fine-grained similarity
BibRef
Jin, M.[Ming],
Zhang, H.X.[Hua-Xiang],
Zhu, L.[Lei],
Sun, J.[Jiande],
Liu, L.[Li],
Video Sampled Frame Category Aggregation and Consistent
Representation for Cross-Modal Retrieval,
CirSysVideo(33), No. 2, February 2023, pp. 909-919.
IEEE DOI
2302
Feature extraction, Semantics, Training, Convolution, Dogs,
Network architecture, Video and text cross-modal retrieval,
video internal frame aggregation loss module
BibRef
Liao, L.[Lei],
Yang, M.[Meng],
Zhang, B.[Bob],
Deep Supervised Dual Cycle Adversarial Network for Cross-Modal
Retrieval,
CirSysVideo(33), No. 2, February 2023, pp. 920-934.
IEEE DOI
2302
Semantics, Generative adversarial networks, Feature extraction,
Task analysis, Media, Deep learning, Neural networks,
deep supervised learning
BibRef
Su, M.Y.[Ming-Yue],
Gu, G.H.[Guang-Hua],
Ren, X.[Xianlong],
Fu, H.[Hao],
Zhao, Y.[Yao],
Semi-Supervised Knowledge Distillation for Cross-Modal Hashing,
MultMed(25), 2023, pp. 662-675.
IEEE DOI
2302
Semantics, Knowledge engineering, Codes, Predictive models,
Data models, Cows, Bridges, Cross-modal retrieval, triplet ranking loss
BibRef
Gong, Y.[Yan],
Cosma, G.[Georgina],
Improving visual-semantic embeddings by learning
semantically-enhanced hard negatives for cross-modal information
retrieval,
PR(137), 2023, pp. 109272.
Elsevier DOI
2302
Visual semantic embedding network, Cross-modal,
Information retrieval, Hard negatives
BibRef
Li, W.H.[Wen-Hui],
Wang, Y.[Yan],
Su, Y.T.[Yu-Ting],
Li, X.Y.[Xuan-Ya],
Liu, A.A.[An-An],
Zhang, Y.D.[Yong-Dong],
Multi-Scale Fine-Grained Alignments for Image and Sentence Matching,
MultMed(25), 2023, pp. 543-556.
IEEE DOI
2302
Semantics, Visualization, Dogs, Mouth, Task analysis, Feature extraction,
Bridges, Bi-directional aggregations, multi-scale alignments
BibRef
Ou, W.H.[Wei-Hua],
Deng, J.X.[Jia-Xin],
Zhang, L.[Lei],
Gou, J.P.[Jian-Ping],
Zhou, Q.[Quan],
Cross-Modal Generation and Pair Correlation Alignment Hashing,
ITS(24), No. 3, March 2023, pp. 3018-3026.
IEEE DOI
2303
Semantics, Feature extraction, Correlation, Codes, Transformers,
Generative adversarial networks, Data mining,
cross-modal interaction
BibRef
Wang, D.[Di],
Zhang, C.P.[Cai-Ping],
Wang, Q.[Quan],
Tian, Y.[Yumin],
He, L.[Lihuo],
Zhao, L.[Lin],
Hierarchical Semantic Structure Preserving Hashing for Cross-Modal
Retrieval,
MultMed(25), 2023, pp. 1217-1229.
IEEE DOI
2305
Semantics, Codes, Binary codes, Representation learning, Correlation,
Hash functions, Feature extraction, Cross-modal retrieval,
hierarchical learning
BibRef
Hu, P.[Peng],
Huang, Z.Y.[Zhen-Yu],
Peng, D.Z.[De-Zhong],
Wang, X.[Xu],
Peng, X.[Xi],
Cross-Modal Retrieval With Partially Mismatched Pairs,
PAMI(45), No. 8, August 2023, pp. 9595-9610.
IEEE DOI
2307
Semantics, Force, Cognition, Visualization, Upper bound,
Stability analysis, Robustness,
mismatched pairs
BibRef
Liu, Y.X.[Ya-Xin],
Wu, J.L.[Jian-Long],
Qu, L.[Leigang],
Gan, T.[Tian],
Yin, J.H.[Jian-Hua],
Nie, L.Q.[Li-Qiang],
Self-Supervised Correlation Learning for Cross-Modal Retrieval,
MultMed(25), 2023, pp. 2851-2863.
IEEE DOI
2307
Correlation, Semantics, Mutual information, Kernel, Unsupervised learning,
Supervised learning, mutual information estimation
BibRef
Wang, B.H.[Ben-Hui],
Zhang, H.X.[Hua-Xiang],
Zhu, L.[Lei],
Nie, L.Q.[Li-Qiang],
Liu, L.[Li],
Multi-level adversarial attention cross-modal hashing,
SP:IC(117), 2023, pp. 117017.
Elsevier DOI
2308
Cross-modal retrieval, Adversarial Learning, Attentional mechanism, Hashing
BibRef
Sun, C.[Chunpu],
Zhang, H.X.[Hua-Xiang],
Liu, L.[Li],
Liu, D.M.[Dong-Mei],
Wang, L.[Lin],
Multi-label adversarial fine-grained cross-modal retrieval,
SP:IC(117), 2023, pp. 117018.
Elsevier DOI
2308
Common representation, Transformer, Adversarial learning, Cross-modal retrieval
BibRef
Song, D.[Dan],
Ling, Y.T.[Yu-Ting],
Li, T.[Tianbao],
Wang, T.[Teng],
Li, X.[Xuanya],
Hierarchical deep semantic alignment for cross-domain 3D model
retrieval,
JVCIR(95), 2023, pp. 103895.
Elsevier DOI
2309
3D model retrieval, Unsupervised domain adaptation, Representation learning
BibRef
Zhao, Y.[Yang],
Zhu, Y.Z.[Ya-Zhou],
Liao, S.[Shengbin],
Ye, Q.[Qiaolin],
Zhang, H.F.[Hao-Feng],
Class Concentration with Twin Variational Autoencoders for Unsupervised
Cross-modal Hashing,
ACCV22(VI:235-251).
Springer DOI
2307
BibRef
Fragomeni, A.[Adriano],
Wray, M.[Michael],
Damen, D.[Dima],
Contra: (con)text (tra)nsformer for Cross-modal Video Retrieval,
ACCV22(IV:451-468).
Springer DOI
2307
BibRef
Zheng, Y.C.[Yuan-Chao],
Zhang, X.W.[Xiao-Wei],
Heterogeneous Interactive Learning Network for Unsupervised Cross-modal
Retrieval,
ACCV22(IV:692-707).
Springer DOI
2307
BibRef
Zhao, Y.[Yang],
Yu, J.G.[Jia-Guo],
Liao, S.[Shengbin],
Zhang, Z.[Zheng],
Zhang, H.F.[Hao-Feng],
From Sparse to Dense: Semantic Graph Evolutionary Hashing for
Unsupervised Cross-Modal Retrieval,
ACCV22(IV:521-536).
Springer DOI
2307
BibRef
Arnold, R.[Rahel],
Sauter, L.[Loris],
Schuldt, H.[Heiko],
Free-Form Multi-Modal Multimedia Retrieval (4MR),
MMMod23(I: 678-683).
Springer DOI
2304
BibRef
Xuan, H.[Hong],
Chen, X.S.[Xi Stephen],
Dissecting Deep Metric Learning Losses for Image-Text Retrieval,
WACV23(2163-2172)
IEEE DOI
2302
Measurement, Training, Analytical models, Semantics,
Space exploration, Task analysis, visual reasoning
BibRef
Ge, X.[Xuri],
Chen, F.[Fuhai],
Xu, S.[Songpei],
Tao, F.[Fuxiang],
Jose, J.M.[Joemon M.],
Cross-modal Semantic Enhanced Interaction for Image-Sentence
Retrieval,
WACV23(1022-1031)
IEEE DOI
2302
Measurement, Representation learning, Visualization, Correlation,
Computational modeling, Semantics,
Algorithms: Vision + language and/or other modalities
BibRef
Jawade, B.[Bhavin],
Mohan, D.D.[Deen Dayal],
Ali, N.M.[Naji Mohamed],
Setlur, S.[Srirangaraj],
Govindaraju, V.[Venu],
NAPReg: Nouns As Proxies Regularization for Semantically Aware
Cross-Modal Embeddings,
WACV23(1135-1144)
IEEE DOI
2302
Training, Measurement, Visualization, Codes, Databases, Semantics,
Algorithms: Vision + language and/or other modalities
BibRef
Nakatsuka, T.[Takayuki],
Hamasaki, M.[Masahiro],
Goto, M.[Masataka],
Content-Based Music-Image Retrieval Using Self- and Cross-Modal
Feature Embedding Memory,
WACV23(2173-2183)
IEEE DOI
2302
Training, Measurement, Art, Multiple signal classification,
Task analysis
BibRef
Chen, Y.X.[Yu-Xiao],
Yuan, J.B.[Jian-Bo],
Zhao, L.[Long],
Chen, T.L.[Tian-Lang],
Luo, R.[Rui],
Davis, L.[Larry],
Metaxas, D.N.[Dimitris N.],
More Than Just Attention: Improving Cross-Modal Attentions with
Contrastive Constraints for Image-Text Matching,
WACV23(4421-4429)
IEEE DOI
2302
Training, Measurement, Visualization, Annotations,
Computational modeling,
Algorithms: Vision + language and/or other modalities
BibRef
Agarwal, A.[Aishwarya],
Karanam, S.[Srikrishna],
Srinivasan, B.V.[Balaji Vasan],
Banerjee, B.[Biplab],
Contrastive Learning of Semantic Concepts for Open-set Cross-domain
Retrieval,
WACV23(4104-4113)
IEEE DOI
2302
Training, Technological innovation, Semantics, Natural languages,
Image retrieval, Feature extraction
BibRef
Yang, Y.[Yulou],
Shen, H.[Hao],
Yang, M.[Ming],
Relation-Guided Network for Image-Text Retrieval,
ICIP22(1856-1860)
IEEE DOI
2211
Transformers, Feature extraction, Cognition, Data mining,
Image-text retrieval, asymmetric structure, relation-guided
BibRef
Sumbul, G.[Gencer],
Müller, M.[Markus],
Demir, B.[Begüm],
A Novel Self-Supervised Cross-Modal Image Retrieval Method in Remote
Sensing,
ICIP22(2426-2430)
IEEE DOI
2211
Training, Codes, Image retrieval, Search problems, Sensors,
Reliability, Cross-modal image retrieval, deep learning, remote sensing
BibRef
Wang, H.[Hu],
Zhang, J.P.[Jian-Peng],
Chen, Y.H.[Yuan-Hong],
Ma, C.B.[Cong-Bo],
Avery, J.[Jodie],
Hull, L.[Louise],
Carneiro, G.[Gustavo],
Uncertainty-Aware Multi-modal Learning via Cross-Modal Random Network
Prediction,
ECCV22(XXXVII:200-217).
Springer DOI
2211
BibRef
de Almeida, L.B.[Lucas Barbosa],
Valem, L.P.[Lucas Pascotti],
Pedronette, D.C.G.[Daniel Carlos Guimarães],
Graph Convolutional Networks and Manifold Ranking for Multimodal
Video Retrieval,
ICIP22(2811-2815)
IEEE DOI
2211
Training, Manifolds, Deep learning, Transfer learning,
Feature extraction, Content-based retrieval, Manifold learning,
rank aggregation
BibRef
Liang, T.[Tao],
Lin, G.S.[Guo-Sheng],
Wan, M.Y.[Ming-Yang],
Li, T.R.[Tian-Rui],
Ma, G.J.[Guo-Jun],
Lv, F.M.[Feng-Mao],
Expanding Large Pre-trained Unimodal Models with Multimodal
Information Injection for Image-Text Multimodal Classification,
CVPR22(15471-15480)
IEEE DOI
2210
Deep learning, Visualization, Image recognition, Correlation,
Bit error rate, Market research, Vision+language
BibRef
Yang, J.H.[Jin-Hui],
Chen, X.Y.[Xian-Yu],
Jiang, M.[Ming],
Chen, S.[Shi],
Wang, L.[Louis],
Zhao, Q.[Qi],
VisualHow: Multimodal Problem Solving,
CVPR22(15606-15616)
IEEE DOI
2210
Training, Visualization, Technological innovation, Annotations,
Natural language processing, Pattern recognition,
Datasets and evaluation
BibRef
Girdhar, R.[Rohit],
Singh, M.[Mannat],
Ravi, N.[Nikhila],
van der Maaten, L.[Laurens],
Joulin, A.[Armand],
Misra, I.[Ishan],
Omnivore: A Single Model for Many Visual Modalities,
CVPR22(16081-16091)
IEEE DOI
2210
Visualization, Solid modeling, Computational modeling,
Transformers, Data models, Action and event recognition
BibRef
Ma, M.M.[Meng-Meng],
Ren, J.[Jian],
Zhao, L.[Long],
Testuggine, D.[Davide],
Peng, X.[Xi],
Are Multimodal Transformers Robust to Missing Modality?,
CVPR22(18156-18165)
IEEE DOI
2210
Training, Benchmark testing, Transformers, Multitasking,
Search problems, Data models, Vision+language, Machine learning
BibRef
Han, Z.B.[Zong-Bo],
Yang, F.[Fan],
Huang, J.Z.[Jun-Zhou],
Zhang, C.Q.[Chang-Qing],
Yao, J.H.[Jian-Hua],
Multimodal Dynamics: Dynamical Fusion for Trustworthy Multimodal
Classification,
CVPR22(20675-20685)
IEEE DOI
2210
Heuristic algorithms, Estimation, Classification algorithms,
Pattern recognition, Medical diagnosis, Machine learning
BibRef
Gupta, V.[Vikram],
Mittal, T.[Trisha],
Mathur, P.[Puneet],
Mishra, V.[Vaibhav],
Maheshwari, M.[Mayank],
Bera, A.[Aniket],
Mukherjee, D.[Debdoot],
Manocha, D.[Dinesh],
3MASSIV: Multilingual, Multimodal and Multi-Aspect dataset of Social
Media Short Videos,
CVPR22(21032-21043)
IEEE DOI
2210
Social networking (online), Semantics, Media, Market research,
Pattern recognition, Task analysis, Datasets and evaluation,
Video analysis and understanding
BibRef
Bogolin, S.V.[Simion-Vlad],
Croitoru, I.[Ioana],
Jin, H.L.[Hai-Lin],
Liu, Y.[Yang],
Albanie, S.[Samuel],
Cross Modal Retrieval with Querybank Normalisation,
CVPR22(5184-5195)
IEEE DOI
2210
Training, Codes, Computational modeling,
Benchmark testing, Pattern recognition, Vision + language, retrieval
BibRef
Yang, E.[Erkun],
Yao, D.R.[Dong-Ren],
Liu, T.L.[Tong-Liang],
Deng, C.[Cheng],
Mutual Quantization for Cross-Modal Search with Noisy Labels,
CVPR22(7541-7550)
IEEE DOI
2210
Training, Representation learning, Quantization (signal), Codes,
Training data, Benchmark testing, Recognition: detection,
Representation learning
BibRef
Neculai, A.[Andrei],
Chen, Y.[Yanbei],
Akata, Z.[Zeynep],
Probabilistic Compositional Embeddings for Multimodal Image Retrieval,
MULA22(4546-4556)
IEEE DOI
2210
Visualization, Codes, Computational modeling,
Image retrieval, Semantics
BibRef
Couairon, G.[Guillaume],
Douze, M.[Matthijs],
Cord, M.[Matthieu],
Schwenk, H.[Holger],
Embedding Arithmetic of Multimodal Queries for Image Retrieval,
ODRUM22(4946-4954)
IEEE DOI
2210
Conferences, Semantics, Image retrieval, Lasers, Transforms,
Image representation
BibRef
Sun, C.C.[Chang-Chang],
Latapie, H.[Hugo],
Liu, G.[Gaowen],
Yan, Y.[Yan],
Deep Normalized Cross-Modal Hashing with Bi-Direction Relation
Reasoning,
ODRUM22(4937-4945)
IEEE DOI
2210
Codes, Computational modeling, Semantics,
Bidirectional control, Benchmark testing
BibRef
Li, Y.H.[Yi-Hao],
Yu, J.[Jun],
Cai, Z.[Zhongpeng],
Pan, Y.[Yuwen],
Cross-modal Target Retrieval for Tracking by Natural Language,
ODRUM22(4927-4936)
IEEE DOI
2210
Visualization, Target tracking, Natural languages, Semantics,
Switches, Benchmark testing
BibRef
Thomas, C.[Christopher],
Kovashka, A.[Adriana],
Emphasizing Complementary Samples for Non-literal Cross-modal
Retrieval,
MULA22(4631-4640)
IEEE DOI
2210
Spatial diversity, Semantics, Channel estimation,
Performance gain, Benchmark testing
BibRef
Xu, B.[Bocheng],
Xiong, Y.H.[Yi-Hua],
Zhang, R.[Rui],
Feng, Y.[Yanyi],
Wu, H.F.[Hai-Feng],
Natural Language-Based Vehicle Retrieval with Explicit Cross-Modal
Representation Learning,
AICity22(3141-3148)
IEEE DOI
2210
Representation learning, Visualization, Semantics, Urban areas,
Feature extraction, Robustness
BibRef
Shvetsova, N.[Nina],
Chen, B.[Brian],
Rouditchenko, A.[Andrew],
Thomas, S.[Samuel],
Kingsbury, B.[Brian],
Feris, R.[Rogerio],
Harwath, D.[David],
Glass, J.[James],
Kuehne, H.[Hilde],
Everything at Once - Multi-modal Fusion Transformer for Video
Retrieval,
CVPR22(19988-19997)
IEEE DOI
2210
Location awareness, Training, Codes, Fuses, Benchmark testing,
Transformers, Action and event recognition, Video analysis and understanding
BibRef
Andonian, A.[Alex],
Chen, S.X.[Shi-Xing],
Hamid, R.[Raffay],
Robust Cross-Modal Representation Learning with Progressive
Self-Distillation,
CVPR22(16409-16420)
IEEE DOI
2210
Training, Representation learning, Computational modeling,
Redundancy, Benchmark testing, Robustness, Noise measurement,
Representation learning
BibRef
Lu, H.Y.[Hao-Yu],
Fei, N.[Nanyi],
Huo, Y.Q.[Yu-Qi],
Gao, Y.Z.[Yi-Zhao],
Lu, Z.W.[Zhi-Wu],
Wen, J.R.[Ji-Rong],
COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for
Cross-Modal Retrieval,
CVPR22(15671-15680)
IEEE DOI
2210
Visualization, Collaboration, Streaming media,
Probability distribution, Pattern recognition, Task analysis,
Video analysis and understanding
BibRef
Abdelnabi, S.[Sahar],
Hasan, R.[Rakibul],
Fritz, M.[Mario],
Open-Domain, Content-based, Multi-modal Fact-checking of
Out-of-Context Images via Online Resources,
CVPR22(14920-14929)
IEEE DOI
2210
Visualization, Machine vision, MIMICs, Manuals,
Cognition, retrieval, Vision + language,
Recognition: detection
BibRef
Wang, Y.[Yun],
Zhang, T.[Tong],
Zhang, X.[Xueya],
Cui, Z.[Zhen],
Huang, Y.[Yuge],
Shen, P.C.[Peng-Cheng],
Li, S.X.[Shao-Xin],
Yang, J.[Jian],
Wasserstein Coupled Graph Learning for Cross-Modal Retrieval,
ICCV21(1793-1802)
IEEE DOI
2203
Training, Representation learning, Analytical models, Dictionaries,
Correlation, Computational modeling, Vision + language,
BibRef
Cai, G.[Guanyu],
Zhang, J.[Jun],
Jiang, X.Y.[Xin-Yang],
Gong, Y.F.[Yi-Fei],
He, L.[Lianghua],
Yu, F.[Fufu],
Peng, P.[Pai],
Guo, X.W.[Xiao-Wei],
Huang, F.Y.[Fei-Yue],
Sun, X.[Xing],
Ask amp;Confirm: Active Detail Enriching for Cross-Modal Retrieval
with Partial Query,
ICCV21(1815-1824)
IEEE DOI
2203
Training, Codes, Computational modeling, Image retrieval,
Search problems, Robustness, Vision + language, Image and video retrieval
BibRef
Wen, K.Y.[Ke-Yu],
Xia, J.[Jin],
Huang, Y.Y.[Yuan-Yuan],
Li, L.Y.[Lin-Yang],
Xu, J.Y.[Jia-Yan],
Shao, J.[Jie],
COOKIE: Contrastive Cross-Modal Knowledge Sharing Pre-training for
Vision-Language Representation,
ICCV21(2188-2197)
IEEE DOI
2203
Visualization, Codes, Computational modeling, Image retrieval,
Semantics, Transformers, Vision + language,
Representation learning
BibRef
Patrick, M.[Mandela],
Huang, P.Y.[Po-Yao],
Misra, I.[Ishan],
Metze, F.[Florian],
Vedaldi, A.[Andrea],
Asano, Y.M.[Yuki M.],
Henriques, J.[João],
Space-Time Crop & Attend:
Improving Cross-modal Video Representation Learning,
ICCV21(10540-10552)
IEEE DOI
2203
Representation learning, Costs, Codes, Computational modeling, Crops,
Image representation, Representation learning, Vision + other modalities
BibRef
Lin, M.X.[Ming-Xian],
Yang, J.[Jie],
Wang, H.[He],
Lai, Y.K.[Yu-Kun],
Jia, R.[Rongfei],
Zhao, B.Q.[Bin-Qiang],
Gao, L.[Lin],
Single Image 3D Shape Retrieval via Cross-Modal Instance and Category
Contrastive Learning,
ICCV21(11385-11395)
IEEE DOI
2203
Representation learning, Deep learning, Shape,
Image color analysis, Pipelines, Gray-scale,
3D from a single image and shape-from-x
BibRef
Zhan, X.L.[Xun-Lin],
Wu, Y.X.[Yang-Xin],
Dong, X.[Xiao],
Wei, Y.C.[Yun-Chao],
Lu, M.L.[Min-Long],
Zhang, Y.C.[Yi-Chi],
Xu, H.[Hang],
Liang, X.D.[Xiao-Dan],
Product1M: Towards Weakly Supervised Instance-Level Product Retrieval
via Cross-Modal Pretraining,
ICCV21(11762-11771)
IEEE DOI
2203
Industries, Measurement, Codes, Transformers, Solids,
Electronic commerce, Image and video retrieval, Vision + language
BibRef
Changpinyo, S.[Soravit],
Pont-Tuset, J.[Jordi],
Ferrari, V.[Vittorio],
Soricut, R.[Radu],
Telling the What while Pointing to the Where:
Multimodal Queries for Image Retrieval,
ICCV21(12116-12126)
IEEE DOI
2203
Location awareness, Error analysis, Computational modeling,
Image retrieval, Natural languages, Mice,
Vision + other modalities
BibRef
Gabeur, V.[Valentin],
Nagrani, A.[Arsha],
Sun, C.[Chen],
Alahari, K.[Karteek],
Schmid, C.[Cordelia],
Masking Modalities for Cross-modal Video Retrieval,
WACV22(2111-2120)
IEEE DOI
2202
Manuals, Benchmark testing, Motion pictures,
Natural language processing, Proposals, Speech processing, Scene Understanding
BibRef
Galanopoulos, D.[Damianos],
Mezaris, V.[Vasileios],
Hard-Negatives or Non-Negatives? A Hard-Negative Selection Strategy
for Cross-Modal Retrieval Using the Improved Marginal Ranking Loss,
ViRaL21(2312-2316)
IEEE DOI
2112
Training, Computational modeling, Network architecture
BibRef
Jing, L.L.[Long-Long],
Vahdani, E.[Elahe],
Tan, J.X.[Jia-Xing],
Tian, Y.L.[Ying-Li],
Cross-Modal Center Loss for 3D Cross-Modal Retrieval,
CVPR21(3141-3150)
IEEE DOI
2111
Solid modeling,
Computational modeling, Metadata, Feature extraction, Pattern recognition
BibRef
Hu, P.[Peng],
Peng, X.[Xi],
Zhu, H.Y.[Hong-Yuan],
Zhen, L.[Liangli],
Lin, J.[Jie],
Learning Cross-Modal Retrieval with Noisy Labels,
CVPR21(5399-5409)
IEEE DOI
2111
Costs, Annotations, Interference,
Pattern recognition, Noise measurement, Labeling
BibRef
Almazán, J.[Jon],
Ko, B.[Byungsoo],
Gu, G.[Geonmo],
Larlus, D.[Diane],
Kalantidis, Y.[Yannis],
Granularity-Aware Adaptation for Image Retrieval Over Multiple Tasks,
ECCV22(XIV:389-406).
Springer DOI
2211
BibRef
Chun, S.[Sanghyuk],
Oh, S.J.[Seong Joon],
Sampaio de Rezende, R.[Rafael],
Kalantidis, Y.[Yannis],
Larlus, D.[Diane],
Probabilistic Embeddings for Cross-Modal Retrieval,
CVPR21(8411-8420)
IEEE DOI
2111
Uncertainty, Codes, Databases, Annotations, Tools, Benchmark testing
BibRef
Croitoru, I.[Ioana],
Bogolin, S.V.[Simion-Vlad],
Leordeanu, M.[Marius],
Jin, H.L.[Hai-Lin],
Zisserman, A.[Andrew],
Albanie, S.[Samuel],
Liu, Y.[Yang],
TeachText:
CrossModal Generalized Distillation for Text-Video Retrieval,
ICCV21(11563-11573)
IEEE DOI
2203
Visualization, Codes, Computational modeling, Noise reduction,
Benchmark testing,
Vision + language
BibRef
Liu, Y.[Yang],
Chen, Q.C.[Qing-Chao],
Albanie, S.[Samuel],
Adaptive Cross-Modal Prototypes for Cross-Domain Visual-Language
Retrieval,
CVPR21(14949-14959)
IEEE DOI
2111
Visualization, Prototypes, Pattern recognition,
Task analysis, Mutual information, Videos
BibRef
Salvador, A.[Amaia],
Gundogdu, E.[Erhan],
Bazzani, L.[Loris],
Donoser, M.[Michael],
Revamping Cross-Modal Recipe Retrieval with Hierarchical Transformers
and Self-supervised Learning,
CVPR21(15470-15479)
IEEE DOI
2111
Training, Codes, Computational modeling, Semantics,
Machine learning, Transformers
BibRef
Dzabraev, M.[Maksim],
Kalashnikov, M.[Maksim],
Komkov, S.[Stepan],
Petiushko, A.[Aleksandr],
MDMMT: Multidomain Multimodal Transformer for Video Retrieval,
HVU21(3349-3358)
IEEE DOI
2109
Training, Benchmark testing,
Pattern recognition, Task analysis
BibRef
Wang, K.[Kai],
Herranz, L.[Luis],
van de Weijer, J.[Joost],
Continual learning in cross-modal retrieval,
OmniCV21(3623-3633)
IEEE DOI
2109
Training, Visualization, Human intelligence, Focusing, Interference,
Tools, Pattern recognition
BibRef
Mafla, A.[Andrés],
Rezende, R.S.[Rafael S.],
Gómez, L.[Lluís],
Larlus, D.[Diane],
Karatzas, D.[Dimosthenis],
StacMR: Scene-Text Aware Cross-Modal Retrieval,
WACV21(2219-2229)
IEEE DOI
2106
Visualization, Annotations,
Computational modeling, Semantics, Task analysis
BibRef
Feng, C.T.[Chang-Ting],
Li, D.G.[Da-Gang],
Zheng, J.W.[Jing-Wei],
Improving Supervised Cross-modal Retrieval with Semantic Graph
Embedding,
MMMod21(I:187-199).
Springer DOI
2106
BibRef
Wen, Z.Y.[Zhen-Yu],
Feng, A.[Aimin],
Deep Centralized Cross-modal Retrieval,
MMMod21(I:443-455).
Springer DOI
2106
BibRef
Li, Z.X.[Zhi-Xin],
Ling, F.[Feng],
Xu, C.S.[Chuan-Sheng],
Zhang, C.L.[Can-Long],
Ma, H.F.[Hui-Fang],
Cross-Media Hash Retrieval Using Multi-Head Attention Network,
ICPR21(1290-1297)
IEEE DOI
2105
Correlation, Semantics, Neural networks, Media,
Extraterrestrial measurements, cross-media retrieval
BibRef
Jin, C.[Cong],
Zhang, T.[Tian],
Liu, S.X.[Shou-Xun],
Tie, Y.[Yun],
Lv, X.[Xin],
Li, J.G.[Jian-Guang],
Yan, W.C.[Wen-Cai],
Yan, M.[Ming],
Xu, Q.[Qian],
Guan, Y.C.[Yi-Cong],
Yang, Z.G.[Zheng-Gougou],
Cross-modal Deep Learning Applications: Audio-visual Retrieval,
MMDLCA20(301-313).
Springer DOI
2103
BibRef
Thomas, C.[Christopher],
Kovashka, A.[Adriana],
Preserving Semantic Neighborhoods for Robust Cross-modal Retrieval,
ECCV20(XVIII:317-335).
Springer DOI
2012
BibRef
Wang, Z.,
Liu, X.,
Li, H.,
Sheng, L.,
Yan, J.,
Wang, X.,
Shao, J.,
CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval,
ICCV19(5763-5772)
IEEE DOI
2004
entropy, feature extraction, image matching, image retrieval,
message passing, natural language processing, text analysis,
Task analysis
BibRef
Nawaz, S.,
Janjua, M.K.,
Gallo, I.,
Mahmood, A.,
Calefati, A.,
Shafait, F.,
Do Cross Modal Systems Leverage Semantic Relationships?,
CroMoL19(4501-4510)
IEEE DOI
2004
image representation, image retrieval, image segmentation,
learning (artificial intelligence),
Text to Image
BibRef
Su, S.,
Zhong, Z.,
Zhang, C.,
Deep Joint-Semantics Reconstructing Hashing for Large-Scale
Unsupervised Cross-Modal Retrieval,
ICCV19(3027-3035)
IEEE DOI
2004
binary codes, image coding, image retrieval, multimedia computing,
neural nets, binary codes, reconstructing framework, DJSRH, Correlation
BibRef
Ning, X.C.[Xue-Cheng],
Yang, X.S.[Xiao-Shan],
Xu, C.S.[Chang-Sheng],
Multi-Hop Interactive Cross-modal Retrieval,
MMMod20(II:681-693).
Springer DOI
2003
BibRef
Cornia, M.[Marcella],
Baraldi, L.[Lorenzo],
Tavakoli, H.R.[Hamed R.],
Cucchiara, R.[Rita],
Towards Cycle-Consistent Models for Text and Image Retrieval,
WiCV-E18(IV:687-691).
Springer DOI
1905
BibRef
Surís, D.[Didac],
Duarte, A.[Amanda],
Salvador, A.[Amaia],
Torres, J.[Jordi],
Giró-i-Nieto, X.[Xavier],
Cross-modal Embeddings for Video and Audio Retrieval,
WiCV-E18(IV:711-716).
Springer DOI
1905
BibRef
Liu, C.L.[Chen-Lu],
Xu, X.[Xing],
Yang, Y.[Yang],
Lu, H.M.[Hui-Min],
Shen, F.M.[Fu-Min],
Ji, Y.L.[Yan-Li],
Domain Invariant Subspace Learning for Cross-Modal Retrieval,
MMMod18(II:94-105).
Springer DOI
1802
BibRef
Yuan, Y.X.[Yu-Xin],
Peng, Y.X.[Yu-Xin],
Recursive Pyramid Network with Joint Attention for Cross-Media
Retrieval,
MMMod18(I:405-416).
Springer DOI
1802
BibRef
Jia, Y.H.[Yu-Hua],
Bai, L.[Liang],
Wang, P.[Peng],
Guo, J.L.[Jin-Lin],
Xie, Y.X.[Yu-Xiang],
Yu, T.Y.[Tian-Yuan],
Utilizing Locality-Sensitive Hash Learning for Cross-Media Retrieval,
MMMod17(I: 550-561).
Springer DOI
1701
BibRef
Shang, X.[Xindi],
Zhang, H.[Hanwang],
Chua, T.S.[Tat-Seng],
Deep Learning Generic Features for Cross-Media Retrieval,
MMMod16(I: 264-275).
Springer DOI
1601
BibRef
Huang, L.[Lei],
Peng, Y.X.[Yu-Xin],
Cross-Media Retrieval via Semantic Entity Projection,
MMMod16(I: 276-288).
Springer DOI
1601
BibRef
Gu, Y.[Yun],
Xue, H.Y.[Hao-Yang],
Yang, J.[Jie],
Shi, P.F.[Peng-Fei],
Cross-modality hashing with partial correspondence,
ICIP15(1925-1929)
IEEE DOI
1512
Cross-modality; Hashing; Multimedia Search; Partial Correspondence
BibRef
Zhang, H.[Hong],
Chen, L.[Li],
Learning optimal data representation for cross-media retrieval,
ICIP12(1925-1928).
IEEE DOI
1302
BibRef
Lin, W.X.[Wan-Xia],
Lu, T.[Tong],
Su, F.[Feng],
A Novel Multi-modal Integration and Propagation Model for Cross-Media
Information Retrieval,
MMMod12(740-749).
Springer DOI
1201
BibRef
Chapter on Implementations and Applications, Databases, QBIC, Video Analysis, Hardware and Software, Inspection continues in
Video Delivery, Video-on-Demand, Indexing, Techniques, Systems .