20.4.3.3.8 Referring Expression Comprehension

Chapter Contents (Back)
Referring Expression.
See also CLIP, Contrastive Language-Image Pre-training.

Li, X., Jiang, S.,
Bundled Object Context for Referring Expressions,
MultMed(20), No. 10, October 2018, pp. 2749-2760.
IEEE DOI 1810
image processing, learning (artificial intelligence), natural language processing, probability, recurrent neural nets, vision-language BibRef

Wang, J.M.[Jian-Ming], Cui, E.[Enjie], Liu, K.L.[Kun-Liang], Sun, Y.K.[Yu-Kuan], Liang, J.Y.[Jia-Yu], Yuan, C.M.[Chun-Miao], Duan, X.J.[Xiao-Jie], Jin, G.H.[Guang-Hao], Chung, T.S.[Tae-Sun],
Referring expression comprehension model with matching detection and linguistic feedback,
IET-CV(14), No. 8, December 2020, pp. 625-633.
DOI Link 2012
BibRef

Qiao, Y.Y.[Yan-Yuan], Deng, C.R.[Chao-Rui], Wu, Q.[Qi],
Referring Expression Comprehension: A Survey of Methods and Datasets,
MultMed(23), 2021, pp. 4426-4440.
IEEE DOI 2112
Task analysis, Visualization, Feature extraction, Context modeling, Training, Image segmentation, Survey BibRef

Niu, Y.L.[Yu-Lei], Zhang, H.W.[Han-Wang], Lu, Z.W.[Zhi-Wu], Chang, S.F.[Shih-Fu],
Variational Context: Exploiting Visual and Textual Context for Grounding Referring Expressions,
PAMI(43), No. 1, January 2021, pp. 347-359.
IEEE DOI 2012
Grounding, Context modeling, Visualization, Task analysis, Pediatrics, Bayes methods, Annotations, referring expression generation BibRef

Yang, S.[Sibei], Li, G.B.[Guan-Bin], Yu, Y.Z.[Yi-Zhou],
Relationship-Embedded Representation Learning for Grounding Referring Expressions,
PAMI(43), No. 8, August 2021, pp. 2765-2779.
IEEE DOI 2107
BibRef
Earlier:
Cross-Modal Relationship Inference for Grounding Referring Expressions,
CVPR19(4140-4149).
IEEE DOI 2002
Locate the object instance in an image described by a referring expression. Visualization, Semantics, Grounding, Proposals, Data mining, Logic gates, Feature extraction, Referring expressions, gated graph convolutional network. Locate target object based on natural language descriptions. BibRef

Sun, M.J.[Ming-Jie], Xiao, J.[Jimin], Lim, E.G.[Eng Gee], Liu, S.[Si], Goulermas, J.Y.[John Y.],
Discriminative Triad Matching and Reconstruction for Weakly Referring Expression Grounding,
PAMI(43), No. 11, November 2021, pp. 4189-4195.
IEEE DOI 2110
Image reconstruction, Training, Proposals, Visualization, Task analysis, Linguistics, Grounding, discriminative triad matching BibRef

Lin, L.[Liang], Yan, P.X.[Peng-Xiang], Xu, X.Q.[Xiao-Qian], Yang, S.[Sibei], Zeng, K.[Kun], Li, G.B.[Guan-Bin],
Structured Attention Network for Referring Image Segmentation,
MultMed(24), No. 2022, pp. 1922-1932.
IEEE DOI 2204
Visualization, Linguistics, Image segmentation, Cognition, Feature extraction, Semantics, Task analysis, cross-modal reasoning BibRef

Yang, X.[Xu], Wang, H.[Hao], Xie, D.[De], Deng, C.[Cheng], Tao, D.C.[Da-Cheng],
Object-Agnostic Transformers for Video Referring Segmentation,
IP(31), No. 2022, pp. 2839-2849.
IEEE DOI 2204
Task analysis, Visualization, Transformers, Feature extraction, Object detection, Image segmentation, Context modeling, video grounding BibRef

Wang, X.[Xing], Xie, D.[De], Zheng, Y.S.[Yuan-Shi],
Referring expression grounding by multi-context reasoning,
PRL(160), 2022, pp. 66-72.
Elsevier DOI 2208
Referring expression grounding, Reasoning, Graph networks BibRef

Shen, H.T.[Heng Tao], Chen, C.[Cheng], Wang, P.[Peng], Gao, L.L.[Lian-Li], Wang, M.[Meng], Song, J.K.[Jing-Kuan],
Continual Referring Expression Comprehension via Dual Modular Memorization,
IP(31), 2022, pp. 6694-6706.
IEEE DOI 2211
Task analysis, Training, Benchmark testing, Training data, Grounding, Data models, Visualization, Continual learning, lifelong learning, visual grounding BibRef

Chen, Y.W.[Yi-Wen], Tsai, Y.H.[Yi-Hsuan], Yang, M.H.[Ming-Hsuan],
Understanding Synonymous Referring Expressions via Contrastive Features,
IJCV(130), No. 10, October 2022, pp. 2501-2516.
Springer DOI 2209
BibRef

Suo, W.[Wei], Sun, M.Y.[Meng-Yang], Wang, P.[Peng], Zhang, Y.N.[Yan-Ning], Wu, Q.[Qi],
Rethinking and Improving Feature Pyramids for One-Stage Referring Expression Comprehension,
IP(32), 2023, pp. 854-864.
IEEE DOI 2301
Task analysis, Visualization, Head, Semantics, Object detection, Neck, Computational modeling, Referring expression comprehension, feature pyramids network BibRef

Liu, X.J.[Xue-Jing], Li, L.[Liang], Wang, S.H.[Shu-Hui], Zha, Z.J.[Zheng-Jun], Li, Z.C.[Ze-Chao], Tian, Q.[Qi], Huang, Q.M.[Qing-Ming],
Entity-Enhanced Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding,
PAMI(45), No. 3, March 2023, pp. 3003-3018.
IEEE DOI 2302
Proposals, Image reconstruction, Grounding, Visualization, Collaboration, Context modeling, Training, Entity enhancement, referring expression grounding BibRef

Liu, X.J.[Xue-Jing], Li, L.[Liang], Wang, S.H.[Shu-Hui], Zha, Z.J.[Zheng-Jun], Meng, D.C.[De-Chao], Huang, Q.M.[Qing-Ming],
Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding,
ICCV19(2611-2620)
IEEE DOI 2004
Localize the object in the image from a query. feature extraction, image classification, image reconstruction, image retrieval, Adaptive systems BibRef

Feng, G.[Guang], Zhang, L.[Lihe], Sun, J.[Jiayu], Hu, Z.W.[Zhi-Wei], Lu, H.C.[Hu-Chuan],
Referring Segmentation via Encoder-Fused Cross-Modal Attention Network,
PAMI(45), No. 6, June 2023, pp. 7654-7667.
IEEE DOI 2305
BibRef
Earlier: A1, A4, A2, A5, Only:
Encoder Fusion Network with Co-Attention Embedding for Referring Image Segmentation,
CVPR21(15501-15510)
IEEE DOI 2111
Visualization, Image segmentation, Decoding, Feature extraction, Linguistics, Task analysis, Correlation, Referring segmentation, asymmetric cross-frame attention module. Measurement, Visualization, Grounding, Semantics, Transforms, Information representation BibRef

Liu, D.Z.[Dai-Zong], Zhou, P.[Pan], Xu, Z.[Zichuan], Wang, H.Z.[Hao-Zhao], Li, R.X.[Rui-Xuan],
Few-Shot Temporal Sentence Grounding via Memory-Guided Semantic Learning,
CirSysVideo(33), No. 5, May 2023, pp. 2491-2505.
IEEE DOI 2305
Semantics, Grounding, Task analysis, Training, Visualization, Proposals, Logic gates, Temporal sentence grounding, memory-augmented network BibRef

Sun, M.J.[Ming-Jie], Xiao, J.[Jimin], Lim, E.G.[Eng Gee], Zhao, Y.[Yao],
Cycle-Free Weakly Referring Expression Grounding With Self-Paced Learning,
MultMed(25), 2023, pp. 1611-1621.
IEEE DOI 2306
Task analysis, Training, Pipelines, Linguistics, Visualization, Optimization, Image reconstruction, self-paced learning BibRef

Sun, M.Y.[Meng-Yang], Suo, W.[Wei], Wang, P.[Peng], Zhang, Y.N.[Yan-Ning], Wu, Q.[Qi],
A Proposal-Free One-Stage Framework for Referring Expression Comprehension and Generation via Dense Cross-Attention,
MultMed(25), 2023, pp. 2446-2458.
IEEE DOI 2306
Task analysis, Visualization, Computational modeling, Proposals, Annotations, Detectors, Feature extraction, one-stage method BibRef

Sun, Y.F.[Yan-Feng], Zhang, Y.[Yunru], Jiang, H.[Huajie], Hu, Y.L.[Yong-Li], Yin, B.C.[Bao-Cai],
Multi-level attention for referring expression comprehension,
PRL(172), 2023, pp. 252-258.
Elsevier DOI 2309
Context information, Multilevel attention, Attribute information BibRef

Wang, R.[Rong], Tang, Z.[Zongheng], Zhou, Q.L.[Qian-Li], Liu, X.Q.[Xiao-Qian], Hui, T.R.[Tian-Rui], Tan, Q.[Quange], Liu, S.[Si],
Unified Transformer with Isomorphic Branches for Natural Language Tracking,
CirSysVideo(33), No. 9, September 2023, pp. 4529-4541.
IEEE DOI 2310
Localize the target object referred to by a language description. BibRef

Li, H.[Hui], Sun, M.J.[Ming-Jie], Xiao, J.[Jimin], Lim, E.G.[Eng Gee], Zhao, Y.[Yao],
Fully and Weakly Supervised Referring Expression Segmentation With End-to-End Learning,
CirSysVideo(33), No. 10, October 2023, pp. 5999-6012.
IEEE DOI Code:
WWW Link. 2310
BibRef

Liu, C.[Chang], Jiang, X.D.[Xu-Dong], Ding, H.H.[Heng-Hui],
Instance-Specific Feature Propagation for Referring Segmentation,
MultMed(25), 2023, pp. 3657-3667.
IEEE DOI 2310
BibRef

Song, Y.Z.[Yun-Zhu], Chen, Y.S.[Yi-Syuan], Shuai, H.H.[Hong-Han],
Decoupling-Cooperative Framework for Referring Expression Comprehension,
SPLetters(30), 2023, pp. 1542-1546.
IEEE DOI 2311
BibRef

Hua, G.G.[Guo-Guang], Liao, M.[Muxin], Tian, S.[Shishun], Zhang, Y.H.[Yu-Hang], Zou, W.B.[Wen-Bin],
Multiple Relational Learning Network for Joint Referring Expression Comprehension and Segmentation,
MultMed(25), 2023, pp. 8805-8816.
IEEE DOI 2312
BibRef

Wang, W.B.[Wen-Bin], Pagnucco, M.[Maurice], Xu, C.P.[Cheng-Pei], Song, Y.[Yang],
InterREC: An Interpretable Method for Referring Expression Comprehension,
MultMed(25), 2023, pp. 9330-9342.
IEEE DOI 2312
BibRef

Ke, J.C.[Jing-Cheng], Wang, J.[Jia], Chen, J.C.[Jun-Cheng], Jhuo, I.H.[I-Hong], Lin, C.W.[Chia-Wen], Lin, Y.Y.[Yen-Yu],
CLIPREC: Graph-Based Domain Adaptive Network for Zero-Shot Referring Expression Comprehension,
MultMed(26), 2024, pp. 2480-2492.
IEEE DOI 2402
Task analysis, Visualization, Adaptation models, Cognition, Adaptive systems, Object detection, Training data, CLIP BibRef

Li, X.C.[Xiao-Chuan], Fan, B.Y.[Bao-Yu], Zhang, R.[Runze], Zhao, K.[Kun], Guo, Z.H.[Zhen-Hua], Zhao, Y.Q.[Ya-Qian], Li, R.[Rengang],
Inexactly Matched Referring Expression Comprehension With Rationale,
MultMed(26), 2024, pp. 3937-3950.
IEEE DOI 2402
Task analysis, Grounding, Visualization, Pipelines, Transformers, Training, Annotations, Referring expression comprehension, multimodal learning BibRef

Luo, G.[Gen], Zhou, Y.[Yiyi], Sun, J.[Jiamu], Sun, X.S.[Xiao-Shuai], Ji, R.R.[Rong-Rong],
A Survivor in the Era of Large-Scale Pretraining: An Empirical Study of One-Stage Referring Expression Comprehension,
MultMed(26), 2024, pp. 3689-3700.
IEEE DOI 2402
Task analysis, Visualization, Training, Head, Cognition, Systematics, Sun, object recognition BibRef

Miao, P.[Peihan], Su, W.[Wei], Wang, G.[Gaoang], Li, X.[Xuewei], Xi, L.[Li],
Self-Paced Multi-Grained Cross-Modal Interaction Modeling for Referring Expression Comprehension,
IP(33), 2024, pp. 1497-1507.
IEEE DOI 2402
Visualization, Linguistics, Transformers, Location awareness, Task analysis, Training, Learning systems, self-paced sample informativeness learning BibRef

Liu, Z.T.[Zong-Tao], Xu, T.Y.[Tian-Yang], Song, X.N.[Xiao-Ning], Wu, X.J.[Xiao-Jun],
Unified Referring Expression Generation for Bounding Boxes and Segmentations,
SPLetters(31), 2024, pp. 636-640.
IEEE DOI 2403
Transformers, Visualization, Task analysis, Image segmentation, Search problems, Object segmentation, Feature extraction, segmentation BibRef


Wu, Y.X.[Yi-Xuan], Zhang, Z.[Zhao], Xie, C.[Chi], Zhu, F.[Feng], Zhao, R.[Rui],
Advancing Referring Expression Segmentation Beyond Single Image,
ICCV23(2628-2638)
IEEE DOI Code:
WWW Link. 2401
BibRef

Kurita, S.[Shuhei], Katsura, N.[Naoki], Onami, E.[Eri],
RefEgo: Referring Expression Comprehension Dataset from First-Person Perception of Ego4D,
ICCV23(15168-15178)
IEEE DOI 2401
BibRef

Qiao, Y.[Yanyuan], Qi, Y.[Yuankai], Yu, Z.[Zheng], Liu, J.[Jing], Wu, Q.[Qi],
March in Chat: Interactive Prompting for Remote Embodied Referring Expression,
ICCV23(15712-15721)
IEEE DOI Code:
WWW Link. 2401
BibRef

Chen, Y.[Yitao], Du, R.[Ruoyi], Liang, K.[Kongming], Ma, Z.Y.[Zhan-Yu],
Self-Enhanced Training Framework for Referring Expression Grounding,
ICIP23(3060-3064)
IEEE DOI Code:
WWW Link. 2312
BibRef

Sun, J.[Jiamu], Luo, G.[Gen], Zhou, Y.[Yiyi], Sun, X.S.[Xiao-Shuai], Jiang, G.[Guannan], Wang, Z.[Zhiyu], Ji, R.R.[Rong-Rong],
RefTeacher: A Strong Baseline for Semi-Supervised Referring Expression Comprehension,
CVPR23(19144-19154)
IEEE DOI 2309
BibRef

Tang, J.J.[Jia-Jin], Zheng, G.[Ge], Shi, C.[Cheng], Yang, S.[Sibei],
Contrastive Grouping with Transformer for Referring Image Segmentation,
CVPR23(23570-23580)
IEEE DOI 2309
BibRef

Liu, J.[Jiang], Ding, H.[Hui], Cai, Z.W.[Zhao-Wei], Zhang, Y.T.[Yu-Ting], Satzoda, R.K.[Ravi Kumar], Mahadevan, V.[Vijay], Manmatha, R.,
PolyFormer: Referring Image Segmentation as Sequential Polygon Generation,
CVPR23(18653-18663)
IEEE DOI 2309
BibRef

Xu, L.[Li], Huang, M.H.[Mark He], Shang, X.[Xindi], Yuan, Z.H.[Ze-Huan], Sun, Y.[Ying], Liu, J.[Jun],
Meta Compositional Referring Expression Segmentation,
CVPR23(19478-19487)
IEEE DOI 2309
BibRef

Liu, C.[Chang], Ding, H.H.[Heng-Hui], Jiang, X.D.[Xu-Dong],
GRES: Generalized Referring Expression Segmentation,
CVPR23(23592-23601)
IEEE DOI 2309
BibRef

Song, S.[Sijie], Lin, X.D.[Xu-Dong], Liu, J.Y.[Jia-Ying], Guo, Z.M.[Zong-Ming], Chang, S.F.[Shih-Fu],
Co-Grounding Networks with Semantic Attention for Referring Expression Comprehension in Videos,
CVPR21(1346-1355)
IEEE DOI 2111
Visualization, Correlation, Grounding, Computational modeling, Semantics, Benchmark testing BibRef

Sun, M.J.[Ming-Jie], Xiao, J.[Jimin], Lim, E.G.[Eng Gee],
Iterative Shrinking for Referring Expression Grounding Using Deep Reinforcement Learning,
CVPR21(14055-14064)
IEEE DOI 2111
Art, Grounding, Reinforcement learning, Cognition, Pattern recognition, Proposals BibRef

Wang, P.[Peng], Wu, Q.[Qi], Cao, J.W.[Jie-Wei], Shen, C.H.[Chun-Hua], Gao, L.L.[Lian-Li], van den Hengel, A.J.[Anton J.],
Neighbourhood Watch: Referring Expression Comprehension via Language-Guided Graph Attention Networks,
CVPR19(1960-1968).
IEEE DOI 2002
BibRef

Yang, S.B.[Si-Bei], Li, G.B.[Guan-Bin], Yu, Y.Z.[Yi-Zhou],
Dynamic Graph Attention for Referring Expression Comprehension,
ICCV19(4643-4652)
IEEE DOI 2004
graph theory, image representation, inference mechanisms, learning (artificial intelligence), Object recognition BibRef

Zhang, H.W.[Han-Wang], Niu, Y.L.[Yu-Lei], Chang, S.F.[Shih-Fu],
Grounding Referring Expressions in Images by Variational Context,
CVPR18(4158-4166)
IEEE DOI 1812
Grounding, Context modeling, Task analysis, Visualization, Pediatrics, Bayes methods, Natural languages BibRef

Yu, L.C.[Li-Cheng], Lin, Z.[Zhe], Shen, X.H.[Xiao-Hui], Yang, J.M.[Ji-Mei], Lu, X.[Xin], Bansal, M.[Mohit], Berg, T.L.[Tamara L.],
MAttNet: Modular Attention Network for Referring Expression Comprehension,
CVPR18(1307-1315)
IEEE DOI 1812
Localize image region described by natural language expression. Visualization, Computational modeling, Task analysis, Cats, Adaptation models, Feature extraction, Knowledge discovery BibRef

Luo, R.[Ruotian], Shakhnarovich, G.[Gregory],
Comprehension-Guided Referring Expressions,
CVPR17(3125-3134)
IEEE DOI 1711
Context modeling, Generators, Training, Visualization BibRef

Chapter on Implementations and Applications, Databases, QBIC, Video Analysis, Hardware and Software, Inspection continues in
CLIP, Contrastive Language-Image Pre-Training .


Last update:Apr 18, 2024 at 11:38:49