8.3.4.3.5 Referring Image Segmentation

Chapter Contents (Back)
Referring Image Segmentation.
See also Neural Networks for Semantic Segmentation.
See also Semantic Segmentation, Label and Segment Together.

Qiu, S., Zhao, Y., Jiao, J., Wei, Y., Wei, S.,
Referring Image Segmentation by Generative Adversarial Learning,
MultMed(22), No. 5, May 2020, pp. 1333-1344.
IEEE DOI 2005
Image segmentation, Semantics, Feature extraction, Natural languages, Generators, Generative adversarial networks, Adversarial training BibRef

Ye, L.W.[Lin-Wei], Liu, Z.[Zhi], Wang, Y.[Yang],
Dual Convolutional LSTM Network for Referring Image Segmentation,
MultMed(22), No. 12, December 2020, pp. 3224-3235.
IEEE DOI 2011
Image segmentation, Visualization, Decoding, Linguistics, Task analysis, Logic gates, deep learning BibRef

Liu, S.[Si], Hui, T.R.[Tian-Rui], Huang, S.F.[Shao-Fei], Wei, Y.C.[Yun-Chao], Li, B.[Bo], Li, G.B.[Guan-Bin],
Cross-Modal Progressive Comprehension for Referring Segmentation,
PAMI(44), No. 9, September 2022, pp. 4761-4775.
IEEE DOI 2208
Image segmentation, Feature extraction, Cognition, Visualization, Semantics, Task analysis, Linguistics, Referring segmentation, multimodal feature fusion BibRef

Huang, S.F.[Shao-Fei], Hui, T.R.[Tian-Rui], Liu, S.[Si], Li, G.B.[Guan-Bin], Wei, Y.C.[Yun-Chao], Han, J.Z.[Ji-Zhong], Liu, L.Q.[Luo-Qi], Li, B.[Bo],
Referring Image Segmentation via Cross-Modal Progressive Comprehension,
CVPR20(10485-10494)
IEEE DOI 2008
segmenting the foreground masks of the entities that can well match the description given in the natural language expression. Visualization, Cognition, Image segmentation, Linguistics, Feature extraction, Convolution, Semantics BibRef

Baffour, A.A.[Adu Asare], Qin, Z.[Zhen], Wang, Y.[Yong], Qin, Z.G.[Zhi-Guang], Choo, K.K.R.[Kim-Kwang Raymond],
Spatial Self-Attention Network with Self-Attention Distillation for Fine-Grained Image Recognition,
JVCIR(81), 2021, pp. 103368.
Elsevier DOI 2112
Fine-grained recognition, Spatial self-attention, Knowledge distillation, Convolutional neural network BibRef

Ye, L.W.[Lin-Wei], Rochan, M.[Mrigank], Liu, Z.[Zhi], Zhang, X.Q.[Xiao-Qin], Wang, Y.[Yang],
Referring Segmentation in Images and Videos With Cross-Modal Self-Attention Network,
PAMI(44), No. 7, July 2022, pp. 3719-3732.
IEEE DOI 2206
BibRef
Earlier: A1, A2, A3, A5, Only:
Cross-Modal Self-Attention Network for Referring Image Segmentation,
CVPR19(10494-10503).
IEEE DOI 2002
Videos, Image segmentation, Visualization, Task analysis, Linguistics, Feature extraction, Semantics, Referring segmentation, self-attention BibRef

Wang, Z.H.[Zhen-Hua], Ye, L.W.[Lin-Wei],
Referring Image Segmentation with Two-Stage Multi-Modal Interaction,
ICIP24(2543-2549)
IEEE DOI 2411
Location awareness, Image segmentation, Visualization, Text to image, Linguistics, Feature extraction, Task analysis, Referring Image Segmentation BibRef

Kim, N.[Namyup], Hwang, S.[Sehyun], Kwak, S.[Suha],
Learning to Detect Semantic Boundaries with Image-Level Class Labels,
IJCV(130), No. 9, September 2022, pp. 2131-2148.
Springer DOI 2208
BibRef

Kim, D.[Dongwon], Kim, N.[Namyup], Lan, C.L.[Cui-Ling], Kwak, S.[Suha],
Shatter and Gather: Learning Referring Image Segmentation with Text Supervision,
ICCV23(15501-15511)
IEEE DOI 2401
BibRef

Ahn, J.[Jiwoon], Cho, S.[Sunghyun], Kwak, S.[Suha],
Weakly Supervised Learning of Instance Segmentation With Inter-Pixel Relations,
CVPR19(2204-2213).
IEEE DOI 2002
BibRef
Earlier: A1, A3, Only:
Learning Pixel-Level Semantic Affinity with Image-Level Supervision for Weakly Supervised Semantic Segmentation,
CVPR18(4981-4990)
IEEE DOI 1812
Image segmentation, Semantics, Training, Shape, Visualization, Pipelines, Motion segmentation BibRef

Shang, C.[Chao], Li, H.L.[Hong-Liang], Qiu, H.Q.[He-Qian], Wu, Q.B.[Qing-Bo], Meng, F.M.[Fan-Man], Zhao, T.[Taijin], Ngan, K.N.[King Ngi],
Cross-Modal Recurrent Semantic Comprehension for Referring Image Segmentation,
CirSysVideo(33), No. 7, July 2023, pp. 3229-3242.
IEEE DOI 2307
Semantics, Visualization, Feature extraction, Image segmentation, Cognition, Task analysis, Linguistics, global semantic reasoning BibRef

Ding, H.X.[Hai-Xin], Zhang, S.C.[Sheng-Chuan], Wu, Q.[Qiong], Yu, S.L.[Song-Lin], Hu, J.[Jie], Cao, L.J.[Liu-Juan], Ji, R.R.[Rong-Rong],
Bilateral Knowledge Interaction Network for Referring Image Segmentation,
MultMed(26), 2024, pp. 2966-2977.
IEEE DOI 2402
Image segmentation, Visualization, Kernel, Knowledge engineering, Feature extraction, Semantics, Convolution, vision-language BibRef

Wu, J.Z.[Jian-Zong], Li, X.T.[Xiang-Tai], Li, X.[Xia], Ding, H.H.[Heng-Hui], Tong, Y.H.[Yun-Hai], Tao, D.C.[Da-Cheng],
Toward Robust Referring Image Segmentation,
IP(33), 2024, pp. 1782-1794.
IEEE DOI Code:
WWW Link. 2403
Image segmentation, Task analysis, Robustness, Transformers, Measurement, Fuses, Benchmark testing, image segmentation, natural language processing BibRef

Cho, Y.B.[Yu-Bin], Yu, H.W.[Hyun-Woo], Kang, S.J.[Suk-Ju],
Cross-Aware Early Fusion With Stage-Divided Vision and Language Transformer Encoders for Referring Image Segmentation,
MultMed(26), 2024, pp. 5823-5833.
IEEE DOI 2404
Feature extraction, Image segmentation, Task analysis, Transformers, Linguistics, Decoding, Visualization, feature-based cross-modal alignment BibRef

Zhang, Z.L.[Zhen-Liang], Teng, Z.[Zhu], Fan, J.[Jack], Zhang, B.P.[Bao-Peng], Fan, J.P.[Jian-Ping],
Token-word mixer meets object-aware transformer for referring image segmentation,
PR(155), 2024, pp. 110719.
Elsevier DOI 2408
Transformer, Multi-modal fusion, Referring image segmentation BibRef

Liu, Y.J.[Ya-Jie], Ge, P.[Pu], Ma, H.X.[Hao-Xiang], Fan, S.C.[Shi-Chao], Liu, Q.J.[Qing-Jie], Huang, D.[Di], Wang, Y.H.[Yun-Hong],
Towards Generalizable Referring Image Segmentation Via Target Prompt And Visual Coherence,
ICIP24(2599-2605)
IEEE DOI 2411
Visualization, Image segmentation, Protocols, Limiting, Coherence, Predictive models, Linguistics, Referring image segmentation, zero-shot cross-dataset BibRef

Wang, Y.[Yehui], Lei, F.[Fang], Wang, B.[Baoyan], Zhang, Q.[Qiang], Zhen, X.T.[Xian-Tong], Zhang, L.[Lei],
De-noising mask transformer for referring image segmentation,
IVC(154), 2025, pp. 105356.
Elsevier DOI 2502
Referring image segmentation, De-noising mask, Multi-modal fusion BibRef

Li, W.H.[Wen-Hui], Pang, C.[Chao], Nie, W.Z.[Wei-Zhi], Tian, H.[Hongshuo], Liu, A.A.[An-An],
Bidirectional Mask Selection for Zero-Shot Referring Image Segmentation,
CirSysVideo(35), No. 1, January 2025, pp. 911-921.
IEEE DOI Code:
WWW Link. 2502
Image segmentation, Visualization, Feature extraction, Semantics, Circuits and systems, Training, Annotations, mask adaptive fusion strategy BibRef


Wang, Y.T.[Yao-Ting], Sun, P.[Peiwen], Li, Y.C.[Yuan-Chao], Zhang, H.G.[Hong-Gang], Hu, D.[Di],
Can Textual Semantics Mitigate Sounding Object Segmentation Preference?,
ECCV24(LXXIV: 340-356).
Springer DOI 2412
BibRef

Wang, Y.T.[Yao-Ting], Sun, P.[Peiwen], Zhou, D.Z.[Dong-Zhan], Li, G.Y.[Guang-Yao], Zhang, H.G.[Hong-Gang], Hu, D.[Di],
REF-AVS: Refer and Segment Objects in Audio-visual Scenes,
ECCV24(LXXIV: 196-213).
Springer DOI 2412
BibRef

Lyu, H.X.[Hao-Xin], Zhong, T.X.[Tian-Xiong], Zhao, S.[Sanyuan],
Gtms: A Gradient-driven Tree-guided Mask-free Referring Image Segmentation Method,
ECCV24(LXVI: 288-304).
Springer DOI 2412
BibRef

Yu, S.[Seonghoon], Seo, P.H.[Paul Hongsuck], Son, J.[Jeany],
Pseudo-ris: Distinctive Pseudo-supervision Generation for Referring Image Segmentation,
ECCV24(LXVIII: 18-36).
Springer DOI 2412
BibRef

Ha, S.[Seongsu], Kim, C.[Chaeyun], Kim, D.[Donghwa], Lee, J.[Junho], Lee, S.H.[Sang-Ho], Lee, J.[Joonseok],
Finding Nemo: Negative-mined Mosaic Augmentation for Referring Image Segmentation,
ECCV24(LXIX: 121-137).
Springer DOI 2412
BibRef

Dai, Q.Y.[Qi-Yuan], Yang, S.[Sibei],
Curriculum Point Prompting for Weakly-Supervised Referring Image Segmentation,
CVPR24(13711-13722)
IEEE DOI 2410
Image segmentation, Annotations, Semantics, Noise, Natural languages, Generators BibRef

Yuan, L.F.[Lin-Feng], Shi, M.J.[Miao-Jing], Yue, Z.J.[Zi-Jie], Chen, Q.J.[Qi-Jun],
LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation,
CVPR24(14001-14010)
IEEE DOI 2410
Optical losses, Visualization, Codes, Pipelines, Object segmentation, Predictive models BibRef

Wu, J.N.[Jian-Nan], Jiang, Y.[Yi], Yan, B.[Bin], Lu, H.C.[Hu-Chuan], Yuan, Z.H.[Ze-Huan], Luo, P.[Ping],
Segment Every Reference Object in Spatial and Temporal Spaces,
ICCV23(2538-2550)
IEEE DOI 2401
Referring image segmentation BibRef

Hu, Y.[Yutao], Wang, Q.X.[Qi-Xiong], Shao, W.Q.[Wen-Qi], Xie, E.[Enze], Li, Z.G.[Zhen-Guo], Han, J.G.[Jun-Gong], Luo, P.[Ping],
Beyond One-to-One: Rethinking the Referring Image Segmentation,
ICCV23(4044-4054)
IEEE DOI Code:
WWW Link. 2401
BibRef

Lee, J.[Jungbeom], Lee, S.J.[Sung-Jin], Nam, J.[Jinseok], Yu, S.[Seunghak], Do, J.[Jaeyoung], Taghavi, T.[Tara],
Weakly Supervised Referring Image Segmentation with Intra-Chunk and Inter-Chunk Consistency,
ICCV23(21813-21824)
IEEE DOI 2401
BibRef

Liu, F.[Fang], Liu, Y.H.[Yu-Hao], Kong, Y.Q.[Yu-Qiu], Xu, K.[Ke], Zhang, L.[Lihe], Yin, B.C.[Bao-Cai], Hancke, G.[Gerhard], Lau, R.[Rynson],
Referring Image Segmentation Using Text Supervision,
ICCV23(22067-22077)
IEEE DOI Code:
WWW Link. 2401
BibRef

Xu, Z.[Zunnan], Chen, Z.H.[Zhi-Hong], Zhang, Y.[Yong], Song, Y.B.[Yi-Bing], Wan, X.[Xiang], Li, G.B.[Guan-Bin],
Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation,
ICCV23(17457-17466)
IEEE DOI Code:
WWW Link. 2401
BibRef

Yu, S.[Seonghoon], Seo, P.H.[Paul Hongsuck], Son, J.[Jeany],
Zero-shot Referring Image Segmentation with Global-Local Context Features,
CVPR23(19456-19465)
IEEE DOI 2309
BibRef

Kim, N.[Namyup], Kim, D.[Dongwon], Kwak, S.[Suha], Lan, C.L.[Cui-Ling], Zeng, W.J.[Wen-Jun],
ReSTR: Convolution-free Referring Image Segmentation Using Transformers,
CVPR22(18124-18133)
IEEE DOI 2210
Image segmentation, Adaptation models, Visualization, Semantics, Benchmark testing, Transformers, Feature extraction, grouping and shape analysis BibRef

Wang, Z.Q.[Zhao-Qing], Lu, Y.[Yu], Li, Q.[Qiang], Tao, X.Q.[Xun-Qiang], Guo, Y.D.[Yan-Dong], Gong, M.M.[Ming-Ming], Liu, T.L.[Tong-Liang],
CRIS: CLIP-Driven Referring Image Segmentation,
CVPR22(11676-11685)
IEEE DOI 2210
Representation learning, Image segmentation, Visualization, Image analysis, Shape, Semantics, Segmentation, Vision+language BibRef

Yang, Z.[Zhao], Wang, J.Q.[Jia-Qi], Tang, Y.S.[Yan-Song], Chen, K.[Kai], Zhao, H.S.[Heng-Shuang], Torr, P.H.S.[Philip H.S.],
LAVT: Language-Aware Vision Transformer for Referring Image Segmentation,
CVPR22(18134-18144)
IEEE DOI 2210
Image segmentation, Visualization, Image coding, Shape, Linguistics, Transformers, Feature extraction, Segmentation, grouping and shape analysis BibRef

Yang, S.[Sibei], Xia, M.[Meng], Li, G.B.[Guan-Bin], Zhou, H.Y.[Hong-Yu], Yu, Y.Z.[Yi-Zhou],
Bottom-Up Shift and Reasoning for Referring Image Segmentation,
CVPR21(11261-11270)
IEEE DOI 2111
Location awareness, Visualization, Image segmentation, Fuses, Message passing, Computational modeling BibRef

Jing, Y.[Ya], Kong, T.[Tao], Wang, W.[Wei], Wang, L.[Liang], Li, L.[Lei], Tan, T.N.[Tie-Niu],
Locate then Segment: A Strong Pipeline for Referring Image Segmentation,
CVPR21(9853-9862)
IEEE DOI 2111
Location awareness, Image segmentation, Visualization, Fuses, Pipelines, Object segmentation, Feature extraction BibRef

Hui, T.R.[Tian-Rui], Liu, S.[Si], Huang, S.F.[Shao-Fei], Li, G.B.[Guan-Bin], Yu, S.[Sansi], Zhang, F.[Faxi], Han, J.Z.[Ji-Zhong],
Linguistic Structure Guided Context Modeling for Referring Image Segmentation,
ECCV20(X:59-75).
Springer DOI 2011
BibRef

Li, X., Liu, Y., Xu, K., Zhao, Z., Liu, S.,
A Context-Based Network For Referring Image Segmentation,
ICIP20(1436-1440)
IEEE DOI 2011
Image segmentation, Visualization, Linguistics, Feature extraction, Convolution, Decoding, Referring Image Segmentation, Dense Convolution BibRef

Li, R., Li, K., Kuo, Y., Shu, M., Qi, X., Shen, X., Jia, J.,
Referring Image Segmentation via Recurrent Refinement Networks,
CVPR18(5745-5753)
IEEE DOI 1812
Image segmentation, Semantics, Natural languages, Task analysis, Feature extraction, Logic gates, Training BibRef

Liu, C.X.[Chen-Xi], Lin, Z.[Zhe], Shen, X.H.[Xiao-Hui], Yang, J.M.[Ji-Mei], Lu, X.[Xin], Yuille, A.L.[Alan L.],
Recurrent Multimodal Interaction for Referring Image Segmentation,
ICCV17(1280-1289)
IEEE DOI 1802
convolution, image segmentation, learning (artificial intelligence), Visualization BibRef

Chapter on 2-D Region Segmentation Techniques, Snakes, Active Contours continues in
Other Complete Systems .


Last update:Mar 12, 2025 at 14:27:03