8.6.3.1 Open-Vocabulary, Open-World Semantic Segmentation

Chapter Contents (Back)
Semantic Segmentation. Open-Vocabulary. Open-World.

Dao, S.D.[Son Duy], Shi, H.[Hengcan], Phung, D.[Dinh], Cai, J.F.[Jian-Fei],
Class Enhancement Losses With Pseudo Labels for Open-Vocabulary Semantic Segmentation,
MultMed(26), 2024, pp. 8442-8453.
IEEE DOI 2408
Proposals, Training, Semantic segmentation, Annotations, Semantics, Predictive models, Visualization, zero-shot semantic segmentation BibRef

Li, Z.H.[Zhi-Heng], Zhong, Y.J.[Yu-Jie], Song, R.[Ran], Li, T.J.[Tian-Jiao], Ma, L.[Lin], Zhang, W.[Wei],
DeTAL: Open-Vocabulary Temporal Action Localization With Decoupled Networks,
PAMI(46), No. 12, December 2024, pp. 7728-7741.
IEEE DOI 2411
Location awareness, Task analysis, Visualization, Proposals, Training, Adaptation models, Semantics, Open-Vocabulary, temporal action localization BibRef

Han, C.[Cong], Zhong, Y.J.[Yu-Jie], Li, D.J.[Deng-Jie], Han, K.[Kai], Ma, L.[Lin],
Open-Vocabulary Semantic Segmentation with Decoupled One-Pass Network,
ICCV23(1086-1096)
IEEE DOI Code:
WWW Link. 2401
BibRef

Zhu, C.Y.[Chao-Yang], Chen, L.[Long],
A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future,
PAMI(46), No. 12, December 2024, pp. 8954-8975.
IEEE DOI 2411
Survey, Open-Vocabulary. Task analysis, Visualization, Training, Semantics, Image segmentation, Vocabulary, Transfer learning, Open-vocabulary, future directions BibRef

Rai, S.N.[Shyam Nandan], Cermelli, F.[Fabio], Caputo, B.[Barbara], Masone, C.[Carlo],
Mask2Anomaly: Mask Transformer for Universal Open-Set Segmentation,
PAMI(46), No. 12, December 2024, pp. 9286-9302.
IEEE DOI 2411
Image segmentation, Task analysis, Semantic segmentation, Semantics, Transformers, Training, Noise measurement, mask architecture BibRef

Pan, T.[Ting], Tang, L.[Lulu], Wang, X.L.[Xin-Long], Shan, S.G.[Shi-Guang],
Tokenize Anything via Prompting,
ECCV24(XLVII: 330-348).
Springer DOI 2412
Code:
WWW Link. segmenting, recognizing, and captioning anything. BibRef

Yang, Y.H.[Yu-Huan], Ma, C.F.[Chao-Fan], Ju, C.[Chen], Zhang, F.[Fei], Yao, J.C.[Jiang-Chao], Zhang, Y.[Ya], Wang, Y.F.[Yan-Feng],
Multi-modal Prototypes for Open-World Semantic Segmentation,
IJCV(132), No. 12, December 2024, pp. 6004-6020.
Springer DOI 2501
BibRef

Tang, L.[Lv], Jiang, P.T.[Peng-Tao], Xiao, H.[Haoke], Li, B.[Bo],
Towards Training-Free Open-World Segmentation via Image Prompt Foundation Models,
IJCV(133), No. 1, January 2025, pp. 1-15.
Springer DOI 2501
BibRef


Zheng, J.W.[Jun-Wei], Liu, R.P.[Rui-Ping], Chen, Y.F.[Yu-Fan], Peng, K.Y.[Kun-Yu], Wu, C.Z.[Cheng-Zhi], Yang, K.L.[Kai-Lun], Zhang, J.[JiaMing], Stiefelhagen, R.[Rainer],
Open Panoramic Segmentation,
ECCV24(XXXIX: 164-182).
Springer DOI 2412
BibRef

Karazija, L.[Laurynas], Laina, I.[Iro], Vedaldi, A.[Andrea], Rupprecht, C.[Christian],
Diffusion Models for Open-vocabulary Segmentation,
ECCV24(V: 299-317).
Springer DOI 2412
BibRef

Wilms, C.[Christian], Rolff, T.[Tim], Hillemann, M.[Maris], Johanson, R.[Robert], Frintrop, S.[Simone],
Sos: Segment Object System for Open-world Instance Segmentation with Object Priors,
ECCV24(XXVII: 165-182).
Springer DOI 2412
BibRef

Jiao, S.[Siyu], Zhu, H.G.[Hong-Guang], Huang, J.N.[Jian-Nan], Zhao, Y.[Yao], Wei, Y.C.[Yun-Chao], Shi, H.[Humphrey],
Collaborative Vision-text Representation Optimizing for Open-vocabulary Segmentation,
ECCV24(XXXIII: 399-416).
Springer DOI 2412
BibRef

Wysoczanska, M.[Monika], Siméoni, O.[Oriane], Ramamonjisoa, M.[Michaël], Bursuc, A.[Andrei], Trzcinski, T.[Tomasz], Pérez, P.[Patrick],
CLIP-dinoiser: Teaching CLIP a Few Dino Tricks for Open-vocabulary Semantic Segmentation,
ECCV24(LXI: 320-337).
Springer DOI 2412
BibRef

Shao, T.[Tong], Tian, Z.[Zhuotao], Zhao, H.[Hang], Su, J.[Jingyong],
Explore the Potential of CLIP for Training-free Open Vocabulary Semantic Segmentation,
ECCV24(LXXXVI: 139-156).
Springer DOI 2412
BibRef

Lan, M.C.[Meng-Cheng], Chen, C.F.[Chao-Feng], Ke, Y.P.[Yi-Ping], Wang, X.J.[Xin-Jiang], Feng, L.[Litong], Zhang, W.[Wayne],
Proxyclip: Proxy Attention Improves CLIP for Open-vocabulary Segmentation,
ECCV24(LXVIII: 70-88).
Springer DOI 2412
BibRef

Jiang, L.[Li], Shi, S.S.[Shao-Shuai], Schiele, B.[Bernt],
Open-Vocabulary 3D Semantic Segmentation with Foundation Models,
CVPR24(21284-21294)
IEEE DOI 2410
Text recognition, Semantic segmentation, 3D Semantic Segmentation, Open Vocabulary, Foundation Models BibRef

Zhao, W.J.[Wen-Jie], Li, J.[Jia], Dong, X.[Xin], Xiang, Y.[Yu], Guo, Y.H.[Yun-Hui],
Segment Every Out-of-Distribution Object,
CVPR24(3910-3920)
IEEE DOI Code:
WWW Link. 2410
Codes, Semantic segmentation, Face recognition, Benchmark testing, Out-of-distribution detection, Semantic segmentation, prompt-based segmentation BibRef

Liu, Y.[Yong], Bai, S.[Sule], Li, G.B.[Guan-Bin], Wang, Y.T.[Yi-Tong], Tang, Y.S.[Yan-Song],
Open-Vocabulary Segmentation with Semantic-Assisted Calibration,
CVPR24(3491-3500)
IEEE DOI 2410
Measurement, Image segmentation, Visualization, Semantics, Benchmark testing, Predictive models, open-vocabulary segmentation BibRef

Bousselham, W.[Walid], Petersen, F.[Felix], Ferrari, V.[Vittorio], Kuehne, H.[Hilde],
Grounding Everything: Emerging Localization Properties in Vision-Language Transformers,
CVPR24(3828-3837)
IEEE DOI Code:
WWW Link. 2410
Location awareness, Training, Codes, Grounding, Semantic segmentation, Pipelines, open-vocabulary zero-shot, CLIP BibRef

Wang, Y.[Yuan], Sun, R.[Rui], Luo, N.[Naisong], Pan, Y.[Yuwen], Zhang, T.Z.[Tian-Zhu],
Image-to-Image Matching via Foundation Models: A New Perspective for Open-Vocabulary Semantic Segmentation,
CVPR24(3952-3963)
IEEE DOI 2410
Visualization, Image recognition, Semantic segmentation, Benchmark testing, Lead, Diffusion models, Training-free BibRef

Nguyen, P.[Phuc], Ngo, T.D.[Tuan Duc], Kalogerakis, E.[Evangelos], Gan, C.[Chuang], Tran, A.[Anh], Pham, C.[Cuong], Nguyen, K.[Khoi],
Open3DIS: Open-Vocabulary 3D Instance Segmentation with 2D Mask Guidance,
CVPR24(4018-4028)
IEEE DOI 2410
Instance segmentation, Point cloud compression, Location awareness, Shape, Performance gain, 3D Instance Segmentation BibRef

Luo, J.[Jiayun], Khandelwal, S.[Siddhesh], Sigal, L.[Leonid], Li, B.Y.[Bo-Yang],
Emergent Open-Vocabulary Semantic Segmentation from Off-the-Shelf Vision-Language Models,
CVPR24(4029-4040)
IEEE DOI Code:
WWW Link. 2410
Training, Vocabulary, Visualization, Image resolution, Semantic segmentation, Text to image, training-free BibRef

Bourouis, A.[Ahmed], Fan, J.E.[Judith E.], Gryaditskaya, Y.[Yulia],
Open Vocabulary Semantic Scene Sketch Understanding,
CVPR24(4176-4186)
IEEE DOI 2410
Training, Vocabulary, Visualization, Semantics, Pipelines, Psychology BibRef

Wang, X.Q.[Xiao-Qi], He, W.B.[Wen-Bin], Xuan, X.[Xiwei], Sebastian, C.[Clint], Ono, J.P.[Jorge Piazentin], Li, X.[Xin], Behpour, S.[Sima], Doan, T.[Thang], Gou, L.[Liang], Shen, H.W.[Han-Wei], Ren, L.[Liu],
USE: Universal Segment Embeddings for Open-Vocabulary Image Segmentation,
CVPR24(4187-4196)
IEEE DOI 2410
Representation learning, Image segmentation, Vocabulary, Semantic segmentation, Scalability, Pipelines, foundation model BibRef

Marcos-Manchon, P.[Pablo], Alcover-Couso, R.[Roberto], SanMiguel, J.C.[Juan C.], Martinez, J.M.[Jose M.],
Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models,
CVPR24(9242-9252)
IEEE DOI 2410
Training, Vocabulary, Text recognition, Semantic segmentation, Computational modeling, Text to image, Diffusion models, Attention BibRef

Sun, S.Y.[Shu-Yang], Li, R.[Runjia], Torr, P.[Philip], Gu, X.[Xiuye], Li, S.Y.[Si-Yang],
CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor,
CVPR24(13171-13182)
IEEE DOI 2410
Training, Vocabulary, Visualization, Filters, Semantic segmentation, Semantics, open-vocabulary, image segmentation, training-free methods BibRef

Kong, L.D.[Ling-Dong], Liu, Y.Q.[You-Quan], Ng, L.X.[Lai Xing], Cottereau, B.R.[Benoit R.], Ooi, W.T.[Wei Tsang],
OpenESS: Event-Based Semantic Scene Understanding with Open Vocabularies,
CVPR24(15686-15698)
IEEE DOI 2410
Representation learning, Vocabulary, Image resolution, Semantic segmentation, Scalability, Semantics, Event Camera, Multi-Modal Learning BibRef

Xu, J.X.[Jing-Xuan], Chen, W.Y.[Wu-Yang], Zhao, Y.[Yao], Wei, Y.C.[Yun-Chao],
Transferable and Principled Efficiency for Open-Vocabulary Segmentation,
CVPR24(15814-15824)
IEEE DOI Code:
WWW Link. 2410
Training, Convolutional codes, Costs, Computational modeling, Object detection, Solids BibRef

Barsellotti, L.[Luca], Amoroso, R.[Roberto], Cornia, M.[Marcella], Baraldi, L.[Lorenzo], Cucchiara, R.[Rita],
Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation,
CVPR24(3689-3698)
IEEE DOI 2410
Training, Location awareness, Visualization, Semantic segmentation, Source coding, Semantics, Open-Vocabulary, Segmentation, Unsupervised BibRef

Xie, B.[Bin], Cao, J.[Jiale], Xie, J.[Jin], Khan, F.S.[Fahad Shahbaz], Pang, Y.W.[Yan-Wei],
SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation,
CVPR24(3426-3436)
IEEE DOI Code:
WWW Link. 2410
Degradation, Adaptation models, Costs, Accuracy, Semantic segmentation, Source coding, Semantics, Open-Vocabulary, Encoder-Decoder BibRef

Sodano, M.[Matteo], Magistri, F.[Federico], Nunes, L.[Lucas], Behley, J.[Jens], Stachniss, C.[Cyrill],
Open-World Semantic Segmentation Including Class Similarity,
CVPR24(3184-3194)
IEEE DOI Code:
WWW Link. 2410
Training, Semantic segmentation, Machine vision, Training data, Computer architecture, Data models, Autonomous Driving BibRef

Choe, S.A.[Seun-An], Shin, A.H.[Ah-Hyung], Park, K.H.[Keon-Hee], Choi, J.[Jinwoo], Park, G.M.[Gyeong-Moon],
Open-Set Domain Adaptation for Semantic Segmentation,
CVPR24(23943-23953)
IEEE DOI Code:
WWW Link. 2410
Industries, Adaptation models, Limiting, Shape, Semantic segmentation, Computational modeling, Domain Adaptation BibRef

Shan, X.H.[Xiang-Heng], Wu, D.Y.[Dong-Yue], Zhu, G.L.[Gui-Lin], Shao, Y.J.[Yuan-Jie], Sang, N.[Nong], Gao, C.X.[Chang-Xin],
Open-Vocabulary Semantic Segmentation with Image Embedding Balancing,
CVPR24(28412-28421)
IEEE DOI Code:
WWW Link. 2410
Training, Adaptation models, Vocabulary, Image resolution, Image recognition, Semantic segmentation, Semantics BibRef

Li, Z.[Ziyi], Zhou, Q.[Qinye], Zhang, X.Y.[Xiao-Yun], Zhang, Y.[Ya], Wang, Y.F.[Yan-Feng], Xie, W.[Weidi],
Open-vocabulary Object Segmentation with Diffusion Models,
ICCV23(7633-7642)
IEEE DOI 2401
BibRef

Zhu, M.[Muzhi], Li, H.T.[Heng-Tao], Chen, H.[Hao], Fan, C.X.[Cheng-Xiang], Mao, W.[Weian], Jing, C.C.[Chen-Chen], Liu, Y.F.[Yi-Fan], Shen, C.H.[Chun-Hua],
SegPrompt: Boosting Open-world Segmentation via Category-level Prompt Learning,
ICCV23(999-1008)
IEEE DOI Code:
WWW Link. 2401
BibRef

Zhang, H.[Hao], Li, F.[Feng], Zou, X.[Xueyan], Liu, S.[Shilong], Li, C.Y.[Chun-Yuan], Yang, J.W.[Jian-Wei], Zhang, L.[Lei],
A Simple Framework for Open-Vocabulary Segmentation and Detection,
ICCV23(1020-1031)
IEEE DOI Code:
WWW Link. 2401
BibRef

Huang, K.[Kai], Wang, F.[Feigege], Xi, Y.[Ye], Gao, Y.[Yutao],
Prototypical Kernel Learning and Open-set Foreground Perception for Generalized Few-shot Semantic Segmentation,
ICCV23(19199-19208)
IEEE DOI 2401
BibRef

Cai, K.X.[Kai-Xin], Ren, P.Z.[Peng-Zhen], Zhu, Y.[Yi], Xu, H.[Hang], Liu, J.Z.[Jian-Zhuang], Li, C.[Changlin], Wang, G.R.[Guang-Run], Liang, X.D.[Xiao-Dan],
MixReorg: Cross-Modal Mixed Patch Reorganization is a Good Mask Learner for Open-World Semantic Segmentation,
ICCV23(1196-1205)
IEEE DOI 2401
BibRef

Chen, J.[Jun], Zhu, D.[Deyao], Qian, G.C.[Guo-Cheng], Ghanem, B.[Bernard], Yan, Z.C.[Zhi-Cheng], Zhu, C.C.[Chen-Chen], Xiao, F.[Fanyi], Culatana, S.C.[Sean Chang], Elhoseiny, M.[Mohamed],
Exploring Open-Vocabulary Semantic Segmentation from CLIP Vision Encoder Distillation Only,
ICCV23(699-710)
IEEE DOI Code:
WWW Link. 2401
BibRef

Han, K.Y.[Kun-Yang], Liu, Y.[Yong], Liew, J.H.[Jun Hao], Ding, H.H.[Heng-Hui], Liu, J.J.[Jia-Jun], Wang, Y.T.[Yi-Tong], Tang, Y.S.[Yan-Song], Yang, Y.[Yujiu], Feng, J.S.[Jia-Shi], Zhao, Y.[Yao], Wei, Y.C.[Yun-Chao],
Global Knowledge Calibration for Fast Open-Vocabulary Segmentation,
ICCV23(797-807)
IEEE DOI 2401
BibRef

Barsellotti, L.[Luca], Amoroso, R.[Roberto], Baraldi, L.[Lorenzo], Cucchiara, R.[Rita],
FOSSIL: Free Open-Vocabulary Semantic Segmentation through Synthetic References Retrieval,
WACV24(1453-1462)
IEEE DOI 2404
BibRef
Earlier:
Enhancing Open-vocabulary Semantic Segmentation with Prototype Retrieval,
CIAP23(II:196-208).
Springer DOI 2312
Training, Visualization, Sensitivity, Semantic segmentation, Semantics, Prototypes, Predictive models, Algorithms, Image recognition and understanding BibRef

Xu, J.[Jilan], Hou, J.L.[Jun-Lin], Zhang, Y.[Yuejie], Feng, R.[Rui], Wang, Y.[Yi], Qiao, Y.[Yu], Xie, W.[Weidi],
Learning Open-Vocabulary Semantic Segmentation Models From Natural Language Supervision,
CVPR23(2935-2944)
IEEE DOI 2309
BibRef

Cha, J.[Junbum], Mun, J.[Jonghwan], Roh, B.[Byungseok],
Learning to Generate Text-Grounded Mask for Open-World Semantic Segmentation from Only Image-Text Pairs,
CVPR23(11165-11174)
IEEE DOI 2309
BibRef

Mukhoti, J.[Jishnu], Lin, T.Y.[Tsung-Yu], Poursaeed, O.[Omid], Wang, R.[Rui], Shah, A.[Ashish], Torr, P.H.S.[Philip H.S.], Lim, S.N.[Ser-Nam],
Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive Learning,
CVPR23(19413-19423)
IEEE DOI 2309
BibRef

Liang, F.[Feng], Wu, B.[Bichen], Dai, X.L.[Xiao-Liang], Li, K.[Kunpeng], Zhao, Y.[Yinan], Zhang, H.[Hang], Zhang, P.Z.[Pei-Zhao], Vajda, P.[Peter], Marculescu, D.[Diana],
Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP,
CVPR23(7061-7070)
IEEE DOI 2309
BibRef

Zabari, N.[Nir], Hoshen, Y.[Yedid],
Open-vocabulary Semantic Segmentation Using Test-time Distillation,
LLID22(56-72).
Springer DOI 2304
BibRef

Nunes, I.[Ian], Pereira, M.B.[Matheus B.], Oliveira, H.[Hugo], dos Santos, J.A.[Jefersson A.], Poggi, M.[Marcus],
Conditional Reconstruction for Open-Set Semantic Segmentation,
ICIP22(946-950)
IEEE DOI 2211
Adaptation models, Semantics, Time series analysis, Data integration, Decoding, Task analysis, Image reconstruction, open world BibRef

Liu, Q.D.[Quan-De], Wen, Y.P.[You-Peng], Han, J.H.[Jian-Hua], Xu, C.J.[Chun-Jing], Xu, H.[Hang], Liang, X.D.[Xiao-Dan],
Open-World Semantic Segmentation via Contrasting and Clustering Vision-Language Embedding,
ECCV22(XX:275-292).
Springer DOI 2211
BibRef

Chapter on 2-D Region Segmentation Techniques, Snakes, Active Contours continues in
Vision Transformers for Semantic Segmentation .


Last update:Jan 15, 2025 at 14:36:47