7.1.7.1 Detection Transformer, DETR Applications

Chapter Contents (Back)
Detection. DETR. Transformer.
See also Vision Transformers, ViT.
See also Vision Transformers for Semantic Segmentation, .
See also SWIN Transformer.

Wen, F.[Feng], Wang, M.[Mei], Hu, X.J.[Xiao-Jie],
DFAM-DETR: Deformable Feature Based Attention Mechanism DETR on Slender Object Detection,
IEICE(E106-D), No. 3, March 2023, pp. 401-409.
WWW Link. 2303
BibRef

Li, S.B.[Shi-Bao], Jia, Z.K.[Ze-Kun], Liu, Y.X.[Yi-Xuan], Cui, X.R.[Xue-Rong], Liu, J.H.[Jian-Hang], Huang, T.P.[Ting-Pei], Xu, J.[Jiuyun],
CLS-DETR: A DETR-series object detection network using classification information to accelerate convergence,
PRL(165), 2023, pp. 168-175.
Elsevier DOI 2301
Artificial intelligence, Deep learning, Computer vision, Object detection BibRef

Dai, L.H.[Lin-Hui], Liu, H.[Hong], Tang, H.[Hao], Wu, Z.W.[Zhi-Wei], Song, P.[Pinhao],
AO2-DETR: Arbitrary-Oriented Object Detection Transformer,
CirSysVideo(33), No. 5, May 2023, pp. 2342-2356.
IEEE DOI 2305
Proposals, Transformers, Feature extraction, Object detection, Detectors, Task analysis, Pipelines, Oriented object detection, feature refinement BibRef

Dai, Z.G.[Zhi-Gang], Cai, B.[Bolun], Lin, Y.G.[Yu-Geng], Chen, J.Y.[Jun-Ying],
Unsupervised Pre-Training for Detection Transformers,
PAMI(45), No. 11, November 2023, pp. 12772-12782.
IEEE DOI 2310
BibRef
Earlier:
UP-DETR: Unsupervised Pre-training for Object Detection with Transformers,
CVPR21(1601-1610)
IEEE DOI 2111
Location awareness, Training data, Object detection, Transformers, Feature extraction, Natural language processing, Pattern recognition BibRef

Zhang, G.J.[Gong-Jie], Luo, Z.P.[Zhi-Peng], Cui, K.W.[Kai-Wen], Lu, S.J.[Shi-Jian], Xing, E.P.[Eric P.],
Meta-DETR: Image-Level Few-Shot Detection With Inter-Class Correlation Exploitation,
PAMI(45), No. 11, November 2023, pp. 12832-12843.
IEEE DOI 2310
BibRef

Huang, L.J.[Lin-Jiang], Lu, K.X.[Kai-Xin], Song, G.L.[Guang-Lu], Wang, L.[Liang], Liu, S.[Si], Liu, Y.[Yu], Li, H.S.[Hong-Sheng],
Teach-DETR: Better Training DETR With Teachers,
PAMI(45), No. 12, December 2023, pp. 15759-15771.
IEEE DOI 2311
BibRef

Feng, Y.X.[Yun-Xiang], You, Y.[Yanan], Tian, J.[Jing], Meng, G.[Gang],
OEGR-DETR: A Novel Detection Transformer Based on Orientation Enhancement and Group Relations for SAR Object Detection,
RS(16), No. 1, 2024, pp. xx-yy.
DOI Link 2401
BibRef

Shi, F.Y.[Feng-Yuan], Gao, R.P.[Ruo-Peng], Huang, W.L.[Wei-Lin], Wang, L.M.[Li-Min],
Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual Grounding,
PAMI(46), No. 2, February 2024, pp. 1181-1198.
IEEE DOI 2401
Visual grounding, multimodal transformer, 2D adaptive sampling, text guided decoding, dynamic structure BibRef

Wang, Y.[Yao], Ha, J.E.[Jong-Eun],
Improved Object Detection with Content and Position Separation in Transformer,
RS(16), No. 2, 2024, pp. 353.
DOI Link 2402
BibRef

Li, F.[Feng], Zhang, H.[Hao], Liu, S.[Shilong], Guo, J.[Jian], Ni, L.M.[Lionel M.], Zhang, L.[Lei],
DN-DETR: Accelerate DETR Training by Introducing Query DeNoising,
PAMI(46), No. 4, April 2024, pp. 2239-2251.
IEEE DOI 2403
BibRef
Earlier: CVPR22(13609-13617)
IEEE DOI 2210
Training, Noise reduction, Convergence, Decoding, Transformers, Task analysis, Detectors, DETR, denoising training, vision transformer. Codes, Pattern recognition, Recognition: detection, Vision applications and systems BibRef


Liu, S.L.[Shi-Long], Ren, T.[Tianhe], Chen, J.[Jiayu], Zeng, Z.Y.[Zhao-Yang], Zhang, H.[Hao], Li, F.[Feng], Li, H.Y.[Hong-Yang], Huang, J.[Jun], Su, H.[Hang], Zhu, J.[Jun], Zhang, L.[Lei],
Detection Transformer with Stable Matching,
ICCV23(6468-6477)
IEEE DOI Code:
WWW Link. 2401
BibRef

He, L.Q.[Li-Qiang], Wang, W.[Wei], Chen, A.[Albert], Sun, M.[Min], Kuo, C.H.[Cheng-Hao], Todorovic, S.[Sinisa],
Bidirectional Alignment for Domain Adaptive Detection with Transformers,
ICCV23(18729-18739)
IEEE DOI Code:
WWW Link. 2401
BibRef

Liu, F.[Feng], Zhang, X.S.[Xiao-Song], Peng, Z.L.[Zhi-Liang], Guo, Z.[Zonghao], Wan, F.[Fang], Ji, X.Y.[Xiang-Yang], Ye, Q.X.[Qi-Xiang],
Integrally Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection,
ICCV23(6802-6811)
IEEE DOI Code:
WWW Link. 2401
BibRef

Fang, Y.X.[Yu-Xin], Yang, S.[Shusheng], Wang, S.J.[Shi-Jie], Ge, Y.X.[Yi-Xiao], Shan, Y.[Ying], Wang, X.G.[Xing-Gang],
Unleashing Vanilla Vision Transformer with Masked Image Modeling for Object Detection,
ICCV23(6221-6230)
IEEE DOI Code:
WWW Link. 2401
BibRef

Li, L.Q.[Liang-Qi], Miao, J.X.[Jia-Xu], Shi, D.[Dahu], Tan, W.M.[Wen-Ming], Ren, Y.[Ye], Yang, Y.[Yi], Pu, S.L.[Shi-Liang],
Distilling DETR with Visual-Linguistic Knowledge for Open-Vocabulary Object Detection,
ICCV23(6478-6487)
IEEE DOI Code:
WWW Link. 2401
BibRef

Lin, Y.T.[Yu-Tong], Yuan, Y.H.[Yu-Hui], Zhang, Z.[Zheng], Li, C.[Chen], Zheng, N.N.[Nan-Ning], Hu, H.[Han],
DETR Does Not Need Multi-Scale or Locality Design,
ICCV23(6522-6531)
IEEE DOI Code:
WWW Link. 2401
BibRef

Zhang, M.[Manyuan], Song, G.[Guanglu], Liu, Y.[Yu], Li, H.S.[Hong-Sheng],
Decoupled DETR: Spatially Disentangling Localization and Classification for Improved End-to-End Object Detection,
ICCV23(6578-6587)
IEEE DOI 2401
BibRef

Chen, Q.[Qiang], Chen, X.K.[Xiao-Kang], Wang, J.[Jian], Zhang, S.[Shan], Yao, K.[Kun], Feng, H.C.[Hao-Cheng], Han, J.Y.[Jun-Yu], Ding, E.[Errui], Zeng, G.[Gang], Wang, J.D.[Jing-Dong],
Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment,
ICCV23(6610-6619)
IEEE DOI Code:
WWW Link. 2401
BibRef

Zheng, D.H.[De-Hua], Dong, W.H.[Wen-Hui], Hu, H.L.[Hai-Lin], Chen, X.H.[Xing-Hao], Wang, Y.H.[Yun-He],
Less is More: Focus Attention for Efficient DETR,
ICCV23(6651-6660)
IEEE DOI 2401
BibRef

Ye, M.Q.[Ming-Qiao], Ke, L.[Lei], Li, S.Y.[Si-Yuan], Tai, Y.W.[Yu-Wing], Tang, C.K.[Chi-Keung], Danelljan, M.[Martin], Yu, F.[Fisher],
Cascade-DETR: Delving into High-Quality Universal Object Detection,
ICCV23(6681-6691)
IEEE DOI Code:
WWW Link. 2401
BibRef

Zong, Z.[Zhuofan], Song, G.[Guanglu], Liu, Y.[Yu],
DETRs with Collaborative Hybrid Assignments Training,
ICCV23(6725-6735)
IEEE DOI Code:
WWW Link. 2401
BibRef

Zhang, L.[Lu], Zhang, C.[Chenbo], Zhao, J.J.[Jia-Jia], Guan, J.H.[Ji-Hong], Zhou, S.[Shuigeng],
Meta-ZSDETR: Zero-shot DETR with Meta-learning,
ICCV23(6822-6831)
IEEE DOI 2401
BibRef

Chang, J.H.[Jia-Hao], Wang, S.[Shuo], Xu, H.M.[Hai-Ming], Chen, Z.[Zehui], Yang, C.[Chenhongyi], Zhao, F.[Feng],
DETRDistill: A Universal Knowledge Distillation Framework for DETR-families,
ICCV23(6875-6885)
IEEE DOI 2401
BibRef

Zhang, R.[Renrui], Qiu, H.[Han], Wang, T.[Tai], Guo, Z.[Ziyu], Cui, Z.T.[Zi-Teng], Qiao, Y.[Yu], Li, H.S.[Hong-Sheng], Gao, P.[Peng],
MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection,
ICCV23(9121-9132)
IEEE DOI Code:
WWW Link. 2401
BibRef

Kim, J.[Jihwan], Lee, M.[Miso], Heo, J.P.[Jae-Pil],
Self-Feedback DETR for Temporal Action Detection,
ICCV23(10252-10262)
IEEE DOI 2401
BibRef

Bulat, A.[Adrian], Guerrero, R.[Ricardo], Martinez, B.[Brais], Tzimiropoulos, G.[Georgios],
FS-DETR: Few-Shot DEtection TRansformer with prompting and without re-training,
ICCV23(11759-11768)
IEEE DOI 2401
BibRef

Senthivel, T.[Tharsan], Vu, N.S.[Ngoc-Son], Borzic, B.[Boris],
Detection Transformer with Diversified Object Queries,
ICIP23(2515-2519)
IEEE DOI 2312
BibRef

Qiu, T.[Tian], Zhou, L.Y.[Lin-Yun], Xu, W.X.[Wen-Xiang], Cheng, L.[Lechao], Feng, Z.[Zunlei], Song, M.L.[Ming-Li],
Team DETR: Guide Queries as a Professional Team in Detection Transformers,
ICIP23(450-454)
IEEE DOI Code:
WWW Link. 2312
BibRef

Zhang, J.Y.[Jing-Yi], Huang, J.X.[Jia-Xing], Luo, Z.P.[Zhi-Peng], Zhang, G.[Gongjie], Zhang, X.Q.[Xiao-Qin], Lu, S.J.[Shi-Jian],
DA-DETR: Domain Adaptive Detection Transformer with Information Fusion,
CVPR23(23787-23798)
IEEE DOI 2309
BibRef

Tang, Z.H.[Zong-Heng], Sun, Y.F.[Yi-Fan], Liu, S.[Si], Yang, Y.[Yi],
DETR with Additional Global Aggregation for Cross-domain Weakly Supervised Object Detection,
CVPR23(11422-11432)
IEEE DOI 2309
BibRef

Liu, Y.[Yang], Zhang, Y.[Yao], Wang, Y.X.[Yi-Xin], Zhang, Y.[Yang], Tian, J.[Jiang], Shi, Z.C.[Zhong-Chao], Fan, J.P.[Jian-Ping], He, Z.Q.[Zhi-Qiang],
SAP-DETR: Bridging the Gap Between Salient Points and Queries-Based Transformer Detector for Fast Model Convergency,
CVPR23(15539-15547)
IEEE DOI 2309
BibRef

Chen, Z.[Zeren], Huang, G.[Gengshi], Li, W.[Wei], Teng, J.N.[Jia-Ning], Wang, K.[Kun], Shao, J.[Jing], Loy, C.C.[Chen Change], Sheng, L.[Lu],
Siamese DETR,
CVPR23(15722-15731)
IEEE DOI 2309
BibRef

Li, F.[Feng], Zeng, A.[Ailing], Liu, S.[Shilong], Zhang, H.[Hao], Li, H.Y.[Hong-Yang], Zhang, L.[Lei], Ni, L.M.[Lionel M.],
Lite DETR: An Interleaved Multi-Scale Encoder for Efficient DETR,
CVPR23(18558-18567)
IEEE DOI 2309
BibRef

Jia, D.[Ding], Yuan, Y.H.[Yu-Hui], He, H.[Haodi], Wu, X.P.[Xiao-Pei], Yu, H.[Haojun], Lin, W.H.[Wei-Hong], Sun, L.[Lei], Zhang, C.[Chao], Hu, H.[Han],
DETRs with Hybrid Matching,
CVPR23(19702-19712)
IEEE DOI 2309
BibRef

Chu, S.Y.[Shih-Yun], Lee, M.S.[Ming-Sui],
MT-DETR: Robust End-to-end Multimodal Detection with Confidence Fusion,
WACV23(5241-5250)
IEEE DOI 2302
Meteorological radar, Laser radar, Scalability, Training data, Object detection, Sensor phenomena and characterization, Vision + language and/or other modalities BibRef

Xu, S.[Sheng], Li, Y.J.[Yag-Jing], Lin, M.[Mingbao], Gao, P.[Peng], Guo, G.D.[Guo-Dong], Lü, J.[Jinhu], Zhang, B.C.[Bao-Chang],
Q-DETR: An Efficient Low-Bit Quantized Detection Transformer,
CVPR23(3842-3851)
IEEE DOI 2309

WWW Link. BibRef

Li, J.M.[Jia-Ming], Lin, X.R.[Xiang-Ru], Zhang, W.[Wei], Tan, X.[Xiao], Li, Y.Y.[Ying-Ying], Han, J.Y.[Jun-Yu], Ding, E.[Errui], Wang, J.D.[Jing-Dong], Li, G.B.[Guan-Bin],
Gradient-based Sampling for Class Imbalanced Semi-supervised Object Detection,
ICCV23(16344-16354)
IEEE DOI Code:
WWW Link. 2401
BibRef

Zhang, J.C.[Jia-Cheng], Lin, X.R.[Xiang-Ru], Zhang, W.[Wei], Wang, K.[Kuo], Tan, X.[Xiao], Han, J.Y.[Jun-Yu], Ding, E.[Errui], Wang, J.D.[Jing-Dong], Li, G.B.[Guan-Bin],
Semi-DETR: Semi-Supervised Object Detection with Detection Transformers,
CVPR23(23809-23818)
IEEE DOI 2309
BibRef

Liu, C.[Chang], Zhang, W.M.[Wei-Ming], Lin, X.R.[Xiang-Ru], Zhang, W.[Wei], Tan, X.[Xiao], Han, J.Y.[Jun-Yu], Li, X.M.[Xiao-Mao], Ding, E.[Errui], Wang, J.D.[Jing-Dong],
Ambiguity-Resistant Semi-Supervised Learning for Dense Object Detection,
CVPR23(15579-15588)
IEEE DOI 2309
BibRef

Zhang, G.J.[Gong-Jie], Luo, Z.P.[Zhi-Peng], Yu, Y.C.[Ying-Chen], Cui, K.W.[Kai-Wen], Lu, S.J.[Shi-Jian],
Accelerating DETR Convergence via Semantic-Aligned Matching,
CVPR22(939-948)
IEEE DOI 2210
Code, Detection Transformer.
WWW Link. Training, Costs, Semantics, Object detection, Transformers, Feature extraction, Recognition: detection, categorization, Motion and tracking BibRef

Gupta, A.[Akshita], Narayan, S.[Sanath], Joseph, K.J.[K J], Khan, S.[Salman], Khan, F.S.[Fahad Shahbaz], Shah, M.[Mubarak],
OW-DETR: Open-world Detection Transformer,
CVPR22(9225-9234)
IEEE DOI 2210
Training, Object detection, Transformers, Pattern recognition, Proposals, Object recognition, retrieval, categorization, Recognition: detection BibRef

Lou, Q.[Qian], Hsu, Y.C.[Yen-Chang], Uzkent, B.[Burak], Hua, T.[Ting], Shen, Y.[Yilin], Jin, H.X.[Hong-Xia],
Lite-MDETR: A Lightweight Multi-Modal Detector,
CVPR22(12196-12205)
IEEE DOI 2210
nTraining, Performance evaluation, Visualization, Dictionaries, Grounding, Detectors, Transformers, BibRef

La Bonte, T.[Tyler], Song, Y.[Yale], Wang, X.[Xin], Vineet, V.[Vibhav], Joshi, N.[Neel],
Scaling Novel Object Detection with Weakly Supervised Detection Transformers,
WACV23(85-96)
IEEE DOI 2302
Training, Costs, Surveillance, Object detection, Detectors, Transformers, Data models, and un-supervised learning) BibRef

Bar, A.[Amir], Wang, X.[Xin], Kantorov, V.[Vadim], Reed, C.J.[Colorado J.], Herzig, R.[Roei], Chechik, G.[Gal], Rohrbach, A.[Anna], Darrell, T.J.[Trevor J.], Globerson, A.[Amir],
DETReg: Unsupervised Pretraining with Region Priors for Object Detection,
CVPR22(14585-14595)
IEEE DOI 2210
Location awareness, Training, Object detection, Detectors, Transformers, Generators, Representation learning BibRef

Wang, P.[Pei], Cai, Z.W.[Zhao-Wei], Yang, H.[Hao], Swaminathan, G.[Gurumurthy], Vasconcelos, N.M.[Nuno M.], Schiele, B.[Bernt], Soatto, S.[Stefano],
Omni-DETR: Omni-Supervised Object Detection with Transformers,
CVPR22(9357-9366)
IEEE DOI 2210
Costs, Codes, Annotations, Filtering, Object detection, Computer architecture, Recognition: detection, categorization, retrieval BibRef

Zang, Y.H.[Yu-Hang], Li, W.[Wei], Zhou, K.Y.[Kai-Yang], Huang, C.[Chen], Loy, C.C.[Chen Change],
Open-Vocabulary DETR with Conditional Matching,
ECCV22(IX:106-122).
Springer DOI 2211
BibRef

Cai, Z.W.[Zhao-Wei], Kwon, G.[Gukyeong], Ravichandran, A.[Avinash], Bas, E.[Erhan], Tu, Z.W.[Zhuo-Wen], Bhotika, R.[Rahul], Soatto, S.[Stefano],
X-DETR: A Versatile Architecture for Instance-wise Vision-Language Tasks,
ECCV22(XXXVI:290-308).
Springer DOI 2211
BibRef

Yang, A.[Antoine], Miech, A.[Antoine], Sivic, J.[Josef], Laptev, I.[Ivan], Schmid, C.[Cordelia],
TubeDETR: Spatio-Temporal Video Grounding with Transformers,
CVPR22(16421-16432)
IEEE DOI 2210
Location awareness, Grounding, Natural languages, Object detection, Benchmark testing, Vision + language BibRef

Kamath, A.[Aishwarya], Singh, M.[Mannat], Le Cun, Y.L.[Yann L.], Synnaeve, G.[Gabriel], Misra, I.[Ishan], Carion, N.[Nicolas],
MDETR: Modulated Detection for End-to-End Multi-Modal Understanding,
ICCV21(1760-1770)
IEEE DOI 2203
Visualization, Vocabulary, Image segmentation, Grounding, Detectors, Vision + language, Visual reasoning and logical representation BibRef

Liu, F.F.[Fan-Fan], Wei, H.R.[Hao-Ran], Zhao, W.Z.[Wen-Zhe], Li, G.Z.[Guo-Zhen], Peng, J.Q.[Jing-Quan], Li, Z.[Zihao],
WB-DETR: Transformer-Based Detector without Backbone,
ICCV21(2959-2967)
IEEE DOI 2203
Pipelines, Detectors, Object detection, Transformers, Feature extraction, Decoding, BibRef

Wang, T.[Tao], Yuan, L.[Li], Chen, Y.P.[Yun-Peng], Feng, J.S.[Jia-Shi], Yan, S.C.[Shui-Cheng],
PnP-DETR: Towards Efficient Visual Analysis with Transformers,
ICCV21(4641-4650)
IEEE DOI 2203
Adaptation models, Visualization, Image segmentation, Image recognition, Computational modeling, Redundancy, Detection and localization in 2D and 3D BibRef

Sun, Z.Q.[Zhi-Qing], Cao, S.[Shengcao], Yang, Y.M.[Yi-Ming], Kitani, K.[Kris],
Rethinking Transformer-based Set Prediction for Object Detection,
ICCV21(3591-3600)
IEEE DOI 2203
Training, Codes, Computational modeling, Object detection, Transformers, Optimization, BibRef

Dai, X.Y.[Xi-Yang], Chen, Y.P.[Yin-Peng], Yang, J.W.[Jian-Wei], Zhang, P.C.[Peng-Chuan], Yuan, L.[Lu], Zhang, L.[Lei],
Dynamic DETR: End-to-End Object Detection with Dynamic Attention,
ICCV21(2968-2977)
IEEE DOI 2203
Training, Visualization, Object detection, Color, Transformers, Feature extraction, Encoding, Vision applications and systems BibRef

Gao, P.[Peng], Zheng, M.H.[Ming-Hang], Wang, X.G.[Xiao-Gang], Dai, J.F.[Ji-Feng], Li, H.S.[Hong-Sheng],
Fast Convergence of DETR with Spatially Modulated Co-Attention,
ICCV21(3601-3610)
IEEE DOI 2203
Training, Representation learning, Visualization, Schedules, Costs, Object detection, Transformers, Vision applications and systems BibRef

Meng, D.[Depu], Chen, X.K.[Xiao-Kang], Fan, Z.[Zejia], Zeng, G.[Gang], Li, H.Q.[Hou-Qiang], Yuan, Y.H.[Yu-Hui], Sun, L.[Lei], Wang, J.D.[Jing-Dong],
Conditional DETR for Fast Training Convergence,
ICCV21(3631-3640)
IEEE DOI 2203

WWW Link. Encoder-decoder object detection. Training, Codes, Pose estimation, Object detection, Transformers, Decoding, Detection and localization in 2D and 3D, Recognition and classification BibRef

Chapter on 2-D Feature Analysis, Extraction and Representations, Shape, Skeletons, Texture continues in
Blob Detection .


Last update:Apr 18, 2024 at 11:38:49