19.4.3.2.2 Visual Grounding, Grounding Expressions

Chapter Contents (Back)
Question Answer. Grounding. Visual Grounding. Visual Dialog. Mostly a subset of the related:
See also Visual Question Answering, Query, VQA, Visual Dialog.

Visual7W visual question answering,
Large-scale visual question answering (QA) dataset, with object-level groundings and multimodal answers. WWW Link.
Dataset, Visual Question Answering.

Liang, J.W.[Jun-Wei], Jiang, L.[Lu], Cao, L.L.[Liang-Liang], Kalantidis, Y.[Yannis], Li, L.J.[Li-Jia], Hauptmann, A.G.[Alexander G.],
Focal Visual-Text Attention for Memex Question Answering,
PAMI(41), No. 8, August 2019, pp. 1893-1908.
IEEE DOI 1907
BibRef
Earlier: A1, A2, A3, A5, A6, Only:
Focal Visual-Text Attention for Visual Question Answering,
CVPR18(6135-6143)
IEEE DOI 1812
Task analysis, Knowledge discovery, Visualization, Grounding, Metadata, Cognition, Photo albums, question answering, memex. Visualization, Videos, Computational modeling, Correlation. BibRef

Riquelme, F.[Felipe], de Goyeneche, A.[Alfredo], Zhang, Y.D.[Yun-Dong], Niebles, J.C.[Juan Carlos], Soto, A.[Alvaro],
Explaining VQA predictions using visual grounding and a knowledge base,
IVC(101), 2020, pp. 103968.
Elsevier DOI 2009
Deep Learning, Attention, Supervision, Knowledge Base, Interpretability, Explainability BibRef

Niu, Y.L.[Yu-Lei], Zhang, H.W.[Han-Wang], Lu, Z.W.[Zhi-Wu], Chang, S.F.[Shih-Fu],
Variational Context: Exploiting Visual and Textual Context for Grounding Referring Expressions,
PAMI(43), No. 1, January 2021, pp. 347-359.
IEEE DOI 2012
Grounding, Context modeling, Visualization, Task analysis, Pediatrics, Bayes methods, Annotations, referring expression generation BibRef

Yang, S.[Sibei], Li, G.[Guanbin], Yu, Y.Z.[Yi-Zhou],
Relationship-Embedded Representation Learning for Grounding Referring Expressions,
PAMI(43), No. 8, August 2021, pp. 2765-2779.
IEEE DOI 2107
BibRef
Earlier:
Cross-Modal Relationship Inference for Grounding Referring Expressions,
CVPR19(4140-4149).
IEEE DOI 2002
Locate the object instance in an image described by a referring expression. Visualization, Semantics, Grounding, Proposals, Data mining, Logic gates, Feature extraction, Referring expressions, gated graph convolutional network. Locate target object based on natural language descriptions. BibRef

Yang, Z.Y.[Zheng-Yuan], Kumar, T.[Tushar], Chen, T.L.[Tian-Lang], Su, J.S.[Jing-Song], Luo, J.B.[Jie-Bo],
Grounding-Tracking-Integration,
CirSysVideo(31), No. 9, September 2021, pp. 3433-3443.
IEEE DOI 2109
Grounding, Target tracking, Visualization, History, Task analysis, Object tracking, Annotations, Tracking by language BibRef

Zhang, W.X.[Wei-Xia], Ma, C.[Chao], Wu, Q.[Qi], Yang, X.K.[Xiao-Kang],
Language-Guided Navigation via Cross-Modal Grounding and Alternate Adversarial Learning,
CirSysVideo(31), No. 9, September 2021, pp. 3469-3481.
IEEE DOI 2109
Navigation, Training, Trajectory, Visualization, Task analysis, Grounding, Generators, Vision-and-language, embodied navigation, adversarial learning BibRef

Zhai, S.L.[Song-Lin], Guo, G.B.[Gui-Bing], Yuan, F.J.[Fa-Jie], Liu, Y.[Yuan], Wang, X.W.[Xing-Wei],
VSE-fs: Fast Full-Sample Visual Semantic Embedding,
IEEE_Int_Sys(36), No. 4, July 2021, pp. 3-12.
IEEE DOI 2109
Construct a joint embedding space between visual features and semantic information. Computational modeling, Training, Integrated circuits, Time complexity, Semantics, Visualization, Intelligent systems, Negative Sampling BibRef

Sun, M.J.[Ming-Jie], Xiao, J.[Jimin], Lim, E.G.[Eng Gee], Liu, S.[Si], Goulermas, J.Y.[John Y.],
Discriminative Triad Matching and Reconstruction for Weakly Referring Expression Grounding,
PAMI(43), No. 11, November 2021, pp. 4189-4195.
IEEE DOI 2110
Image reconstruction, Training, Proposals, Visualization, Task analysis, Linguistics, Grounding, discriminative triad matching BibRef

Bargal, S.A.[Sarah Adel], Zunino, A.[Andrea], Petsiuk, V.[Vitali], Zhang, J.M.[Jian-Ming], Saenko, K.[Kate], Murino, V.[Vittorio], Sclaroff, S.[Stan],
Guided Zoom: Zooming into Network Evidence to Refine Fine-Grained Model Decisions,
PAMI(43), No. 11, November 2021, pp. 4196-4202.
IEEE DOI 2110
Grounding, Training, Predictive models, Annotations, Location awareness, Correlation, Visualization, Explainable AI, convolutional neural networks BibRef

Yang, W.F.[Wen-Fei], Zhang, T.Z.[Tian-Zhu], Zhang, Y.D.[Yong-Dong], Wu, F.[Feng],
Local Correspondence Network for Weakly Supervised Temporal Sentence Grounding,
IP(30), 2021, pp. 3252-3262.
IEEE DOI 2103
Grounding, Annotations, Training, Feature extraction, Computational modeling, Task analysis, temporal sentence grounding BibRef

Luo, W.[Wang], Zhang, T.Z.[Tian-Zhu], Yang, W.[Wenfei], Liu, J.G.[Jin-Gen], Mei, T.[Tao], Wu, F.[Feng], Zhang, Y.D.[Yong-Dong],
Action Unit Memory Network for Weakly Supervised Temporal Action Localization,
CVPR21(9964-9974)
IEEE DOI 2111
Location awareness, Training, Knowledge engineering, Motion segmentation, Refining, Interference, Benchmark testing BibRef

Hong, R.[Richang], Liu, D.[Daqing], Mo, X.Y.[Xiao-Yu], He, X.N.[Xiang-Nan], Zhang, H.[Hanwang],
Learning to Compose and Reason with Language Tree Structures for Visual Grounding,
PAMI(44), No. 2, February 2022, pp. 684-696.
IEEE DOI 2201
Grounding, Visualization, Dogs, Natural languages, Cognition, Computational modeling, Semantics, Fine-grained detection, visual reasoning BibRef

Bin, Y.[Yi], Ding, Y.[Yujuan], Peng, B.[Bo], Peng, L.[Liang], Yang, Y.[Yang], Chua, T.S.[Tat-Seng],
Entity Slot Filling for Visual Captioning,
CirSysVideo(32), No. 1, January 2022, pp. 52-62.
IEEE DOI 2201
Task analysis, Visualization, Neural networks, Adaptation models, Filling, Grounding, Training, Image captioning, dataset BibRef

Chu, C.[Chenhui], Oliveira, V.[Vinicius], Virgo, F.G.[Felix Giovanni], Otani, M.[Mayu], Garcia, N.[Noa], Nakashima, Y.[Yuta],
The semantic typology of visually grounded paraphrases,
CVIU(215), 2022, pp. 103333.
Elsevier DOI 2201
Vision and language, Image interpretation, Visual grounded paraphrases, Semantic typology, Dataset BibRef


Soldan, M.[Mattia], Xu, M.M.[Meng-Meng], Qu, S.[Sisi], Tegner, J.[Jesper], Ghanem, B.[Bernard],
VLG-Net: Video-Language Graph Matching Network for Video Grounding,
CVEU21(3217-3227)
IEEE DOI 2112
Location awareness, Grounding, Semantics, Syntactics, Graph neural networks BibRef

Lu, X.P.[Xiao-Peng], Fan, Z.[Zhen], Wang, Y.[Yansen], Oh, J.[Jean], Rosť, C.P.[Carolyn P.],
Localize, Group, and Select: Boosting Text-VQA by Scene Text Modeling,
XSAnim21(2631-2639)
IEEE DOI 2112
Integrated optics, Visualization, Grounding, Computational modeling, Knowledge discovery BibRef

Song, S.[Sijie], Lin, X.D.[Xu-Dong], Liu, J.Y.[Jia-Ying], Guo, Z.M.[Zong-Ming], Chang, S.F.[Shih-Fu],
Co-Grounding Networks with Semantic Attention for Referring Expression Comprehension in Videos,
CVPR21(1346-1355)
IEEE DOI 2111
Visualization, Correlation, Grounding, Computational modeling, Semantics, Benchmark testing BibRef

Tian, Y.P.[Ya-Peng], Hu, D.[Di], Xu, C.L.[Chen-Liang],
Cyclic Co-Learning of Sounding Object Visual Grounding and Sound Separation,
CVPR21(2744-2753)
IEEE DOI 2111
Training, Visualization, Codes, Grounding, Computational modeling, Pattern recognition BibRef

Nan, G.[Guoshun], Qiao, R.[Rui], Xiao, Y.[Yao], Liu, J.[Jun], Leng, S.[Sicong], Zhang, H.[Hao], Lu, W.[Wei],
Interventional Video Grounding with Dual Contrastive Learning,
CVPR21(2764-2774)
IEEE DOI 2111
Visualization, Correlation, Grounding, Benchmark testing, Knowledge discovery, Data models, Pattern recognition BibRef

Zhao, Y.[Yang], Zhao, Z.[Zhou], Zhang, Z.[Zhu], Lin, Z.J.[Zhi-Jie],
Cascaded Prediction Network via Segment Tree for Temporal Video Grounding,
CVPR21(4195-4204)
IEEE DOI 2111
Costs, Grounding, Navigation, Fuses, Benchmark testing, Pattern recognition BibRef

Liu, Y.[Yongfei], Wan, B.[Bo], Ma, L.[Lin], He, X.M.[Xu-Ming],
Relation-aware Instance Refinement for Weakly Supervised Visual Grounding,
CVPR21(5608-5617)
IEEE DOI 2111
Location awareness, Learning systems, Visualization, Grounding, Semantics, Noise reduction, Benchmark testing BibRef

Liu, H.[Haolin], Lin, A.[Anran], Han, X.G.[Xiao-Guang], Yang, L.[Lei], Yu, Y.Z.[Yi-Zhou], Cui, S.G.[Shu-Guang],
Refer-it-in-RGBD: A Bottom-up Approach for 3D Visual Grounding in RGBD Images,
CVPR21(6028-6037)
IEEE DOI 2111
Heating systems, Geometry, Visualization, Grounding, Fuses, Feature extraction BibRef

Lin, X.R.[Xiang-Ru], Li, G.[Guanbin], Yu, Y.Z.[Yi-Zhou],
Scene-Intuitive Agent for Remote Embodied Visual Grounding,
CVPR21(7032-7041)
IEEE DOI 2111
Training, Visualization, Grounding, Navigation, Fuses, Semantics, Pipelines BibRef

Liu, D.[Daizong], Qu, X.Y.[Xiao-Ye], Dong, J.F.[Jian-Feng], Zhou, P.[Pan], Cheng, Y.[Yu], Wei, W.[Wei], Xu, Z.[Zichuan], Xie, Y.[Yulai],
Context-aware Biaffine Localizing Network for Temporal Sentence Grounding,
CVPR21(11230-11239)
IEEE DOI 2111
Location awareness, Codes, Grounding, Cognition, Pattern recognition, Task analysis BibRef

Meng, Z.[Zihang], Yu, L.C.[Li-Cheng], Zhang, N.[Ning], Berg, T.[Tamara], Damavandi, B.[Babak], Singh, V.[Vikas], Bearman, A.[Amy],
Connecting What to Say With Where to Look by Modeling Human Attention Traces,
CVPR21(12674-12683)
IEEE DOI 2111
Measurement, Visualization, Grounding, Unified modeling language, Training data, Computer architecture, Transformers BibRef

Sun, M.J.[Ming-Jie], Xiao, J.[Jimin], Lim, E.G.[Eng Gee],
Iterative Shrinking for Referring Expression Grounding Using Deep Reinforcement Learning,
CVPR21(14055-14064)
IEEE DOI 2111
Art, Grounding, Reinforcement learning, Cognition, Pattern recognition, Proposals BibRef

Wang, L.[Liwei], Huang, J.[Jing], Li, Y.[Yin], Xu, K.[Kun], Yang, Z.Y.[Zheng-Yuan], Yu, D.[Dong],
Improving Weakly Supervised Visual Grounding by Contrastive Knowledge Distillation,
CVPR21(14085-14095)
IEEE DOI 2111
Training, Visualization, Technological innovation, Costs, Grounding, Detectors BibRef

Feng, G.[Guang], Hu, Z.W.[Zhi-Wei], Zhang, L.[Lihe], Lu, H.C.[Hu-Chuan],
Encoder Fusion Network with Co-Attention Embedding for Referring Image Segmentation,
CVPR21(15501-15510)
IEEE DOI 2111
Measurement, Visualization, Image segmentation, Grounding, Semantics, Transforms, Information representation BibRef

Huang, B.[Binbin], Lian, D.Z.[Dong-Ze], Luo, W.X.[Wei-Xin], Gao, S.H.[Sheng-Hua],
Look Before You Leap: Learning Landmark Features for One-Stage Visual Grounding,
CVPR21(16883-16892)
IEEE DOI 2111
Visualization, Grounding, Convolution, Heuristic algorithms, Computational modeling, Linguistics BibRef

Zhou, H.[Hao], Zhang, C.Y.[Chong-Yang], Luo, Y.[Yan], Chen, Y.J.[Yan-Jun], Hu, C.P.[Chuan-Ping],
Embracing Uncertainty: Decoupling and De-bias for Robust Temporal Grounding,
CVPR21(8441-8450)
IEEE DOI 2111
Performance evaluation, Uncertainty, Grounding, Annotations, Feature extraction, Robustness BibRef

Whitehead, S.[Spencer], Wu, H.[Hui], Ji, H.[Heng], Feris, R.[Rogerio], Saenko, K.[Kate],
Separating Skills and Concepts for Novel Visual Question Answering,
CVPR21(5628-5637)
IEEE DOI 2111
Training, Visualization, Grounding, Annotations, Knowledge discovery, Encoding BibRef

Khan, A.U.[Aisha Urooj], Kuehne, H.[Hilde], Duarte, K.[Kevin], Gan, C.[Chuang], Lobo, N.[Niels], Shah, M.[Mubarak],
Found a Reason for me? Weakly-supervised Grounded Visual Question Answering using Capsules,
CVPR21(8461-8470)
IEEE DOI 2111
Training, Visualization, Vocabulary, Grounding, Focusing, Detectors, Knowledge discovery BibRef

Zhang, S.Y.[Sheng-Yu], Jiang, T.[Tan], Wang, T.[Tan], Kuang, K.[Kun], Zhao, Z.[Zhou], Zhu, J.[Jianke], Yu, J.[Jin], Yang, H.X.[Hong-Xia], Wu, F.[Fei],
DeVLBert: Out-of-distribution Visio-Linguistic Pretraining with Causality,
CiV21(1744-1747)
IEEE DOI 2109
Visualization, Correlation, Image retrieval, Computer architecture, Knowledge discovery BibRef

Nguyen, A.T.[Andre T.], Richards, L.E.[Luke E.], Kebe, G.Y.[Gaoussou Youssouf], Raff, E.[Edward], Darvish, K.[Kasra], Ferraro, F.[Frank], Matuszek, C.[Cynthia],
Practical Cross-modal Manifold Alignment for Robotic Grounded Language Learning,
MULA21(1613-1622)
IEEE DOI 2109
Manifolds, Measurement, Learning systems, Natural languages, Robot sensing systems BibRef

Shrestha, A.[Amar], Pugdeethosapol, K.[Krittaphat], Fang, H.[Haowen], Qiu, Q.[Qinru],
MAGNet: Multi-Region Attention-Assisted Grounding of Natural Language Queries at Phrase Level,
ICPR21(8275-8282)
IEEE DOI 2105
Visualization, Grounding, Fuses, Magnetic resonance imaging, Natural languages, Games, Pattern recognition BibRef

Zhang, Z., Zhao, Z., Zhao, Y., Wang, Q., Liu, H., Gao, L.,
Where Does It Exist: Spatio-Temporal Video Grounding for Multi-Form Sentences,
CVPR20(10665-10674)
IEEE DOI 2008
Grounding, Task analysis, Visualization, Cognition, Feature extraction, Natural languages BibRef

Burns, A.[Andrea], Tan, R.[Reuben], Saenko, K.[Kate], Sclaroff, S.[Stan], Plummer, B.[Bryan],
Language Features Matter: Effective Language Representations for Vision-Language Tasks,
ICCV19(7473-7482)
IEEE DOI 2004
Code, Visualization.
WWW Link. data visualisation, graph theory, image representation, learning (artificial intelligence), Grounding BibRef

Sadhu, A.[Arka], Chen, K.[Kan], Nevatia, R.[Ram],
Video Object Grounding Using Semantic Roles in Language Description,
CVPR20(10414-10424)
IEEE DOI 2008
grounds objects in videos referred to in natural language descriptions. Semantics, Encoding, Proposals, Grounding, Visualization, Task analysis, Feature extraction BibRef

Ma, C.Y.[Chih-Yao], Kalantidis, Y.[Yannis], AlRegib, G.[Ghassan], Vajda, P.[Peter], Rohrbach, M.[Marcus], Kira, Z.[Zsolt],
Learning to Generate Grounded Visual Captions Without Localization Supervision,
ECCV20(XVIII:353-370).
Springer DOI 2012
BibRef

Gouthaman, K.V., Mittal, A.[Anurag],
Reducing Language Biases in Visual Question Answering with Visually-grounded Question Encoder,
ECCV20(XIII:18-34).
Springer DOI 2011
BibRef

Zeng, R.H.[Run-Hao], Xu, H.M.[Hao-Ming], Huang, W.B.[Wen-Bing], Chen, P.H.[Pei-Hao], Tan, M.K.[Ming-Kui], Gan, C.[Chuang],
Dense Regression Network for Video Grounding,
CVPR20(10284-10293)
IEEE DOI 2008
Grounding, Training, Task analysis, Proposals, Semantics, Magnetic heads, Feature extraction BibRef

Gupta, T.[Tanmay], Vahdat, A.[Arash], Chechik, G.[Gal], Yang, X.D.[Xiao-Dong], Kautz, J.[Jan], Hoiem, D.[Derek],
Contrastive Learning for Weakly Supervised Phrase Grounding,
ECCV20(III:752-768).
Springer DOI 2012
BibRef

Tan, H.L., Leong, M.C., Xu, Q., Li, L., Fang, F., Cheng, Y., Gauthier, N., Sun, Y., Lim, J.H.,
Task-Oriented Multi-Modal Question Answering For Collaborative Applications,
ICIP20(1426-1430)
IEEE DOI 2011
Task analysis, Collaboration, Grounding, Visualization, Cognition, Training, Machine learning, question answering, corpora BibRef

Yang, S.[Sibei], Li, G.B.[Guan-Bin], Yu, Y.Z.[Yi-Zhou],
Propagating Over Phrase Relations for One-stage Visual Grounding,
ECCV20(XIX:589-605).
Springer DOI 2011
BibRef

Xiao, J.B.[Jun-Bin], Shang, X.[Xindi], Yang, X.[Xun], Tang, S.[Sheng], Chua, T.S.[Tat-Seng],
Visual Relation Grounding in Videos,
ECCV20(VI:447-464).
Springer DOI 2011
Code, Relations.
WWW Link. BibRef

Mun, J., Cho, M., Han, B.,
Local-Global Video-Text Interactions for Temporal Grounding,
CVPR20(10807-10816)
IEEE DOI 2008
Semantics, Feature extraction, Grounding, Visualization, Proposals, Task analysis, Context modeling BibRef

Wu, C., Lin, Z., Cohen, S., Bui, T., Maji, S.,
PhraseCut: Language-Based Image Segmentation in the Wild,
CVPR20(10213-10222)
IEEE DOI 2008
Visualization, Grounding, Image segmentation, Task analysis, Genomics, Bioinformatics, Natural languages BibRef

Selvaraju, R.R., Tendulkar, P., Parikh, D., Horvitz, E., Tulio Ribeiro, M., Nushi, B., Kamar, E.,
SQuINTing at VQA Models: Introspecting VQA Models With Sub-Questions,
CVPR20(10000-10008)
IEEE DOI 2008
Cognition, Task analysis, Visualization, Image color analysis, Grounding, Text recognition, Computational modeling BibRef

Chen, L.[Lei], Zhai, M.Y.[Meng-Yao], He, J.W.[Jia-Wei], Mori, G.[Greg],
Object Grounding via Iterative Context Reasoning,
MDALC19(1407-1415)
IEEE DOI 2004
Localize set of queries in the image. image classification, image representation, image segmentation, inference mechanisms, iterative methods, query processing, weakly supervised learning BibRef

Sinha, A.[Abhishek], Akilesh, B., Sarkar, M.[Mausoom], Krishnamurthy, B.[Balaji],
Attention Based Natural Language Grounding by Navigating Virtual Environment,
WACV19(236-244)
IEEE DOI 1904
learning (artificial intelligence), natural language processing, virtual reality, Grounding BibRef

Selvaraju, R.R., Lee, S., Shen, Y., Jin, H., Ghosh, S., Heck, L., Batra, D., Parikh, D.,
Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded,
ICCV19(2591-2600)
IEEE DOI 2004
gradient methods, image retrieval, natural language processing, neural nets, question answering (information retrieval), HINT, Correlation BibRef

Zhang, Y., Niebles, J.C., Soto, A.,
Interpretable Visual Question Answering by Visual Grounding From Attention Supervision Mining,
WACV19(349-357)
IEEE DOI 1904
data mining, data visualisation, image representation, learning (artificial intelligence), Computer architecture BibRef

Shi, J.[Jing], Xu, J.[Jia], Gong, B.[Boqing], Xu, C.L.[Chen-Liang],
Not All Frames Are Equal: Weakly-Supervised Video Grounding With Contextual Similarity and Visual Clustering Losses,
CVPR19(10436-10444).
IEEE DOI 2002
BibRef

Datta, S.[Samyak], Sikka, K.[Karan], Roy, A.[Anirban], Ahuja, K.[Karuna], Parikh, D.[Devi], Divakaran, A.[Ajay],
Align2Ground: Weakly Supervised Phrase Grounding Guided by Image-Caption Alignment,
ICCV19(2601-2610)
IEEE DOI 2004
image representation, image retrieval, learning (artificial intelligence), Image coding BibRef

Fang, Z.Y.[Zhi-Yuan], Kong, S.[Shu], Fowlkes, C.C.[Charless C.], Yang, Y.Z.[Ye-Zhou],
Modularized Textual Grounding for Counterfactual Resilience,
CVPR19(6371-6381).
IEEE DOI 2002
BibRef

Liu, X.J.[Xue-Jing], Li, L.[Liang], Wang, S.H.[Shu-Hui], Zha, Z.J.[Zheng-Jun], Meng, D.C.[De-Chao], Huang, Q.M.[Qing-Ming],
Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding,
ICCV19(2611-2620)
IEEE DOI 2004
Localize the object in the image from a query. feature extraction, image classification, image reconstruction, image retrieval, Adaptive systems BibRef

Zhuang, B., Wu, Q., Shen, C., Reid, I.D., van den Hengel, A.J.[Anton J.],
Parallel Attention: A Unified Framework for Visual Object Discovery Through Dialogs and Queries,
CVPR18(4252-4261)
IEEE DOI 1812
Visualization, Task analysis, Cognition, Proposals, Grounding, Correlation BibRef

Yang, Z.Y.[Zheng-Yuan], Chen, T.L.[Tian-Lang], Wang, L.[Liwei], Luo, J.B.[Jie-Bo],
Improving One-Stage Visual Grounding by Recursive Sub-query Construction,
ECCV20(XIV:387-404).
Springer DOI 2011
Code, Query.
WWW Link. BibRef

Zhang, H.W.[Han-Wang], Niu, Y.L.[Yu-Lei], Chang, S.F.[Shih-Fu],
Grounding Referring Expressions in Images by Variational Context,
CVPR18(4158-4166)
IEEE DOI 1812
Grounding, Context modeling, Task analysis, Visualization, Pediatrics, Bayes methods, Natural languages BibRef

Yu, L.C.[Li-Cheng], Lin, Z.[Zhe], Shen, X.H.[Xiao-Hui], Yang, J.M.[Ji-Mei], Lu, X.[Xin], Bansal, M.[Mohit], Berg, T.L.[Tamara L.],
MAttNet: Modular Attention Network for Referring Expression Comprehension,
CVPR18(1307-1315)
IEEE DOI 1812
Localize image region described by natural language expression. Visualization, Computational modeling, Task analysis, Cats, Adaptation models, Feature extraction, Knowledge discovery BibRef

Liu, D.Q.[Da-Qing], Zhang, H.W.[Han-Wang], Zha, Z.J.[Zheng-Jun], Wu, F.[Feng],
Learning to Assemble Neural Module Tree Networks for Visual Grounding,
ICCV19(4672-4681)
IEEE DOI 2004
approximation theory, data visualisation, grammars, learning (artificial intelligence), Training BibRef

Sadhu, A., Chen, K., Nevatia, R.,
Zero-Shot Grounding of Objects From Natural Language Queries,
ICCV19(4693-4702)
IEEE DOI 2004
image classification, learning (artificial intelligence), Visualization, natural language processing, object detection, query processing. BibRef

Yang, Z.Y.[Zheng-Yuan], Gong, B.Q.[Bo-Qing], Wang, L.W.[Li-Wei], Huang, W.B.[Wen-Bing], Yu, D.[Dong], Luo, J.B.[Jie-Bo],
A Fast and Accurate One-Stage Approach to Visual Grounding,
ICCV19(4682-4692)
IEEE DOI 2004
document image processing, feature extraction, image fusion, image segmentation, natural language processing, Encoding BibRef

Rohrbach, A.[Anna], Rohrbach, M.[Marcus], Tang, S.[Siyu], Oh, S.J.[Seong Joon], Schiele, B.[Bernt],
Generating Descriptions with Grounded and Co-referenced People,
CVPR17(4196-4206)
IEEE DOI 1711
Movie description. Grounding, Head, Joining processes, Motion pictures, Videos, Visualization BibRef

Zhu, Y., Kiros, R., Zemel, R., Salakhutdinov, R., Urtasun, R., Torralba, A.B., Fidler, S.,
Aligning Books and Movies: Towards Story-Like Visual Explanations by Watching Movies and Reading Books,
ICCV15(19-27)
IEEE DOI 1602
Grounding BibRef

Chapter on Implementations and Applications, Databases, QBIC, Video Analysis, Hardware and Software, Inspection continues in
Internet Label Information .


Last update:Jan 20, 2022 at 13:32:42