15.3.1.10 Vision-Language Navigation

Chapter Contents (Back)
Navigation. Vision-Language.

Bajcsy, R., and Nagel, H.H.,
Descriptive and Prescriptive Languages for Mobility Tasks: Are They Different?,
AIU96(280-300). BibRef 9600

Zhu, M., Chen, W., Xia, J., Ma, Y., Zhang, Y., Luo, Y., Huang, Z., Liu, L.,
Location2Vec: A Situation-Aware Representation for Visual Exploration of Urban Locations,
ITS(20), No. 10, October 2019, pp. 3981-3990.
IEEE DOI 1910
Trajectory, Visualization, Sociology, Statistics, Vehicle dynamics, Mobile handsets, Natural language processing, Human mobility, visual exploration BibRef

Li, P.[Pei], Li, X.[Xinde], Li, X.H.[Xiang-Hui], Pan, H.[Hong], Khyam, M.O., Noor-A-Rahim, M., Ge, S.S.[Shuzhi Sam],
Place perception from the fusion of different image representation,
PR(110), 2021, pp. 107680.
Elsevier DOI 2011
Indoor place perception, CNN, LSTM, Convolutional auto-encoder, Natural language BibRef

Wang, X.[Xin], Huang, Q.Y.[Qiu-Yuan], Celikyilmaz, A.[Asli], Gao, J.F.[Jian-Feng], Shen, D.[Dinghan], Wang, Y.F.[Yuan-Fang], Wang, W.Y.[William Yang], Zhang, L.[Lei],
Vision-Language Navigation Policy Learning and Adaptation,
PAMI(43), No. 12, December 2021, pp. 4205-4216.
IEEE DOI 2112
BibRef
Earlier:
Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation,
CVPR19(6622-6631).
IEEE DOI Award, CVPR, Student. 2002
Navigation, Visualization, Cognition, Reinforcement learning, Natural languages, Benchmark testing, Natural languages, multimodal machine learning BibRef


Guhur, P.L.[Pierre-Louis], Tapaswi, M.[Makarand], Chen, S.Z.[Shi-Zhe], Laptev, I.[Ivan], Schmid, C.[Cordelia],
Airbert: In-Domain Pretraining for Vision-and-Language Navigation,
ICCV21(1614-1623)
IEEE DOI 2203
Adaptation models, Navigation, Atmospheric modeling, Computational modeling, Natural languages, Training data, Vision for robotics and autonomous vehicles BibRef

Liu, C.[Chong], Zhu, F.[Fengda], Chang, X.J.[Xiao-Jun], Liang, X.D.[Xiao-Dan], Ge, Z.[Zongyuan], Shen, Y.D.[Yi-Dong],
Vision-Language Navigation with Random Environmental Mixup,
ICCV21(1624-1634)
IEEE DOI 2203
Visualization, Navigation, Natural languages, Benchmark testing, Data models, Task analysis, Vision + language, BibRef

Qi, Y.[Yuankai], Pan, Z.Z.[Zi-Zheng], Hong, Y.C.[Yi-Cong], Yang, M.H.[Ming-Hsuan], van den Hengel, A.J.[Anton J.], Wu, Q.[Qi],
The Road to Know-Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation,
ICCV21(1635-1644)
IEEE DOI 2203
Visualization, TV, Navigation, Roads, Bit error rate, Predictive models, Linguistics, Vision + language, BibRef

Liu, Z.Y.[Zhe-Yuan], Rodriguez-Opazo, C.[Cristian], Teney, D.[Damien], Gould, S.[Stephen],
Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models,
ICCV21(2105-2114)
IEEE DOI 2203
Visualization, Limiting, Codes, Image retrieval, Natural languages, Computer architecture, Vision + language, Representation learning BibRef

Pashevich, A.[Alexander], Schmid, C.[Cordelia], Sun, C.[Chen],
Episodic Transformer for Vision-and-Language Navigation,
ICCV21(15922-15932)
IEEE DOI 2203
Training, Visualization, Navigation, Natural languages, Detectors, Benchmark testing, Transformers, Vision + language BibRef

Ding, H.H.[Heng-Hui], Liu, C.[Chang], Wang, S.[Suchen], Jiang, X.D.[Xu-Dong],
Vision-Language Transformer and Query Generation for Referring Segmentation,
ICCV21(16301-16310)
IEEE DOI 2203
Convolutional codes, Image segmentation, Visualization, Computational modeling, Computer architecture, Transformers, Vision + language BibRef

Chen, K.[Kevin], Chen, J.K.[Junshen K.], Chuang, J.[Jo], Vázquez, M.[Marynel], Savarese, S.[Silvio],
Topological Planning with Transformers for Vision-and-Language Navigation,
CVPR21(11271-11281)
IEEE DOI 2111
Backtracking, Navigation, Natural languages, Buildings, Transformers, Planning BibRef

Badki, A.[Abhishek], Gallo, O.[Orazio], Kautz, J.[Jan], Sen, P.[Pradeep],
Binary TTC: A Temporal Geofence for Autonomous Navigation,
CVPR21(12941-12950)
IEEE DOI 2111
Quantization (signal), Estimation, Tools, Observers, Cameras, Real-time systems BibRef

Wang, H.Q.[Han-Qing], Wang, W.G.[Wen-Guan], Liang, W.[Wei], Xiong, C.M.[Cai-Ming], Shen, J.B.[Jian-Bing],
Structured Scene Memory for Vision-Language Navigation,
CVPR21(8451-8460)
IEEE DOI 2111
Visualization, Recurrent neural networks, Navigation, Decision making, Layout, Memory architecture BibRef

Wang, H.Q.[Han-Qing], Wang, W.[Wenguan], Shu, T.[Tianmin], Liang, W.[Wei], Shen, J.B.[Jian-Bing],
Active Visual Information Gathering for Vision-language Navigation,
ECCV20(XXII:307-322).
Springer DOI 2011
BibRef

Cao, J.[Jize], Gan, Z.[Zhe], Cheng, Y.[Yu], Yu, L.C.[Li-Cheng], Chen, Y.C.[Yen-Chun], Liu, J.J.[Jing-Jing],
Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-language Models,
ECCV20(VI:565-580).
Springer DOI 2011
BibRef

Moghaddam, M.K.[Mahdi Kazemi], Abbasnejad, E.[Ehsan], Wu, Q.[Qi], Shi, J.Q.[Javen Qinfeng], van den Hengel, A.J.[Anton J.],
ForeSI: Success-Aware Visual Navigation Agent,
WACV22(3401-3410)
IEEE DOI 2202
Training, Visualization, Navigation, Detectors, Reinforcement learning, Predictive models, Analysis and Understanding BibRef

Qi, Y., Wu, Q., Anderson, P., Wang, X., Wang, W.Y., Shen, C., van den Hengel, A.J.,
REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments,
CVPR20(9979-9988)
IEEE DOI 2008
Task analysis, Navigation, Robots, Natural languages, Visualization, Object recognition, Indoor environments BibRef

Qi, Y.K.[Yuan-Kai], Pan, Z.Z.[Zi-Zheng], Zhang, S.P.[Sheng-Ping], van den Hengel, A.J.[Anton J.], Wu, Q.[Qi],
Object-and-action Aware Model for Visual Language Navigation,
ECCV20(X:303-317).
Springer DOI 2011
BibRef

Krantz, J.[Jacob], Wijmans, E.[Erik], Majumdar, A.[Arjun], Batra, D.[Dhruv], Lee, S.[Stefan],
Beyond the Nav-Graph: Vision-and-Language Navigation in Continuous Environments,
ECCV20(XXVIII:104-120).
Springer DOI 2011
Agents must execute low-level actions to follow natural language navigation directions. BibRef

Wang, H.[Hu], Wu, Q.[Qi], Shen, C.H.[Chun-Hua],
Soft Expert Reward Learning for Vision-and-Language Navigation,
ECCV20(IX:126-141).
Springer DOI 2011
BibRef

Kim, J., Moon, S., Rohrbach, A., Darrell, T.J., Canny, J.,
Advisable Learning for Self-Driving Vehicles by Internalizing Observation-to-Action Rules,
CVPR20(9658-9667)
IEEE DOI 2008
Visualization, Semantics, Natural languages, Image segmentation, Generators, Training, Roads BibRef

Fu, T.J.[Tsu-Jui], Wang, X.E.[Xin Eric], Peterson, M.F.[Matthew F.], Grafton, S.T.[Scott T.], Eckstein, M.P.[Miguel P.], Wang, W.Y.[William Yang],
Counterfactual Vision-and-Language Navigation via Adversarial Path Sampler,
ECCV20(VI:71-86).
Springer DOI 2011
Based on language descriptions, relate them to the environment. BibRef

Majumdar, A.[Arjun], Shrivastava, A.[Ayush], Lee, S.[Stefan], Anderson, P.[Peter], Parikh, D.[Devi], Batra, D.[Dhruv],
Improving Vision-and-language Navigation with Image-text Pairs from the Web,
ECCV20(VI:259-274).
Springer DOI 2011
BibRef

Zhu, F.D.[Feng-Da], Zhu, Y.[Yi], Chang, X.J.[Xiao-Jun], Liang, X.D.[Xiao-Dan],
Vision-Language Navigation With Self-Supervised Auxiliary Reasoning Tasks,
CVPR20(10009-10019)
IEEE DOI 2008
Task analysis, Navigation, Cognition, Trajectory, Semantics, Training, Natural languages BibRef

Hao, W., Li, C., Li, X., Carin, L., Gao, J.,
Towards Learning a Generic Agent for Vision-and-Language Navigation via Pre-Training,
CVPR20(13134-13143)
IEEE DOI 2008
Task analysis, Navigation, Visualization, Trajectory, Presses, Head, Predictive models BibRef

Yu, F., Deng, Z., Narasimhan, K., Russakovsky, O.,
Take the Scenic Route: Improving Generalization in Vision-and-Language Navigation,
VL3W20(4000-4004)
IEEE DOI 2008
Navigation, Benchmark testing, Task analysis, Natural languages, Visualization, Training data, Markov processes BibRef

Ma, C.Y.[Chih-Yao], Wu, Z.X.[Zu-Xuan], Al Regib, G.[Ghassan], Xiong, C.M.[Cai-Ming], Kira, Z.[Zsolt],
The Regretful Agent: Heuristic-Aided Navigation Through Progress Estimation,
CVPR19(6725-6733).
IEEE DOI 2002
Navigating to a goal purely from language instructions and visual information. BibRef

Ke, L.Y.M.[Li-Yi-Ming], Li, X.J.[Xiu-Jun], Bisk, Y.[Yonatan], Holtzman, A.[Ari], Gan, Z.[Zhe], Liu, J.J.[Jing-Jing], Gao, J.F.[Jian-Feng], Choi, Y.J.[Ye-Jin], Srinivasa, S.[Siddhartha],
Tactical Rewind: Self-Correction via Backtracking in Vision-And-Language Navigation,
CVPR19(6734-6742).
IEEE DOI 2002
BibRef

Wang, X.[Xin], Xiong, W.H.[Wen-Han], Wang, H.M.[Hong-Min], Wang, W.Y.[William Yang],
Look Before You Leap: Bridging Model-Free and Model-Based Reinforcement Learning for Planned-Ahead Vision-and-Language Navigation,
ECCV18(XVI: 38-55).
Springer DOI 1810
BibRef

Anderson, P.[Peter], Wu, Q.[Qi], Teney, D.[Damien], Bruce, J.[Jake], Johnson, M.[Mark], Sünderhauf, N.[Niko], Reid, I.D.[Ian D.], Gould, S.[Stephen], van den Hengel, A.J.[Anton J.],
Vision-and-Language Navigation: Interpreting Visually-Grounded Navigation Instructions in Real Environments,
CVPR18(3674-3683)
IEEE DOI 1812
Navigation, Task analysis, Robots, Visualization, Cameras, Natural languages BibRef

Chen, H.[Howard], Suhr, A.[Alane], Misra, D.[Dipendra], Snavely, N.[Noah], Artzi, Y.[Yoav],
TOUCHDOWN: Natural Language Navigation and Spatial Reasoning in Visual Street Environments,
CVPR19(12530-12539).
IEEE DOI 2002
BibRef

Nguyen, K.[Khanh], Dey, D.[Debadeepta], Brockett, C.[Chris], Dolan, B.[Bill],
Vision-Based Navigation With Language-Based Assistance via Imitation Learning With Indirect Intervention,
CVPR19(12519-12529).
IEEE DOI 2002
BibRef

Khoshelham, K., Díaz-Vilarińo, L.,
3D Modelling of Interior Spaces: Learning the Language of Indoor Architecture,
CloseRange14(321-326).
DOI Link 1411
BibRef

van Laere, O.[Olivier], Schockaert, S.[Steven], Dhoedt, B.[Bart],
Finding locations of Flickr resources using language models and similarity search,
ICMR11(48).
DOI Link 1301
estimate where a given photo or video was taken, using only the tags that a user has assigned BibRef

Chapter on Active Vision, Camera Calibration, Mobile Robots, Navigation, Road Following continues in
Visual SLAM: Simultaneous Location and Mapping or Matching .


Last update:Jun 27, 2022 at 12:58:02