11.14.4.2.2 Instructional, Training Videos, How To, Teach Machine How

Chapter Contents (Back)
Video Analysis. Instructional. Education. How-To. Instructional Video.

How2 Dataset,
2019
WWW Link. Instructional videos Used in How2 Challenge at ICML 2009 Dataset, Instructional Video.

YouCook2,
2018
WWW Link. Cooking videos Dataset, Instructional Video.

Hoshino, K.[Kiyoshi],
Dexterous Robot Hand Control with Data Glove by Human Imitation,
IEICE(E89-D), No. 6, June 2006, pp. 1820-1825.
DOI Link 0606
BibRef

Choudary, C., Liu, T.C.[Tie-Cheng],
Summarization of Visual Content in Instructional Videos,
MultMed(9), No. 7, November 2007, pp. 1443-1455.
IEEE DOI 0905
BibRef
Earlier: A2, A1:
Content Extraction and Summarization of Instructional Videos,
ICIP06(149-152).
IEEE DOI 0610
BibRef

Liu, T.C.[Tie-Cheng], Katpelly, R.,
Content-Adaptive Video Summarization Combining Queueing and Clustering,
ICIP06(145-148).
IEEE DOI 0610
BibRef

Alayrac, J.B.[Jean-Baptiste], Bojanowski, P.[Piotr], Agrawal, N.[Nishant], Sivic, J.[Josef], Laptev, I.[Ivan], Lacoste-Julien, S.[Simon],
Learning from Narrated Instruction Videos,
PAMI(40), No. 9, September 2018, pp. 2194-2208.
IEEE DOI 1808
Dataset, Instructional Video.
WWW Link. BibRef
Earlier:
Unsupervised Learning from Narrated Instruction Videos,
CVPR16(4575-4583)
IEEE DOI 1612
Videos, Automobiles, Visualization, Tires, YouTube, Internet, Pragmatics, Step discovery, narrated instruction videos, unsupervised learning. Text and images from video for learning the steps. BibRef

Doering, M., Glas, D.F., Ishiguro, H.,
Modeling Interaction Structure for Robot Imitation Learning of Human Social Behavior,
HMS(49), No. 3, June 2019, pp. 219-231.
IEEE DOI 1906
Robot sensing systems, Hidden Markov models, Data collection, Unsupervised learning, Training, Man-machine systems, unsupervised learning BibRef

Wu, A., Piergiovanni, A.J., Ryoo, M.S.,
Model-Based Robot Imitation with Future Image Similarity,
IJCV(128), No. 5, May 2020, pp. 1360-1374.
Springer DOI 2005
BibRef
And: Correction: IJCV(128), No. 5, May 2020, pp. 1375.
Springer DOI 2005
Imitation learning. BibRef

Tang, Y.S.[Yan-Song], Lu, J.W.[Ji-Wen], Zhou, J.[Jie],
Comprehensive Instructional Video Analysis: The COIN Dataset and Performance Evaluation,
PAMI(43), No. 9, September 2021, pp. 3138-3153.
IEEE DOI 2108
Task analysis, Tires, YouTube, Automobiles, Fasteners, Benchmark testing, Computed tomography, Instructional video, large-scale benchmark BibRef

Tang, Y.S.[Yan-Song], Ding, D.J.[Da-Jun], Rao, Y.M.[Yong-Ming], Zheng, Y.[Yu], Zhang, D.Y.[Dan-Yang], Zhao, L.[Lili], Lu, J.W.[Ji-Wen], Zhou, J.[Jie],
COIN: A Large-Scale Dataset for Comprehensive Instructional Video Analysis,
CVPR19(1207-1216).
IEEE DOI 2002
Dataset, Instructional Video.
WWW Link. BibRef


Nagasinghe, K.R.Y.[Kumaranage Ravindu Yasas], Zhou, H.[Honglu], Gunawardhana, M.[Malitha], Min, M.R.[Martin Renqiang], Harari, D.[Daniel], Khan, M.H.[Muhammad Haris],
Why Not Use Your Textbook? Knowledge-Enhanced Procedure Planning of Instructional Videos,
CVPR24(18816-18826)
IEEE DOI Code:
WWW Link. 2410
Training, Visualization, Sequential analysis, Training data, Knowledge graphs, Probabilistic logic, Planning BibRef

Ashutosh, K.[Kumar], Xue, Z.[Zihui], Nagarajan, T.[Tushar], Grauman, K.[Kristen],
Detours for Navigating Instructional Videos,
CVPR24(18804-18815)
IEEE DOI 2410
Training, Navigation, Computational modeling, Pipelines, Natural languages, Buildings, video understanding, video language models BibRef

Nagarajan, T.[Tushar], Torresani, L.[Lorenzo],
Step Differences in Instructional Video,
CVPR24(18740-18750)
IEEE DOI 2410
Visualization, Annotations, Training data, Benchmark testing, Data models, Cognition BibRef

Cui, J.M.[Jie-Ming], Liu, T.[Tengyu], Liu, N.[Nian], Yang, Y.D.[Yao-Dong], Zhu, Y.X.[Yi-Xin], Huang, S.Y.[Si-Yuan],
AnySkill: Learning Open-Vocabulary Physical Skill for Interactive Agents,
CVPR24(852-862)
IEEE DOI 2410
Training, Visualization, Imitation learning, Computational modeling, Humanoid robots, Manuals, open-vocabular, interactive agent BibRef

Bansal, S.[Siddhant], Arora, C.[Chetan], Jawahar, C.V.,
United We Stand, Divided We Fall: UnityGraph for Unsupervised Procedure Learning from Videos,
WACV24(6495-6505)
IEEE DOI 2404
Computational modeling, Clustering algorithms, Benchmark testing, Task analysis, Videos, Algorithms BibRef

Ben-Shabat, Y.Z.[Yi-Zhak], Paul, J.[Jonathan], Segev, E.[Eviatar], Shrout, O.[Oren], Gould, S.[Stephen],
IKEA Ego 3D Dataset: Understanding furniture assembly actions from ego-view 3D Point Clouds,
WACV24(4343-4352)
IEEE DOI 2404
Point cloud compression, Performance evaluation, Focusing, Benchmark testing, Task analysis, Algorithms, 3D computer vision BibRef

Schoonbeek, T.J.[Tim J.], Houben, T.[Tim], Onvlee, H.[Hans], de With, P.H.N.[Peter H.N.], van der Sommen, F.[Fons],
IndustReal: A Dataset for Procedure Step Recognition Handling Execution Errors in Egocentric Videos in an Industrial-Like Setting,
WACV24(4353-4362)
IEEE DOI Code:
WWW Link. 2404
Solid modeling, Scalability, Benchmark testing, Reproducibility of results, Robustness, Task analysis, Algorithms, Video recognition and understanding BibRef

Abdelslam, M.A.[Mohamed A.], Rangrej, S.B.[Samrudhdhi B.], Hadji, I.[Isma], Dvornik, N.[Nikita], Derpanis, K.G.[Konstantinos G.], Fazly, A.[Afsaneh],
GePSAn: Generative Procedure Step Anticipation in Cooking Videos,
ICCV23(2976-2985)
IEEE DOI 2401
BibRef

Zhong, Y.[Yiwu], Yu, L.C.[Li-Cheng], Bai, Y.[Yang], Li, S.W.[Shang-Wen], Yan, X.T.[Xue-Ting], Li, Y.[Yin],
Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations,
CVPR23(14825-14835)
IEEE DOI 2309
BibRef

Zhang, J.H.[Jia-Hao], Cherian, A.[Anoop], Liu, Y.[Yanbin], Ben-Shabat, Y.Z.[Yi-Zhak], Rodriguez, C.[Cristian], Gould, S.[Stephen],
Aligning Step-by-Step Instructional Diagrams to Video Demonstrations,
CVPR23(2483-2492)
IEEE DOI 2309
BibRef

Kosaka, T.[Takayuki], Kosaka, M.[Mari],
Development and Discussion of an Authentic Game to Develop Cleaning Skills,
VAMR23(33-42).
Springer DOI 2307
BibRef

Pan, Y.[Yueran], Wu, J.X.[Jia-Xin], Ju, R.[Ran], Zhou, Z.[Ziang], Gu, J.[Jiayue], Zeng, S.T.[Song-Tian], Yuan, L.[Lynn], Li, M.[Ming],
A Multimodal Framework for Automated Teaching Quality Assessment of One-to-many Online Instruction Videos,
ICPR22(1777-1783)
IEEE DOI 2212
Technical management, Protocols, Distance learning, Education, Supervised learning, Feature extraction, Quality assessment, Emotion Recognition BibRef

Qin, Y.Z.[Yu-Zhe], Wu, Y.H.[Yueh-Hua], Liu, S.W.[Shao-Wei], Jiang, H.W.[Han-Wen], Yang, R.[Ruihan], Fu, Y.[Yang], Wang, X.L.[Xiao-Long],
DexMV: Imitation Learning for Dexterous Manipulation from Human Videos,
ECCV22(XXIX:570-587).
Springer DOI 2211
BibRef

Sener, F.[Fadime], Chatterjee, D.[Dibyadip], Shelepov, D.[Daniel], He, K.[Kun], Singhania, D.[Dipika], Wang, R.[Robert], Yao, A.[Angela],
Assembly101: A Large-Scale Multi-View Video Dataset for Understanding Procedural Activities,
CVPR22(21064-21074)
IEEE DOI 2210
Training, Toy manufacturing industry, Recording, Synchronization, Datasets and evaluation, Action and event recognition BibRef

Ghoddoosian, R.[Reza], Dwivedi, I.[Isht], Agarwal, N.[Nakul], Dariush, B.[Behzad],
Weakly-Supervised Action Segmentation and Unseen Error Detection in Anomalous Instructional Videos,
ICCV23(10094-10104)
IEEE DOI 1806
BibRef

Ghoddoosian, R.[Reza], Dwivedi, I.[Isht], Agarwal, N.[Nakul], Choi, C.[Chiho], Dariush, B.[Behzad],
Weakly-Supervised Online Action Segmentation in Multi-View Instructional Videos,
CVPR22(13770-13780)
IEEE DOI 2210
Training, Costs, Annotations, Computational modeling, Benchmark testing, Self- semi- meta- unsupervised learning BibRef

Ghoddoosian, R.[Reza], Sayed, S.[Saif], Athitsos, V.[Vassilis],
Hierarchical Modeling for Task Recognition and Action Segmentation in Weakly-Labeled Instructional Videos,
WACV22(120-130)
IEEE DOI 2202
Training, Measurement, Runtime, Semantics, Task analysis, Videos, Action and Behavior Recognition action segmentation BibRef

Ramrakhya, R.[Ram], Undersander, E.[Eric], Batra, D.[Dhruv], Das, A.[Abhishek],
Habitat-Web: Learning Embodied Object-Search Strategies from Human Demonstrations at Scale,
CVPR22(5163-5173)
IEEE DOI 2210
Navigation, Training data, Reinforcement learning, Search problems, Behavioral sciences, Trajectory, Vision+language BibRef

Zhao, H.[He], Hadji, I.[Isma], Dvornik, N.[Nikita], Derpanis, K.G.[Konstantinos G.], Wildes, R.P.[Richard P.], Jepson, A.D.[Allan D.],
P3IV: Probabilistic Procedure Planning from Instructional Videos with Weak Supervision,
CVPR22(2928-2938)
IEEE DOI 2210
Training, Measurement, Visualization, Uncertainty, Transforms, Probabilistic logic, Transformers, Vision+language BibRef

Li, M.[Muheng], Chen, L.[Lei], Duarr, Y.[Yueqi], Hu, Z.[Zhilan], Feng, J.J.[Jian-Jiang], Zhou, J.[Jie], Lu, J.W.[Ji-Wen],
Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos,
CVPR22(19848-19857)
IEEE DOI 2210
Codes, Semantics, Benchmark testing, Task analysis, Context modeling, Action and event recognition BibRef

Singh, K.P.[Kunal Pratap], Bhambri, S.[Suvaansh], Kim, B.[Byeonghwi], Mottaghi, R.[Roozbeh], Choi, J.H.[Jong-Hyun],
Factorizing Perception and Policy for Interactive Instruction Following,
ICCV21(1868-1877)
IEEE DOI 2203
Art, Navigation, Buildings, Benchmark testing, Task analysis, Collision avoidance, Vision+language, Vision for robotics and autonomous vehicles BibRef

Bi, J.[Jing], Luo, J.B.[Jie-Bo], Xu, C.L.[Chen-Liang],
Procedure Planning in Instructional Videos via Contextual Modeling and Model-based Policy Learning,
ICCV21(15591-15600)
IEEE DOI 2203
Computational modeling, Decision making, Focusing, Inference algorithms, Planning, Bayes methods, Video analysis and understanding BibRef

Diaz, M.[Manfred], Fevens, T.[Thomas], Paull, L.[Liam],
Uncertainty-Aware Policy Sampling and Mixing for Safe Interactive Imitation Learning,
CRV21(72-78)
IEEE DOI 2108
Teaching robots how to execute tasks. Uncertainty, Supervised learning, Measurement uncertainty, Education, Training data, Safety, Trajectory, imitation learning, uncertainty estimation BibRef

Wang, S.J.[Shao-Jie], Zhao, W.T.[Wen-Tian], Kou, Z.Y.[Zi-Yi], Shi, J.[Jing], Xu, C.L.[Chen-Liang],
How to Make a BLT Sandwich? Learning VQA towards Understanding Web Instructional Videos,
WACV21(1129-1138)
IEEE DOI 2106
Measurement, Visualization, Fuses, Knowledge discovery, Motion pictures BibRef

Shen, Y.H.[Yu-Han], Elhamifar, E.[Ehsan],
Semi-Weakly-Supervised Learning of Complex Actions from Instructional Task Videos,
CVPR22(3334-3344)
IEEE DOI 2210
Training, Benchmark testing, Task analysis, Unsupervised learning, Videos, Video analysis and understanding, Self- semi- meta- unsupervised learning BibRef

Elhamifar, E.[Ehsan], Huynh, D.[Dat],
Self-supervised Multi-task Procedure Learning from Instructional Videos,
ECCV20(XVII:557-573).
Springer DOI 2011
BibRef

Yao, C.[Chong], Lou, L.Z.[Li-Zhu], Sui, X.K.[Xiao-Kui], Xu, M.[Ming],
Research on Quality Evaluation Algorithm of Flight Training for National Day Parade Air Echelon,
CVIDL20(130-134)
IEEE DOI 2102
aerospace computing, cameras, computer based training, learning (artificial intelligence), stereo image processing, Vanishing point BibRef

Chang, C.Y.[Chien-Yi], Huang, D.A.[De-An], Xu, D.[Danfei], Adeli, E.[Ehsan], Fei-Fei, L.[Li], Niebles, J.C.[Juan Carlos],
Procedure Planning in Instructional Videos,
ECCV20(XI:334-350).
Springer DOI 2011
BibRef

Miech, A., Zhukov, D., Alayrac, J., Tapaswi, M., Laptev, I., Alayrac, J.B.[Jean-Baptiste],
HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips,
ICCV19(2630-2640)
IEEE DOI 2004

WWW Link. Dataset, Instructional Video. Internet, learning (artificial intelligence), natural language processing, social networking (online), Computational modeling BibRef

Qian, M.[Ming], Nicholson, J.[John], Wang, E.[Erin],
Quality of Experience Comparison Between Binocular and Monocular Augmented Reality Display Under Various Occlusion Conditions for Manipulation Tasks with Virtual Instructions,
VAMR19(I:490-499).
Springer DOI 1909
BibRef

Kayser, M.[Maxime], Camburu, O.M.[Oana-Maria], Recasens, A.[Adrià], Luc, P.[Pauline], Alayrac, J.B.[Jean-Baptiste], Wang, L.[Luyu], Strub, F.[Florian], Tallec, C.[Corentin], Malinowski, M.[Mateusz], Patraaucean, V.[Viorica], Altché, F.[Florent], Valko, M.[Michal], Grill, J.B.[Jean-Bastien], van den Oord, A.[Aäron], Zisserman, A.[Andrew],
Broaden Your Views for Self-Supervised Video Learning,
ICCV21(1235-1245)
IEEE DOI 2203
Representation learning, Computational modeling, Crops, Benchmark testing, Kinetic theory, Standards, Representation learning BibRef

Zhukov, D.[Dimitri], Alayrac, J.B.[Jean-Baptiste], Laptev, I.[Ivan], Sivic, J.[Josef],
Learning Actionness via Long-range Temporal Order Verification,
ECCV20(XXIX: 470-487).
Springer DOI 2010
BibRef

Zhukov, D.[Dimitri], Alayrac, J.B.[Jean-Baptiste], Cinbis, R.G.[Ramazan Gokberk], Fouhey, D.[David], Laptev, I.[Ivan], Sivic, J.[Josef],
Cross-Task Weakly Supervised Learning From Instructional Videos,
CVPR19(3532-3540).
IEEE DOI 2002
Dataset, Instructional Video.
WWW Link. BibRef

Sener, F., Yao, A.,
Zero-Shot Anticipation for Instructional Activities,
ICCV19(862-871)
IEEE DOI 2004
educational robots, learning (artificial intelligence), natural language processing, text analysis, Training BibRef

Huang, D., Buch, S., Dery, L., Garg, A., Fei-Fei, L., Niebles, J.C.,
Finding 'It': Weakly-Supervised Reference-Aware Visual Grounding in Instructional Videos,
CVPR18(5948-5957)
IEEE DOI 1812
Grounding, Videos, Visualization, Task analysis, Image resolution, Optimization, Joining processes BibRef

Huang, D.A., Lim, J.J., Fei-Fei, L.[Li], Niebles, J.C.[Juan Carlos],
Unsupervised Visual-Linguistic Reference Resolution in Instructional Videos,
CVPR17(1032-1041)
IEEE DOI 1711
Optimization, Pragmatics, Spatial resolution, Videos, Visualization BibRef

Dorai, C., Oria, V., Neelavalli, V.,
Structuralizing educational videos based on presentation content,
ICIP03(II: 1029-1032).
IEEE DOI 0312
BibRef

Liu, T.C.[Tie-Cheng], Kender, J.R.,
Semantic mosaic for indexing and compressing instructional videos,
ICIP03(I: 921-924).
IEEE DOI 0312
BibRef
Earlier:
Rule-based semantic summarization of instructional videos,
ICIP02(I: 601-604).
IEEE DOI 0210
BibRef

Chapter on 3-D Object Description and Computation Techniques, Surfaces, Deformable, View Generation, Video Conferencing continues in
Face Synthesis Using Three-Dimensional Models .


Last update:Nov 26, 2024 at 16:40:19