How2 Dataset,
2019
WWW Link. Instructional videos
Used in How2 Challenge at ICML 2009
Dataset, Instructional Video.
YouCook2,
2018
WWW Link. Cooking videos
Dataset, Instructional Video.
Hoshino, K.[Kiyoshi],
Dexterous Robot Hand Control with Data Glove by Human Imitation,
IEICE(E89-D), No. 6, June 2006, pp. 1820-1825.
DOI Link
0606
BibRef
Choudary, C.,
Liu, T.C.[Tie-Cheng],
Summarization of Visual Content in Instructional Videos,
MultMed(9), No. 7, November 2007, pp. 1443-1455.
IEEE DOI
0905
BibRef
Earlier: A2, A1:
Content Extraction and Summarization of Instructional Videos,
ICIP06(149-152).
IEEE DOI
0610
BibRef
Liu, T.C.[Tie-Cheng],
Katpelly, R.,
Content-Adaptive Video Summarization Combining Queueing and Clustering,
ICIP06(145-148).
IEEE DOI
0610
BibRef
Alayrac, J.B.[Jean-Baptiste],
Bojanowski, P.[Piotr],
Agrawal, N.[Nishant],
Sivic, J.[Josef],
Laptev, I.[Ivan],
Lacoste-Julien, S.[Simon],
Learning from Narrated Instruction Videos,
PAMI(40), No. 9, September 2018, pp. 2194-2208.
IEEE DOI
1808
Dataset, Instructional Video.
WWW Link.
BibRef
Earlier:
Unsupervised Learning from Narrated Instruction Videos,
CVPR16(4575-4583)
IEEE DOI
1612
Videos, Automobiles, Visualization, Tires, YouTube, Internet, Pragmatics,
Step discovery, narrated instruction videos, unsupervised learning.
Text and images from video for learning the steps.
BibRef
Doering, M.,
Glas, D.F.,
Ishiguro, H.,
Modeling Interaction Structure for Robot Imitation Learning of Human
Social Behavior,
HMS(49), No. 3, June 2019, pp. 219-231.
IEEE DOI
1906
Robot sensing systems, Hidden Markov models, Data collection,
Unsupervised learning, Training, Man-machine systems,
unsupervised learning
BibRef
Wu, A.,
Piergiovanni, A.J.,
Ryoo, M.S.,
Model-Based Robot Imitation with Future Image Similarity,
IJCV(128), No. 5, May 2020, pp. 1360-1374.
Springer DOI
2005
BibRef
And:
Correction:
IJCV(128), No. 5, May 2020, pp. 1375.
Springer DOI
2005
Imitation learning.
BibRef
Tang, Y.S.[Yan-Song],
Lu, J.W.[Ji-Wen],
Zhou, J.[Jie],
Comprehensive Instructional Video Analysis:
The COIN Dataset and Performance Evaluation,
PAMI(43), No. 9, September 2021, pp. 3138-3153.
IEEE DOI
2108
Task analysis, Tires, YouTube, Automobiles, Fasteners,
Benchmark testing, Computed tomography, Instructional video,
large-scale benchmark
BibRef
Tang, Y.S.[Yan-Song],
Ding, D.J.[Da-Jun],
Rao, Y.M.[Yong-Ming],
Zheng, Y.[Yu],
Zhang, D.Y.[Dan-Yang],
Zhao, L.[Lili],
Lu, J.W.[Ji-Wen],
Zhou, J.[Jie],
COIN: A Large-Scale Dataset for Comprehensive Instructional Video
Analysis,
CVPR19(1207-1216).
IEEE DOI
2002
Dataset, Instructional Video.
WWW Link.
BibRef
Ashutosh, K.[Kumar],
Xue, Z.[Zihui],
Nagarajan, T.[Tushar],
Grauman, K.[Kristen],
Detours for Navigating Instructional Videos,
CVPR24(18804-18815)
IEEE DOI
2410
Training, Navigation, Computational modeling, Pipelines,
Natural languages, Buildings, video understanding, video language models
BibRef
Nagarajan, T.[Tushar],
Torresani, L.[Lorenzo],
Step Differences in Instructional Video,
CVPR24(18740-18750)
IEEE DOI
2410
Visualization, Annotations, Training data, Benchmark testing,
Data models, Cognition
BibRef
Cui, J.M.[Jie-Ming],
Liu, T.[Tengyu],
Liu, N.[Nian],
Yang, Y.D.[Yao-Dong],
Zhu, Y.X.[Yi-Xin],
Huang, S.Y.[Si-Yuan],
AnySkill: Learning Open-Vocabulary Physical Skill for Interactive
Agents,
CVPR24(852-862)
IEEE DOI
2410
Training, Visualization, Imitation learning, Computational modeling,
Humanoid robots, Manuals, open-vocabular, interactive agent
BibRef
Bansal, S.[Siddhant],
Arora, C.[Chetan],
Jawahar, C.V.,
United We Stand, Divided We Fall:
UnityGraph for Unsupervised Procedure Learning from Videos,
WACV24(6495-6505)
IEEE DOI
2404
Computational modeling, Clustering algorithms, Benchmark testing,
Task analysis, Videos, Algorithms
BibRef
Ben-Shabat, Y.Z.[Yi-Zhak],
Paul, J.[Jonathan],
Segev, E.[Eviatar],
Shrout, O.[Oren],
Gould, S.[Stephen],
IKEA Ego 3D Dataset: Understanding furniture assembly actions from
ego-view 3D Point Clouds,
WACV24(4343-4352)
IEEE DOI
2404
Point cloud compression, Performance evaluation, Focusing,
Benchmark testing, Task analysis, Algorithms,
3D computer vision
BibRef
Schoonbeek, T.J.[Tim J.],
Houben, T.[Tim],
Onvlee, H.[Hans],
de With, P.H.N.[Peter H.N.],
van der Sommen, F.[Fons],
IndustReal: A Dataset for Procedure Step Recognition Handling
Execution Errors in Egocentric Videos in an Industrial-Like Setting,
WACV24(4353-4362)
IEEE DOI Code:
WWW Link.
2404
Solid modeling, Scalability, Benchmark testing,
Reproducibility of results, Robustness, Task analysis, Algorithms,
Video recognition and understanding
BibRef
Abdelslam, M.A.[Mohamed A.],
Rangrej, S.B.[Samrudhdhi B.],
Hadji, I.[Isma],
Dvornik, N.[Nikita],
Derpanis, K.G.[Konstantinos G.],
Fazly, A.[Afsaneh],
GePSAn: Generative Procedure Step Anticipation in Cooking Videos,
ICCV23(2976-2985)
IEEE DOI
2401
BibRef
Zhong, Y.[Yiwu],
Yu, L.C.[Li-Cheng],
Bai, Y.[Yang],
Li, S.W.[Shang-Wen],
Yan, X.T.[Xue-Ting],
Li, Y.[Yin],
Learning Procedure-aware Video Representation from Instructional
Videos and Their Narrations,
CVPR23(14825-14835)
IEEE DOI
2309
BibRef
Zhang, J.H.[Jia-Hao],
Cherian, A.[Anoop],
Liu, Y.[Yanbin],
Ben-Shabat, Y.Z.[Yi-Zhak],
Rodriguez, C.[Cristian],
Gould, S.[Stephen],
Aligning Step-by-Step Instructional Diagrams to Video Demonstrations,
CVPR23(2483-2492)
IEEE DOI
2309
BibRef
Kosaka, T.[Takayuki],
Kosaka, M.[Mari],
Development and Discussion of an Authentic Game to Develop Cleaning
Skills,
VAMR23(33-42).
Springer DOI
2307
BibRef
Pan, Y.[Yueran],
Wu, J.X.[Jia-Xin],
Ju, R.[Ran],
Zhou, Z.[Ziang],
Gu, J.[Jiayue],
Zeng, S.T.[Song-Tian],
Yuan, L.[Lynn],
Li, M.[Ming],
A Multimodal Framework for Automated Teaching Quality Assessment of
One-to-many Online Instruction Videos,
ICPR22(1777-1783)
IEEE DOI
2212
Technical management, Protocols, Distance learning, Education,
Supervised learning, Feature extraction, Quality assessment,
Emotion Recognition
BibRef
Qin, Y.Z.[Yu-Zhe],
Wu, Y.H.[Yueh-Hua],
Liu, S.W.[Shao-Wei],
Jiang, H.W.[Han-Wen],
Yang, R.[Ruihan],
Fu, Y.[Yang],
Wang, X.L.[Xiao-Long],
DexMV: Imitation Learning for Dexterous Manipulation from Human Videos,
ECCV22(XXIX:570-587).
Springer DOI
2211
BibRef
Sener, F.[Fadime],
Chatterjee, D.[Dibyadip],
Shelepov, D.[Daniel],
He, K.[Kun],
Singhania, D.[Dipika],
Wang, R.[Robert],
Yao, A.[Angela],
Assembly101: A Large-Scale Multi-View Video Dataset for Understanding
Procedural Activities,
CVPR22(21064-21074)
IEEE DOI
2210
Training, Toy manufacturing industry, Recording,
Synchronization, Datasets and evaluation,
Action and event recognition
BibRef
Ghoddoosian, R.[Reza],
Dwivedi, I.[Isht],
Agarwal, N.[Nakul],
Dariush, B.[Behzad],
Weakly-Supervised Action Segmentation and Unseen Error Detection in
Anomalous Instructional Videos,
ICCV23(10094-10104)
IEEE DOI
1806
BibRef
Ghoddoosian, R.[Reza],
Dwivedi, I.[Isht],
Agarwal, N.[Nakul],
Choi, C.[Chiho],
Dariush, B.[Behzad],
Weakly-Supervised Online Action Segmentation in Multi-View
Instructional Videos,
CVPR22(13770-13780)
IEEE DOI
2210
Training, Costs, Annotations, Computational modeling,
Benchmark testing,
Self- semi- meta- unsupervised learning
BibRef
Ghoddoosian, R.[Reza],
Sayed, S.[Saif],
Athitsos, V.[Vassilis],
Hierarchical Modeling for Task Recognition and Action Segmentation in
Weakly-Labeled Instructional Videos,
WACV22(120-130)
IEEE DOI
2202
Training, Measurement, Runtime, Semantics,
Task analysis, Videos, Action and Behavior Recognition action segmentation
BibRef
Ramrakhya, R.[Ram],
Undersander, E.[Eric],
Batra, D.[Dhruv],
Das, A.[Abhishek],
Habitat-Web: Learning Embodied Object-Search Strategies from Human
Demonstrations at Scale,
CVPR22(5163-5173)
IEEE DOI
2210
Navigation, Training data, Reinforcement learning, Search problems,
Behavioral sciences, Trajectory, Vision+language
BibRef
Zhao, H.[He],
Hadji, I.[Isma],
Dvornik, N.[Nikita],
Derpanis, K.G.[Konstantinos G.],
Wildes, R.P.[Richard P.],
Jepson, A.D.[Allan D.],
P3IV: Probabilistic Procedure Planning from Instructional Videos with
Weak Supervision,
CVPR22(2928-2938)
IEEE DOI
2210
Training, Measurement, Visualization, Uncertainty, Transforms,
Probabilistic logic, Transformers, Vision+language
BibRef
Li, M.[Muheng],
Chen, L.[Lei],
Duarr, Y.[Yueqi],
Hu, Z.[Zhilan],
Feng, J.J.[Jian-Jiang],
Zhou, J.[Jie],
Lu, J.W.[Ji-Wen],
Bridge-Prompt:
Towards Ordinal Action Understanding in Instructional Videos,
CVPR22(19848-19857)
IEEE DOI
2210
Codes, Semantics, Benchmark testing,
Task analysis, Context modeling, Action and event recognition
BibRef
Singh, K.P.[Kunal Pratap],
Bhambri, S.[Suvaansh],
Kim, B.[Byeonghwi],
Mottaghi, R.[Roozbeh],
Choi, J.H.[Jong-Hyun],
Factorizing Perception and Policy for Interactive Instruction
Following,
ICCV21(1868-1877)
IEEE DOI
2203
Art, Navigation, Buildings, Benchmark testing, Task analysis,
Collision avoidance, Vision+language,
Vision for robotics and autonomous vehicles
BibRef
Bi, J.[Jing],
Luo, J.B.[Jie-Bo],
Xu, C.L.[Chen-Liang],
Procedure Planning in Instructional Videos via Contextual Modeling
and Model-based Policy Learning,
ICCV21(15591-15600)
IEEE DOI
2203
Computational modeling, Decision making, Focusing,
Inference algorithms, Planning, Bayes methods,
Video analysis and understanding
BibRef
Diaz, M.[Manfred],
Fevens, T.[Thomas],
Paull, L.[Liam],
Uncertainty-Aware Policy Sampling and Mixing for Safe Interactive
Imitation Learning,
CRV21(72-78)
IEEE DOI
2108
Teaching robots how to execute tasks.
Uncertainty, Supervised learning, Measurement uncertainty,
Education, Training data, Safety, Trajectory, imitation learning, uncertainty estimation
BibRef
Wang, S.J.[Shao-Jie],
Zhao, W.T.[Wen-Tian],
Kou, Z.Y.[Zi-Yi],
Shi, J.[Jing],
Xu, C.L.[Chen-Liang],
How to Make a BLT Sandwich? Learning VQA towards Understanding Web
Instructional Videos,
WACV21(1129-1138)
IEEE DOI
2106
Measurement, Visualization, Fuses,
Knowledge discovery, Motion pictures
BibRef
Shen, Y.H.[Yu-Han],
Elhamifar, E.[Ehsan],
Semi-Weakly-Supervised Learning of Complex Actions from Instructional
Task Videos,
CVPR22(3334-3344)
IEEE DOI
2210
Training, Benchmark testing, Task analysis,
Unsupervised learning, Videos, Video analysis and understanding,
Self- semi- meta- unsupervised learning
BibRef
Elhamifar, E.[Ehsan],
Huynh, D.[Dat],
Self-supervised Multi-task Procedure Learning from Instructional Videos,
ECCV20(XVII:557-573).
Springer DOI
2011
BibRef
Yao, C.[Chong],
Lou, L.Z.[Li-Zhu],
Sui, X.K.[Xiao-Kui],
Xu, M.[Ming],
Research on Quality Evaluation Algorithm of Flight Training for
National Day Parade Air Echelon,
CVIDL20(130-134)
IEEE DOI
2102
aerospace computing, cameras, computer based training,
learning (artificial intelligence), stereo image processing, Vanishing point
BibRef
Chang, C.Y.[Chien-Yi],
Huang, D.A.[De-An],
Xu, D.[Danfei],
Adeli, E.[Ehsan],
Fei-Fei, L.[Li],
Niebles, J.C.[Juan Carlos],
Procedure Planning in Instructional Videos,
ECCV20(XI:334-350).
Springer DOI
2011
BibRef
Miech, A.,
Zhukov, D.,
Alayrac, J.,
Tapaswi, M.,
Laptev, I.,
Alayrac, J.B.[Jean-Baptiste],
HowTo100M: Learning a Text-Video Embedding by Watching Hundred
Million Narrated Video Clips,
ICCV19(2630-2640)
IEEE DOI
2004
WWW Link.
Dataset, Instructional Video. Internet, learning (artificial intelligence),
natural language processing, social networking (online),
Computational modeling
BibRef
Qian, M.[Ming],
Nicholson, J.[John],
Wang, E.[Erin],
Quality of Experience Comparison Between Binocular and Monocular
Augmented Reality Display Under Various Occlusion Conditions for
Manipulation Tasks with Virtual Instructions,
VAMR19(I:490-499).
Springer DOI
1909
BibRef
Kayser, M.[Maxime],
Camburu, O.M.[Oana-Maria],
Recasens, A.[Adrià],
Luc, P.[Pauline],
Alayrac, J.B.[Jean-Baptiste],
Wang, L.[Luyu],
Strub, F.[Florian],
Tallec, C.[Corentin],
Malinowski, M.[Mateusz],
Patraaucean, V.[Viorica],
Altché, F.[Florent],
Valko, M.[Michal],
Grill, J.B.[Jean-Bastien],
van den Oord, A.[Aäron],
Zisserman, A.[Andrew],
Broaden Your Views for Self-Supervised Video Learning,
ICCV21(1235-1245)
IEEE DOI
2203
Representation learning, Computational modeling, Crops,
Benchmark testing, Kinetic theory, Standards, Representation learning
BibRef
Zhukov, D.[Dimitri],
Alayrac, J.B.[Jean-Baptiste],
Laptev, I.[Ivan],
Sivic, J.[Josef],
Learning Actionness via Long-range Temporal Order Verification,
ECCV20(XXIX: 470-487).
Springer DOI
2010
BibRef
Zhukov, D.[Dimitri],
Alayrac, J.B.[Jean-Baptiste],
Cinbis, R.G.[Ramazan Gokberk],
Fouhey, D.[David],
Laptev, I.[Ivan],
Sivic, J.[Josef],
Cross-Task Weakly Supervised Learning From Instructional Videos,
CVPR19(3532-3540).
IEEE DOI
2002
Dataset, Instructional Video.
WWW Link.
BibRef
Sener, F.,
Yao, A.,
Zero-Shot Anticipation for Instructional Activities,
ICCV19(862-871)
IEEE DOI
2004
educational robots, learning (artificial intelligence),
natural language processing, text analysis,
Training
BibRef
Huang, D.,
Buch, S.,
Dery, L.,
Garg, A.,
Fei-Fei, L.,
Niebles, J.C.,
Finding 'It': Weakly-Supervised Reference-Aware Visual Grounding in
Instructional Videos,
CVPR18(5948-5957)
IEEE DOI
1812
Grounding, Videos, Visualization, Task analysis, Image resolution,
Optimization, Joining processes
BibRef
Huang, D.A.,
Lim, J.J.,
Fei-Fei, L.[Li],
Niebles, J.C.[Juan Carlos],
Unsupervised Visual-Linguistic Reference Resolution in Instructional
Videos,
CVPR17(1032-1041)
IEEE DOI
1711
Optimization, Pragmatics, Spatial resolution, Videos, Visualization
BibRef
Dorai, C.,
Oria, V.,
Neelavalli, V.,
Structuralizing educational videos based on presentation content,
ICIP03(II: 1029-1032).
IEEE DOI
0312
BibRef
Liu, T.C.[Tie-Cheng],
Kender, J.R.,
Semantic mosaic for indexing and compressing instructional videos,
ICIP03(I: 921-924).
IEEE DOI
0312
BibRef
Earlier:
Rule-based semantic summarization of instructional videos,
ICIP02(I: 601-604).
IEEE DOI
0210
BibRef
Chapter on 3-D Object Description and Computation Techniques, Surfaces, Deformable, View Generation, Video Conferencing continues in
Face Synthesis Using Three-Dimensional Models .