Co Author Listing * End-to-End Learning of Visual Representations From Uncurated Instructional Videos
* HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips
* Just Ask: Learning to Answer Questions from Millions of Narrated Videos
* Learning from Video and Text via Large-Scale Discriminative Clustering
* Look for the Change: Learning Object States and State-Modifying Actions from Untrimmed Web Videos
* Thinking Fast and Slow: Efficient Text-to-Visual Retrieval with Transformers
* TubeDETR: Spatio-Temporal Video Grounding with Transformers
Includes: Miech, A. Miech, A.[Antoine]
7 for Miech, A.
Index for "m"