Miech, A.
Standard Author Listing
with: Alayrac, J.: End-to-End Learning of Visual Representations From Uncura...
with: Alayrac, J.: HowTo100M: Learning a Text-Video Embedding by Watching Hu...
with: Alayrac, J.B.: HowTo100M: Learning a Text-Video Embedding by Watching ...
with: Alayrac, J.B.: Learning from Video and Text via Large-Scale Discrimina...
with: Alayrac, J.B.: Look for the Change: Learning Object States and State-M...
with: Alayrac, J.B.: Multi-Task Learning of Object States and State-Modifyin...
with: Alayrac, J.B.: Thinking Fast and Slow: Efficient Text-to-Visual Retrie...
with: Bojanowski, P.: Learning from Video and Text via Large-Scale Discrimin...
with: Chiu, J.: Simple Recipe for Contrastively Pre-Training Video-First Enc...
with: Heyward, J.: Simple Recipe for Contrastively Pre-Training Video-First ...
with: Koppula, S.: Simple Recipe for Contrastively Pre-Training Video-First ...
with: Laptev, I.: End-to-End Learning of Visual Representations From Uncurat...
with: Laptev, I.: HowTo100M: Learning a Text-Video Embedding by Watching Hun...
with: Laptev, I.: Just Ask: Learning to Answer Questions from Millions of Na...
with: Laptev, I.: Learning from Video and Text via Large-Scale Discriminativ...
with: Laptev, I.: Look for the Change: Learning Object States and State-Modi...
with: Laptev, I.: Multi-Task Learning of Object States and State-Modifying A...
with: Laptev, I.: Thinking Fast and Slow: Efficient Text-to-Visual Retrieval...
with: Laptev, I.: TubeDETR: Spatio-Temporal Video Grounding with Transformers
with: Laptev, I.: Vid2Seq: Large-Scale Pretraining of a Visual Language Mode...
with: Nagrani, A.: Vid2Seq: Large-Scale Pretraining of a Visual Language Mod...
with: Nematzdeh, A.: Simple Recipe for Contrastively Pre-Training Video-Firs...
with: Papalampidi, P.: Simple Recipe for Contrastively Pre-Training Video-Fi...
with: Pathak, S.: Simple Recipe for Contrastively Pre-Training Video-First E...
with: Patraucean, V.: Simple Recipe for Contrastively Pre-Training Video-Fir...
with: Pont Tuset, J.: Vid2Seq: Large-Scale Pretraining of a Visual Language ...
with: Schmid, C.: Just Ask: Learning to Answer Questions from Millions of Na...
with: Schmid, C.: TubeDETR: Spatio-Temporal Video Grounding with Transformers
with: Schmid, C.: Vid2Seq: Large-Scale Pretraining of a Visual Language Mode...
with: Seo, P.H.: Vid2Seq: Large-Scale Pretraining of a Visual Language Model...
with: Shen, J.J.: Simple Recipe for Contrastively Pre-Training Video-First E...
with: Sivic, J.: End-to-End Learning of Visual Representations From Uncurate...
with: Sivic, J.: Just Ask: Learning to Answer Questions from Millions of Nar...
with: Sivic, J.: Learning from Video and Text via Large-Scale Discriminative...
with: Sivic, J.: Look for the Change: Learning Object States and State-Modif...
with: Sivic, J.: Multi-Task Learning of Object States and State-Modifying Ac...
with: Sivic, J.: Thinking Fast and Slow: Efficient Text-to-Visual Retrieval ...
with: Sivic, J.: TubeDETR: Spatio-Temporal Video Grounding with Transformers
with: Sivic, J.: Vid2Seq: Large-Scale Pretraining of a Visual Language Model...
with: Smaira, L.: End-to-End Learning of Visual Representations From Uncurat...
with: Soucek, T.: Look for the Change: Learning Object States and State-Modi...
with: Soucek, T.: Multi-Task Learning of Object States and State-Modifying A...
with: Tapaswi, M.: HowTo100M: Learning a Text-Video Embedding by Watching Hu...
with: Yang, A.: Just Ask: Learning to Answer Questions from Millions of Narr...
with: Yang, A.: TubeDETR: Spatio-Temporal Video Grounding with Transformers
with: Yang, A.: Vid2Seq: Large-Scale Pretraining of a Visual Language Model ...
with: Zhukov, D.: HowTo100M: Learning a Text-Video Embedding by Watching Hun...
with: Zisserman, A.: End-to-End Learning of Visual Representations From Uncu...
with: Zisserman, A.: Simple Recipe for Contrastively Pre-Training Video-Firs...
with: Zisserman, A.: Thinking Fast and Slow: Efficient Text-to-Visual Retrie...
50 for Miech, A.