Miech, A.
Standard Author Listing
with: Alayrac, J.: End-to-End Learning of Visual Representations From Uncura...
with: Alayrac, J.: HowTo100M: Learning a Text-Video Embedding by Watching Hu...
with: Alayrac, J.B.: HowTo100M: Learning a Text-Video Embedding by Watching ...
with: Alayrac, J.B.: Learning from Video and Text via Large-Scale Discrimina...
with: Alayrac, J.B.: Look for the Change: Learning Object States and State-M...
with: Alayrac, J.B.: Thinking Fast and Slow: Efficient Text-to-Visual Retrie...
with: Bojanowski, P.: Learning from Video and Text via Large-Scale Discrimin...
with: Laptev, I.: End-to-End Learning of Visual Representations From Uncurat...
with: Laptev, I.: HowTo100M: Learning a Text-Video Embedding by Watching Hun...
with: Laptev, I.: Just Ask: Learning to Answer Questions from Millions of Na...
with: Laptev, I.: Learning from Video and Text via Large-Scale Discriminativ...
with: Laptev, I.: Look for the Change: Learning Object States and State-Modi...
with: Laptev, I.: Thinking Fast and Slow: Efficient Text-to-Visual Retrieval...
with: Laptev, I.: TubeDETR: Spatio-Temporal Video Grounding with Transformers
with: Laptev, I.: Vid2Seq: Large-Scale Pretraining of a Visual Language Mode...
with: Nagrani, A.: Vid2Seq: Large-Scale Pretraining of a Visual Language Mod...
with: Pont Tuset, J.: Vid2Seq: Large-Scale Pretraining of a Visual Language ...
with: Schmid, C.: Just Ask: Learning to Answer Questions from Millions of Na...
with: Schmid, C.: TubeDETR: Spatio-Temporal Video Grounding with Transformers
with: Schmid, C.: Vid2Seq: Large-Scale Pretraining of a Visual Language Mode...
with: Seo, P.H.: Vid2Seq: Large-Scale Pretraining of a Visual Language Model...
with: Sivic, J.: End-to-End Learning of Visual Representations From Uncurate...
with: Sivic, J.: Just Ask: Learning to Answer Questions from Millions of Nar...
with: Sivic, J.: Learning from Video and Text via Large-Scale Discriminative...
with: Sivic, J.: Look for the Change: Learning Object States and State-Modif...
with: Sivic, J.: Thinking Fast and Slow: Efficient Text-to-Visual Retrieval ...
with: Sivic, J.: TubeDETR: Spatio-Temporal Video Grounding with Transformers
with: Sivic, J.: Vid2Seq: Large-Scale Pretraining of a Visual Language Model...
with: Smaira, L.: End-to-End Learning of Visual Representations From Uncurat...
with: Soucek, T.: Look for the Change: Learning Object States and State-Modi...
with: Tapaswi, M.: HowTo100M: Learning a Text-Video Embedding by Watching Hu...
with: Yang, A.: Just Ask: Learning to Answer Questions from Millions of Narr...
with: Yang, A.: TubeDETR: Spatio-Temporal Video Grounding with Transformers
with: Yang, A.: Vid2Seq: Large-Scale Pretraining of a Visual Language Model ...
with: Zhukov, D.: HowTo100M: Learning a Text-Video Embedding by Watching Hun...
with: Zisserman, A.: End-to-End Learning of Visual Representations From Uncu...
with: Zisserman, A.: Thinking Fast and Slow: Efficient Text-to-Visual Retrie...
37 for Miech, A.