Index for nagr

Nagrani, A.[Arsha] Co Author Listing * AutoAD II: The Sequel - Who, When, and What in Movie Audio Description
* AutoAD: Movie Description in Context
* AVFormer: Injecting Vision into Frozen Speech Models for Zero-Shot AV-ASR
* Composable Augmentation Encoding for Video Representation Learning
* Condensed Movies: Story Based Retrieval with Contextual Embeddings
* Count, Crop and Recognise: Fine-Grained Recognition in the Wild
* End-to-end Generative Pretraining for Multimodal Video Captioning
* EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition
* Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval
* Learnable PINs: Cross-modal Embeddings for Person Identity
* Learning Audio-Video Modalities from Image Captions
* Localizing Visual Sounds the Hard Way
* Look Before you Speak: Visually Contextualized Utterances
* Masking Modalities for Cross-modal Video Retrieval
* Seeing Voices and Hearing Faces: Cross-Modal Biometric Matching
* Speech2Action: Cross-Modal Supervision for Action Recognition
* TL;DW? Summarizing Instructional Videos with Task Relevance and Cross-Modal Saliency
* Uncertainty-aware Weakly Supervised Action Detection from Untrimmed Videos
* UnLoc: A Unified Framework for Video Localization Tasks
* Verbs in Action: Improving verb understanding in video-language models
* Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning
Includes: Nagrani, A.[Arsha] Nagrani, A.
21 for Nagrani, A.

Nagrecha, K.[Kabir] Co Author Listing * Gradient-based Algorithms for Machine Teaching

Index for "n"


Last update:27-Apr-24 11:57:48
Use price@usc.edu for comments.