Maaz, M.[Muhammad]
Co Author Listing * Class-Agnostic Object Detection with Multi-modal Transformer
* Edgenext: Efficiently Amalgamated CNN-transformer Architecture for Mobile Vision Applications
* Fine-tuned CLIP Models are Efficient Video Learners
* GLaMM: Pixel Grounding Large Multimodal Model
* MaPLe: Multi-modal Prompt Learning
* Palo: A Polyglot Large Multimodal Model for 5B People
* SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications
* UNETR++: Delving Into Efficient and Accurate 3D Medical Image Segmentation
8 for Maaz, M.
Maazoun, W.[Wissem]
Co Author Listing * Bi-discriminator GAN for tabular data synthesis