_ | unseen | _ |
3d Object Detection and Pose Estimation of | unseen | Objects in Color Images with Local Surface Embeddings |
3D-GAT: 3D-Guided adversarial transform network for person re-identification in | unseen | domains |
3DCD: Scene Independent End-to-End Spatiotemporal Feature Learning Framework for Change Detection in | unseen | Videos |
Active 3d Segmentation through Fixation of Previously | unseen | Objects |
Adversarial Fine-Grained Composition Learning for | unseen | Attribute-Object Recognition |
ALFA: Leveraging All Levels of Feature Abstraction for Enhancing the Generalization of Histopathology Image Classification Across | unseen | Hospitals |
Are These from the Same Place? Seeing the | unseen | in Cross-View Image Geo-Localization |
Attributes as Operators: Factorizing | unseen | Attribute-Object Compositions |
BSUV-Net: A Fully-Convolutional Neural Network for Background Subtraction of | unseen | Videos |
Caption generation on scenes with seen and | unseen | object categories |
COCOA: Context-Conditional Adaptation for Recognizing | unseen | Classes in Unseen Domains |
COCOA: Context-Conditional Adaptation for Recognizing | unseen | Classes in Unseen Domains |
CompoNet: Learning to Generate the | unseen | by Part Synthesis and Composition |
Compound Projection Learning for Bridging Seen and | unseen | Objects |
Computing in Astronomy: To See the | unseen | |
Cross-Domain Similarity Learning for Face Recognition in | unseen | Domains |
CrossFuser: Multi-Modal Feature Fusion for End-to-End Autonomous Driving Under | unseen | Weather Conditions |
Describing | unseen | Classes by Exemplars: Zero-Shot Learning Using Grouped Simile Ensemble |
Describing | unseen | Videos via Multi-Modal Cooperative Dialog Agents |
Detecting | unseen | Visual Relations Using Analogies |
Distinguishing | unseen | from Seen for Generalized Zero-shot Learning |
DoFE: Domain-Oriented Feature Embedding for Generalizable Fundus Image Segmentation on | unseen | Datasets |
Emotional contagion for | unseen | bodily expressions: Evidence from facial EMG |
Encode the | unseen | : Predictive Video Hashing for Scalable Mid-stream Retrieval |
Encoding features robust to | unseen | modes of variation with attentive long short-term memory |
Enroll-to-Verify Approach for Cross-Task | unseen | Emotion Class Recognition, An |
Exploiting Word Embeddings for Recognition of Previously | unseen | Objects |
Exploring Steganography: Seeing the | unseen | |
Exploring the ability of CNNs to generalise to previously | unseen | scales over wide scale ranges |
Fast online incremental approach of | unseen | place classification using disjoint-text attribute prediction |
Few-Shot Classification in | unseen | Domains by Episodic Meta-Learning Across Visual Domains |
Few-shot Keypoint Detection with Uncertainty Learning for | unseen | Species |
From Zero-Shot Learning to Conventional Supervised Classification: | unseen | Visual Data Synthesis |
Fusing Local Similarities for Retrieval-Based 3D Orientation Estimation of | unseen | Objects |
Generalization on | unseen | Domains via Inference-time Label-Preserving Target Projections |
Generalized zero-shot classification via iteratively generating and selecting | unseen | samples |
Generalizing Deep Learning for Medical Image Segmentation to | unseen | Domains via Deep Stacked Transformation |
Generalizing Neural Human Fitting to | unseen | Poses with Articulated SE(3) Equivariance |
Generalizing to | unseen | Domains in Diabetic Retinopathy Classification |
Generative Meta-Adversarial Network for | unseen | Object Navigation |
GenLR-Net: Deep Framework for Very Low Resolution Face and Object Recognition with Generalization to | unseen | Categories |
Gradient Estimation for | unseen | Domain Risk Minimization with Pre-Trained Models |
Graph CNN for Moving Object Detection in Complex Environments from | unseen | Videos |
Hyperspectral Image Classification across Different Datasets: A Generalization to | unseen | Categories |
Implicit 3D Human Mesh Recovery using Consistency with Pose and Shape from | unseen | -view |
Infer | unseen | from seen: Relation regularized zero-shot visual dialog |
Inferring | unseen | Views of People |
Joint Intermodal and Intramodal Label Transfers for Extremely Rare or | unseen | Classes |
Knowledge-Driven Saliency: Attention to the | unseen | |
LatentFusion: End-to-End Differentiable Reconstruction and Rendering for | unseen | Object Pose Estimation |
Learning Meta Face Recognition in | unseen | Domains |
Learning Multimodal Representations for | unseen | Activities |
Learning to Adapt to | unseen | Abnormal Activities Under Weak Supervision |
Learning to Better Segment Objects from | unseen | Classes with Unlabeled Videos |
Learning to detect | unseen | object classes by between-class attribute transfer |
Learning to Generalize | unseen | Domains via Memory-based Multi-Source Meta-Learning for Person Re-Identification |
Learning to Infer | unseen | Single-/ Multi-Attribute-Object Compositions With Graph Networks |
Learning to Look Around: Intelligently Exploring | unseen | Environments for Unknown Tasks |
Learning to Recognize Objects from | unseen | Modalities |
Learning | unseen | Concepts via Hierarchical Decomposition and Composition |
Leveraging Seen and | unseen | Semantic Relationships for Generative Zero-shot Learning |
Leveraging Test-Time Consensus Prediction for Robustness against | unseen | Noise |
LipFormer: Learning to Lipread | unseen | Speakers Based on Visual-Landmark Transformers |
Measuring Generalisation to | unseen | Viewpoints, Articulations, Shapes and Objects for 3d Hand Pose Estimation Under Hand-object Interaction |
Meta-Knowledge Learning and Domain Adaptation for | unseen | Background Subtraction |
Mixing Zero-Shot Learning Up: Learning | unseen | Classes from Mixed Features |
Multi-view Adversarial Discriminator: Mine the Non-causal Factors for Object Detection in | unseen | Domains |
Mutual Information-Based Disentangled Neural Networks for Classifying | unseen | Categories in Different Domains: Application to Fetal Ultrasound Imaging |
Neural Task Graphs: Generalizing to | unseen | Tasks From a Single Video Demonstration |
Object Priors for Classifying and Localizing | unseen | Actions |
On Decomposing an | unseen | 3D Face into Neutral Face and Expression Deformations |
Pairwise Feature Learning for | unseen | Plant Disease Recognition |
Part Segmentation of | unseen | Objects using Keypoint Guidance |
PEANUT: Predicting and Navigating to | unseen | Targets |
Predicting the Physical Dynamics of | unseen | 3D Objects |
Predicting Visual Exemplars of | unseen | Classes for Zero-Shot Learning |
Predicting with Confidence on | unseen | Distributions |
Progressive randomization: Seeing the | unseen | |
Pseudo distribution on | unseen | classes for generalized zero shot learning |
Radar HRRP | unseen | Class Recognition Based on the Joint Dictionary Learning |
Randomized Spectrum Transformations for Adapting Object Detector in | unseen | Domains |
Real-time method for counting | unseen | stacked objects in mobile |
Recognising faces in | unseen | modes: A tensor based approach |
Recognition of | unseen | Bird Species by Learning from Field Guides |
Recognizing Actions in Videos from | unseen | Viewpoints |
Recognizing | unseen | actions in a domain-adapted embedding space |
Registration of | unseen | images based on the generative manifold modeling of variations of appearance and anatomical shape in brain population |
Safe-Student for Safe Deep Semi-Supervised Learning with | unseen | -Class Unlabeled Data |
Safety-aware Motion Prediction with | unseen | Vehicles for Autonomous Driving |
Saying the | unseen | : Video Descriptions via Dialog Agents |
Scoring Your Prediction on | unseen | Data |
Seeing the | unseen | : Predicting the First-Person Camera Wearer's Location and Pose in Third-Person Scenes |
Seeing the | unseen | : Wifi-based 2D human pose estimation via an evolving attentive spatial-Frequency network |
Seeing Through Fog Without Seeing Fog: Deep Multimodal Sensor Fusion in | unseen | Adverse Weather |
Segmenting Known Objects and | unseen | Unknowns without Prior Knowledge |
Segmenting | unseen | Industrial Components In A Heavy Clutter Using RGB-D Fusion And Synthetic Data |
Self-supervised Monocular Depth Estimation on | unseen | Synthetic Cameras |
Semi-Supervised Background Subtraction Of | unseen | Videos: Minimization of the Total Variation Of Graph Signals |
Shape-merging and interpolation using class estimation for | unseen | voxels with a GPU-based efficient implementation |
Single-View 3D Mesh Reconstruction for Seen and | unseen | Categories |
Speaker Attractor Network: Generalizing Speech Separation to | unseen | Numbers of Sources |
Specialist Diffusion: Plug-and-Play Sample-Efficient Fine-Tuning of Text-to-Image Diffusion Models to Learn Any | unseen | Style |
Spoken language identification in | unseen | channel conditions using modified within-sample similarity loss |
Synthesizing Images of Humans in | unseen | Poses |
Synthesizing the | unseen | for Zero-shot Object Detection |
Synthetic Humans for Action Recognition from | unseen | Viewpoints |
Task Agnostic and Post-hoc | unseen | Distribution Detection |
Tensor based completion meets adversarial learning: A win-win solution for change detection on | unseen | videos |
Towards Fine-Grained Open Zero-Shot Learning: Inferring | unseen | Visual Features from Attributes |
Towards realistic symmetry-based completion of previously | unseen | point clouds |
Towards Recognizing | unseen | Categories in Unseen Domains |
Towards Recognizing | unseen | Categories in Unseen Domains |
Towards the | unseen | : Iterative Text Recognition by Distilling from Errors |
Towards Universal Representation for | unseen | Action Recognition |
Training Neural Networks on Remote Edge Devices for | unseen | Class Classification |
TranstextNet: Transducing Text for Recognizing | unseen | Visual Relationships |
Uncertainty-guided Model Generalization to | unseen | Domains |
| unseen | and Adverse Outdoor Scenes Recognition Through Event-based Captions |
| unseen | Appliances Identification |
| unseen | Challenge data sets, The |
| unseen | Classes at a Later Time? No Problem |
| unseen | Face Presentation Attack Detection Using Sparse Multiple Kernel Fisher Null-Space |
| unseen | Land Cover Classification from High-Resolution Orthophotos Using Integration of Zero-Shot Learning and Convolutional Neural Networks |
| unseen | Object Segmentation in Videos via Transferable Representations |
| unseen | Visible Watermarking |
| unseen | ) event recognition via semantic compositionality |
| unseen | : An Investigative Analysis of Thematic and Spatial Coverage of News on the Ongoing Refugee Crisis in West Africa, The |
Video Stream Retrieval of | unseen | Queries using Semantic Memory |
View Synthesis for Recognizing | unseen | Poses of Object Classes |
ViewCLR: Learning Self-supervised Video Representation for | unseen | Viewpoints |
VisageSynTalk: | unseen | Speaker Video-to-Speech Synthesis via Speech-Visage Feature Selection |
Vision of the | unseen | : Current trends and challenges in digital image and video forensics |
Weakly-Supervised Action Segmentation and | unseen | Error Detection in Anomalous Instructional Videos |
Worktorial on Vision of the | unseen | |
YOLO-Anti: YOLO-based counterattack model for | unseen | congested object detection |
You always look again: Learning to detect the | unseen | objects |
Zero-Shot Learning Using Synthesised | unseen | Visual Data with Diffusion Regularisation |
Zero-Shot Single-Microphone Sound Classification and Localization in a Building Via the Synthesis of | unseen | Features |
Zero-VAE-GAN: Generating | unseen | Features for Generalized and Transductive Zero-Shot Learning |
139 for unseen