| _ | unseen | _ |
| 3d Object Detection and Pose Estimation of | unseen | Objects in Color Images with Local Surface Embeddings |
| 3D-GAT: 3D-Guided adversarial transform network for person re-identification in | unseen | domains |
| 3DCD: Scene Independent End-to-End Spatiotemporal Feature Learning Framework for Change Detection in | unseen | Videos |
| Active 3d Segmentation through Fixation of Previously | unseen | Objects |
| Adapting Foundation Features via Cross-View Contrastive Learning for | unseen | Object Pose Estimation |
| Adversarial Fine-Grained Composition Learning for | unseen | Attribute-Object Recognition |
| ALFA: Leveraging All Levels of Feature Abstraction for Enhancing the Generalization of Histopathology Image Classification Across | unseen | Hospitals |
| ARC-NeRF: Area Ray Casting for Broader | unseen | View Coverage in Few-Shot Object Rendering |
| Are These from the Same Place? Seeing the | unseen | in Cross-View Image Geo-Localization |
| Attribute-Based Learning for Remote Sensing Image Captioning in | unseen | Scenes |
| Attributes as Operators: Factorizing | unseen | Attribute-Object Compositions |
| AuraFusion360: Augmented | unseen | Region Alignment for Reference-based 360° Unbounded Scene Inpainting |
| BOP Challenge 2023 on Detection, Segmentation and Pose Estimation of Seen and | unseen | Rigid Objects |
| BSUV-Net: A Fully-Convolutional Neural Network for Background Subtraction of | unseen | Videos |
| Caption generation on scenes with seen and | unseen | object categories |
| Causality-inspired learning semantic segmentation in | unseen | domain |
| COCOA: Context-Conditional Adaptation for Recognizing | unseen | Classes in Unseen Domains |
| COCOA: Context-Conditional Adaptation for Recognizing | unseen | Classes in Unseen Domains |
| CompoNet: Learning to Generate the | unseen | by Part Synthesis and Composition |
| Compound Projection Learning for Bridging Seen and | unseen | Objects |
| Computing in Astronomy: To See the | unseen | |
| Cross-Domain Similarity Learning for Face Recognition in | unseen | Domains |
| CrossFuser: Multi-Modal Feature Fusion for End-to-End Autonomous Driving Under | unseen | Weather Conditions |
| Describing | unseen | Classes by Exemplars: Zero-Shot Learning Using Grouped Simile Ensemble |
| Describing | unseen | Videos via Multi-Modal Cooperative Dialog Agents |
| Detecting | unseen | Visual Relations Using Analogies |
| DiffDeMorph: Extending Reference-Free Demorphing to | unseen | Faces |
| Distinguishing | unseen | from Seen for Generalized Zero-shot Learning |
| DoFE: Domain-Oriented Feature Embedding for Generalizable Fundus Image Segmentation on | unseen | Datasets |
| Domainfusion: Generalizing to | unseen | Domains with Latent Diffusion Models |
| DVMNet: Computing Relative Pose for | unseen | Objects Beyond Hypotheses |
| Emotional contagion for | unseen | bodily expressions: Evidence from facial EMG |
| Empirical Study of the Generalization Ability of Lidar 3D Object Detectors to | unseen | Domains, An |
| Encode the | unseen | : Predictive Video Hashing for Scalable Mid-stream Retrieval |
| Encoding features robust to | unseen | modes of variation with attentive long short-term memory |
| Enroll-to-Verify Approach for Cross-Task | unseen | Emotion Class Recognition, An |
| Exploiting Word Embeddings for Recognition of Previously | unseen | Objects |
| Exploring Steganography: Seeing the | unseen | |
| Exploring the ability of CNNs to generalise to previously | unseen | scales over wide scale ranges |
| FakeInversion: Learning to Detect Images from | unseen | Text-to-Image Models by Inverting Stable Diffusion |
| Fast online incremental approach of | unseen | place classification using disjoint-text attribute prediction |
| Few-Shot Classification in | unseen | Domains by Episodic Meta-Learning Across Visual Domains |
| Few-shot Keypoint Detection with Uncertainty Learning for | unseen | Species |
| Foundpose: | unseen | Object Pose Estimation with Foundation Features |
| From Zero-Shot Learning to Conventional Supervised Classification: | unseen | Visual Data Synthesis |
| FS-Depth: Focal-and-Scale Depth Estimation From a Single Image in | unseen | Indoor Scene |
| Fusing Local Similarities for Retrieval-Based 3D Orientation Estimation of | unseen | Objects |
| Generalization on | unseen | Domains via Inference-time Label-Preserving Target Projections |
| Generalized zero-shot classification via iteratively generating and selecting | unseen | samples |
| Generalizing Deep Learning for Medical Image Segmentation to | unseen | Domains via Deep Stacked Transformation |
| Generalizing Neural Human Fitting to | unseen | Poses with Articulated SE(3) Equivariance |
| Generalizing Single-View 3D Shape Retrieval to Occlusions and | unseen | Objects |
| Generalizing to | unseen | Domains in Diabetic Retinopathy Classification |
| Generalizing to | unseen | Domains via Text-guided Augmentation: A Training-free Approach |
| Generalizing to | unseen | Speakers: Multimodal Emotion Recognition in Conversations With Speaker Generalization |
| Generating Stylized Features for Single-Source Cross-Dataset Palmprint Recognition With | unseen | Target Dataset |
| Generative Meta-Adversarial Network for | unseen | Object Navigation |
| GenLR-Net: Deep Framework for Very Low Resolution Face and Object Recognition with Generalization to | unseen | Categories |
| Gradient Estimation for | unseen | Domain Risk Minimization with Pre-Trained Models |
| Graph CNN for Moving Object Detection in Complex Environments from | unseen | Videos |
| Guess The | unseen | : Dynamic 3D Scene Reconstruction from Partial 2D Glimpses |
| Gumbel-NeRF: Representing | unseen | Objects as Part-Compositional Neural Radiance Fields |
| Hyperspectral Image Classification across Different Datasets: A Generalization to | unseen | Categories |
| iG-6DoF: Model-Free 6DoF Pose Estimation for | unseen | Object via Iterative 3D Gaussian Splatting |
| Imaginary-Connected Embedding in Complex Space for | unseen | Attribute-Object Discrimination |
| Imbuing, Enrichment and Calibration: Leveraging Language for | unseen | Domain Extension |
| Implicit 3D Human Mesh Recovery using Consistency with Pose and Shape from | unseen | -view |
| Individual-Aware Attention Modulation for | unseen | Speaker Emotion Recognition |
| Infer | unseen | from seen: Relation regularized zero-shot visual dialog |
| Inferring | unseen | Views of People |
| Insights from the Use of Previously | unseen | Neural Architecture Search Datasets |
| Joint Intermodal and Intramodal Label Transfers for Extremely Rare or | unseen | Classes |
| Knowledge-Driven Saliency: Attention to the | unseen | |
| LatentFusion: End-to-End Differentiable Reconstruction and Rendering for | unseen | Object Pose Estimation |
| Learning Local Pattern Modularization for Point Cloud Reconstruction from | unseen | Classes |
| Learning Meta Face Recognition in | unseen | Domains |
| Learning Multimodal Representations for | unseen | Activities |
| Learning to Adapt to | unseen | Abnormal Activities Under Weak Supervision |
| Learning to Better Segment Objects from | unseen | Classes with Unlabeled Videos |
| Learning to detect | unseen | object classes by between-class attribute transfer |
| Learning to Generalize | unseen | Domains via Memory-based Multi-Source Meta-Learning for Person Re-Identification |
| Learning to Generate Parameters of ConvNets for | unseen | Image Data |
| Learning to Identify Seen, | unseen | and Unknown in the Open World: A Practical Setting for Zero-Shot Learning |
| Learning to Infer | unseen | Single-/ Multi-Attribute-Object Compositions With Graph Networks |
| Learning to Look Around: Intelligently Exploring | unseen | Environments for Unknown Tasks |
| Learning to Recognize Objects from | unseen | Modalities |
| Learning | unseen | Concepts via Hierarchical Decomposition and Composition |
| Leveraging Seen and | unseen | Semantic Relationships for Generative Zero-shot Learning |
| Leveraging Test-Time Consensus Prediction for Robustness against | unseen | Noise |
| LipFormer: Learning to Lipread | unseen | Speakers Based on Visual-Landmark Transformers |
| LocPoseNet: Robust Location Prior for | unseen | Object Pose Estimation |
| Lost in light field compression: Understanding the | unseen | pitfalls in computer vision |
| MatchU: Matching | unseen | Objects for 6D Pose Estimation from RGB-D Images |
| Measuring Generalisation to | unseen | Viewpoints, Articulations, Shapes and Objects for 3d Hand Pose Estimation Under Hand-object Interaction |
| Meta Feature Disentanglement under continuous-valued domain modeling for generalizable remote sensing image segmentation on | unseen | domains |
| Meta-Knowledge Learning and Domain Adaptation for | unseen | Background Subtraction |
| Mixing Zero-Shot Learning Up: Learning | unseen | Classes from Mixed Features |
| Multi-view Adversarial Discriminator: Mine the Non-causal Factors for Object Detection in | unseen | Domains |
| Multimodal 3D Object Detection on | unseen | Domains |
| Mutual Information-Based Disentangled Neural Networks for Classifying | unseen | Categories in Different Domains: Application to Fetal Ultrasound Imaging |
| Navigating the | unseen | : Zero-shot Scene Graph Generation via Capsule-Based Equivariant Features |
| Neural Task Graphs: Generalizing to | unseen | Tasks From a Single Video Demonstration |
| Object Priors for Classifying and Localizing | unseen | Actions |
| On Decomposing an | unseen | 3D Face into Neutral Face and Expression Deformations |
| OoD-Control: Generalizing Control in | unseen | Environments |
| Pairwise Feature Learning for | unseen | Plant Disease Recognition |
| Part Segmentation of | unseen | Objects using Keypoint Guidance |
| PEANUT: Predicting and Navigating to | unseen | Targets |
| Pos3R: 6D Pose Estimation for | unseen | Objects Made Easy |
| PoseIRM: Enhance 3D Human Pose Estimation on | unseen | Camera Settings via Invariant Risk Minimization |
| Predicting the Physical Dynamics of | unseen | 3D Objects |
| Predicting Visual Exemplars of | unseen | Classes for Zero-Shot Learning |
| Predicting with Confidence on | unseen | Distributions |
| Progressive randomization: Seeing the | unseen | |
| Prompt- | unseen | -Emotion: Mixed Emotional Speech Synthesis With Prompt-LLM Contextual Knowledge |
| Pseudo distribution on | unseen | classes for generalized zero shot learning |
| Radar HRRP | unseen | Class Recognition Based on the Joint Dictionary Learning |
| Randomized Spectrum Transformations for Adapting Object Detector in | unseen | Domains |
| Real-time method for counting | unseen | stacked objects in mobile |
| Recognising faces in | unseen | modes: A tensor based approach |
| Recognition of | unseen | Bird Species by Learning from Field Guides |
| Recognizing Actions in Videos from | unseen | Viewpoints |
| Recognizing | unseen | actions in a domain-adapted embedding space |
| Recognizing | unseen | States of Unknown Objects by Leveraging Knowledge Graphs |
| RefPose: Leveraging Reference Geometric Correspondences for Accurate 6D Pose Estimation of | unseen | Objects |
| Registration of | unseen | images based on the generative manifold modeling of variations of appearance and anatomical shape in brain population |
| Safe-Student for Safe Deep Semi-Supervised Learning with | unseen | -Class Unlabeled Data |
| Safety-aware Motion Prediction with | unseen | Vehicles for Autonomous Driving |
| Saying the | unseen | : Video Descriptions via Dialog Agents |
| Scoring Your Prediction on | unseen | Data |
| See the | unseen | : Grid-Wise Drivable Area Detection Dataset and Network Using LiDAR |
| Seeing the | unseen | : A Frequency Prompt Guided Transformer for Image Restoration |
| Seeing the | unseen | : Predicting the First-Person Camera Wearer's Location and Pose in Third-Person Scenes |
| Seeing the | unseen | : Visual Common Sense for Semantic Placement |
| Seeing the | unseen | : Wifi-based 2D human pose estimation via an evolving attentive spatial-Frequency network |
| Seeing Through Fog Without Seeing Fog: Deep Multimodal Sensor Fusion in | unseen | Adverse Weather |
| Seeing | unseen | : Discover Novel Biomedical Concepts via Geometry-Constrained Probabilistic Modeling |
| Segmenting Known Objects and | unseen | Unknowns without Prior Knowledge |
| Segmenting | unseen | Industrial Components In A Heavy Clutter Using RGB-D Fusion And Synthetic Data |
| Self-supervised Monocular Depth Estimation on | unseen | Synthetic Cameras |
| Semi-Supervised Background Subtraction Of | unseen | Videos: Minimization of the Total Variation Of Graph Signals |
| Shape-merging and interpolation using class estimation for | unseen | voxels with a GPU-based efficient implementation |
| Single-View 3D Mesh Reconstruction for Seen and | unseen | Categories |
| Sparse multi-view hand-object reconstruction for | unseen | environments |
| Speaker Attractor Network: Generalizing Speech Separation to | unseen | Numbers of Sources |
| Specialist Diffusion: Plug-and-Play Sample-Efficient Fine-Tuning of Text-to-Image Diffusion Models to Learn Any | unseen | Style |
| Spoken language identification in | unseen | channel conditions using modified within-sample similarity loss |
| SPVLOC: Semantic Panoramic Viewport Matching for 6d Camera Localization in | unseen | Environments |
| Synthesizing Images of Humans in | unseen | Poses |
| Synthesizing the | unseen | for Zero-shot Object Detection |
| Synthetic Humans for Action Recognition from | unseen | Viewpoints |
| TAFL: Task-Agnostic Feature Learner for Efficient Adaptation to | unseen | Clinical Tasks Based on Whole-Slide Histopathological Images |
| Task Agnostic and Post-hoc | unseen | Distribution Detection |
| Temporal downscaling meteorological variables to | unseen | moments: Continuous temporal downscaling via Multi-source Spatial-temporal-wavelet feature Fusion and Time-Continuous Manifold |
| Tensor based completion meets adversarial learning: A win-win solution for change detection on | unseen | videos |
| Test-time Assessment of a Model's Performance on | unseen | Domains via Optimal Transport |
| Towards Fine-Grained Open Zero-Shot Learning: Inferring | unseen | Visual Features from Attributes |
| Towards Generalizing to | unseen | Domains with Few Labels |
| Towards Open-set Face Anti-spoofing with | unseen | Attack Synthesis |
| Towards realistic symmetry-based completion of previously | unseen | point clouds |
| Towards Recognizing | unseen | Categories in Unseen Domains |
| Towards Recognizing | unseen | Categories in Unseen Domains |
| Towards the | unseen | : Iterative Text Recognition by Distilling from Errors |
| Towards Universal Representation for | unseen | Action Recognition |
| Training Neural Networks on Remote Edge Devices for | unseen | Class Classification |
| TranstextNet: Transducing Text for Recognizing | unseen | Visual Relationships |
| Uncertainty-guided Model Generalization to | unseen | Domains |
| UNOPose: | unseen | Object Pose Estimation with an Unposed RGB-D Reference Image |
| unseen | and Adverse Outdoor Scenes Recognition Through Event-based Captions |
| unseen | Appliances Identification |
| unseen | Challenge data sets, The |
| unseen | Classes at a Later Time? No Problem |
| unseen | Face Presentation Attack Detection Using Sparse Multiple Kernel Fisher Null-Space |
| unseen | Land Cover Classification from High-Resolution Orthophotos Using Integration of Zero-Shot Learning and Convolutional Neural Networks |
| unseen | Object Segmentation in Videos via Transferable Representations |
| unseen | Visible Watermarking |
| unseen | Visual Anomaly Generation |
| unseen | ) event recognition via semantic compositionality |
| unseen | : An Investigative Analysis of Thematic and Spatial Coverage of News on the Ongoing Refugee Crisis in West Africa, The |
| USGS: Enhancing sparse view synthesis with | unseen | viewpoint regularization in 3D Gaussian splatting |
| Video Stream Retrieval of | unseen | Queries using Semantic Memory |
| View Synthesis for Recognizing | unseen | Poses of Object Classes |
| ViewCLR: Learning Self-supervised Video Representation for | unseen | Viewpoints |
| VisageSynTalk: | unseen | Speaker Video-to-Speech Synthesis via Speech-Visage Feature Selection |
| Vision of the | unseen | : Current trends and challenges in digital image and video forensics |
| Weakly-Supervised Action Segmentation and | unseen | Error Detection in Anomalous Instructional Videos |
| Worktorial on Vision of the | unseen | |
| YOLO-Anti: YOLO-based counterattack model for | unseen | congested object detection |
| You always look again: Learning to detect the | unseen | objects |
| Zero-Shot Learning Using Synthesised | unseen | Visual Data with Diffusion Regularisation |
| Zero-Shot Single-Microphone Sound Classification and Localization in a Building Via the Synthesis of | unseen | Features |
| Zero-VAE-GAN: Generating | unseen | Features for Generalized and Transductive Zero-Shot Learning |
192 for unseen