_ | visual | _ |
1000 fps | visual | Feedback Control of an Active Vision System over a High-Load Network |
2 1/2 D | visual | Servoing |
2 1/2 D | visual | Servoing with Respect to Unknown Objects Through a New Estimation Scheme of Camera Displacement |
2,2) XOR-based | visual | cryptography scheme without pixel expansion, A |
2-manifold reconstruction from sparse | visual | features |
2-Stage High-Precision | visual | Inspection of Surface Mount Devices |
2.5D | visual | relationship detection |
2.5D | visual | Sound |
2.5D | visual | Speech Synthesis Using Appearance Models |
25th ICPR: Real-time | visual | Surveillance as-a-Service (VSaaS) for smart security solutions |
2D Synthesized Image Improves the 3D Search for Foveated | visual | Systems, A |
2D-3D model-based approach to real-time | visual | tracking, A |
3-D Audio- | visual | Corpus of Affective Communication, A |
3-D Interfaces to Improve the Performance of | visual | Known-Item Search |
3-D Modelling and Robot Localization from | visual | and Range Data in Natural Scenes |
3-D motion estimation by integrating | visual | cues in 2-D multi-modal opti-acoustic stereo sequences |
3-D Motion Estimation for | visual | Saliency Modeling |
360VOT: A New Benchmark Dataset for Omnidirectional | visual | Object Tracking |
3D AffordanceNet: A Benchmark for | visual | Object Affordance Understanding |
3D Audio- | visual | Speaker Tracking with A Novel Particle Filter |
3D Audio- | visual | Speaker Tracking with A Two-Layer Particle Filter |
3D deformable model-based framework for the retrieval of near-isometric flattenable objects using Bag-of- | visual | -Words, A |
3D Display Calibration by | visual | Pattern Analysis |
3D gesture recognition framework based on hierarchical | visual | attention and perceptual organization models, A |
3D human pose recovery from image by efficient | visual | feature selection |
3D key-frame extraction method based on | visual | saliency |
3D Mesh Exploration for Smart | visual | Interfaces |
3D Model Reconstruction by Fusing Multiple | visual | Cues |
3D Motion and Shape Representations in | visual | Servo Control |
3D Motion Representations in | visual | Servo Control |
3D object retrieval via range image queries in a bag-of- | visual | -words context |
3D Reconstruction in the Presence of Glass and Mirrors by Acoustic and | visual | Fusion |
3d Reconstruction of On-/offshore Wind Turbines for Manual And Computational | visual | Inspection |
3D shape retrieval by | visual | parts similarity |
3D | visual | Activity Assessment Based on Natural Scene Statistics |
3D | visual | discomfort predictor based on neural activity statistics |
3D | visual | Discomfort Predictor: Analysis of Disparity and Neural Activity Statistics |
3D | visual | Experience Oriented Cross-Layer Optimized Scalable Texture Plus Depth Based 3D Video Streaming Over Wireless Networks |
3D | visual | Homing for Commodity UAVs |
3D | visual | Information from Vanishing Points |
3D | visual | passcode: Speech-driven 3D facial dynamics for behaviometrics |
3D | visual | phrases for landmark recognition |
3D | visual | pronunciation of Mandarine Chinese for language learning |
3D | visual | Proxemics: Recognizing Human Interactions in 3D from a Single Image |
3D | visual | reconstruction of large scale natural sites and their fauna |
3D | visual | Saliency: An Independent Perceptual Measure or a Derivative of 2D Image Saliency? |
3D | visual | Security (3DVS) score to measure the visual security level of selectively encrypted 3D objects, A |
3D | visual | Security (3DVS) score to measure the visual security level of selectively encrypted 3D objects, A |
3D | visual | simulation of Chinese fir based on the influence of different stand spatial structures |
3D | visual | system using electronic holography: Towards ultra-realistic communication |
3D-IntPhys: Towards More Generalized 3D-grounded | visual | Intuitive Physics under Challenging Scenes |
3D-SPS: Single-Stage 3D | visual | Grounding via Referred Point Progressive Selection |
3D-Yoga: A 3D Yoga Dataset for | visual | -based Hierarchical Sports Action Analysis |
3DJCG: A Unified Framework for Joint Dense Captioning and | visual | Grounding on 3D Point Clouds |
3DQ-Nets: | visual | Concepts Emerge in Pose Equivariant 3D Quantized Neural Scene Representations |
3DVG-Transformer: Relation Modeling for | visual | Grounding on Point Clouds |
3DVQA: | visual | Question Answering for 3D Environments |
4D Space-Time Mereotopogeometry-Part Connectivity Calculus for | visual | Object Representation |
6-Degree-of-Freedom Hand Eye | visual | Tracking with Uncertain Parameters |
A-OKVQA: A Benchmark for | visual | Question Answering Using World Knowledge |
AA-Trans: Core Attention Aggregating Transformer with Information Entropy Selector for Fine-Grained | visual | Classification |
Abduction of Sherlock Holmes: A Dataset for | visual | Abductive Reasoning, The |
Aberrance suppressed spatio-temporal correlation filters for | visual | object tracking |
Ablation-CAM++: Grouped Recursive | visual | Explanations for Deep Convolutional Networks |
Ablation-CAM: | visual | Explanations for Deep Convolutional Network via Gradient-free Localization |
Abrupt motion tracking using a | visual | saliency embedded particle filter |
Abstract | visual | Reasoning Enabled by Language |
Abstract | visual | Reasoning: An Algebraic Approach for Solving Raven's Progressive Matrices |
Abstracts of the LIX Fall Colloquium 2008: Emerging Trends in | visual | Computing |
ACAV100M: Automatic Curation of Large-Scale Datasets for Audio- | visual | Video Representation Learning |
Accelerated low-rank | visual | recovery by random projection |
Accelerated Particle Filter for Real-Time | visual | Tracking With Decision Fusion |
Accelerated | visual | Context Classification on a Low-Power Smartwatch |
Accelerating Multimedia Search by | visual | Features |
Accelerating the Divisive Information-Theoretic Clustering of | visual | Words |
Acceleration-Level Pseudo-Dynamic | visual | Servoing of Mobile Robots With Backstepping and Dynamic Surface Control |
Acceptance of | visual | Search Interfaces for the Web: Design and Empirical Evaluation of a Book Search Interface |
Accumulated | visual | Representation for Cognitive Vision |
Accumulation of Different | visual | Feature Descriptors in a Coherent Framework |
Accumulative Errors Optimization for | visual | Odometry of ORB-SLAM2 Based on RGB-D Cameras |
Accuracy Analysis of Augmented Reality Markers for | visual | Mapping and Localization |
Accuracy vs. complexity: A trade-off in | visual | question answering models |
Accurate and efficient cross-domain | visual | matching leveraging multiple feature representations |
Accurate and Fast Pattern Localization Algorithm for Automated | visual | Inspection, An |
Accurate and robust | visual | SLAM with a novel ray-to-ray line measurement model |
Accurate and robust | visual | tracking using bounding box refinement and online sample filtering |
Accurate bounding-box regression with distance-IoU loss for | visual | tracking |
Accurate Ego-Vehicle Global Localization at Intersections Through Alignment of | visual | Data With Digital Map |
Accurate Global Localization Using | visual | Odometry and Digital Maps on Urban Environments |
Accurate keyframe selection and keypoint tracking for robust | visual | odometry |
Accurate Scale Adaptive and Real-Time | visual | Tracking with Correlation Filters |
Accurate Scale Estimation for Robust | visual | Tracking |
Accurate scale estimation for | visual | tracking with significant deformation |
Accurate | visual | word construction using a supervised approach |
Accurate | visual | -Inertial Integrated Geo-Tagging Method for Crowdsourcing-Based Indoor Localization, An |
Accurate, Robust | visual | Odometry and Detail-Preserving Reconstruction System, An |
Achieving a Fitts Law Relationship for | visual | Guided Reaching |
Acquiring | visual | -Motor Models for Precision Manipulation with Robot Hands |
Acquisition of Obstacle Avoidance Behaviors for a Quadruped Robot Using | visual | and Ultrasonic Sensors |
ACSiam: Asymmetric convolution structures for | visual | tracking with Siamese network |
ACT: an ACTNet for | visual | tracking |
Action Reaction Learning: Automatic | visual | Analysis and Synthesis of Interactive Behaviour |
Action Recognition Using | visual | Attention with Reinforcement Learning |
Action Recognition Using | visual | -Neuron Feature |
Action Recognition With Spatio-Temporal | visual | Attention on Skeleton Image Sequences |
Action Recognition with | visual | Attention on Skeleton Images |
Action Search by Example Using Randomized | visual | Vocabularies |
Action-Decision Networks for | visual | Tracking with Deep Reinforcement Learning |
Actions in the Eye: Dynamic Gaze Datasets and Learnt Saliency Models for | visual | Recognition |
Active Audio- | visual | Separation of Dynamic Sound Sources |
Active Contour-Based | visual | Tracking by Integrating Colors, Shapes, and Motions |
Active Contours: The Application of Techniques from Graphics, Vision, Control Theory and Statistics to | visual | Tracking of Shapes in Motion |
Active Learning Paradigm for Online Audio- | visual | Emotion Recognition, An |
Active Perception for | visual | -Language Navigation |
Active Vision With Two Differentiated | visual | Fields |
Active Vision, | visual | Attention |
Active | visual | Attention System to Play Where's Waldo, An |
Active | visual | Estimator for Dextrous Manipulation, An |
Active | visual | Inference of Surface Shape |
Active | visual | Information Gathering for Vision-language Navigation |
Active | visual | Navigation Using Non-Metric Structure |
Active | visual | Object Reconstruction using D-, E-, and T-Optimal Next Best Views |
Active | visual | Recognition with Expertise Estimation in Crowdsourcing |
Active | visual | Segmentation |
Active | visual | Sensing of the 3-D Pose of a Flexible Object |
Active | visual | tracking in multi-agent scenarios |
Active | visual | -Based Detection and Tracking of Moving Objects from Clustering and Classification Methods |
Active vs. Passive | visual | Search: Which Is More Efficient? |
ActiVis: Mobile Object Detection and Active Guidance for People with | visual | Impairments |
Activity Recognition using | visual | Tracking and RFID |
AD-VAT+: An Asymmetric Dueling Mechanism for Learning and Understanding | visual | Active Tracking |
Ada-Sal Network: emulate the Human | visual | System |
Adapted Vocabularies for Generic | visual | Categorization |
Adapting a real-time monocular | visual | SLAM from conventional to omnidirectional cameras |
Adapting computer vision systems to the | visual | environment: Topographic mapping |
Adapting Egocentric | visual | Hand Pose Estimation Towards a Robot-Controlled Exoskeleton |
Adapting Grounded | visual | Question Answering Models to Low Resource Languages |
Adapting Shortcut with Normalizing Flow: An Efficient Tuning Framework for | visual | Recognition |
Adapting | visual | Category Models to New Domains |
Adaptive 2.5D | visual | servoing of cartesian robots |
Adaptive anisotropic filtering (AAF) for real-time | visual | enhancement of MPEG-coded video sequences |
Adaptive Appearance Modeling With Point-to-Set Metric Learning for | visual | Tracking |
Adaptive bag-of- | visual | word modelling using stacked-autoencoder and particle swarm optimisation for the unsupervised categorisation of images |
Adaptive Block-Wise Compressive Image Sensing Based on | visual | Perception |
Adaptive Channel Selection for Robust | visual | Object Tracking with Discriminative Correlation Filters |
Adaptive Color Attributes for Real-Time | visual | Tracking |
Adaptive color image compression based on | visual | attention |
Adaptive Context-Aware Discriminative Correlation Filters for Robust | visual | Object Tracking |
Adaptive Control Techniques for Dynamic | visual | Repositioning of Hand-Eye Robotic Systems |
Adaptive convolutional layer selection based on historical retrospect for | visual | tracking |
adaptive coupled-layer | visual | model for robust visual tracking, An |
adaptive coupled-layer | visual | model for robust visual tracking, An |
Adaptive Cross-Modal Prototypes for Cross-Domain | visual | -Language Retrieval |
adaptive dandelion model for reconstructing spherical terrain-like | visual | hull surfaces, An |
adaptive data hiding scheme with high embedding capacity and | visual | image quality based on SMVQ prediction through classification codebooks, An |
Adaptive Decontamination of the Training Set: A Unified Formulation for Discriminative | visual | Tracking |
Adaptive Discriminative Deep Correlation Filter for | visual | Object Tracking |
Adaptive discriminative generative model and application to | visual | tracking |
Adaptive Edge Enhancement Using a Neurodynamical Model of | visual | Attention |
Adaptive estimation of | visual | smoke detection parameters based on spatial data and fire risk index |
Adaptive Feature Attention Module for Robust | visual | -LiDAR Fusion-Based Object Detection in Adverse Weather Conditions |
Adaptive feature fusion for | visual | object tracking |
Adaptive feature representation for | visual | tracking |
Adaptive Fitting Approach for the | visual | Detection and Counting of Small Circular Objects in Manufacturing Applications, An |
Adaptive fusion of human | visual | sensitive features for surveillance video summarization |
Adaptive Hierarchical Model of the Ventral | visual | Pathway Implemented on a Mobile Robot, An |
Adaptive Image Feature Prediction and Control for | visual | Tracking with a Hand-Eye Coordinated Camera |
Adaptive Image Segmentation Method with | visual | Nonlinearity Characteristics, An |
Adaptive Ladder Loss for Learning Coherent | visual | -Semantic Embedding |
Adaptive Learning Procedure for a Network of Spiking Neurons and | visual | Pattern Recognition |
Adaptive Method for Efficient Detection of Salient | visual | Object from Color Images, An |
adaptive mixture color model for robust | visual | tracking, An |
Adaptive modeling and segmentation of | visual | image streams |
Adaptive multi-cue fusion for | visual | target tracking based on uncertainly measure |
Adaptive Multi-Feature Reliability Re-Determinative Correlation Filter for | visual | Tracking |
Adaptive Non-rigid Object Tracking by Fusing | visual | and Motional Descriptors |
Adaptive on-line similarity measure for direct | visual | tracking |
Adaptive Part Mining for Robust | visual | Tracking |
Adaptive Partial Differential Equation Learning for | visual | Saliency Detection |
Adaptive Probabilistic | visual | Tracking with Incremental Subspace Update |
Adaptive pyramid mean shift for global real-time | visual | tracking |
Adaptive RGB Image Recognition by | visual | -Depth Embedding |
Adaptive Robotic Contour Following from Low Accuracy RGB-D Surface Profiling and | visual | Servoing |
Adaptive selection of | visual | and infra-red image fusion rules |
Adaptive Speaker Identification with Audio- | visual | Cues for Movie Content Analysis |
Adaptive Synthesis in Progressive Retrieval of Audio- | visual | Data |
Adaptive Techniques for Simultaneous Optimization of | visual | Quality and Battery Power in Video Encoding Sensors |
Adaptive Text Recognition Through | visual | Matching |
Adaptive transmission compensation via human | visual | system for efficient single image dehazing |
Adaptive Unsupervised Multi-view Feature Selection for | visual | Concept Recognition |
Adaptive Updating Probabilistic Model for | visual | Tracking |
Adaptive Video Presentation for Small Display While Maximize | visual | Information |
Adaptive | visual | Inspection Method for Transparent Label Defect Detection of Curved Glass Bottle |
Adaptive | visual | Obstacle Detection for Mobile Robots Using Monocular Camera and Ultrasonic Sensor |
adaptive | visual | quality optimization method for Internet video applications, An |
Adaptive | visual | System for Tracking Low Resolution Colour Targets |
Adaptive | visual | target detection and tracking using incremental appearance learning |
Adaptive | visual | target detection and tracking using weakly supervised incremental appearance learning and RGM-PHD tracker |
Adaptive | visual | Tracking Algorithm and Real Time Implementation |
Adaptive | visual | Tracking Control for Manipulator With Actuator Fuzzy Dead-Zone Constraint and Unmodeled Dynamic |
Adaptive | visual | tracking using the prioritized Q-learning algorithm: MDP-based parameter learning approach |
Adaptive | visual | Tracking with Minimum Uncertainty Gap Estimation |
Adaptive | visual | -Depth Fusion Transfer |
Adaptive, real-time | visual | simultaneous localization and mapping |
Adaptively Clustering-Driven Learning for | visual | Relationship Detection |
ADCCF: Adaptive deep concatenation coder framework for | visual | question answering |
Adding Color Information to Spatially-Enhanced, Bag-of- | visual | -Words Models |
Adding Object Detection Skills to | visual | Dialogue Agents |
Addressing Feature Suppression in Unsupervised | visual | Representations |
Addressing Information Inequality for Text-Based Person Search via Pedestrian-Centric | visual | Denoising and Bias-Aware Alignments |
Addressing | visual | Consistency in Video Retargeting: A Refined Homogeneous Approach |
Addressing | visual | Search in Open and Closed Set Settings |
Adopting Feature-Based | visual | Odometry for Resource-Constrained Mobile Devices |
Advanced Correlation Filters for Face Recognition Using Low-Resolution | visual | and Thermal Imagery |
Advanced modeling of | visual | information processing: A multi-resolution directional-oriented image transform based on Gaussian derivatives |
Advanced | visual | Sensor Systems |
Advanced | visual | Sensor Systems (1998) |
Advanced | visual | Surveillance Using Bayesian Networks |
Advances in the statistical methodology for the selection of image descriptors for | visual | pattern representation and classification |
Advances in | visual | Computing |
Advances in | visual | Information Management: Visual Database Systems |
Advances in | visual | Information Management: Visual Database Systems |
Advances in | visual | information processing |
Advancing | visual | Grounding with Scene Knowledge: Benchmark and Method |
Adversarial Counterfactual | visual | Explanations |
Adversarial Examples in | visual | Object Tracking in Satellite Videos: Cross-Frame Momentum Accumulation for Adversarial Examples Generation |
Adversarial Learning for | visual | Storytelling with Sense Group Partition |
Adversarial Mask Generation for Preserving | visual | Privacy |
Adversarial Training with Bi-Directional Likelihood Regularization for | visual | Classification |
Adversarial-Metric Learning for Audio- | visual | Cross-Modal Matching |
ADVIO: An Authentic Dataset for | visual | -Inertial Odometry |
Ae Textspotter: Learning | visual | and Linguistic Representation for Ambiguous Text Spotting |
Aesthetic assessment of paintings based on | visual | balance |
Affection: Learning Affective Explanations for Real-World | visual | Data |
Affective Audio- | visual | Words and Latent Topic Driving Model for Realizing Movie Affective Scene Classification |
Affective | visual | Perception Using Machine Pareidolia of Facial Expressions |
Affine hull based target representation for | visual | tracking |
Affine invariant | visual | phrases for object instance recognition |
Affine | visual | Servoing |
Affine | visual | Servoing for Robot Relative Positioning and Landmark-Based Docking |
Affine-Invariant | visual | Features Contain Supplementary Information to Enhance Speech Recognition |
Affinity Graph Supervision for | visual | Recognition |
Age interval and gender prediction using PARAFAC2 and SVMs based on | visual | and aural features |
Agent Orientated Annotation in Model Based | visual | Surveillance |
Agent-Centric Relation Graph for Object | visual | Navigation |
Aggregating Global and Local | visual | Representation for Vehicle Re-IDentification |
AI-Based | visual | Aid With Integrated Reading Assistant for the Completely Blind, An |
AiATrack: Attention in Attention for Transformer | visual | Tracking |
AIT 3D Audio / | visual | Person Tracker for CLEAR 2007, The |
AKVSR: Audio Knowledge Empowered | visual | Speech Recognition by Compressing Audio Knowledge of a Pretrained Model |
Algebraic solution for the | visual | hull |
Algorithm/Architecture Co-Exploration of | visual | Computing on Emergent Platforms: Overview and Future Prospects |
Algorithmic Representation of | visual | Information |
Algorithms and Techniques for Automated | visual | Inspection |
Algorithms for Defining | visual | Regions-of-Interest: Comparison with Eye Fixations |
Algorithms for multiplex scheduling of object-based audio- | visual | presentations |
Algorithms for postprocessing OCR results with | visual | inter-word constraints |
Align R-CNN: A Pairwise Head Network for | visual | Relationship Detection |
Aligning Books and Movies: Towards Story-Like | visual | Explanations by Watching Movies and Reading Books |
Aligning Source | visual | and Target Language Domains for Unpaired Video Captioning |
Aligning vision-language for graph inference in | visual | dialog |
AlignNet: A Unifying Approach to Audio- | visual | Alignment |
AlignVE: | visual | Entailment Recognition Based on Alignment Relations |
All in Tokens: Unifying Output Space of | visual | Tasks via Soft Token |
All-Transputer | visual | Autobahn-Autopilot/Copilot, An |
ALSA: Adversarial Learning of Supervised Attentions for | visual | Question Answering |
Alzheimer's disease diagnosis based on the | visual | attention model and equal-distance ring shape context features |
Ambiance in Social Media Venues: | visual | Cue Interpretation by Machines and Crowds |
Ambient Sound Provides Supervision for | visual | Learning |
Amodal volume completion: 3D | visual | completion |
AMOVIP: advanced modeling of | visual | information processing |
AmsterTime: A | visual | Place Recognition Benchmark Dataset for Severe Domain Shift |
analogic single-chip CNN | visual | supercomputer: A review, The |
Analysing User | visual | Implicit Feedback in Enhanced TV Scenarios |
Analysis and Adaptation of Integration Time in PMD Camera for | visual | Servoing |
Analysis and interpretation of | visual | saliency for document functional labeling |
analysis and research of | visual | perception and image processing in visual information design: Take Google Earth for example, The |
analysis and research of | visual | perception and image processing in visual information design: Take Google Earth for example, The |
Analysis of brain-facial muscle connection in the static fractal | visual | stimulation |
Analysis of Compact Features for RGB-D | visual | Search |
Analysis of Human Movement and Its Application for | visual | Surveillance, The |
Analysis of Lip Geometric Features for Audio- | visual | Speech Recognition |
Analysis of Multihypothesis Motion Compensated Prediction (MHMCP) for Robust | visual | Communication |
Analysis of Scores, Datasets, and Models in | visual | Saliency Prediction |
Analysis of Thermal Infrared and | visual | Images for Industrial Inspection Tasks |
Analysis of Various | visual | Cryptographic Techniques and their Issues Based on Optimization Algorithms |
Analysis of | visual | Adaptation and Contrast Perception for Tone Mapping, An |
Analysis Of | visual | Interpretation Of Satellite Data |
Analysis of | visual | Motion by Biological and Computer Systems |
Analysis of | visual | Motion: From Computational Theory to Neuronal Mechanisms, The |
Analysis of | visual | Question Answering Algorithms, An |
Analysis of | visual | risk perception model for braking control behaviour of human drivers: A literature review |
Analysis of | visual | Search Patterns With EMD Metric in Normalized Anatomical Space |
Analyzing Muscle Activity and Force with Skin Shape Captured by Non-contact | visual | Sensor |
Analyzing Sensor Quantization of Raw Images For | visual | SLAM |
Analyzing Vision at the Complexity Level: Constraints on an Architecture, An Explanation for | visual | Search Performance, and Computational Justification for Attentive Processes |
Analyzing | visual | -search observers using eye-tracking data for digital breast tomosynthesis images |
Angle histogram of Hough transform as shape signature for | visual | object classification - (AHOC) |
Animatable 3D Model Generation from 2D Monocular | visual | Data |
Animation Transformer: | visual | Correspondence via Segment Matching, The |
Anisotropic Filtering Operations for Image Enhancement and Their Relation to the | visual | System |
Anisotropies in | visual | Motion Perception: A Fresh Look |
Annotation-free Audio- | visual | Segmentation |
Annotator rationales for | visual | recognition |
Anomaly Detection for Road Traffic: A | visual | Analytics Framework |
Anomaly Detection in Road Traffic Using | visual | Surveillance: A Survey |
Anomaly Matters: An Anomaly-Oriented Model for Medical | visual | Question Answering |
Answer Distillation for | visual | Question Answering |
Answer Them All! Toward Universal | visual | Question Answering Models |
Answer-checking in Context: A Multi-modal Fully Attention Network for | visual | Question Answering |
Answer-Type Prediction for | visual | Question Answering |
Answering knowledge-based | visual | questions via the exploration of Question Purpose |
Answering | visual | What-If Questions: From Actions to Predicted Scene Descriptions |
Anticipating | visual | Representations from Unlabeled Video |
AP-CNN: Weakly Supervised Attention Pyramid Convolutional Neural Network for Fine-Grained | visual | Classification |
APEX: an adaptive | visual | information retrieval system |
Appearance and Structure Aware Robust Deep | visual | Graph Matching: Attack, Defense and Beyond |
Appearance Based Indexing for Relocalisation in Real-Time | visual | SLAM |
Appearance Variation Insensitive State Regression for | visual | Tracking |
Appearance-Based Gaze Estimation Using | visual | Saliency |
Appearance-based Loop Closure Detection with Scale-restrictive | visual | Features |
Appearance-Based Particle Filter for | visual | Tracking in Smart Rooms, An |
Appearance-Based | visual | Learning and Object Recognition with Illumination Invariance |
Appearances Can Be Deceiving: Learning | visual | Tracking from Few Trajectory Annotations |
APPLeNet: | visual | Attention Parameterized Prompt Learning for Few-Shot Remote Sensing Image Generalization using CLIP |
application and design of neural computation in | visual | perception, The |
Application and Evaluation of Colour Constancy in | visual | Surveillance |
Application of Adaptive Convolution Masking to the Automation of | visual | Inspection |
Application of Color Information to | visual | Perception |
Application of JND | visual | model to SPIHT image coding and performance evaluation |
Application of Lie Algebras to | visual | Servoing |
Application of support vector machines classifiers to | visual | speech recognition |
Application of Vision Algorithms to | visual | Effects Production, The |
application of | visual | Computational Theory in spatial frequency domain: The simulation of dynamic radiated fringes, An |
Applications and Challenges of Wearable | visual | Lifeloggers |
Applications for bio-inspired | visual | processing algorithms |
Applications of Computer Graphics and Image Processing to 2-D and 3-D Modeling of the Functional Architecture of | visual | Cortex |
Applications of non-metric vision to some | visual | guided tasks |
Applications of Sequence Geometry to | visual | Motion |
Applying audio description for context understanding of surveillance videos by people with | visual | impairments |
Applying Detection Proposals to | visual | Tracking for Scale and Aspect Ratio Adaptability |
Applying Preattentive | visual | Guidance in Document Image Analysis |
Applying Segment-Level Attention on Bi-Modal Transformer Encoder for Audio- | visual | Emotion Recognition |
Applying | visual | Object Categorization and Memory Colors for Automatic Color Constancy |
Applying | visual | Processing to GPS Mapping of Trackside Structures |
Applying | visual | User Interest Profiles for Recommendation and Personalisation |
approach for image retrieval based on | visual | saliency, An |
Approach for Preparing Groundtruth Data and Evaluating | visual | Saliency Models, An |
Approach for | visual | Realism Complexity Classification of 3d Models in Virtual and Augmented Reality, An |
approach of | visual | motion analysis, An |
Approach to Investigate an Influence of | visual | Angle Size on Emotional Activation During a Decision-making Task, An |
Approach to Overcome Occlusions in | visual | Tracking: By Occlusion Estimating Agency and Self-Adapting Learning Rate for Filter's Training, An |
Approaches for Event Segmentation of | visual | Lifelog Data |
Approximating the visuomotor function for | visual | servoing |
Approximation-Based Keypoints in Colour Images: A Tool for Building and Searching | visual | Databases |
Approximations of Gaussian Process Uncertainties for | visual | Recognition Problems |
AR Assistive System in Domestic Environment Using HMDs: Comparing | visual | and Aural Instructions |
Arbitrary-Shape Scene Text Detection via | visual | -Relational Rectification and Contour Approximation |
Architecture for Prototyping and Application Development of | visual | Tracking Systems, An |
Architectures and | visual | -Processing Applications of Multimedia DSPs |
Are All Combinations Equal? Combining Textual and | visual | Features with Multiple Space Learning for Text-based Video Retrieval |
Are Current Monocular Computer Vision Systems for Human Action Recognition Suitable for | visual | Surveillance Applications? |
Are Large-Scale 3D Models Really Necessary for Accurate | visual | Localization? |
Are Local Features All You Need for Cross-Domain | visual | Place Recognition? |
Are | visual | Informatics Actually Useful in Practice: A Study in a Film Studies Context |
Are You Talking to Me? Reasoned | visual | Dialog Generation Through Adversarial Learning |
ARM-VO: an efficient monocular | visual | odometry for ground vehicles on ARM CPUs |
Arousal Recognition Using Audio- | visual | Features and FMRI-Based Brain Response |
ArtEmis: Affective Language for | visual | Art |
Artificial intelligence structural imaging techniques in | visual | pattern analysis and medical data understanding |
Artpedia: A New | visual | -Semantic Dataset with Visual and Contextual Sentences in the Artistic Domain |
Artpedia: A New | visual | -Semantic Dataset with Visual and Contextual Sentences in the Artistic Domain |
Ascender II: A | visual | Framework for 3D Reconstruction |
Ask Me Anything: Free-Form | visual | Question Answering Based on Knowledge from External Sources |
Ask Your Neurons: A Deep Learning Approach to | visual | Question Answering |
Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for | visual | Question Answering |
Aspect ratio invariant | visual | secret sharing schemes with minimum pixel expansion |
Aspects of | visual | Form Processing |
Assessing Accuracy of Land Cover Change Maps Derived from Automated Digital Processing and | visual | Interpretation in Tropical Forests in Indonesia |
Assessing Essential Qualities of Urban Space with Emotional and | visual | Data Based on GIS Technique |
Assessing Perceived Image Quality Using Steady-State | visual | Evoked Potentials and Spatio-Spectral Decomposition |
Assessing Similarities and Differences between Males and Females in | visual | Behaviors in Spatial Orientation Tasks |
Assessing the contribution of color in | visual | attention |
Assessing the Distinctiveness and Representativeness of | visual | Vocabularies |
Assessing the | visual | effect of non-periodic temporal variation of quantization stepsize in compressed video |
Assessing | visual | attributes of handwriting for prediction of neurological disorders: A case study on Parkinson's disease |
Assessing | visual | Quality of 3-D Polygonal Models |
Assessing | visual | Quality of Omnidirectional Videos |
Assessment and classification of singing quality based on audio- | visual | features |
Assessment of feature fusion strategies in | visual | attention mechanism for saliency detection |
Assessment of MODIS, OMI, MISR and CALIOP Aerosol Products for Estimating Surface | visual | Range: A Mathematical Model for Hong Kong |
Assessment of | visual | Discomfort Caused by Motion-in-Depth in Stereoscopic 3D Video, An |
Assisting human experts in the interpretation of their | visual | process: A case study on assessing copper surface adhesive potency |
Assistive Malaysian Sign Language Application for D/HH Learning Using | visual | Phonics |
Associating audio- | visual | activity cues in a dominance estimation framework |
Association Loss for | visual | Object Detection |
Asymmetric Foveated Just-Noticeable-Difference Model for Images With | visual | Field Inhomogeneities |
asymmetric real-time dense | visual | localisation and mapping system, An |
Asymmetric Sparse Kernel Approximations for Large-Scale | visual | Search |
Asymmetry as a Measure of | visual | Saliency |
Attack Agnostic Adversarial Defense via | visual | Imperceptible Bound |
Attend and Imagine: Multi-Label Image Classification With | visual | Attention and Recurrent Neural Networks |
Attending to | visual | motion |
Attending to | visual | motion: localizing and classifying affine motion patterns |
Attention Based Speaker-independent Audio- | visual | Deep Learning Model for Speech Enhancement, An |
Attention Branch Network: Learning of Attention Mechanism for | visual | Explanation |
Attention Consistency on | visual | Corruptions for Single-Source Domain Generalization |
Attention Convolutional Binary Neural Tree for Fine-Grained | visual | Categorization |
Attention Fusion for Audio- | visual | Person Verification Using Multi-Scale Features |
Attention meets involution in | visual | tracking |
Attention Prediction in Egocentric Video Using Motion and | visual | Saliency |
Attention Where It Matters: Rethinking | visual | Document Understanding with Selective Region Concentration |
Attention-Aware Age-Agnostic | visual | Place Recognition |
Attention-Based Dynamic | visual | Search Using Inner-Scene Similarity: Algorithms and Bounds |
Attention-based Long-term Modeling for Deep | visual | Odometry |
Attention-Guided Neural Networks for Full-Reference and No-Reference Audio- | visual | Quality Assessment |
Attention-Guided Spatial Transformer Networks for Fine-Grained | visual | Recognition |
Attention-shift based deep neural network for fine-grained | visual | categorization |
Attentional Control for | visual | Surveillance |
Attentional Correlation Filter Network for Adaptive | visual | Tracking |
Attentional Kernel Encoding Networks for Fine-Grained | visual | Categorization |
Attentional Pyramid Pooling of Salient | visual | Residuals for Place Recognition |
Attentive Feature Augmentation for Long-Tailed | visual | Recognition |
Attentive | visual | Recognition |
Attentive | visual | Semantic Specialized Network for Video Captioning |
Attentive | visual | Servoing in the MPEG compressed domain for Un-calibrated Motion Parameter Estimation of Road Traffic |
Attentive | visual | Tracking |
Attentive | visual | Tracking |
Attributable | visual | Similarity Learning |
Attribute Embedding with | visual | -Semantic Ambiguity Removal for Zero-shot Learning |
Attribute rating for classification of | visual | objects |
Attribute-Based Classification for Zero-Shot | visual | Object Categorization |
Attribute2Image: Conditional Image Generation from | visual | Attributes |
Atypical Salient Regions Enhancement Network for | visual | saliency prediction of individuals with Autism Spectrum Disorder |
Audio Assisted Robust | visual | Tracking With Adaptive Particle Filtering |
Audio Matters in | visual | Attention |
Audio | visual | isolated Hindi digits recognition using HMM |
Audio | visual | Person Authentication by Multiple Nearest Neighbor Classifiers |
Audio | visual | Scene-Aware Dialog |
Audio | visual | Speaker Verification Based on Hybrid Fusion of Cross Modal Features |
Audio- | visual | Active Speaker Tracking in Cluttered Indoors Environments |
Audio- | visual | Affect Recognition |
Audio- | visual | Affect Recognition through Multi-Stream Fused HMM for HCI |
Audio- | visual | Affective Expression Recognition Through Multistream Fused HMM |
Audio- | visual | attention: Eye-tracking dataset and analysis toolbox |
Audio- | visual | Automatic Group Affect Analysis |
Audio- | visual | based emotion recognition-a new approach |
Audio- | visual | biometric recognition via joint sparse representations |
Audio- | visual | Biometrics |
Audio- | visual | Class-Incremental Learning |
Audio- | visual | Classification and Fusion of Spontaneous Affective Data in Likelihood Space |
Audio- | visual | Classification of Sports Types |
Audio- | visual | Classification Video Browser |
Audio- | visual | Co-Training for Vehicle Classification |
Audio- | visual | content-based violent scene characterization |
Audio- | visual | continuous speech recognition using MPEG-4 compliant visual features |
Audio- | visual | continuous speech recognition using MPEG-4 compliant visual features |
Audio- | visual | Contrastive and Consistency Learning for Semi-Supervised Action Recognition |
Audio- | visual | data association for face expression analysis |
Audio- | visual | Data Fusion Using a Particle Filter in the Application of Face Recognition |
Audio- | visual | Deception Detection: DOLOS Dataset and Parameter-Efficient Crossmodal Learning |
Audio- | visual | Efficient Conformer for Robust Speech Recognition |
Audio- | visual | Emotion Analysis Using Semi-Supervised Temporal Clustering with Constraint Propagation |
Audio- | visual | Emotion Recognition in Video Clips |
Audio- | visual | emotion recognition using Boltzmann Zippers |
Audio- | visual | emotion recognition with boosted coupled HMM |
Audio- | visual | Emotion Recognition With Preference Learning Based on Intended and Multi-Modal Perceived Labels |
Audio- | visual | Emotion, Audiovisual Emotion Recognition |
Audio- | visual | Emotion-Aware Cloud Gaming Framework |
Audio- | visual | event classification via spatial-temporal-audio words |
Audio- | visual | Event Detection using Duration Dependent Input Output Markov Models |
Audio- | visual | Event Localization by Learning Spatial and Semantic Co-Attention |
Audio- | visual | Event Localization in Unconstrained Videos |
Audio- | visual | Event Localization via Recursive Fusion by Joint Co-Attention |
Audio- | visual | Event Recognition in Surveillance Video Sequences |
Audio- | visual | Face Reenactment |
Audio- | visual | Feature Fusion for Vehicles Classification in a Surveillance System |
Audio- | visual | Floorplan Reconstruction |
Audio- | visual | flow: A variational approach to multi-modal flow estimation |
Audio- | visual | Foreground Extraction for Event Characterization |
Audio- | visual | Gated-Sequenced Neural Networks for Affect Recognition |
Audio- | visual | Glance Network for Efficient Video Recognition |
Audio- | visual | Grouping Network for Sound Localization from Mixtures |
Audio- | visual | Hybrid Approach for Filling Mass Estimation |
Audio- | visual | Identity Verification and Robustness to Imposture |
Audio- | visual | Instance Discrimination with Cross-Modal Agreement |
Audio- | visual | Keyword Spotting Based on Multidimensional Convolutional Neural Network |
Audio- | visual | Keyword Spotting for Mandarin Based on Discriminative Local Spatial-Temporal Descriptors |
Audio- | visual | Kinship Verification: A New Dataset and a Unified Adaptive Adversarial Multimodal Learning Approach |
Audio- | visual | Mismatch-Aware Video Retrieval via Association and Adjustment |
Audio- | visual | Model Distillation Using Acoustic Images |
Audio- | visual | Particle Flow SMC-PHD Filtering for Multi-Speaker Tracking |
Audio- | visual | Person Authentication with Multiple Visualized-Speech Features and Multiple Face Profiles |
Audio- | visual | Person Verification |
Audio- | visual | Person-of-Interest DeepFake Detection |
Audio- | visual | Predictive Coding for Self-Supervised Visual Representation Learning |
Audio- | visual | Predictive Coding for Self-Supervised Visual Representation Learning |
Audio- | visual | processing for scene change detection |
Audio- | visual | Quality Assessment for User Generated Content: Database and Method |
Audio- | visual | Recognition System in Compression Domain |
Audio- | visual | saliency prediction for movie viewing in immersive environments: Dataset and benchmarks |
Audio- | visual | saliency prediction with multisensory perception and integration |
Audio- | visual | Scene Analysis with Self-Supervised Multisensory Features |
Audio- | visual | Segmentation |
Audio- | visual | selection process for the synthesis of photo-realistic talking-head animations |
audio- | visual | sensor fusion approach for feature based vehicle identification, An |
Audio- | visual | Sensor Fusion Framework Using Person Attributes Robust to Missing Visual Modality for Person Recognition |
Audio- | visual | Sensor Fusion Framework Using Person Attributes Robust to Missing Visual Modality for Person Recognition |
Audio- | visual | speaker detection using dynamic Bayesian networks |
Audio- | visual | Speaker Diarization Based on Spatiotemporal Bayesian Fusion |
Audio- | visual | Speaker Identification Based on the Use of Dynamic Audio and Visual Features |
Audio- | visual | Speaker Identification Based on the Use of Dynamic Audio and Visual Features |
audio- | visual | speaker identification using coupled hidden Markov models, A |
Audio- | visual | Speaker Identification via Adaptive Fusion Using Reliability Estimates of Both Modalities |
Audio- | visual | speaker identification with multi-view distance metric learning |
Audio- | visual | Speaker Localization Using Graphical Models |
Audio- | visual | speaker tracking with importance particle filters |
Audio- | visual | Speech Codecs: Rethinking Audio-Visual Speech Enhancement by Re-Synthesis |
Audio- | visual | Speech Codecs: Rethinking Audio-Visual Speech Enhancement by Re-Synthesis |
Audio- | visual | Speech Fusion Using Coupled Hidden Markov Models |
Audio- | visual | Speech Recognition Based on AAM Parameter and Phoneme Analysis of Visual Feature |
Audio- | visual | Speech Recognition Based on AAM Parameter and Phoneme Analysis of Visual Feature |
Audio- | visual | Speech Recognition Scheme Based on Wavelets and Random Forests Classification |
Audio- | visual | speech recognition techniques in augmented reality environments |
Audio- | visual | Speech Recognition Using A Two-Step Feature Fusion Strategy |
Audio- | visual | Speech Recognition Using MPEG-4 Compliant Visual Features |
Audio- | visual | Speech Recognition Using MPEG-4 Compliant Visual Features |
Audio- | visual | speech synchronization detection using a bimodal linear prediction model |
Audio- | visual | Speech Synthesis Based on Chinese Visual Triphone |
Audio- | visual | Speech Synthesis Based on Chinese Visual Triphone |
Audio- | visual | System for Object-Based Audio: From Recording to Listening, An |
Audio- | visual | Temporal Saliency Modeling Validated by fMRI Data |
Audio- | visual | Tracking of Concurrent Speakers |
Audio- | visual | Transformer Based Crowd Counting |
Audio- | visual | Unit Selection for the Synthesis of Photo-Realistic Talking-Heads |
AudioScopeV2: Audio- | visual | Attention Architectures for Calibrated Open-Domain On-Screen Sound Separation |
Audio | visual | Discrimination Between Speech and Laughter: Why and When Visual Information Might Help |
Audio | visual | Transformer with Instance Attention for Audio-visual Event Localization |
Auditory and | visual | Properties in the Virtual Reality Using Haptic Device |
Augmentation Pathways Network for | visual | Recognition |
Augmented Multimodality Fusion for Generalized Zero-Shot Sketch-Based | visual | Retrieval |
Augmented Particle Filtering for Efficient | visual | Tracking |
Augmented Reality Head-Up Display: A | visual | Support During Malfunctions in Partially Automated Driving? |
Augmented Reality Views: Discussing the Utility of | visual | Elements by Mediation Means in Industrial AR from a Design Perspective |
Augmented | visual | -Semantic Embeddings for Image and Sentence Matching |
Augmenting Crop Detection for Precision Agriculture with Deep | visual | Transfer Learning: A Case Study of Bale Detection |
Augmenting Vision Language Pretraining by Learning Codebook with | visual | Semantics |
Australian Centre for | visual | Technologies |
Authenticating | visual | Cryptography Shares Using 2D Barcodes |
Autism Spectrum Disorder Identification from | visual | Exploration of Images |
Auto-Encoder-Based Shared Mid-Level | visual | Dictionary Learning for Scene Classification Using Very High Resolution Remote Sensing Images |
Auto-Grouped Sparse Representation for | visual | Analysis |
Auto-Navigator: Decoupled Neural Architecture Search for | visual | Navigation |
Auto-Organized | visual | Perception Using Distributed Camera Network |
Auto-Parsing Network for Image Captioning and | visual | Question Answering |
AutoBD: Automated Bi-Level Description for Scalable Fine-Grained | visual | Categorization |
Autocalibration of | visual | Sensor Parameters on a Robotic Head |
Autocovariance-based Perceptual Textural Features Corresponding to Human | visual | Perception |
Autofocus window selection algorithm based on | visual | saliency |
AutoFormer: Searching Transformers for | visual | Recognition |
Autogrouped Sparse Representation for | visual | Analysis |
Automated Audio- | visual | Activity Analysis |
Automated Creation of | visual | Routines Using Genetic Programming |
Automated detection of errors and quality issues in audio- | visual | content |
Automated Detection of Human for | visual | Surveillance System |
Automated Estimator of Image | visual | Realism Based on Human Cognition, An |
Automated identification and retrieval of moth images with semantically related | visual | attributes on the wings |
Automated Inspection of Solder Bumps Using | visual | Signatures of Specular Image-Highlights |
Automated Plantation Mapping in Southeast Asia Using MODIS Data and Imperfect | visual | Annotations |
Automated Real-Time | visual | Inspection System for High-Resolution Superimposed Printings |
Automated Temporal Analysis of Gaze Following in a | visual | Tracking Task, The |
Automated | visual | analysis in large scale sensor networks |
Automated | visual | Fin Identification of Individual Great White Sharks |
Automated | visual | fruit detection for harvest estimation and robotic harvesting |
Automated | visual | identification of characters in situation comedies |
Automated | visual | Inspection |
Automated | visual | Inspection of Glass Bottles Using Adapted Median Filtering |
Automated | visual | inspection of imprint quality of pharmaceutical tablets |
Automated | visual | inspection of pharmaceutical tablets in heavily cluttered dynamic environments |
Automated | visual | Inspection of Railroad Tracks |
Automated | visual | inspection of ripple defects using wavelet characteristic based multivariate statistical approach |
Automated | visual | Inspection Of Rolled Metal Surfaces |
Automated | visual | Inspection of Solder Joints Using 2D and 3D Features, An |
Automated | visual | inspection of target parts for train safety based on deep learning |
Automated | visual | Inspection of Textile |
Automated | visual | Inspection System for the Classification of the Phases of Ti-6Al-4V Titanium Alloy, An |
Automated | visual | Inspection Techniques and Applications: A Bibliography |
Automated | visual | Inspection: 1981 to 1987 |
Automated | visual | Inspection: A Survey |
Automated | visual | Perception-Based Web Browser Rendering Results Comparison with Multi-part Fragment Image Matching |
Automated | visual | Recognizability Evaluation of Traffic Sign Based on 3D LiDAR Point Clouds |
Automated | visual | stimuli evoked multi-channel EEG signal classification using EEGCapsNet |
Automated | visual | Surveillance Using Hidden Markov Models |
Automated | visual | Traffic Monitoring and Surveillance Through a Network of Distributed Units |
Automatic Acquisition of | visual | Models for Image Recognition |
Automatic Assessment of Depression Based on | visual | Cues: A Systematic Review |
Automatic Audio- | visual | Fusion for Aggression Detection Using Meta-information |
Automatic Calibration and | visual | Servoing for a Robot Navigation System |
Automatic classification of medical X-ray images using a bag of | visual | words |
Automatic Classification of Optical Defects of Mirrors from Ronchigram Images Using Bag of | visual | Words and Support Vector Machines |
Automatic Concept Discovery from Parallel Text and | visual | Corpora |
Automatic creation of magazine-page-like social media | visual | summary for mobile browsing |
Automatic Deformation Detection for | visual | Post Inspection |
Automatic Detection of Utility Poles Using the Bag of | visual | Words Method for Different Feature Extractors |
Automatic Exact Histogram Specification for Contrast Enhancement and | visual | System Based Quantitative Evaluation |
Automatic Eye Localization; Multi-block LBP vs. Pyramidal LBP Three-Levels Image Decomposition for Eye | visual | Appearance Description |
Automatic Foveation for Video Compression Using a Neurobiological Model of | visual | Attention |
Automatic Group Affect Analysis in Images via | visual | Attribute and Feature Networks |
Automatic Identification Of Diatoms Using | visual | Human-Interpretable Features |
Automatic Identification of Perceptually Important Regions in an Image Using a Model of the Human | visual | System |
Automatic Image Annotation by Ensemble of | visual | Descriptors |
Automatic Image Cropping for | visual | Aesthetic Enhancement Using Deep Neural Networks and Cascaded Regression |
Automatic Key-frame Selection Method for | visual | Odometry Based On The Improved PWC-Net, An |
Automatic Measurement of | visual | Attention to Video Content using Deep Learning |
Automatic Method for | visual | Grading of Seed Food Products |
Automatic Multiple | visual | Inspection on Non-calibrated Image Sequence with Intermediate Classifier Block |
Automatic Pear Extraction from High-Resolution Images by a | visual | Attention Mechanism Network |
Automatic Photo Tagging and | visual | Image Search (ALIPR) |
Automatic Prediction of Perceived Traits Using | visual | Cues under Varied Situational Context |
Automatic Rating of Perivascular Spaces in Brain MRI Using Bag of | visual | Words |
Automatic Recognition of Human Emotions Induced by | visual | Contents of Digital Images Based on Color Histogram |
Automatic region of interest tracking for | visual | characterization of the driver's behaviour |
Automatic retrieval of | visual | continuity errors in movies |
Automatic Robust Background Modeling Using Multivariate Non-parametric Kernel Density Estimation for | visual | Surveillance |
Automatic scoring of CDMAM using a model of the recognition threshold of the human | visual | system: R* |
Automatic Segmentation of TV News into Stories Using | visual | and Temporal Information |
Automatic Selection and Detection of | visual | Landmarks Using Multiple Segmentations |
Automatic Selection of Image Features for | visual | Servoing |
Automatic Selection of Visemes for Image-based | visual | Speech Synthesis |
Automatic Shot-Change Detection Algorithm Based on | visual | Rhythm Extraction |
Automatic Statistical Object Detection for | visual | Surveillance |
Automatic Tagging by Leveraging | visual | and Annotated Features in Social Media |
Automatic textile image annotation by predicting emotional concepts from | visual | features |
Automatic Thresholding Based on Human | visual | -Perception |
Automatic Thumbnail Generation Based on | visual | Representativeness and Foreground Recognizability |
Automatic Video Object Segmentation Based on | visual | and Motion Saliency |
Automatic | visual | Concept Learning for Social Event Understanding |
Automatic | visual | dictionary generation through Optimum-Path Forest clustering |
Automatic | visual | Fingerprinting for Indoor Image-Based Localization Applications |
Automatic | visual | Inspection Based upon a Variant of the N-Tuple Technique |
Automatic | visual | Inspection of LSI Photomasks |
Automatic | visual | inspection of thermoelectric metal pipes |
Automatic | visual | Inspection of Wood Surfaces |
Automatic | visual | Inspection System for Integrated Circuit Chips, An |
Automatic | visual | mimicry expression analysis in interpersonal interaction |
Automatic | visual | pattern mining from categorical image dataset |
Automatic | visual | recognition of armed robbery |
Automatic | visual | Recognition of Deformable Objects for Grasping and Manipulation |
Automatic | visual | Solder Joint Inspection |
Automatic | visual | Sorting Method of Compressors with Stamped Marks |
Automatic | visual | speech segmentation and recognition using directional motion history images and Zernike moments |
Automatic | visual | /IR Image Registration |
Automatically discovering local | visual | material attributes |
Automatically Discovering Novel | visual | Categories With Adaptive Prototype Learning |
Automating | visual | Inspection of Print Quality |
Automating | visual | Privacy Protection Using a Smart LED |
Autonomous Audio-Supported Learning of | visual | Classifiers for Traffic Monitoring |
Autonomous Navigation of Vehicles from a | visual | Memory Using a Generic Camera Model |
Autonomous robot exploration and cognitive map building in unknown environments using omnidirectional | visual | information only |
Autonomous Robot Navigation by Active | visual | Motion Analysis and Understanding |
Autonomous Vehicle Localization with Prior | visual | Point Cloud Map Constraints in GNSS-Challenged Environments |
Autonomous | visual | Control of a Mobile Robot |
Autonomous | visual | Events Detection and Classification without Explicit Object-Centred Segmentation and Tracking |
Autonomous | visual | Navigation and Laser-Based Moving Obstacle Avoidance |
Autonomous | visual | Navigation for Mobile Robots: A Systematic Literature Review |
AutoNovel: Automatically Discovering and Learning Novel | visual | Categories |
Autoregressive | visual | Tracking |
AutoTrack: Towards High-Performance | visual | Tracking for UAV With Automatic Spatio-Temporal Regularization |
AV-GAZE: A Study on the Effectiveness of Audio Guided | visual | Attention Estimation for Non-profilic Faces |
AVA: A large-scale database for aesthetic | visual | analysis |
AVA: A Video Dataset of Spatio-Temporally Localized Atomic | visual | Actions |
AVE-CLIP: AudioCLIP-based Multi-window Temporal Transformer for Audio | visual | Event Localization |
AVFace: Towards Detailed Audio- | visual | 4D Face Reconstruction |
AVGZSLNet: Audio- | visual | Generalized Zero-Shot Learning by Reconstructing Label Features from Multi-Modal Embeddings |
AVID: Adversarial | visual | Irregularity Detection |
Avoiding Robot Joint Limits and Kinematic Singularities in | visual | Servoing |
AVPL: Augmented | visual | perception learning for person Re-identification and beyond |
AVS: Scientific Research Community Audio- | visual | Systems |
AVT: Au-Assisted | visual | Transformer for Facial Expression Recognition |
AWEAR 2.0 system: Omni-directional audio- | visual | data acquisition and processing |
Backbone is All Your Need: A Simplified Architecture for | visual | Object Tracking |
Background and foreground modeling using nonparametric kernel density estimation for | visual | surveillance |
Background replacement using chromatic adaptation transform for | visual | communication |
Background Updating for | visual | Surveillance |
Backward Motion for Estimation Enhancement in Sparse | visual | Odometry |
bag of constrained informative deep | visual | words for image retrieval, A |
Bag of Contextual- | visual | Words for Road Scene Object Detection From Mobile Laser Scanning Data |
bag of relevant regions for | visual | place recognition in challenging environments, A |
Bag of spatio- | visual | words for context inference in scene classification |
Bag of Surrogate Parts Feature for | visual | Recognition |
Bag of | visual | words and fusion methods for action recognition: Comprehensive study and good practice |
Bag of | visual | Words Approach for Bleeding Detection in Wireless Capsule Endoscopy Images |
Bag of | visual | Words: A Soft Clustering Based Exposition |
Bag-of-Features Based Classification of Breast Parenchymal Tissue in the Mammogram via Jointly Selecting and Weighting | visual | Words |
Bag-of-Features Based Medical Image Retrieval via Multiple Assignment and | visual | Words Weighting |
Bag-of-features representations using spatial | visual | vocabularies for object classification |
Bag-of- | visual | -phrases and hierarchical deep models for traffic sign detection and recognition in mobile laser scanning data |
Bag-of- | visual | -Phrases via Local Contexts |
Bag-of- | visual | -Words Approach to Abnormal Image Detection in Wireless Capsule Endoscopy Videos |
Bag-of- | visual | -words models for adult image classification and filtering |
Bag-of-Words Against Nearest-Neighbor Search for | visual | Object Retrieval |
Bagging-based saliency distribution learning for | visual | saliency detection |
Balanced Contrastive Learning for Long-Tailed | visual | Recognition |
Balanced MSE for Imbalanced | visual | Regression |
Bandpass Channels, Zero-Crossings, and Early | visual | Information Processing |
Barlow constrained optimization for | visual | Question Answering |
Baseline Independent Binocular Vergence Control of 2 DOF Pan-Tilt Cameras using a | visual | cortical Model |
Basic | visual | Disciplines in Heritage Conservation: Outline of Selected Perspectives in Teaching And Learning |
BAUM-1: A Spontaneous Audio- | visual | Face Database of Affective and Mental States |
Bayesian Approach to Audio- | visual | Speaker Identification, A |
Bayesian approach to image-based | visual | hull reconstruction, A |
Bayesian Approach to Multimodal | visual | Dictionary Learning, A |
Bayesian Approach to | visual | Size Classification of Everyday Objects, A |
Bayesian Correlation Filter Learning With Gaussian Scale Mixture Model for | visual | Tracking |
Bayesian Denoising of | visual | Images in the Wavelet Domain |
Bayesian evaluation framework for subjectively annotated | visual | recognition tasks, A |
Bayesian feature evaluation for | visual | saliency estimation |
Bayesian Inference of | visual | Motion Boundaries |
Bayesian non-parametric viewpoint to | visual | tracking, A |
Bayesian Reference Model for | visual | Time-Sharing Behaviour in Manual and Automated Naturalistic Driving, A |
Bayesian Relational Memory for Semantic | visual | Navigation |
Bayesian Segmentation Framework for Textured | visual | Images, A |
Bayesian Surface Estimation from Multiple Cameras Using a Prior Based on the | visual | Hull and its Application to Image Based Rendering |
Bayesian | visual | Reranking |
Bayesian | visual | surveillance: A model for detecting and tracking a variable number of moving objects |
Bayesian | visual | Tracking with Existence Process |
BBN: Bilateral-Branch Network With Cumulative Learning for Long-Tailed | visual | Recognition |
Be Everywhere - Hear Everything (BEE): Audio Scene Reconstruction by Sparse Audio- | visual | Samples |
Beat Synchronous Dance Animation Based on | visual | Analysis of Human Motion and Audio Analysis of Music Tempo |
BEHAVE: Behavioral Analysis of | visual | Events for Assisted Living Scenarios |
Behavioral Analysis of Computational Models of | visual | Attention, A |
Behavioral | visual | Motion Analysis |
Being in Two Places at Once: Smooth | visual | Path Following on Globally Inconsistent Pose Graphs |
Benchmark for Automatic | visual | Classification of Clinical Skin Disease Images, A |
Benchmark Platform for Ultra-Fine-Grained | visual | Categorization Beyond Human Performance |
Benchmarking 6DOF Outdoor | visual | Localization in Changing Conditions |
Benchmarking Image Retrieval for | visual | Localization |
Benchmarking Omni-Vision Representation Through the Lens of | visual | Realms |
Benchmarking Out-of-Distribution Detection in | visual | Question Answering |
benchmarking tool for MAV | visual | pose estimation, A |
Benchmarking | visual | Localization for Autonomous Navigation |
Best lighting for | visual | appreciation of artistic paintings: Experiments with real paintings and real illumination |
Best practices for convolutional neural networks applied to | visual | document analysis |
Best Practices for Fine-Tuning | visual | Classifiers to New Domains |
Better Use of Human | visual | Model in Watermarking Based on Linear Prediction Synthesis Filter |
Between Post-Flaneur and Smartphone Zombie: Smartphone Users' Altering | visual | Attention and Walking Behavior in Public Space |
Beyond a Pre-Trained Object Detector: Cross-Modal Textual and | visual | Context for Image Captioning |
Beyond Appearance: A Semantic Controllable Self-Supervised Learning Framework for Human-Centric | visual | Tasks |
Beyond Correlation Filters: Learning Continuous Convolution Operators for | visual | Tracking |
Beyond Covariance: SICE and Kernel Based | visual | Feature Representation |
Beyond Explicit Codebook Generation: | visual | Representation Using Implicitly Transferred Codebooks |
Beyond Fixation: Dynamic Window | visual | Transformer |
Beyond ICONDENSATION: AICONDENSATION and AFCONDENSATION for | visual | tracking with low-level and high-level cues |
Beyond Instance-Level Image Retrieval: Leveraging Captions to Learn a Global | visual | Representation for Semantic Retrieval |
Beyond Linear Perspective: A Cubist Manifesto for | visual | Science |
Beyond Literal | visual | Modeling: Understanding Image Metaphor Based on Literal-implied Concept Mapping |
Beyond Nouns: Exploiting Prepositions and Comparative Adjectives for Learning | visual | Classifiers |
Beyond Question-Based Biases: Assessing Multimodal Shortcut Learning in | visual | Question Answering |
Beyond Self-Attention: External Attention Using Two Linear Layers for | visual | Tasks |
Beyond Standard Benchmarks: Parameterizing Performance Evaluation in | visual | Object Tracking |
Beyond tag relevance: Integrating | visual | attention model and multi-instance learning for tag saliency ranking |
Beyond the Euclidean distance: Creating effective | visual | codebooks using the Histogram Intersection Kernel |
Beyond Tracking: Selecting Memory and Refining Poses for Deep | visual | Odometry |
Beyond | visual | Retargeting: A Feature Retargeting Approach for Visual Recognition and Its Applications |
Beyond | visual | Retargeting: A Feature Retargeting Approach for Visual Recognition and Its Applications |
Beyond | visual | semantics: Exploring the role of scene text in image understanding |
Beyond | visual | word ambiguity: Weighted local feature encoding with governing region |
Beyond VQA: Generating Multi-word Answers and Rationales to | visual | Questions |
bi-directional | visual | stereo interface for accessing stereo matching results from a human brain, A |
Bi-modal First Impressions Recognition Using Temporally Ordered Deep Audio and Stochastic | visual | Features |
bi-subspace model for robust | visual | tracking, A |
Bias reduction for stereo based motion estimation with applications to large scale | visual | odometry |
biased selection strategy for information recycling in Boosting cascade | visual | -object detectors, A |
Big Transfer (BIT): General | visual | Representation Learning |
bigVAT: | visual | assessment of cluster tendency for large data sets |
Bijective Weighted Kernel with Connected Component Analysis for | visual | Object Search |
Bilateral Weighted Regression Ranking Model With Spatial-Temporal Correlation Filter for | visual | Tracking |
Bilaterally Slimmable Transformer for Elastic and Efficient | visual | Question Answering |
Bilayer representation for three dimensional | visual | communication |
Bilinear CNN Models for Fine-Grained | visual | Recognition |
Bilinear Convolutional Neural Networks for Fine-Grained | visual | Recognition |
Bilinear Optimized Product Quantization for Scalable | visual | Content Analysis |
Billion-Scale Pretraining with Vision Transformers for Multi-Task | visual | Representations |
BIM-Tracker: A model-based | visual | tracking approach for indoor localisation using a 3D building model |
Bimanual design of deformable objects thanks to the multi-tool | visual | metaphor |
Bimodal fusion in audio- | visual | speech recognition |
Bimodal fusion of low-level | visual | features and high-level semantic features for near-duplicate video clip detection |
Bimodal recognition of affective states with the features inspired from human | visual | and auditory perception system |
Binarized Mode Seeking for Scalable | visual | Pattern Discovery |
Binary cross coupled discriminant analysis for | visual | kinship verification |
Binocular Fusion Net: Deep Learning | visual | Comfort Assessment for Stereoscopic 3D |
Binocular Shading and | visual | Surface Reconstruction |
Binocular | visual | Environment Perception Technology for Unmanned Surface Vehicle |
Bio-inspired algorithm for online | visual | tracking |
Bio-inspired feature extraction and enhancement of targets moving against | visual | clutter during closed loop pursuit |
Bio-Inspired Representation Learning for | visual | Attention Prediction |
Bio-Inspired Robot with | visual | Perception of Affordances, A |
Bio-inspired | visual | attention process using spiking neural networks controlling a camera |
Bioinspired | visual | Motion Estimation |
Biological Inspired | visual | Landmark Recognition Architecture, A |
Biological modeling of human | visual | system for object recognition using GLoP filters and sparse coding on multi-manifolds |
Biological Shape and | visual | Science |
Biologically inspired approaches for | visual | information processing and analysis |
Biologically Inspired Model for | visual | Cognition Achieving Unsupervised Episodic and Semantic Feature Learning |
biologically inspired object-based | visual | attention model, A |
Biologically Inspired Online Learning of | visual | Autonomous Driving |
Biologically Inspired Saliency Map Model for Bottom-up | visual | Attention |
Biologically Inspired | visual | Model With Preliminary Cognition and Active Attention Adjustment |
Biologically Inspired | visual | Motion Detection in VLSI |
Biologically Motivated Local Contextual Modulation Improves Low-Level | visual | Feature Representations |
Biologically-Inspired Top-Down Learning Model Based on | visual | Attention, A |
Biometric surveillance using | visual | question answering |
Biometrics on | visual | preferences: A pump and distill regression approach |
Birdsnap: Large-Scale Fine-Grained | visual | Categorization of Birds |
BirdSoundsDenoising: Deep | visual | Audio Denoising for Bird Sounds |
Bittracker: A Bitmap Tracker for | visual | Tracking under Very General Conditions |
Bixplorer: | visual | Analytics with Biclusters |
Black-box Adversarial Attack against | visual | Interpreters for Deep Neural Networks |
BlackVIP: Black-Box | visual | Prompting for Robust Transfer Learning |
Bleeding Simulation With Improved | visual | Effects for Surgical Simulation Systems |
Blind Audio- | visual | Localization and Separation via Low-Rank and Sparsity |
Blind Image Quality Assessment by | visual | Neuron Matrix |
Blind Invisible Watermarking Technique in DT-CWT Domain Using | visual | Cryptography |
Blind optical aberration correction by exploring geometric and | visual | priors |
Blind Sharpness Prediction for Ultrahigh-Definition Video Based on Human | visual | Resolution |
Blind Stereoscopic Image Quality Evaluator With Segmented Stacked Autoencoders Considering the Whole | visual | Perception Route, A |
Blind tone mapped image quality assessment with image segmentation and | visual | perception |
Blind | visual | inference by composition |
Blind | visual | Motif Removal From a Single Image |
Blind Watermarking Scheme Based on | visual | Model for Copyright Security, A |
Block-based discrete wavelet transform-singular value decomposition image watermarking scheme using human | visual | system characteristics |
Block-based progressive | visual | cryptography scheme with uniform progressive recovery and consistent background |
block-based RDWT-SVD image watermarking method using human | visual | system characteristics, A |
Blur in Human Vision and Increased | visual | Realism in Virtual Environments |
Blurry Video Compression A Trade-off between | visual | Enhancement and Data Compression |
BMaE : Discriminative Density Propagation for | visual | Tracking |
BodyViz: | visual | Medical Solutions |
BoMuDANet: Unsupervised Adaptation for | visual | Scene Understanding in Unstructured Driving Environments |
Bongard-HOI: Benchmarking Few-Shot | visual | Reasoning for Human-Object Interactions |
Boosted Algorithms for | visual | Object Detection on Graphics Processing Units |
Boosted audio- | visual | HMM for speech reading |
Boosted Cross-Domain Dictionary Learning for | visual | Categorization |
Boosted Learning of | visual | Word Weighting Factors for Bag-of-Features Based Medical Image Retrieval |
Boosting and structure learning in dynamic Bayesian networks for audio- | visual | speaker detection |
Boosting bottom-up and top-down | visual | features for saliency estimation |
Boosting Few-Shot | visual | Learning With Self-Supervision |
Boosting Generic | visual | -Linguistic Representation With Dynamic Contexts |
Boosting Positive Segments for Weakly-Supervised Audio- | visual | Video Parsing |
Boosting-Based | visual | Tracking Using Structural Local Sparse Descriptors |
Bootstrapping Objectness from Videos by Relaxed Common Fate and | visual | Grouping |
Bootstrapping | visual | Categorization With Relevant Negatives |
Borrowing Knowledge From Pre-trained Language Model: A New Data-efficient | visual | Learning Paradigm |
Bottleneck Transformers for | visual | Recognition |
Bottom-Up and Top-Down Attention for Image Captioning and | visual | Question Answering |
bottom-up and top-down human | visual | attention approach for hyperspectral anomaly detection, A |
Bottom-up Approach for Learning | visual | Object Detection Models from Unreliable Sources, A |
Bottom-Up Saliency Detection Model Based on Human | visual | Sensitivity and Amplitude Spectrum |
Bottom-up spatiotemporal | visual | attention model for video analysis |
Bottom-Up | visual | Image-Processing Probed with Weighted Hermite-Polynomials |
Bottom-up/top-down coordination in a multiagent | visual | sensor network |
Boundaries of | visual | Motion |
Boundary Localisation Algorithm Consistent with Human | visual | Perception, A |
Bounding-box Channels for | visual | Relationship Detection |
BoVDW: Bag-of- | visual | -and-Depth-Words for gesture recognition |
Brain Dynamics During Arousal-Dependent Pleasant/Unpleasant | visual | Elicitation: An Electroencephalographic Study on the Circumplex Model of Affect |
Brand > Logo: | visual | Analysis of Fashion Brands |
brand new application of | visual | -audio fingerprints: Estimating the position of the pirate in a theater-A case study, A |
Breaking Shortcuts by Masking for Robust | visual | Reasoning |
Breast Cancer: Model Reconstruction and Image Registration From Segmented Deformed Image Using | visual | and Force Based Analysis |
Bridging the Gap Between Computational Photography and | visual | Recognition |
Bridging the Gap Between Computational Photography and | visual | Recognition |
Bridging the | visual | Gap: Wide-Range Image Blending |
Bridging the | visual | Semantic Gap in VLN via Semantically Richer Instructions |
Bringing Semantics into Focus Using | visual | Abstraction |
Broad Study on the Transferability of | visual | Representations with Contrastive Learning, A |
Broadcasting Convolutional Network for | visual | Relational Reasoning |
Broadcasting Oneself: | visual | Discovery of Vlogging Styles |
Browsing | visual | Sentiment Datasets Using Psycholinguistic Groundings |
Bubble Plume Target Detection Method of Multibeam Water Column Images Based on Bags of | visual | Word Features |
BubbLeNet: Foveated Imaging for | visual | Discovery |
Building a Classification Cascade for | visual | Identification from One Example |
Building an Effective | visual | Codebook: Is K-means Clustering Useful? |
Building Detection in Aerial Images Based on Watershed and | visual | Attention Feature Descriptors |
Building Extraction Using Orthophotos and Dense Point Cloud Derived from | visual | Band Aerial Imagery Based on Machine Learning and Segmentation |
Building Qualitative Event Models Automatically from | visual | Input |
Building Roadmaps of Local Minima of | visual | Models |
Building Roadmaps of Minima and Transitions in | visual | Models |
Building | visual | Maps by Combining Noisy Stereo Measurements |
Building | visual | Vocabulary for Image Indexation and Query Formulation |
Building, Registering, and Fusing Noisy | visual | Maps |
Bundle Adjustment for Monocular | visual | Odometry Based on Detected Traffic Sign Features |
C-Conditional Density Propagation for | visual | Tracking |
CA-PMG: Channel attention and progressive multi-granularity training network for fine-grained | visual | classification |
CAAN: Context-Aware attention network for | visual | question answering |
CAD Model | visual | Registration from Closed-Contour Neighborhood Descriptors |
CAGD-Based 3-D | visual | Recognition |
Calibration Algorithm for Multi-camera | visual | Surveillance Systems Based on Single-View Metrology, A |
Calibration and Validation of the Intel T265 for | visual | Localisation and Tracking Underwater |
Calibration from Statistical Properties of the | visual | World |
Calibration of a Mobile Robot with Application to | visual | Navigation |
Calibration of | visual | sensors and actuators in distributed computing platforms |
Calibration-Free | visual | Control Using Projective Invariance |
Calorie Counter: RGB-Depth | visual | Estimation of Energy Expenditure at Home |
CAM: A fine-grained vehicle model recognition method based on | visual | attention model |
Camera and | visual | veiling glare in HDR images |
Camera cooperation for achieving | visual | attention |
Camera motion and | visual | information fusion for 3D target tracking |
Camera placement using particle swarm optimization in | visual | surveillance applications |
Camera Scheduling and Energy Allocation for Lifetime Maximization in User-Centric | visual | Sensor Networks |
Camera selection in | visual | sensor networks |
Camera Sensor Model for | visual | SLAM |
Camera-Projector System for Robot Positioning by | visual | Servoing, A |
Camera-to-Camera Geometry Estimation Requiring no Overlap in their | visual | Fields |
CamShift guided particle filter for | visual | tracking |
Can audio- | visual | integration strengthen robustness under multimodal attacks? |
Can image quality features predict | visual | change blindness? |
Can Saliency Map Models Predict Human Egocentric | visual | Attention? |
Can the Early Human | visual | System Compete with Deep Neural Networks? |
Can | visual | fixation patterns improve image fidelity assessment? |
Can | visual | Recognition Benefit from Auxiliary Information in Training? |
Can We Read Speech Beyond the Lips? Rethinking RoI Selection for Deep | visual | Speech Recognition |
Can You Trust Your Pose? Confidence Estimation in | visual | Localization |
Canonical Correlation Analysis based motion model for probabilistic | visual | tracking, A |
Canonical Image Selection by | visual | Context Learning |
Cantata: | visual | Programming Environment for the Khoros System |
Capturing Relevant Context for | visual | Tracking |
Capturing | visual | Experiences |
car detection system based on hierarchical | visual | features, A |
Carryover effects of calibration to | visual | and proprioceptive information on near field distance judgments in 3D user interaction |
Carved | visual | Hulls for Image-Based Modeling |
Cascade Category-Aware | visual | Search |
Cascaded Generative and Discriminative Learning for | visual | Tracking |
cascaded long short-term memory (LSTM) driven generic | visual | question answering (VQA), A |
CASP-Net: Rethinking Video Saliency Prediction from an Audio- | visual | Consistency Perceptual Perspective |
CAT: Re-Conv Attention in Transformer for | visual | Question Answering |
Category attention transfer for efficient fine-grained | visual | categorization |
Category Attentional Search for Fast Object Detection by Mimicking Human | visual | Perception |
Category Contrast for Unsupervised Domain Adaptation in | visual | Tasks |
Category-specific incremental | visual | codebook training for scene categorization |
CATNet: Cross-modal fusion for audio- | visual | speech recognition |
Causal Attention for Unbiased | visual | Recognition |
Causal Transportability for | visual | Recognition |
CDTB: A Color and Depth | visual | Object Tracking Dataset and Benchmark |
CeDAR: A real-world vision system: Mechanism, control and | visual | processing |
Ceiling-View Semi-Direct Monocular | visual | Odometry with Planar Constraint |
CENTRIST: A | visual | Descriptor for Scene Categorization |
Century of Portraits: A | visual | Historical Record of American High School Yearbooks, A |
ChaboNet: Design of a deep CNN for prediction of | visual | saliency in natural video |
Chaining Convolution and Correlation in Practice: A Case Study in | visual | Tracking |
ChaLearn Joint Contest on Multimedia Challenges Beyond | visual | Analysis: An overview |
Challenging Issues in | visual | Information Understanding Researches |
Change Detection and Land Use: Land Cover Database Updating Using Image Segmentation, GIS Analysis and | visual | Interpretation |
ChangeNet: A Deep Learning Architecture for | visual | Change Detection |
Changes in Surface Convexity and Topology Caused by Distortions of Stereoscopic | visual | Space |
Channel Graph Regularized Correlation Filters for | visual | Object Tracking |
Channel Pruning for | visual | Tracking |
Channel-Wise Bit Allocation for Deep | visual | Feature Quantization |
Chaotic particle filter for | visual | object tracking |
Character Behavior Planning and | visual | Simulation in Virtual 3D Space |
Character Detection in Animated Movies Using Multi-Style Adaptation and | visual | Attention |
Character-Oriented Video Summarization With | visual | and Textual Cues |
Characterization of Human | visual | Sensitivity for Video Imaging Applications |
Characterization of SURF and BRISK Interest Point Distribution for Distributed Feature Extraction in | visual | Sensor Networks |
Characterization of | visual | Appearance Applied to Image Retrieval, A |
Characterization of | visual | Object Representations in Rat Primary Visual Cortex |
Characterization of | visual | Object Representations in Rat Primary Visual Cortex |
Characterizing everyday activities from | visual | lifelogs based on enhancing concept representation |
Characterizing Three-Dimensional Surface Structure from | visual | Images |
Characterizing Tourism Destination Image Using Photos' | visual | Content |
cheat preventing method with efficient pixel expansion for Naor-Shamir's | visual | cryptography, A |
Cheat-Prevention | visual | Secret Sharing Scheme with Minimum Pixel Expansion, A |
Cheating Immune Block-Based Progressive | visual | Cryptography |
Cheating in (halftone-secret) | visual | cryptography: Analysis of blind authentication schemes |
Cheating Prevention in | visual | Cryptography |
cheating prevention scheme for binary | visual | cryptography with homogeneous secret images, A |
Cherry-Picking Gradients: Learning Low-Rank Embeddings of | visual | Data via Differentiable Cross-Approximation |
Chi-Squared-Transformed Subspace of LBP Histogram for | visual | Recognition, A |
Chinese Image Caption Generation via | visual | Attention and Topic Modeling |
Chinese traditional | visual | Cultural Symbols recognition based on SPM muti-feature extraction |
Choosing Basic-Level Concept Names Using | visual | and Language Context |
Chro-Ring: A time-oriented | visual | approach to represent writer's history |
Chromatic | visual | evoked potential responses in preschool children |
Chromatic | visual | evoked potentials in young patients with demyelinating disease |
Circular Reranking for | visual | Search |
Circular-Structured Representation for | visual | Emotion Distribution Learning, A |
CiteTracker: Correlating Image and Text for | visual | Tracking |
CitySensing: Fusing City Data for | visual | Storytelling |
Class Confusability Reduction in Audio- | visual | Speech Recognition Using Random Forests |
Class knowledge overlay to | visual | feature learning for zero-shot image classification |
Class Probability-based | visual | and Contextual Feature Integration for Image Parsing |
Class Representative | visual | Words for Category-Level Object Recognition |
Class-Difficulty Based Methods for Long-Tailed | visual | Recognition |
Class-Incremental Grouping Network for Continual Audio- | visual | Learning |
Class-Specific Reconstruction Transfer Learning for | visual | Recognition Across Domains |
Classification Based on SPACT and | visual | Saliency |
Classification of dual- and single polarized SAR images by incorporating | visual | features |
Classification of Rail Welding Defects Based on the Bag of | visual | Words Approach |
Classification-Specific Parts for Improving Fine-Grained | visual | Categorization |
Classifier Belief Optimization for | visual | Categorization |
Classifying Ambiguities in a | visual | Spatial Language |
Classifying and Detecting Group Behaviour from | visual | Surveillance Data |
Classifying Objects from | visual | Information |
CLEVR-Ref+: Diagnosing | visual | Reasoning With Referring Expressions |
CLEVR: A Diagnostic Dataset for Compositional Language and Elementary | visual | Reasoning |
Clickable real world information retrieval application based on geo- | visual | clustering |
Climate Effects on Vertical Forest Phenology of Fagus sylvatica L., Sensed by Sentinel-2, Time Lapse Camera, and | visual | Ground Observations |
Clinical Valid Pain Database with Biomarker and | visual | Information for Pain Level Analysis |
CLIP-VG: Self-Paced Curriculum Adapting of CLIP for | visual | Grounding |
CLIPath: Fine-tune CLIP with | visual | Feature Fusion for Pathology Image Analysis Towards Minimizing Data Collection Efforts |
Clipboard: A | visual | Search and Browsing Engine for Tablet and PC |
CLIPTrans: Transferring | visual | Knowledge with Pre-trained Models for Multimodal Machine Translation |
Closed-Form Solution of | visual | -Inertial Structure from Motion |
Closed-Loop | visual | Grasping and Manipulation |
Clothes image caption generation with attribute detection and | visual | attention model |
Clothing retrieval with | visual | attention model |
ClothPose: A Real-world Benchmark for | visual | Analysis of Garment Pose via An Indirect Recording Solution |
Cloud Resource Optimization for Processing Multiple Streams of | visual | Data |
Cloud-Based | visual | SLAM Framework for Low-Cost Agents, A |
Clusformer: A Transformer based Clustering Approach to Unsupervised Large-scale Face and | visual | Landmark Recognition |
Clustered Blockwise PCA for Representing | visual | Data |
Clustered Exemplar-SVM: Discovering sub-categories for | visual | recognition |
ClusterFit: Improving Generalization of | visual | Representations |
Clustering and | visual | izing Audio-Visual Dataset on Mobile Devices in a Topic-Oriented Manner |
Clustering in image space for place recognition and | visual | annotations for human-robot interaction |
Clustering of hierarchical image database to reduce inter-and intra-semantic gaps in | visual | space for finding specific image semantics |
ClusterVO: Clustering Moving Instances and Estimating | visual | Odometry for Self and Surroundings |
CM-BOF: | visual | similarity-based 3D shape retrieval using Clock Matching and Bag-of-Features |
CMAT: Integrating Convolution Mixer and Self-Attention for | visual | Tracking |
CMDM-VAC: Improving A Perceptual Quality Metric for 3D Graphics by Integrating a | visual | Attention Complexity Measure |
CMLocate: A cross-modal automatic | visual | geo-localization framework for a natural environment without GNSS information |
CNN-RNN Framework for Image Annotation from | visual | Cues and Social Network Metadata, A |
CNN-Transformer for | visual | -tactile fusion applied in road recognition of autonomous vehicles |
Co-inference Approach to Robust | visual | Tracking, A |
Co-Learning Meets Stitch-Up for Noisy Multi-Label | visual | Recognition |
Co-occurrence matching of local binary patterns for improving | visual | adaption and its application to smoke recognition |
Co-Separating Sounds of | visual | Objects |
Co-training 2L Submodels for | visual | Recognition |
co-training framework for | visual | tracking with multiple instance learning, A |
Co.Vi.Wo.: Color | visual | Words Based on Non-Predefined Size Codebooks |
Coarse adaptive color image segmentation for | visual | object classification |
Coarse Iris Classification by Learned | visual | Dictionary |
Coarse representation of | visual | object's shape for search/query/filtering applications |
Coarse to Fine Two-Stage Approach to Robust Tensor Completion of | visual | Data |
Coarse | visual | Registration from Closed-Contour Neighborhood Descriptor |
Coarse-to-Fine Description for Fine-Grained | visual | Categorization |
Coarse-to-fine Estimation of | visual | Motion |
Coarse-to-Fine Q-attention: Efficient Learning for | visual | Robotic Manipulation via Discretisation |
Coarse-to-Fine Reasoning for | visual | Question Answering |
Coarse-to-Fine | visual | Question Answering by Iterative, Conditional Refinement |
CoCoLoT: Combining Complementary Trackers in Long-Term | visual | Tracking |
Codebook-Free Compact Descriptor for Scalable | visual | Search |
CodeSLAM: Learning a Compact, Optimisable Representation for Dense | visual | SLAM |
Coding and encryption of | visual | objects for privacy protected surveillance |
Coding Gain and Tuning for Parametrized | visual | Quality Metrics |
Coding Images in the Frequency Domain: Filter Design and Energy Processing Characteristics of the Human | visual | System |
Coding Local and Global Binary | visual | Features Extracted From Video Sequences |
Coding of Image Feature Descriptors for Distributed Rate-efficient | visual | Correspondences |
Coding video sequences of | visual | features |
Coding | visual | Features Extracted From Video Sequences |
Cogni-Net: Cognitive Feature Learning Through Deep | visual | Perception |
Cognitive Approach to | visual | Data Interpretation in Medical Information and Recognition Systems |
Cognitive Mapping and Planning for | visual | Navigation |
Cognitive Techniques in | visual | Data Interpretation |
Cognitive | visual | tracking and camera control |
Coherent Computational Approach to Model Bottom-Up | visual | Attention, A |
Coherent Semantic- | visual | Indexing for Large-Scale Image Retrieval in the Cloud |
Coherent | visual | Storytelling via Parallel Top-Down Visual and Topic Attention |
Coherent | visual | Storytelling via Parallel Top-Down Visual and Topic Attention |
Collaborating frames: Temporally weighted sparse representation for | visual | tracking |
Collaborative Active | visual | Recognition from Crowds: A Distributed Ensemble Approach |
Collaborative Image Relevance Learning for | visual | Re-Ranking |
Collaborative Sampling Using Heterogeneous Marine Robots Driven by | visual | Cues |
Collaborative Sparse Representation in Dissimilarity Space for Classification of | visual | Information |
Collaborative Unsupervised | visual | Representation Learning from Decentralized Data |
Collaborative Video Search Combining Video Retrieval with Human-Based | visual | Inspection |
Collaborative | visual | Cryptography Schemes |
Collaborative | visual | Tracking Architecture for Correlation Filter and Convolutional Neural Network Learning, A |
Collaterally Cued Labelling Framework Underpinning Semantic-Level | visual | Content Descriptor |
Collect Earth: Land Use and Land Cover Assessment through Augmented | visual | Interpretation |
Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio- | visual | Event Perception |
Collection of | visual | Data in Climbing Experiments for Addressing the Role of Multi-modal Exploration in Motor Learning Efficiency |
Collective | visual | Representation of Rainfall-Runoff Difference Model, The |
Collineation estimation from two unmatched views of an unknown planar contour for | visual | servoing |
Collision Anticipation via Deep Reinforcement Learning for | visual | Navigation |
Collusive Attacks to Partition Authentication | visual | Cryptography Scheme |
Color Appearance in the Entire | visual | Field: Color Zone Map Based on the Unique Hue Component |
Color Blindness and a Color Human | visual | System Model |
Color coding in the primate | visual | pathway: a historical view |
Color decorrelation helps | visual | saliency detection |
Color Extended | visual | Cryptography Using Error Diffusion |
Color image denoising with wavelet thresholding based on human | visual | system model |
Color image enhancement with a human | visual | system based adaptive filter |
Color Texture Based | visual | Monitoring System For Automated Surveillance |
Color texture signatures for art-paintings vs. scene-photographs based on human | visual | system |
Color to Gray: | visual | Cue Preservation |
Color uniformity evaluation of electronic displays based on | visual | sensitivity |
Color Vision Cells Found in | visual | Cortex |
Color | visual | Cryptography with Completely Randomly Coded Colors |
Color-Based | visual | Object Tracking with Prediction and Error Judgment |
Color-based | visual | servoing under varying illumination conditions |
Color-Boosted Saliency-Guided Rotation Invariant Bag of | visual | Words Representation with Parameter Transfer for Cross-Domain Scene-Level Classification |
Color-Perception of Aperture Colors Using a Computational Model of the Human | visual | -System |
Colored | visual | cryptography scheme based on additive color mixing |
Colored | visual | tags: a robust approach for augmented reality |
Coloring Channel Representations for | visual | Tracking |
Colorization as a Proxy Task for | visual | Understanding |
Columnar Architecture and Computational Anatomy in Primate | visual | Cortex: Segmentation and Feature Extraction via Spatial Frequence Coded Difference Mapping |
combination of color-black-and-white | visual | cryptography and polynomial based secret image sharing, A |
Combined Audio | visual | Recognition and Analysis |
Combined Audio | visual | Speaker Tracking |
Combined EM and | visual | Tracking Probabilistic Model for Robust Mosaicking: Application to Fetoscopy, A |
Combined feature evaluation for adaptive | visual | object tracking |
Combined Hapto- | visual | and Auditory Rendering of Cultural Heritage Objects |
Combined Rule-Based Machine Learning Audio- | visual | Emotion Recognition Approach, A |
Combined sensor device for measuring both rain-covered area on and | visual | range through a windshield of a motor vehicle |
Combined | visual | attention model for video sequences |
Combined | visual | Exploration of 2d Ground Radar and 3D Point Cloud Data For Road Environments |
Combining Acoustic and | visual | Classifiers for the Recognition of Spoken Sentences |
Combining apparent motion and perspective as | visual | cues for content-based camera motion indexing |
Combining Automated and Interactive | visual | Analysis of Biomechanical Motion Data |
Combining Bottom-Up and Top-Down | visual | Mechanisms for Color Constancy Under Varying Illumination |
Combining Color and Geometry for the Active, | visual | Recognition of Shadows |
Combining complementary trackers for enhanced long-term | visual | object tracking |
Combining Computer Graphics and Computer Vision for Probabilistic | visual | Robot Navigation |
Combining Foreground / Background Feature Points and Anisotropic Mean Shift For Enhanced | visual | Object Tracking |
Combining Image Invariant Features and Clustering Techniques for | visual | Place Classification |
Combining inertial and | visual | sensing for human action recognition in tennis |
Combining knowledge with data for efficient and generalizable | visual | learning |
Combining local and global: Rich and robust feature pooling for | visual | recognition |
Combining Monocular and Stereo Cues for Mobile Robot Localization Using | visual | Words |
Combining MPEG-7 Based | visual | Experts for Reaching Semantics |
Combining Multiple Cues for | visual | Madlibs Question Answering |
Combining multiple | visual | processing streams for locating and classifying objects in video |
Combining Particle Filter and Population-based Metaheuristics for | visual | Articulated Motion Tracking |
Combining passive | visual | cameras and active IMU sensors for persistent pedestrian tracking |
Combining Siamese Network and Regression Network for | visual | Tracking |
Combining Similarity and Adversarial Learning to Generate | visual | Explanation: Application to Medical Image Classification |
Combining textual and | visual | cues for content-based image retrieval on the World Wide Web |
Combining | visual | and acoustic features for audio classification tasks |
Combining | visual | and Detection Models in Spread-spectrum Watermarking |
Combining | visual | and textual features for filtering spam emails |
Combining | visual | Dictionary, Kernel-Based Similarity and Learning Strategy for Image Category Retrieval |
Combining | visual | features with semantics for a more effective image retrieval |
Combining | visual | MPEG tools in the context of video adaptability |
Combining | visual | Tracking and Person Detection for Long Term Tracking on a UAV |
Combining words and object-based | visual | features in image retrieval |
Comment on Cheating Prevention in | visual | Cryptography |
Commentary Paper 1 on | visual | Players Detection and Tracking in Soccer Matches |
Commentary Paper 2 on | visual | Players Detection and Tracking in Soccer Matches |
Commentary Paper 3 on | visual | Players Detection and Tracking in Soccer Matches |
Commentary Paper on Person Tracking With Audio- | visual | Cues Using the Iterative Decoding Framework |
Common and Innovative | visual | s: A Sparsity Modeling Framework for Video |
Common Crucial Feature for Crowdsourcing Based Mobile | visual | Location Recognition |
Common | visual | Pattern Discovery via Directed Graph |
Common | visual | pattern discovery via directed graph model |
Common | visual | Pattern Discovery via Nonlinear Mean Shift Clustering |
Common | visual | pattern discovery via spatially coherent correspondences |
Commonsense | visual | sensemaking for autonomous driving: On generalised neurosymbolic online abduction integrating vision and semantics |
Community Streaming With Interactive | visual | Overlays: System and Optimization |
Compact associative representation of | visual | information |
Compact correlation coding for | visual | object categorization |
Compact Descriptors for | visual | Search |
Compact discriminative object representation via weakly supervised learning for real-time | visual | tracking |
Compact Environment-Invariant Codes for Robust | visual | Place Recognition |
Compact Hash Codes for Efficient | visual | Descriptors Retrieval in Large Scale Databases |
compact integrated | visual | motion sensor for ITS applications, A |
Compact Polarimetric SAR Ship Detection with m-d Decomposition Using | visual | Attention Model |
Compact Representation of | visual | Speech Data Using Latent Variables, A |
Compact Sensor for | visual | Motion Detection, A |
Compact Trilinear Interaction for | visual | Question Answering |
Compact | visual | codebook for action recognition |
Compact VLSI System for Bio-Inspired | visual | Motion Estimation, A |
CompactNets: Compact Hierarchical Compositional Networks for | visual | Recognition |
Comparative Analysis of the Evolution of the IBM Watson's | visual | Recognition API on Android, A |
Comparative Analysis of Thermal and | visual | Modalities for Automated Facial Expression Recognition, A |
Comparative Analysis of | visual | -Inertial SLAM for Assisted Wayfinding of the Visually Impaired, A |
Comparative Error Analysis of Audio- | visual | Source Localization, A |
comparative evaluation of interest point detectors and local descriptors for | visual | SLAM, A |
Comparative Perceptual Assessment of | visual | Signals Using Free Energy Features |
Comparative Research of | visual | Interpretation of Aerial Images and Topographic Maps for Unskilled Users: Searching for Objects Important for Decision-Making in Crisis Situations |
Comparative Study for Known Item | visual | Search Using Position Color Feature Signatures, A |
comparative study of data fusion for RGB-D based | visual | recognition, A |
Comparative Study of | visual | Tracking Method: A Probabilistic Approach for Pose Estimation Using Lines |
comparative study on automatic audio- | visual | fusion for aggression detection using meta-information, A |
Compare and Contrast: Learning Prominent | visual | Differences |
Comparing compact codebooks for | visual | categorization |
Comparing Small | visual | Differences between Conforming Meshes |
Comparing state-of-the-art | visual | features on invariant object recognition tasks |
Comparing the Effects of | visual | Distraction in a High-Fidelity Driving Simulator and on a Real Highway |
Comparing Threshold-Selection Methods for Image Segmentation: Application to Defect Detection in Automated | visual | Inspection Systems |
Comparing | visual | Dapattta Fusion Techniques Using FIR and Visible Light Sensors to Improve Pedestrian Detection |
Comparing | visual | descriptors and automatic rating strategies for video aesthetics prediction |
Comparing | visual | Feature Coding for Learning Disjoint Camera Dependencies |
Comparing | visual | Features for Morphing Based Recognition |
Comparison of Active Shape Model and Scale Decomposition Based Features for | visual | Speech Recognition, A |
comparison of color features for | visual | concept classification, A |
Comparison of different approaches to | visual | terrain classification for outdoor mobile robots |
Comparison of Feature Detectors with Passive and Task-Based | visual | Saliency, A |
comparison of image quality models and metrics based on human | visual | sensitivity, A |
Comparison of Image Transform-Based Features for | visual | Speech Recognition in Clean and Corrupted Videos |
Comparison of local feature descriptors for mobile | visual | search |
comparison of local feature detectors and descriptors for | visual | object categorization by intra-class repeatability and matching, A |
Comparison of mid-level feature coding approaches and pooling strategies in | visual | concept detection |
Comparison of MPEG-4 Facial Animation Parameter Groups with Respect to Audio- | visual | Speech Recognition Performance |
Comparison of Multiclass SVM Decomposition Schemes for | visual | Object Recognition |
Comparison of RGB and HSV Colour Spaces for | visual | Attention Models, A |
Comparison of the Efficiency of Deterministic and Stochastic Algorithms for | visual | Reconstruction |
Comparison of | visual | Registration Approaches of 3D Models for Orthodontics |
Comparison of | visual | saliency models for compressed video |
Comparisons of | visual | Activity Primitives for Voice Activity Detection |
Compass aided | visual | -inertial odometry |
Compatibility-aware Heterogeneous | visual | Search |
Competence-aware Curriculum for | visual | Concepts Learning via Question Answering, A |
Compilation and Sufficient Representation of Object Models for | visual | Representation |
Complementary computing for | visual | tasks: Meshing computer vision with human visual processing |
Complementary computing for | visual | tasks: Meshing computer vision with human visual processing |
Complementary Discriminative Correlation Filters Based on Collaborative Representation for | visual | Object Tracking |
Complementary | visual | Tracking |
Complete and Extendable Approach to | visual | Recognition, A |
Complete | visual | metrology using relative affine structure |
Complex Object Tracking by | visual | Servoing Based on 2D Image Motion |
Complex Terrain Mapping with Multi-camera | visual | Odometry and Realtime Drift Correction |
Complex Volume and Pose Tracking with Probabilistic Dynamical Models and | visual | Hull Constraints |
Complex-Object | visual | Inspection: Empirical Studies on A Multiple Lighting Solution |
Compositional Feature Embedding and Similarity Metric for Ultra-Fine-Grained | visual | Categorization, A |
Compositional models and Structured learning for | visual | recognition |
Compositional | visual | Generation with Composable Diffusion Models |
Comprehensive Data Set for Automatic Single Camera | visual | Speed Measurement |
Comprehensive Study of Image Classification Model Sensitivity to Foregrounds, Backgrounds, and | visual | Attributes, A |
Comprehensive Survey on Video Saliency Detection With Auditory Information: The Audio- | visual | Consistency Perceptual is the Key!, A |
Comprehensive | visual | Features and Pseudo Labeling for Robust Natural Language-based Vehicle Retrieval |
Comprehensive-perception dynamic reasoning for | visual | question answering |
Compressing | visual | Descriptors of Image Sequences |
Compressing | visual | -linguistic Model via Knowledge Distillation |
Compression of 3-D Point | visual | Data Using Vector Quantization and Rate-Distortion Optimization |
Compressive Sampling-Based Image Coding for Resource-Deficient | visual | Communication |
Computation in the higher | visual | cortices: map-seeking circuit theory and application to machine vision |
Computation of Ego-Motion and Structure from | visual | an Inertial Sensor Using the Vertical Cue |
Computational Analysis of | visual | Motion |
Computational Approach to the Emulation of | visual | Neural Architectures, A |
Computational Approach to | visual | Word Recognition: Hypothesis Generation and Testing, A |
Computational Framework and an Algorithm for the Measurement of | visual | Motion, A |
Computational Framework for the | visual | Correspondence Problem, A |
Computational Model for Object-Based | visual | Saliency: Spreading Attention Along Gestalt Cues, A |
Computational Model for Stereoscopic | visual | Saliency Prediction, A |
computational model for | visual | selection, A |
Computational Model of Early Auditory- | visual | Integration, A |
Computational Model of Stereoscopic 3D | visual | Saliency |
Computational Model of the Human | visual | System for Color Coding: Results with Adaptation and Colored Surrounds, A |
Computational Modeling of Top-down | visual | Attention in Interactive Environments |
Computational Models for Integrating Linguistic and | visual | Information: A Survey |
Computational Models of Human | visual | Attention and Their Implementations: A Survey |
Computational models of | visual | neurons specialised in the detection of periodic and aperiodic oriented visual stimuli: bar and grating cells |
Computational models of | visual | neurons specialised in the detection of periodic and aperiodic oriented visual stimuli: bar and grating cells |
Computational Models of | visual | Processing |
Computational primitives of | visual | perception |
Computational Techniques in the | visual | Segmentation of Static Scenes |
Computational Theory of | visual | Surface Interpolation, A |
Computational UAV Cinematography for Intelligent Shooting Based on Semantic | visual | Analysis |
Computational Understanding of | visual | Interestingness Beyond Semantics: Literature Survey and Analysis of Covariates |
Computational Vision: Information Processing in Perception and | visual | Behavior |
Computational | visual | Distinctness Metric |
Computationally efficient, real-time motion recognition based on bio-inspired | visual | and cognitive processing |
Computations Underlying the Measurement of | visual | Motion |
Computer Analysis of | visual | Properties of Curved Objects |
Computer Analysis of | visual | Textures |
Computer Identification of Textured | visual | Scene |
Computer Identification of | visual | Surfaces |
Computer Platform for Transformation of | visual | Information into Sound Sensations for Vision Impaired Persons |
Computer Recognition of Three-Dimensional Objects in a | visual | Scene |
Computer System for | visual | Recognition Using Active Knowledge, A |
Computer Vision for Audio- | visual | Media |
Computer Vision for General-Purpose | visual | Inspection: A Fuzzy-Logic Approach |
Computer Vision for | visual | Effects |
Computer Vision In | visual | Effects |
Computer vision methods for | visual | MIMO optical system |
computer vision model for | visual | -object-based attention and eye movements, A |
Computer Vision System for | visual | Grape Grading in Wine Cellars, A |
Computer | visual | System Analyzing the Influence of Stimulants on Human Motion |
Computer-Aided Design of Clustered-Dot Color Screens Based on a Human | visual | System Model |
Computer-Generated Holograms and 3-D | visual | Communication |
Computing iconic summaries of general | visual | concepts |
Computing Multi-Colored Polygonal Masks in Pipeline Architectures and Its Application to Automated | visual | Inspection |
Computing the Probability of Target Detection in Dynamic | visual | Scenes Containing Clutter Using Fuzzy Logic Approach |
Computing the | visual | hull of solids of revolution |
Computing the | visual | Potential of an Articulated Assembly of Parts |
Computing | visual | Attention from Scene Depth |
Computing | visual | Correspondence |
Computing | visual | Correspondence with Occlusions via Graph Cuts |
Computing | visual | Correspondence: Incorporating the Probability of a False Match |
Concatenated Frame Image Based CNN for | visual | Speech Recognition |
Concealment of | visual | effects of image transmission errors by a sketch-based recovery approach |
Concept Generalization in | visual | Representation Learning |
Concept of | visual | Classes for Object Classification, The |
Concept-Enhanced Relation Network for Video | visual | Relation Inference |
ConceptLearner: Discovering | visual | concepts from weakly labeled image collections |
Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail | visual | Concepts |
Conceptual and | visual | Focusing in the Recognition Process as Induced by Queries |
Conceptual description of | visual | scenes from linguistic models |
Conceptualization and Modeling of | visual | Patterns |
Concurrent CT Reconstruction and | visual | Analysis Using Hybrid Multi-resolution Raycasting in a Cluster Environment |
Condition Monitoring for Image-Based | visual | Servoing Using Kalman Filter |
Conditional Feature Embedding by | visual | Clue Correspondence Graph for Person Re-Identification |
Conditions of Similarity between Hermite and Gabor Filters as Models of the Human | visual | System |
Confidence analysis of feature points for | visual | -inertial odometry of urban vehicles |
Confidence-aware Pseudo-label Learning for Weakly Supervised | visual | Grounding |
confidence-based late fusion framework for audio- | visual | biometric identification, A |
Confidence-based | visual | Dispersal for Few-shot Unsupervised Domain Adaptation |
Conformer: Local Features Coupling Global Representations for | visual | Recognition |
Confounds in the Data: Comments on Decoding Brain Representations by Multimodal Learning of Neural Activity and | visual | Features |
Conic-based algorithm for | visual | line estimation from one image |
Connecting Look and Feel: Associating the | visual | and Tactile Properties of Physical Materials |
Connecting the dots without clues: Unsupervised domain adaptation for cross-domain | visual | classification |
Connecting the dots: Embodied | visual | perception from first-person cameras |
Connection Between Image Processing and Artificial Neural Networks Layers Through a Geometric Model of | visual | Perception, A |
Consensus Analysis and Modeling of | visual | Aesthetic Perception |
Consensus-aware | visual | -semantic Embedding for Image-Text Matching |
Conservative | visual | Learning for Object Detection with Minimal Hand Labeling Effort |
Considerations for a touchscreen | visual | lifelog |
Considering Spherical Refraction in | visual | Ocean Gas Release Quantification |
Consistency of robust estimators in multi-structural | visual | data segmentation |
Consistent | visual | Information Processing Applied to Object Recognition, Landmark Definition, and Real-Time Tracking |
Consistent | visual | Quality Control in Video Coding |
Consistent | visual | words mining with adaptive sampling |
Constrained Utility Maximizations for Generating | visual | Skims |
Constraints on the | visual | Interpretation of Surface Contours |
Constructing Adaptive Complex Cells for Robust | visual | Tracking |
Constructing Category Hierarchies for | visual | Recognition |
Constructing | visual | Taxonomies by Shape |
Construction for | visual | C1 Continuity of Polynomial Surface Patches, A |
Constructions and Properties of General (k, n) Block-Based Progressive | visual | Cryptography |
Constructions of general reversible AMBTC-based | visual | cryptography with two decryption options |
consumer video search system by audio- | visual | concept classification, A |
Contact Geometry and | visual | Factors for Vibrotactile-Grid Location Cues |
Contemplating | visual | Emotions: Understanding and Overcoming Dataset Bias |
Content adaptive video denoising based on human | visual | perception |
Content aware quantization: Requantization of high dynamic range baseband signals based on | visual | masking by noise and texture |
Content Based Access to Video Objects: Temporal Segmentation, | visual | Summary, and Feature Extraction |
Content Based Image Retrieval Based on Modelling Human | visual | Attention |
Content Based Image Retrieval Using | visual | -Words Distribution Entropy |
Content-adaptive encoder optimization of the H.264/AVC deblocking filter for | visual | quality improvement |
Content-aware Ranking for | visual | search |
Content-aware video resizing based on salient | visual | cubes |
Content-Based Attention Ranking Using | visual | and Contextual Attention Model for Baseball Videos |
Content-based image retrieval using computational | visual | attention model |
Content-based image retrieval using local | visual | attention feature |
Content-Based Indexing and Retrieval of | visual | Information |
Content-Based Movie Analysis and Indexing Based on Audio- | visual | Cues |
Content-Based Representation and Retrieval of | visual | Media: A State-of-the-Art Review |
Content-based video parsing and indexing based on audio- | visual | interaction |
Content-Based | visual | Landmark Search via Multimodal Hypergraph Learning |
Content-centric computing in | visual | systems |
Context aware privacy in | visual | surveillance |
Context Disentangling and Prototype Inheriting for Robust | visual | Grounding |
Context multi-task | visual | object tracking via guided filter |
Context Relation Fusion Model for | visual | Question Answering |
Context | visual | Information-based Deliberation Network for Video Captioning |
Context-Aware Correlation Filter Learning Toward Peak Strength for | visual | Tracking |
Context-Aware Deep Feature Compression for High-Speed | visual | Tracking |
Context-Aware Discovery of | visual | Co-Occurrence Patterns |
Context-Aware Graph Inference With Knowledge Distillation for | visual | Dialog |
Context-Aware | visual | Compatibility Prediction |
Context-Aware | visual | Policy Network for Fine-Grained Image Captioning |
Context-Aware | visual | Tracking |
Context-based occlusion detection for robust | visual | tracking |
Context-Based Reasoning Using Ontologies to Adapt | visual | Tracking in Surveillance |
Context-based | visual | Feedback Recognition |
Context-Based | visual | Hand Gesture Recognition |
Context-empowered | visual | Attention Prediction in Pedestrian Scenarios |
Context-VQA: Towards Context-Aware and Purposeful | visual | Question Answering |
Contextual and | visual | modeling for detection of mild traumatic brain injury in MRI |
Contextual Bag-of-Words for Robust | visual | Tracking |
Contextual Bag-of-Words for | visual | Categorization |
Contextual Debiasing for | visual | Recognition with Causal Mechanisms |
Contextual information based | visual | saliency model |
Contextual Learning in the Selective Attention for Identification model (CL-SAIM): Modeling contextual cueing in | visual | search tasks |
Contextual Transformer Networks for | visual | Recognition |
Contextual Translation Embedding for | visual | Relationship Detection and Scene Graph Generation |
Contextual-Guided Bag-of- | visual | -Words Model for Multi-class Object Categorization |
Continual Adaptation of | visual | Representations via Domain Randomization and Meta-learning |
Continual Learning for | visual | Search with Backward Consistent Feature Embedding |
Continuous activity understanding based on accumulative pose-context | visual | patterns |
Continuous Audio- | visual | Speech Recognition |
Continuous Emotion Recognition using | visual | -audio-linguistic Information: A Technical Report for ABAW3 |
Continuous Emotion Recognition with Audio- | visual | Leader-follower Attentive Fusion |
Continuous Manifold Based Adaptation for Evolving | visual | Domains |
Continuous | visual | vocabulary models for pLSA-based scene recognition |
Continuous | visual | World Modeling for Autonomous Robot Manipulation |
Continuous-Time Stereo | visual | Odometry Based on Dynamics Model |
Contour detection model based on neuron behaviour in primary | visual | cortex |
Contour Processing in | visual | Pattern Recognition. Application in Robotics |
Contour/Texture Approach for | visual | Tracking |
Contrast adaptation reveals increased organizational complexity of chromatic processing in the | visual | evoked potential |
contrast improved OR and XOR based (k,n) | visual | cryptography scheme without pixel expansion, A |
Contrast Optimization for Size Invariant | visual | Cryptography Scheme |
Contrastive Positive Sample Propagation Along the Audio- | visual | Event Line |
Contribution of Color Information in | visual | Saliency Model for Videos |
Contributions of Shape, Texture, and Color in | visual | Recognition |
control theoretic method for categorizing | visual | imagery as human motion behaviors, A |
Controllable | visual | -Tactile Synthesis |
Controlled-Smoothness Stabilizers fo the Regularization of Ill-Posed | visual | Problems Involving Discontinuities |
Convected Activation Profiles and the Measurement of | visual | Motion |
Convex reduction of high-dimensional kernels for | visual | classification |
Convexity-Based | visual | Camouflage Breaking |
ConvNets vs. Transformers: Whose | visual | Representations are More Transferable? |
Convolutional Adaptive Particle Filter with Multiple Models for | visual | Tracking |
Convolutional Attention Model For Restaurant Recommendation With Multi-View | visual | Features |
Convolutional Features for Correlation Filter Based | visual | Tracking |
Convolutional Hough Matching Networks for Robust and Efficient | visual | Correspondence |
Convolutional neural net bagging for online | visual | tracking |
Convolutional Neural Network for Blind Mesh | visual | Quality Assessment Using 3D Visual Saliency |
Convolutional Neural Network for Blind Mesh | visual | Quality Assessment Using 3D Visual Saliency |
Convolutional Neural Network for | visual | Security Evaluation |
convolutional neural network framework for blind mesh | visual | quality assessment, A |
Convolutional Neural Networks Based on Multi-Scale Additive Merging Layers for | visual | Smoke Recognition |
Convolutional Neural Networks Can Be Deceived by | visual | Illusions |
Convolutional Neural Networks for | visual | Information Analysis with Limited Computing Resources |
Convolutional Regression for | visual | Tracking |
Cooperative analysis of multiple frames by | visual | echoes |
Coopetitive | visual | surveillance using model predictive control |
Coordinated Joint Multimodal Embeddings for Generalized Audio- | visual | Zero-shot Classification and Retrieval of Videos |
Coordinating Distributed Algorithms for Feature Extraction Offloading in Multi-Camera | visual | Sensor Networks |
Coordinating Motion of Cooperative Mobile Robots Through | visual | Observation |
Coping with change: Learning invariant and minimum sufficient representations for fine-grained | visual | categorization |
Coplanar Shadowgrams for Acquiring | visual | Hulls of Intricate Objects |
Core zone scatterplots: A new approach to feature extraction for | visual | displays |
Corner detection and matching for | visual | tracking during power line inspection |
Corner Finder for | visual | Feedback, A |
Correlation Filter Learning Toward Peak Strength for | visual | Tracking |
Correlation Filter Selection for | visual | Tracking Using Reinforcement Learning |
Correlation filter via random-projection based CNNs features combination for | visual | tracking |
Correlation Filter-Based | visual | Tracking for UAV with Online Multi-Feature Learning |
Correlation filter-based | visual | tracking via adaptive weighted CNN features fusion |
Correlation Information Bottleneck: Towards Adapting Pretrained Multimodal Models for Robust | visual | Question Answering |
Correlation Particle Filter for | visual | Tracking |
Correlation, Kalman filter and adaptive fast mean shift based heuristic approach for robust | visual | tracking |
Correlation-based incremental | visual | tracking |
Correlation-Based Tracker-Level Fusion for Robust | visual | Tracking |
Correlation-Guided Attention for Corner Detection Based | visual | Tracking |
Correlational Gaussian Processes for Cross-Domain | visual | Recognition |
Correlational Image Modeling for Self-Supervised | visual | Pre-Training |
Correlative Scan Matching Position Estimation Method by Fusing | visual | and Radar Line Features |
Corridor Navigation and Obstacle Avoidance using | visual | Potential for Mobile Robot |
Corrupting Neuron Explanations of Deep | visual | Features |
Cortically-inspired Architecture for Event-based | visual | Motion Processing: From Design Principles to Real-world Applications, A |
CoSLAM: Collaborative | visual | SLAM in Dynamic Environments |
Cost-Distortion Optimization and Resource Control in Pseudo-Analog | visual | Communications |
Cost-effective solution to synchronised audio- | visual | data capture using multiple sensors |
Cost-Effective Solution to Synchronized Audio- | visual | Capture Using Multiple Sensors |
Cost-efficient Automated | visual | Inspection system for small manufacturing industries based on SIFT |
Cost-Sensitive Active | visual | Category Learning |
Cost-Sensitive Rank Learning From Positive and Unlabeled Data for | visual | Saliency Estimation |
Cost-Sensitive Two-Stage Depression Prediction Using Dynamic | visual | Clues |
Could early | visual | processes be sufficient to label motions? |
Counterfactual Attention Learning for Fine-Grained | visual | Categorization and Re-identification |
Counterfactual Samples Synthesizing and Training for Robust | visual | Question Answering |
Counterfactual Samples Synthesizing for Robust | visual | Question Answering |
Counterfactual | visual | Dialog: Robust Commonsense Knowledge Learning From Unbiased Training |
Counterfactual Zero-Shot and Open-Set | visual | Recognition |
Counterfactual-based Saliency Map: Towards | visual | Contrastive Explanations for Neural Networks |
Counting-based | visual | question answering with serial cascaded attention deep learning |
Coupled Knowledge Transfer for | visual | Data Recognition |
Coupled Prediction Classification for Robust | visual | Tracking |
Coupled | visual | and Kinematic Manifold Models for Tracking |
Coupled-layer based | visual | tracking via adaptive kernelized correlation filters |
Coupling Attention and Convolution for Heuristic Network in | visual | Dialog |
Covert Photo Classification by Fusing Image Features and | visual | Attributes |
CoVIO: Online Continual Learning for | visual | -Inertial Odometry |
CoVR+: Design of | visual | Effects for Promoting Joint Attention During Shared VR Experiences via a Projection of HMD User's View |
CPC-GSCT: | visual | quality assessment for coloured point cloud based on geometric segmentation and colour transformation |
CR-LDSO: Direct Sparse LiDAR-Assisted | visual | Odometry With Cloud Reusing |
Creating audio-centric, image-centric, and integrated audio- | visual | summaries |
Creating Compact and Discriminative | visual | Vocabularies Using Visual Bits |
Creating Compact and Discriminative | visual | Vocabularies Using Visual Bits |
Creating descriptive | visual | words for tag ranking of compressed social image |
Creating Efficient Codebooks for | visual | Recognition |
Creating Efficient | visual | Codebook Ensembles for Object Categorization |
CREST: Convolutional Residual Learning for | visual | Tracking |
Critical Infrastructure Security Against Drone Attacks Using | visual | Analytics |
CrOC: Cross-View Online Clustering for Dense | visual | Representation Learning |
CroMM-VSR: Cross-Modal Memory Augmented | visual | Speech Recognition |
Cross Attentional Audio- | visual | Fusion for Dimensional Emotion Recognition |
Cross-Dataset Adaptation for | visual | Question Answering |
Cross-Descriptor | visual | Localization and Mapping |
Cross-Dimensional Refined Learning for Real-Time 3D | visual | Perception from Monocular Video |
Cross-Domain Deep Feature Combination for Bird Species Classification with Audio- | visual | Data |
Cross-domain learning methods for high-level | visual | concept classification |
Cross-Domain Recommendation Method Based on Multi-Layer Graph Analysis With | visual | Information |
Cross-domain structure learning for | visual | data recognition |
Cross-Domain | visual | Matching via Generalized Similarity Measure and Feature Learning |
Cross-Layer Optimization with Power Control in DS-CDMA | visual | Sensor Networks |
Cross-layer progressive attention bilinear fusion method for fine-grained | visual | classification |
Cross-modal Background Suppression for Audio- | visual | Event Localization |
Cross-Modal Causal Relational Reasoning for Event-Level | visual | Question Answering |
Cross-modal Deep Learning Applications: Audio- | visual | Retrieval |
Cross-Modal Dense Passage Retrieval for Outside Knowledge | visual | Question Answering |
Cross-modal face matching: Tackling | visual | abstraction using fine-grained attributes |
Cross-modal knowledge reasoning for knowledge-based | visual | question answering |
Cross-modal Relational Reasoning Network for | visual | Question Answering |
Cross-modal Retrieval Using Contrastive Learning of | visual | -Semantic Embeddings |
Cross-Modal Retrieval With CNN | visual | Features: A New Baseline |
Cross-Modal | visual | Question Answering for Remote Sensing Data: the International Conference on Digital Image Computing: Techniques and Applications (DICTA 2021) |
Cross-Modality Pyramid Alignment for | visual | Intention Understanding |
Cross-table linking and brushing: interactive | visual | analysis of multiple tabular data sets |
Cross-X Learning for Fine-Grained | visual | Categorization |
CrossLocate: Cross-modal Large-scale | visual | Geo-Localization in Natural Environments using Rendered Modalities |
Crowd Behavior Analysis Using Local Mid-Level | visual | Descriptors |
Crowd behaviours analysis in dynamic | visual | scenes of complex environment |
Crowd counting and segmentation in | visual | surveillance |
Crowd flow estimation using multiple | visual | features for scenes with changing crowd densities |
CrowdDriven: A New Challenging Dataset for Outdoor | visual | Localization |
CRTransSar: A | visual | Transformer Based on Contextual Joint Representation Learning for SAR Ship Detection |
CS-VQA: | visual | Question Answering with Compressively Sensed Images |
CSG: Classifier-Aware Defense Strategy Based on Compressive Sensing and Generative Networks for | visual | Recognition in Autonomous Vehicle Systems |
CSV: Image quality assessment based on color, structure, and | visual | system |
CUDA accelerated | visual | relative motion estimation |
Cultural Event recognition with | visual | ConvNets and temporal models |
Curiosity Guided Fine-Tuning for Encoder-Decoder-Based | visual | Forecasting |
Curious George: An Integrated | visual | Search Platform |
Curious Robot: Learning | visual | Representations via Physical Interactions, The |
Current challenges in automating | visual | perception |
Current issues and new techniques in | visual | quality assessment |
Curriculum Learning for Multi-task Classification of | visual | Attributes |
Curriculum learning of | visual | attribute clusters for multi-task classification |
Curvature Tensor Distance for Mesh | visual | Quality Assessment, A |
Customized Image Narrative Generation via Interactive | visual | Question Generation and Answering |
Cv4code: Sourcecode Understanding via | visual | Code Representations |
CVIDS: A Collaborative Localization and Dense Mapping Framework for Multi-Agent Based | visual | -Inertial SLAM |
CVM-Cervix: A hybrid cervical Pap-smear image classification framework using CNN, | visual | transformer and multilayer perceptron |
CVNodes: A | visual | Programming Paradigm for Developing Computer Vision Algorithms |
CVonline: Introductory | visual | Psychophysics/Psychology |
CVonline: | visual | Processing Software and Environments |
CVT-SLR: Contrastive | visual | -Textual Transformation for Sign Language Recognition with Variational Alignment |
Cycle-Consistency for Robust | visual | Question Answering |
Cycle-Consistent Weakly Supervised | visual | Grounding With Individual and Contextual Representations |
CycleMLP: A MLP-Like Architecture for Dense | visual | Predictions |
Cyclic Co-Learning of Sounding Object | visual | Grounding and Sound Separation |
D 3 Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and | visual | Grounding |
D-mago: A novel | visual | entity for storing emotional feeling with visual imprint |
D-mago: A novel | visual | entity for storing emotional feeling with visual imprint |
D-VINS: Dynamic Adaptive | visual | -Inertial SLAM with IMU Prior and Semantic Constraints in Dynamic Scenes |
D-ViSA: A Dataset for Detecting | visual | Sentiment from Art Images |
D3VO: Deep Depth, Deep Pose and Deep Uncertainty for Monocular | visual | Odometry |
Da4ad: End-to-end Deep Attention-based | visual | Localization for Autonomous Driving |
dacl-challenge: Semantic Segmentation during | visual | Bridge Inspections |
DanceUnisoner: A Parametric, | visual | , and Interactive Simulation Interface for Choreographic Composition of Group Dance |
DAReN: A Collaborative Approach Towards | visual | Reasoning And Disentangling |
DASTSiam: Spatio-temporal fusion and discriminative enhancement for Siamese | visual | tracking |
Data driven | visual | tracking via representation learning and online multi-class LPBoost learning |
Data hiding in grayscale images by dynamic programming based on a human | visual | model |
Data Hiding in Images With Adaptive Numbers of Least Significant Bits Based on the Human | visual | System |
Data-Driven Lightweight Interest Point Selection for Large-Scale | visual | Search |
Data-Driven Probabilistic Occlusion Mask to Promote | visual | Tracking |
Data-Driven Probability Hypothesis Density Filter for | visual | Tracking |
Data-Driven Spatially-Adaptive Metric Adjustment for | visual | Tracking |
Data-free Knowledge Distillation for Fine-grained | visual | Categorization |
Database of | visual | Color Differences of Modern Smartphone Photography, A |
Dataset and Architecture for | visual | Reasoning with a Working Memory, A |
Dataset and Model for the | visual | Quality Assessment of Inversely Tone-Mapped HDR Videos, A |
DBN-Mix: Training dual branch network using bilateral mixup augmentation for long-tailed | visual | recognition |
DC-VINS: Dynamic Camera | visual | Inertial Navigation System with Online Calibration |
DCCO: Towards Deformable Continuous Convolution Operators for | visual | Tracking |
DCT Regularized Extreme | visual | Recovery |
DDP: Diffusion Model for Dense | visual | Prediction |
Dealing with Missing Modalities in the | visual | Question Answer-Difference Prediction Task through Knowledge Distillation |
Debiased | visual | Question Answering via the perspective of question types |
Decision Level Fusion for Audio- | visual | Speech Recognition in Noisy Conditions |
Decoding Brain Representations by Multimodal Learning of Neural Activity and | visual | Features |
Decoding Generic | visual | Representations from Human Brain Activity Using Machine Learning |
Decoding of multichannel EEG activity from the | visual | cortex in response to pseudorandom binary sequences of visual stimuli |
Decoding of multichannel EEG activity from the | visual | cortex in response to pseudorandom binary sequences of visual stimuli |
Decoding | visual | brain states from fMRI using an ensemble of classifiers |
Decoding | visual | network-related dynamic functional connectivity for eyes-open and eyes-closed using machine learning |
Decoding | visual | Neural Representations by Multimodal Learning of Brain-Visual-Linguistic Features |
Decoding | visual | Neural Representations by Multimodal Learning of Brain-Visual-Linguistic Features |
Decomposition and Extraction: A New Framework for | visual | Classification |
Decomposition of a | visual | Scene into Bodies |
Decomposition of a | visual | Scene into Three-Dimensional Bodies |
Decomposition Theory and Transformations of | visual | Directions |
Decomposition, discovery and detection of | visual | categories using topic models |
DecomVQANet: Decomposing | visual | question answering deep network via tensor decomposition and regression |
Decorrelating Semantic | visual | Attributes by Resisting the Urge to Share |
Decouple Before Interact: Multi-Modal Prompt Learning for Continual | visual | Question Answering |
Decoupled Mixup for Out-of-distribution | visual | Recognition |
Decoupling Identity and | visual | Quality for Image and Video Anonymization |
Decoupling Sparse Coding with Fusion of Fisher Vectors and Scalable SVMs for Large-Scale | visual | Recognition |
Deep Attention Neural Tensor Network for | visual | Question Answering |
Deep Attention-Based Spatially Recursive Networks for Fine-Grained | visual | Recognition |
Deep Audio- | visual | Beamforming for Speaker Localization |
Deep Audio- | visual | Fusion Neural Network for Saliency Estimation |
deep audio- | visual | model for efficient dynamic video summarization, A |
Deep Audio- | visual | Speech Recognition |
Deep Bayesian Network for | visual | Question Generation |
Deep Binaries: Encoding Semantic-Rich Cues for Efficient Textual- | visual | Cross Retrieval |
Deep Blind Image Quality Assessment with | visual | Importance Based Patch Score, A |
Deep Boltzmann Machines for i-Vector Based Audio- | visual | Person Identification |
Deep Clustering for Unsupervised Learning of | visual | Features |
Deep Co-occurrence Feature Learning for | visual | Object Recognition |
Deep Compressive Sensing for | visual | Privacy Protection in FlatCam Imaging |
Deep Convolutional Correlation Filters for Forward-Backward | visual | Tracking |
Deep convolutional correlation iterative particle filter for | visual | tracking |
Deep convolutional hashing using pairwise multi-label supervision for large-scale | visual | search |
Deep convolutional particle filter for | visual | tracking |
Deep Convolutional Particle Filter with Adaptive Correlation Maps for | visual | Tracking |
Deep Counterfactual Representation Learning for | visual | Recognition Against Weather Corruptions |
Deep cross-layer activation features for | visual | recognition |
Deep Direct | visual | Odometry |
Deep emotion recognition based on audio- | visual | correlation |
Deep Fair Clustering for | visual | Learning |
Deep Hashing Learning for | visual | and Semantic Retrieval of Remote Sensing Images |
Deep Hierarchies in the Primate | visual | Cortex: What Can We Learn for Computer Vision? |
Deep High-Resolution Representation Learning for | visual | Recognition |
Deep Imbalanced Attribute Classification Using | visual | Attention Aggregation |
Deep Learning Approach to Clustering | visual | Arts, A |
Deep learning assisted robust | visual | tracking with adaptive particle filtering |
Deep Learning Driven | visual | Path Prediction From a Single Image |
Deep learning features exception for cross-season | visual | place recognition |
Deep Learning for Surface Material Classification Using Haptic and | visual | Information |
Deep Learning for | visual | Content Analysis |
Deep Learning for | visual | SLAM |
Deep Learning for | visual | Tracking: A Comprehensive Survey |
Deep Learning for | visual | Understanding |
Deep Learning for | visual | Understanding: Part 2 |
Deep Learning Guided Partitioned Shape Model for Anterior | visual | Pathway Segmentation |
Deep Learning Human Mind for Automated | visual | Classification |
Deep Learning of Human | visual | Sensitivity in Image Quality Assessment Framework |
Deep Learning of | visual | and Textual Data for Region Detection Applied to Item Coding |
Deep Learning on | visual | Data |
Deep Meta Learning for Real-Time Target-Aware | visual | Tracking |
Deep Metric Learning for | visual | Tracking |
Deep Metric Learning for | visual | Understanding: An Overview of Recent Advances |
Deep Mixture of Diverse Experts for Large-Scale | visual | Recognition |
Deep Modular Co-Attention Networks for | visual | Question Answering |
Deep motion and appearance cues for | visual | tracking |
Deep motion features for | visual | tracking |
Deep Multidilation Temporal and Spatial Dependence Modeling in Stereoscopic 3-D EEG for | visual | Discomfort Assessment |
Deep Multimodal Pain Recognition: A Database and Comparison of Spatio-Temporal | visual | Modalities |
Deep mutual learning for | visual | object tracking |
Deep Neural Networks for Full-Reference and No-Reference Audio- | visual | Quality Assessment |
Deep Next-Best-View Planner for Cross-Season | visual | Route Classification |
Deep Pixel Probabilistic Model for Super Resolution Based on Human | visual | Saliency Mechanism |
Deep Poisoning: Towards Robust Image Data Sharing against | visual | Disclosure |
Deep Radial Embedding for | visual | Sequence Learning |
Deep Radio- | visual | Localization |
Deep Reinforced Attention Learning for Quality-Aware | visual | Recognition |
Deep Reinforcement Learning for Autonomous Driving by Transferring | visual | Features |
Deep Reinforcement Learning with Iterative Shift for | visual | Tracking |
Deep Relation Transformer for Diagnosing Glaucoma With Optical Coherence Tomography and | visual | Field Function |
Deep Residual Weight-Sharing Attention Network With Low-Rank Attention for | visual | Question Answering |
Deep RNN Framework for | visual | Sequential Applications |
Deep Robust Subjective | visual | Property Prediction in Crowdsourcing |
Deep Saliency Prior for Reducing | visual | Distraction |
Deep Semantic- | visual | Alignment for zero-shot remote sensing image scene classification |
Deep Spatial and Temporal Network for Robust | visual | Object Tracking |
Deep Unsupervised Learning for Simultaneous | visual | Odometry and Depth Estimation |
Deep unsupervised learning of | visual | similarities |
Deep Variation-Structured Reinforcement Learning for | visual | Relationship and Attribute Detection |
Deep Video Quality Assessor: From Spatio-Temporal | visual | Sensitivity to a Convolutional Neural Aggregation Network |
Deep | visual | Attention Prediction |
Deep | visual | Attention Prediction |
Deep | visual | Correspondence Embedding Model for Stereo Matching Costs, A |
Deep | visual | Discomfort Predictor for Stereoscopic 3D Images |
Deep | visual | Geo-localization Benchmark |
Deep | visual | Odometry With Adaptive Memory |
Deep | visual | Place Recognition for Waterborne Domains |
Deep | visual | Saliency on Stereoscopic Images |
Deep | visual | Teach and Repeat on Path Networks |
Deep | visual | tracking: Review and experimental comparison |
Deep | visual | unsupervised domain adaptation for classification tasks: A survey |
Deep | visual | words: Improved fisher vector for image classification |
Deep | visual | -Genetic Biometrics for Taxonomic Classification of Rare Species |
Deep | visual | -Semantic Alignments for Generating Image Descriptions |
Deep | visual | -Semantic Quantization for Efficient Image Retrieval |
Deep-based fisher vector for mobile | visual | search |
DEEP-CARVING: Discovering | visual | attributes by carving deep neural nets |
Deep-Disaster: Unsupervised Disaster Detection and Localization Using | visual | Data |
Deep-Like Hashing-in-Hash for | visual | Retrieval: An Embarrassingly Simple Method |
DeepBees - Building and Scaling Convolutional Neuronal Nets For Fast and Large-Scale | visual | Monitoring of Bee Hives |
Deeper and Wider Siamese Networks for Real-Time | visual | Tracking |
Deepfake Video Detection Using Audio- | visual | Consistency |
DeepFuseNet of Omnidirectional Far-Infrared and | visual | Stream for Vegetation Detection |
Deeply Supervised Multimodal Attentional Translation Embeddings for | visual | Relationship Detection |
DeePoint: | visual | Pointing Recognition and Direction Estimation |
DeepOpht: Medical Report Generation for Retinal Images via Deep Models and | visual | Explanation |
DeepPermNet: | visual | Permutation Learning |
DeepTrack: Learning Discriminative Feature Representations by Convolutional Neural Networks for | visual | Tracking |
DeepTrack: Learning Discriminative Feature Representations Online for Robust | visual | Tracking |
DeepVS2.0: A Saliency-Structured Deep Learning Method for Predicting Dynamic | visual | Attention |
Deepzzle: Solving | visual | Jigsaw Puzzles With Deep Learning and Shortest Path Optimization |
Defect Analysis Method for | visual | Inspection, A |
Defining Image Memorability Using the | visual | Memory Schema |
Deformable Parts Correlation Filters for Robust | visual | Tracking |
Deformable Siamese Attention Networks for | visual | Object Tracking |
Deformation Invariant | visual | Object Recognition: Experiments with a Self-Organizing Neural Architecture |
Deformation | visual | inspection of industrial parts with image sequence |
Degraded | visual | environments present challenges for DARPA |
Dehashing: Server-Side Context-Aware Feature Reconstruction for Mobile | visual | Search |
Delving into Inter-Image Invariance for Unsupervised | visual | Representations |
Demand-Driven | visual | Information Acquisition |
Demonstrating the new compact descriptors for | visual | search (CDVS) standard for image retrieval on mobile devices |
Demonstration of an HMM-based photorealistic expressive audio- | visual | speech synthesis system |
Dense Captioning with Joint Inference and | visual | Context |
Dense Contrastive Learning for Self-Supervised | visual | Pre-Training |
Dense convolutional feature histograms for robust | visual | object tracking |
Dense Modality Interaction Network for Audio- | visual | Event Localization |
Dense non-rigid | visual | tracking with a robust similarity function |
Dense Reppoints: Representing | visual | Objects with Dense Point Sets |
Dense sampling and fast encoding for 3D model retrieval using bag-of- | visual | features |
Dense, Auto-Calibrating | visual | Odometry from a Downward-Looking Camera |
Dense-Localizing Audio- | visual | Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline |
Densely Connected Discriminative Correlation Filters for | visual | Tracking |
Densifying Supervision for Fine-Grained | visual | Comparisons |
Density-Aware Graph for Deep Semi-Supervised | visual | Recognition |
Dependence of chromatic responses in V1 on | visual | field eccentricity and spatial frequency: an fMRI study |
Depth and Video Segmentation Based | visual | Attention for Embodied Question Answering |
Depth as attention to learn image representations for | visual | localization, using monocular images |
Depth estimation and camera calibration of a focused plenoptic camera for | visual | odometry |
Depth Matters: Influence of Depth Cues on | visual | Saliency |
depth perception and | visual | comfort guided computational model for stereoscopic 3D visual saliency, A |
depth perception and | visual | comfort guided computational model for stereoscopic 3D visual saliency, A |
Depth Prediction for Monocular Direct | visual | Odometry |
Depth-Adaptive Computational Policies for Efficient | visual | Tracking |
Depth-Aware and Semantic Guided Relational Attention Network for | visual | Question Answering |
Depth-based local feature selection for mobile | visual | search |
Depth-weighted correlation method for | visual | tracking with occlusion detection |
Describable | visual | Attributes for Face Verification and Image Search |
Describe, Spot and Explain: Interpretable Representation Learning for Discriminative | visual | Reasoning |
Describing Common Human | visual | Actions in Images |
Describing | visual | Scenes Using Transformed Objects and Parts |
Description of Evolutional Changes in Image Time Sequences Using MPEG-7 | visual | Descriptors |
Description of Planar Patterns by Invariant Features: An Attempt Towards the Explanation of | visual | Pattern Recognition |
Descriptive temporal template features for | visual | motion recognition |
Descriptor free | visual | indoor localization with line segments |
Descriptor Scoring for Feature Selection in Real-Time | visual | SLAM |
Design a new | visual | cryptography for human-verifiable authentication in accessing a database |
Design and evaluation of a | visual | acclimation aid for a semi-natural locomotion device |
Design and Experimental Evaluation of an Aerial Solution for | visual | Inspection of Tunnel-like Infrastructures |
design and implementation of a | visual | Workflow Modeling tool based on Eclipse plug-ins, The |
Design and Implementation of High-Speed | visual | Tracking System for Real-Time Motion Analysis |
Design and Implementation of People Tracking Algorithms for | visual | Surveillance Applications |
Design and implementation of the | visual | programming environment for the distributed image processing |
Design Architecture for IMPlayer as a Tool for Supporting | visual | Education Presentation |
Design considerations for a space-variant | visual | sensor with complex-logarithmic geometry |
Design for a | visual | -Motion Transducer, A |
Design Information Extraction and | visual | Representation based on Artificial Intelligence Natural Language Processing Techniques |
Design of a | visual | environment for evaluating and customizing medical image compression techniques |
Designing Category-Level Attributes for Discriminative | visual | Recognition |
Designing JPEG Quantization Tables based on Human | visual | System |
Designing | visual | Systems: Purposive Navigation |
Desktop 3D Scanner Exploiting Rotation and | visual | Rectification of Laser Profiles, A |
Detect | visual | Spoofing in Unicode-Based Text |
Detecting a gazing region by | visual | direction and stereo cameras |
Detecting Ads in Video Streams Using Acoustic and | visual | Cues |
Detecting and classifying online dark | visual | propaganda |
Detecting and Removing | visual | Distractors for Video Aesthetic Enhancement |
Detecting and Suppressing Marine Snow for Underwater | visual | SLAM |
Detecting and Tracking Distant Objects at Night Based on Human | visual | System |
Detecting Attended | visual | Targets in Video |
Detecting Driver Cognition Alertness State From | visual | Activities in Normal and Emergency Scenarios |
Detecting image orientation based on low-level | visual | content |
Detecting local audio- | visual | synchrony in monologues utilizing vocal pitch and facial landmark trajectories |
Detecting Maritime Infrared Targets in Harsh Environment by Improved | visual | Attention Model Preselector and Anti-Jitter Spatiotemporal Filter Discriminator |
Detecting News Reporting Using Audio/ | visual | Information |
Detecting Semantic Concepts Using Context and Audio/ | visual | Features |
Detecting Social Groups in Crowded Surveillance Videos Using | visual | Attention |
Detecting Unseen | visual | Relations Using Analogies |
Detecting | visual | Relationships Using Box Attention |
Detecting | visual | Relationships with Deep Relational Networks |
Detecting Water in | visual | Image Streams from UAV with Flight Constraints |
Detecting, Localizing and Classifying | visual | Traits from Arbitrary Viewpoints Using Probabilistic Local Feature Modeling |
Detection and Measurement of | visual | Motion, The |
Detection and sizing | visual | features in wood using tonal measures and a classification algorithm |
Detection and Three-Dimensional Localization by Stereoscopic | visual | Sensor and Its Application to a Robot for Picking Asparagus |
Detection of complex video events through | visual | rhythm |
Detection of Contours and Their | visual | Motion, The |
Detection of Eye Locations in Unconstrained | visual | Images |
Detection of Vehicles in a Motorway Environment by Means of Telemetric and | visual | Data |
Detection of | visual | attention regions in images using robust subspace analysis |
Detection of | visual | Concepts and Annotation of Images Using Ensembles of Trees for Hierarchical Multi-Label Classification |
Detection of | visual | Defects in Citrus Fruits: Multivariate Image Analysis vs Graph Image Segmentation |
Detection of | visual | pursuits using 1D convolutional neural networks |
Detection of | visual | symmetries |
Deterioration of | visual | information in face classification using Eigenfaces and Fisherfaces |
Determination of ambient light level changes in | visual | images |
Determination of Moment Invariants and Their Application to | visual | Servoing |
Determining characteristic views of a 3D object by | visual | hulls and Hausdorff distance |
Determining driver | visual | attention with one camera |
Deterministic Method of | visual | Servoing: Robust Object Tracking by Drone |
Deterministic Optimality for Robust Vehicle Localization Using | visual | Measurements |
Developing a Cubature Multi-state Constraint Kalman Filter for | visual | -Inertial Navigation System |
Developing a GIS-Based | visual | -Acoustic 3D Simulation for Wind Farm Assessment |
Development and Evaluation of Stochastic-Based | visual | Textures Features |
Development and Plasticity in | visual | Cortex |
Development and utilization of a disgusting image dataset to understand and predict | visual | disgust |
Development of a Low Cost Gamma-Ray Imaging System Using Handheld Scintillation Detectors for | visual | Surveying of Radiation Fields with Robots |
Development of a N-type GM-PHD filter for multiple target, multiple type | visual | tracking |
Development of an Estimation Model for Instantaneous Presence in Audio- | visual | Content |
development of hierarchical | visual | languages, The |
Development of Stereo | visual | Odometry Based on Photogrammetric Feature Optimization |
Device and method for dubbing an audio- | visual | presentation which generates synthesized speech and corresponding facial movements |
Device and method for prosody generation at | visual | synthesis |
Devising a | visual | Inspection System for Canal Tunnels: Preliminary Studies |
DEWS: A Live | visual | Surveillance System for Early Drowning Detection at Pool |
DFT-based Transformation Invariant Pooling Layer for | visual | Classification |
Diagnostic Study of | visual | Question Answering With Analogical Reasoning, A |
Dialog Must Go On: Improving | visual | Dialog via Generative Self-Training, The |
Dictionary learning for a sparse appearance model in | visual | tracking |
Dictionary Learning for | visual | Tracking with Dimensionality Reduction |
Different Binding Strategies for the Different Stages of | visual | Recognition |
Differentiable Adaptive Computation Time for | visual | Reasoning |
Differentiable SLAM-net: Learning Particle SLAM for | visual | Navigation |
Differential Attention for | visual | Question Answering |
Differential Earth Mover's Distance with Its Applications to | visual | Tracking |
Differential transient MEG and fMRI responses to | visual | stimulation onset rate |
DIFNet: Boosting | visual | Information Flow for Image Captioning |
Digging Hierarchical Information For | visual | Place Recognition With Weighting Similarity Metric |
Digital Coding Techniques for | visual | Communications |
Digital Color Image Processing within the Framework of a Human | visual | Model |
Digital halftoning algorithm using | visual | -optimized binary patterns |
Digital halftoning with correlated minimum | visual | modulation patterns |
Digital halftoning with minimum | visual | modulation patterns |
Digital Production of Color Mach Bands Using a Color Human | visual | -System Model |
Digital retina simulating dynamic behavior of | visual | perception |
Digital Retina: A Way to Make the City Brain More Efficient by | visual | Coding |
Digital Watermarking of 3d Medical | visual | Objects |
Dilated Inception Network for | visual | Saliency Prediction, A |
Dilated MultiRes | visual | Attention U-Net for historical document image binarization, A |
DilateFormer: Multi-Scale Dilated Transformer for | visual | Recognition |
DIME: An Online Tool for the | visual | Comparison of Cross-modal Retrieval Models |
Dimensionality reduction of | visual | features using sparse projectors for content-based image retrieval |
Direct Aerial | visual | Geolocalization Using Deep Neural Networks |
Direct Estimation of Affine Deformations Using | visual | Front-End Operators with Automatic Scale Selection |
Direct Estimation of Image Deformations Using | visual | Front-End Operations with Automatic Scale Selection |
Direct Iterative Closest Point for real-time | visual | odometry |
Direct Mapping of | visual | Input to Motor Torques |
Direct methods for 3D reconstruction and | visual | SLAM |
Direct Methods for | visual | Scene Reconstruction |
Direct model based | visual | tracking and pose estimation using mutual information |
Direct perception of three-dimensional motion from patterns of | visual | motion |
Direct Sparse | visual | Odometry with Structural Regularities for Long Corridor Environments |
Direct | visual | Localisation and Calibration for Road Vehicles in Changing City Environments |
Direct | visual | servoing in the non-linear scale space of camera pose |
Direct | visual | tracking under extreme illumination variations using the sum of conditional variance |
Direct | visual | -Inertial Odometry and Mapping for Unmanned Vehicle |
Directing | visual | attention by subliminal cues |
Directional Selectivity and its Use in Early | visual | Processing |
Directional Space-Time Oriented Gradients for 3D | visual | Pattern Analysis |
Disambiguating | visual | Motion by Form-Motion Interaction: A Computational Model |
Disambiguating | visual | relations using loop constraints |
Disambiguating | visual | Verbs |
DisAVR: Disentangled Adaptive | visual | Reasoning Network for Diagram Question Answering |
Discontinuity Detection for | visual | Surface Reconstruction |
Discontinuity Preserving Regularization of Inverse | visual | Problems |
Discontinuity Preserving | visual | Reconstruction by Means of Potential Theory |
Discovering a Fish in a Forest of Trees: False Positives and User Expectations in | visual | Retrieval: Experiments in CBIR and the Visual Arts |
Discovering a Fish in a Forest of Trees: False Positives and User Expectations in | visual | Retrieval: Experiments in CBIR and the Visual Arts |
Discovering Bayesian causality among | visual | events in a complex outdoor scene |
Discovering intrinsic properties of human observers' | visual | search and mathematical observers' scanning |
Discovering joint audio- | visual | codewords for video event detection |
Discovering meaningful multimedia patterns with audio- | visual | concepts and associated text |
Discovering Multi-relational Latent Attributes by | visual | Similarity Networks |
Discovering Planes and Collapsing the State Space in | visual | SLAM |
Discovering Primitive Action Categories by Leveraging Relevant | visual | Context |
Discovering Recurrent | visual | Semantics in Consumer Photographs |
Discovering Respects for | visual | Similarity |
Discovering the Local Co-occurring Patterns in | visual | Categorization |
Discovering Video Clusters from | visual | Features and Noisy Tags |
Discovering | visual | concept structure with sparse and incomplete tags |
Discovering | visual | Patterns in Art Collections With Spatially-Consistent Feature Learning |
Discovery of Collocation Patterns: from | visual | Words to Visual Phrases |
Discovery of Collocation Patterns: from | visual | Words to Visual Phrases |
Discrete Cosine Transform and Its Impact on | visual | Compression: Fifty Years From Its Invention, The |
Discrete Integral Sliding Mode Control in | visual | Object Tracking Using Differential Kinematics |
Discrete | visual | features modeling via leave-one-out likelihood estimation and applications |
Discrete | visual | Perception |
Discriminant Learning Through Multiple Principal Angles for | visual | Recognition |
Discriminant Saliency, the Detection of Suspicious Coincidences, and Applications to | visual | Recognition |
Discriminating Semantic | visual | Words for Scene Classification |
Discrimination Between Native and Non-Native Speech Using | visual | Features Only |
Discrimination Between | visual | Stimuli by Variation of Shape and Relative Position of Volumetric Primitives |
Discriminative Bag-of-Words-Based Adaptive Appearance Model for Robust | visual | Tracking |
Discriminative Bimodal Networks for | visual | Localization and Detection with Natural Language Queries |
Discriminative Cross-Modality Attention Network for Temporal Inconsistent Audio- | visual | Event Localization |
Discriminative deep belief networks for | visual | data classification |
Discriminative Descriptor-Based Observation Model for | visual | Tracking |
Discriminative feature learning from big data for | visual | recognition |
Discriminative Latent | visual | Space For Zero-Shot Object Classification |
Discriminative learning of relaxed hierarchy for large-scale | visual | recognition |
Discriminative learning of | visual | words for 3D human pose estimation |
Discriminative Multi-Task Sparse Learning for Robust | visual | Tracking Using Conditional Random Field |
Discriminative part model for | visual | recognition |
Discriminative Self-Paced Group-Metric Adaptation for Online | visual | Identification |
Discriminative Single-Shot Segmentation Network for | visual | Object Tracking, A |
Discriminative Soft Bag-of- | visual | Phrase for Mobile Landmark Recognition |
Discriminative sparse flexible manifold embedding with novel graph for robust | visual | representation and label propagation |
Discriminative subspace learning with sparse representation view-based model for robust | visual | tracking |
Disentangling Label Distribution for Long-tailed | visual | Recognition |
Disentangling Semantic-to- | visual | Confusion for Zero-Shot Learning |
Disentangling | visual | and written concepts in CLIP |
Disentangling | visual | Embeddings for Attributes and Objects |
Disparity Component Matching for | visual | Correspondence |
Dissimilarity Measures for | visual | Pattern Partitioning |
Distance Measuring Method Using | visual | Image Processing, A |
Distance measuring using passive | visual | means |
Distances between frequency features for 3D | visual | pattern partitioning |
Distant Supervised Centroid Shift: A Simple and Efficient Approach to | visual | Domain Adaptation |
Distilled Siamese Networks for | visual | Tracking |
Distilling Audio- | visual | Knowledge by Compositional Contrastive Learning |
Distilling Coarse-to-Fine Semantic Matching Knowledge for Weakly Supervised 3D | visual | Grounding |
Distilling DETR with | visual | -Linguistic Knowledge for Open-Vocabulary Object Detection |
Distinction between handwritten and machine-printed text based on the bag of | visual | words model |
Distinguishing Photographic Images and Photorealistic Computer Graphics Using | visual | Vocabulary on Local Image Edges |
Distortion estimation techniques in solving | visual | CAPTCHAs |
distortion measure for blocking artifacts in images based on human | visual | sensitivity, A |
Distortion of Steroscopic | visual | Space |
Distortion-Weighing Spatiotemporal | visual | Attention Model for Video Analysis, A |
Distortions of stereoscopic | visual | space and quadratic Cremona transformations |
Distractor-Aware Siamese Networks for | visual | Object Tracking |
Distributed Algorithms for Network Lifetime Maximization in Wireless | visual | Sensor Networks |
Distributed Analysis and Representation of | visual | Motion |
Distributed intelligence for multi-camera | visual | surveillance |
Distributed multi-camera | visual | mapping using topological maps of planar regions |
Distributed Self-Deployment in | visual | Sensor Networks |
distributed solution to detect targets in crowds using | visual | sensor networks, A |
distributed source autoencoder of local | visual | descriptors for 3D reconstruction, A |
Distributed three-level QR codes based on | visual | cryptography scheme |
Distributed | visual | Processing for a Home Visual Sensor Network |
Distributed | visual | Processing for a Home Visual Sensor Network |
Distributed | visual | sensing for virtual top-view trajectory generation in football videos |
distributed | visual | surveillance system, A |
Distributed | visual | -Target-Surveillance System in Wireless Sensor Networks |
Distributed Wireless | visual | Communication with Power Distortion Optimization |
Distribution Alignment: A Unified Framework for Long-tail | visual | Recognition |
Distribution Unified and Probability Space Aligned Teacher-Student Learning for Imbalanced | visual | Recognition |
Distributional semantics of objects in | visual | scenes in comparison to text |
Disturbed Augmentation Invariance for Unsupervised | visual | Representation Learning |
Diva: Diverse | visual | Feature Aggregation for Deep Metric Learning |
Diversified Fisher kernel: encoding discrimination in Fisher features to compete deep neural models for | visual | classification task |
Diversified | visual | Attention Networks for Fine-Grained Object Classification |
Diversity in Ensembles of Codebooks for | visual | Concept Detection |
Diversity-Aware Meta | visual | Prompting |
Divide&Classify: Fine-Grained Classification for City-Wide | visual | Place Recognition |
Divisively Normalized Sparse Coding: Toward Perceptual | visual | Signal Representation |
DLD-SLAM: RGB-D | visual | Simultaneous Localisation and Mapping in Indoor Dynamic Environments Based on Deep Learning |
Do The | visual | Complexity Algorithms Match The Generalization Process In Geographical Displays? |
Do video coding impairments disturb the | visual | attention deployment? |
Do we know what the early | visual | system does? |
Document page similarity based on layout | visual | saliency: application to query by example and document classification |
Does Randomness in the Random | visual | Cryptography Protocol Depend on the Period of Pseudorandom Number Generators? |
Does | visual | Self-Supervision Improve Learning of Speech Representations for Emotion Recognition? |
Does where you Gaze on an Image Affect your Perception of Quality? Applying | visual | Attention to Image Quality Metric |
Domain Adaptation for | visual | Recognition |
Domain Adaptive Fisher Vector for | visual | Recognition |
Domain Adaptive Transfer Learning on | visual | Attention Aware Data Augmentation for Fine-grained Visual Categorization |
Domain Adaptive Transfer Learning on | visual | Attention Aware Data Augmentation for Fine-grained Visual Categorization |
Domain Generalization through Audio- | visual | Relative Norm Alignment in First Person Action Recognition |
Domain Generalized Stereo Matching via Hierarchical | visual | Transformation |
Domain Invariant and Class Discriminative Feature Learning for | visual | Domain Adaptation |
Domain invariant regularization by disentangling content and style Features for | visual | domain generalization |
Domain-Aware | visual | Bias Eliminating for Generalized Zero-Shot Learning |
Domain-Dependent Reasoning for | visual | Navigation of Roadways |
Domain-Specific Codesign for Automated | visual | Inspection Systems |
Don't Just Assume; Look and Answer: Overcoming Priors for | visual | Question Answering |
Don't just listen, use your imagination: Leveraging | visual | common sense for non-visual tasks |
Don't just listen, use your imagination: Leveraging | visual | common sense for non-visual tasks |
Don't Trust Your Eyes: Cutting-Edge | visual | Effects |
Dot-Size Variant | visual | Cryptography |
Double window optimisation for constant time | visual | SLAM |
Double-Domain Adaptation Semantics for Retrieval-Based Long-Term | visual | Localization |
Doubly Right Object Recognition: A Why Prompt for | visual | Rationales |
DPcode: Privacy-Preserving Frequent | visual | Patterns Publication on Cloud |
DPDM: Feature-Based Pose Refinement with Deep Pose and Deep Match for Monocular | visual | Odometry |
DR-KFS: A Differentiable | visual | Similarity Metric for 3D Shape Reconstruction |
DR-Tune: Improving Fine-tuning of Pretrained | visual | Models by Distribution Regularization with Semantic Calibration |
DRAU: Dual Recurrent Attention Units for | visual | Question Answering |
Drawing Scene Perception Model Based On The Human | visual | System, A |
DREAM: | visual | Decoding from REversing HumAn Visual SysteM |
DREAM: | visual | Decoding from REversing HumAn Visual SysteM |
DRIVE: Deep Reinforced Accident Anticipation with | visual | Explanation |
DRIVE: Dynamic Reasoning from Integrated | visual | Evidence |
Driver aggressiveness detection using | visual | information from forward camera |
Driving me around the bend: Learning to drive from | visual | gist |
Driving | visual | Saliency Prediction of Dynamic Night Scenes via a Spatio-Temporal Dual-Encoder Network |
Drone-View Building Identification by Cross-View | visual | Learning and Relative Spatial Estimation |
DSGEM: Dual scene graph enhancement module-based | visual | question answering |
DSGN++: Exploiting | visual | -Spatial Relation for Stereo-Based 3D Detectors |
DSNet: Deep and Shallow Feature Learning for Efficient | visual | Tracking |
DSP: Discriminative Spatial Part modeling for Fine-Grained | visual | Categorization |
Dual Attention Matching for Audio- | visual | Event Localization |
Dual Cross-Attention Learning for Fine-Grained | visual | Categorization and Object Re-Identification |
Dual Deep Network for | visual | Tracking |
Dual integration of multi-model with spatial-temporal occlusion-awareness for | visual | object tracking |
Dual Path Multi-Modal High-Order Features for Textual Content based | visual | Question Answering |
Dual Perspective Network for Audio- | visual | Event Localization |
Dual self-attention with co-attention networks for | visual | question answering |
Dual Transformer With Multi-Grained Assembly for Fine-Grained | visual | Classification |
Dual-Attention Learning Network With Word and Sentence Embedding for Medical | visual | Question Answering, A |
Dual-decoder transformer network for answer grounding in | visual | question answering |
Dual-Key Multimodal Backdoors for | visual | Question Answering |
Dual-Layer | visual | Vocabulary Tree Hypotheses for Object Recognition |
Dual-modality Talking-metrics: 3D | visual | -Audio Integrated Behaviometric Cues from Speakers |
DualRC: A Dual-Resolution Learning Framework With Neighbourhood Consensus for | visual | Correspondences |
DualVGR: A Dual- | visual | Graph Reasoning Unit for Video Question Answering |
Duo-graph: An efficient and robust method for large-scale mapping for | visual | -guided robots |
DV-LOAM: Direct | visual | LiDAR Odometry and Mapping |
Dynamic Adaptation on Non-stationary | visual | Domains |
Dynamic and invisible messaging for | visual | MIMO |
Dynamic and Multiresolution Model of | visual | Attention and Its Application to Facial Landmark Detection, A |
Dynamic appearance model for particle filter based | visual | tracking |
Dynamic Approach To | visual | Data-Compression |
Dynamic Attention-based | visual | Odometry |
Dynamic Audio- | visual | Mapping using Fused Hidden Markov Model Inversion Method |
Dynamic Bayes Network for | visual | Pedestrian Tracking, A |
dynamic bayesian network approach to multi-cue based | visual | tracking, A |
Dynamic Bayesian Networks for Audio- | visual | Speaker Recognition |
Dynamic Bayesian Networks for Audio- | visual | Speech Recognition |
Dynamic camera scheduling for | visual | surveillance in crowded scenes using Markov random fields |
Dynamic Coding of | visual | Information |
Dynamic Computational Time for | visual | Attention |
Dynamic Effects in | visual | Closed-Loop Systems |
Dynamic Eye Movement Datasets and Learnt Saliency Models for | visual | Action Recognition |
Dynamic Feature Interaction Framework for Multi-task | visual | Perception, A |
Dynamic Few-Shot | visual | Learning Without Forgetting |
Dynamic Fusion With Intra- and Inter-Modality Attention Flow for | visual | Question Answering |
Dynamic geodesic snakes for | visual | tracking |
Dynamic Markov Random Field Model for | visual | Tracking |
Dynamic Markov random fields for stochastic modeling of | visual | attention |
Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for | visual | Grounding |
Dynamic Memory: Architecture for Real Time Integration of | visual | Perception, Camera Action, and Network Communication |
Dynamic Object-Aware Monocular | visual | Odometry With Local And Global Information Aggregation |
Dynamic objects detection through | visual | odometry and stereo-vision: A study of inaccuracy and improvement sources |
Dynamic Perceiver for Efficient | visual | Recognition |
Dynamic RGB-to-CMYK conversion using | visual | contrast optimisation |
Dynamic Sensor-Based Control of Robots with | visual | Feedback |
Dynamic Stereo in | visual | Navigation |
Dynamic Template Update for | visual | Object Tracking |
Dynamic | visual | attention model in image sequences |
Dynamic | visual | attention on the sphere |
Dynamic | visual | category learning |
Dynamic | visual | motion estimation from subspace constraints |
Dynamic | visual | Search Using Inner-Scene Similarity: Algorithms and Inherent Limitations |
Dynamic | visual | Sequence Prediction with Motion Flow Networks |
Dynamic | visual | tracking control of a mobile robot with image noise and occlusion robustness |
Dynamic-Model of | visual | Recognition Predicts Neural Response Properties in the Visual-Cortex |
Dynamic-Model of | visual | Recognition Predicts Neural Response Properties in the Visual-Cortex |
Dynamical road modeling and matching for direct | visual | navigation |
Dynamically | visual | Learning for People Identification with Sparsely Distributed Cameras |
DynGraph: | visual | Question Answering via Dynamic Scene Graphs |
E2VPT: An Effective and Efficient Approach for | visual | Prompt Tuning |
EACOFT: An energy-aware correlation filter for | visual | tracking |
Early detection of children with Autism Spectrum Disorder based on | visual | exploration of images |
Early Processing of Spatio-Temporal | visual | Information, The |
Early Processing of | visual | Information |
Early | visual | Learning |
Early | visual | Processing for Pattern Recognition in Natural Environments |
EAVA: A 3D Emotive Audio- | visual | Avatar |
EDA: Explicit Text-Decoupling and Dense Alignment for 3D | visual | Grounding |
Edge and Curve Detection for | visual | Scene Analysis |
edge detection with automatic scale selection approach to improve coherent | visual | attention model, An |
Edge Devices Clustering for Federated | visual | Classification: A Feature Norm Based Framework |
Edge Enhanced Direct | visual | Odometry |
Edge Potential Functions (EPF) and Genetic Algorithms (GA) for Edge-Based Matching of | visual | Objects |
Edge SLAM: Edge Points Based Monocular | visual | SLAM |
Edges in | visual | Scenes and Sequences: Application to Filtering, Sampling, and Adaptive DPCM Coding |
Editorial of Special Issue on Shape Representations Meet | visual | Recognition |
Editorial paper for Pattern Recognition Letters VSI on Cross Model Understanding for | visual | Question Answering |
Editorial to special issue on cross-media learning for | visual | question answering |
Editorial: | visual | information engineering |
Editorial: | visual | Information Engineering |
EDVD: Enhanced descriptor for | visual | and depth data |
EEG Based | visual | Classification With Multi-Feature Joint Learning |
EEG Biometrics Using | visual | Stimuli: A Longitudinal Study |
EEG-based brain source localization using | visual | stimuli |
EEG-ConvTransformer for single-trial EEG-based | visual | stimulus classification |
Effect of brightness on the quality of | visual | 3D perception |
Effect of noise on model selection criteria in | visual | applications |
Effect of Nonlinear Human | visual | System Components on Performance of a Channelized Hotelling Observer in Structured Backgrounds, The |
Effect of rod-cone interactions on mesopic | visual | performance mediated by chromatic and luminance pathways |
Effect of Sound on | visual | Fidelity Perception in Stereoscopic 3-D, The |
effect of target size and force feedback on 3D selection within a co-located | visual | -haptic immersive virtual environment, The |
Effect of Various | visual | Speech Units on Language Identification Using Visual Speech Recognition |
Effect of Various | visual | Speech Units on Language Identification Using Visual Speech Recognition |
Effect of | visual | of a Courseware towards Pre-University Students' Learning in Literature, The |
Effective and Efficient Midlevel | visual | Elements-Oriented Land-Use Classification Using VHR Remote Sensing Images |
effective and efficient | visual | quality index based on local edge gradients, An |
Effective browsing of image search results via diversified | visual | summarization by clustering and refining clusters |
Effective End-to-End Vision Language Pretraining With Semantic | visual | Loss |
effective LRTC model integrated with total a-order variation and boundary adjustment for multichannel | visual | data inpainting, An |
Effective near-duplicate image retrieval with image-specific | visual | phrase selection |
Effective News Anchorperson Shot Detection Method Based on Adaptive Audio/ | visual | Model Generation, An |
Effective Resource Management in | visual | Sensor Networks With MPSK |
effective use of adaptive combination of | visual | features to retrieve image semantics from a hierarchical image database, An |
Effective | visual | masking techniques in JPEG200 |
Effective | visual | Scanning of Geographic Information |
Effectively Leveraging Attributes for | visual | Similarity |
Effectiveness of | visual | Interactive Modeling in the Context of Multiple-Criteria Group Decisions |
Effects and optimization of | visual | -proprioceptive discrepancy reduction for virtual grasping |
Effects of ATM network impairments on audio- | visual | broadcast applications |
Effects of Colour Content and Cumulative Area of Outdoor Advertisement Billboards on the | visual | Quality of Urban Streets, The |
Effects of Dynamic | visual | Stimuli on the Development of Carsickness in Real Driving |
Effects of negative afterimages in | visual | illusions |
Effects of sound on | visual | realism perception and task performance |
Effects of | visual | attention on chromatic and achromatic detection sensitivities |
Effects of | visual | conflicts on 3D selection task performance in stereoscopic display environments |
Effects of | visual | Feedback on Out-of-Body Illusory Tactile Sensation When Interacting With Augmented Virtual Objects |
Efficient Adversarial Attacks for | visual | Object Tracking |
Efficient and Accurate Tightly-Coupled | visual | -Lidar SLAM |
Efficient Attention Mechanism for | visual | Dialog that Can Handle All the Interactions Between Multiple Inputs |
Efficient BOF Generation and Compression for On-Device Mobile | visual | Location Recognition |
Efficient Closed-Form Solution to Probabilistic 6D | visual | Odometry for a Stereo Camera, An |
Efficient Clothing Retrieval with Semantic-Preserving | visual | Phrases |
Efficient Computation Sharing for Multi-Task | visual | Scene Understanding |
Efficient Construction for Region Incrementing | visual | Cryptography |
Efficient Content Analysis Engine for | visual | Surveillance Network |
Efficient Counterfactual Debiasing for | visual | Question Answering |
Efficient CU Splitting Method for HEVC Intra Coding Based on | visual | Saliency |
Efficient Deep | visual | and Inertial Odometry with Adaptive Visual Modality Selection |
Efficient Deep | visual | and Inertial Odometry with Adaptive Visual Modality Selection |
Efficient design and implementation of | visual | computing algorithms on the GPU |
Efficient dictionary learning for | visual | categorization |
Efficient Discovery and Effective Evaluation of | visual | Perceptual Similarity: A Benchmark and Beyond |
Efficient Feature Distribution for Object Matching in | visual | -Sensor Networks |
Efficient Feature Parameterisation for | visual | SLAM Using Inverse Depth Bundles |
Efficient framework for extended | visual | object tracking |
Efficient Geometric Re-ranking for Mobile | visual | Search |
Efficient hybrid search for | visual | reconstruction problems |
Efficient image mosaicing for multi-robot | visual | underwater mapping |
Efficient Implementation and Evaluation of Reid's Multiple Hypothesis Tracking Algorithm for | visual | Tracking, An |
Efficient Implementation of Reid's Multiple Hypothesis Tracking Algorithm and Its Evaluation for the Purpose of | visual | Tracking, An |
Efficient indexing for large scale | visual | search |
Efficient Intrusion Detection Approach for | visual | Sensor Networks Based on Traffic Pattern Learning, An |
Efficient Kernels Couple | visual | Words Through Categorical Opponency |
Efficient Large-scale Semantic | visual | Localization in 2d Maps |
Efficient Map Compression for Collaborative | visual | SLAM |
Efficient MAP/ML Similarity Matching for | visual | Recognition |
Efficient Method for Infrared and | visual | Images Fusion Based on Visual Attention Technique, An |
Efficient Method for Infrared and | visual | Images Fusion Based on Visual Attention Technique, An |
Efficient Mining of Optimal AND/OR Patterns for | visual | Recognition |
Efficient Misalignment Method for | visual | Tracking Based on Sparse Representation, An |
Efficient modeling of | visual | saliency based on local sparse representation and the use of hamming distance |
Efficient Monocular 3D Reconstruction from Segments for | visual | Navigation in Structured Environments |
Efficient Multi-level Correlating for | visual | Tracking |
Efficient Multitarget | visual | Tracking Using Random Finite Sets |
Efficient Neural Models for | visual | Attention |
Efficient Online Egomotion Estimation Using | visual | and Inertial Readings |
Efficient Optimal Kernel Placement for Reliable | visual | Tracking |
Efficient Parallel Strategy for Matching | visual | Self-similarities in Large Image Databases, An |
Efficient QR Code Beautification With High Quality | visual | Content |
Efficient quantization parameter coding based on intra/inter prediction for | visual | quality conscious video coders |
Efficient Retrieval from Large-Scale Egocentric | visual | Data Using a Sparse Graph Representation |
Efficient Saliency-Model-Guided | visual | Co-Saliency Detection |
Efficient Scale- and Rotation-Invariant Encoding of | visual | Words for Image Classification |
Efficient Scene Compression for | visual | -based Localization |
Efficient Schmidt-EKF for 3D | visual | -Inertial SLAM, An |
Efficient Search in a Panoramic Image Database for Long-term | visual | Localization |
efficient system for combining complementary kernels in complex | visual | categorization tasks, An |
Efficient Technique for Summarizing Videos using | visual | Contents, An |
Efficient Transfer Learning for | visual | Tasks via Continuous Optimization of Prompts |
Efficient Two-Step Middle-Level Part Feature Extraction for Fine-Grained | visual | Categorization |
Efficient Version-Space Reduction for | visual | Tracking |
Efficient video annotation with | visual | interpolation and frame selection guidance |
Efficient video coding based on audio- | visual | focus of attention |
Efficient Vision-Based Calibration for | visual | Surveillance Systems with Multiple PTZ Cameras |
Efficient | visual | attention based framework for extracting key frames from videos |
Efficient | visual | Event Detection Using Volumetric Features |
Efficient | visual | hull computation for real-time 3D reconstruction using CUDA |
Efficient | visual | memory based navigation of indoor robot with a wide-field of view camera |
Efficient | visual | object detection with spatially global Gaussian mixture models and uncertainties |
Efficient | visual | Object Tracking with Online Nearest Neighbor Classifier |
Efficient | visual | Pretraining with Contrastive Detection |
Efficient | visual | Recognition |
Efficient | visual | Representation and Reconstruction from Generalized Curvature Measures |
Efficient | visual | Search for Objects in Videos |
Efficient | visual | Search of Videos Cast as Text Retrieval |
Efficient | visual | secret sharing scheme for color images |
Efficient | visual | Tracking Based on Fuzzy Inference for Intelligent Transportation Systems |
Efficient | visual | Tracking by Probabilistic Fusion of Multiple Cues |
Efficient | visual | Tracking via Hierarchical Cross-attention Transformer |
Efficient | visual | Tracking with Exemplar Transformers |
Efficient, simultaneous detection of multi-class geospatial targets based on | visual | saliency modeling and discriminative learning of sparse coding |
EfficientAD: Accurate | visual | Anomaly Detection at Millisecond-Level Latencies |
Efficiently Increasing Map Density in | visual | SLAM Using Planar Features with Adaptive Measurement |
Efficiently secure image transmission against tampering in wireless | visual | sensor networks |
Efficiently selecting spatially distributed keypoints for | visual | tracking |
Efficiently training a better | visual | detector with sparse eigenvectors |
EfficientTrain: Exploring Generalized Curriculum Learning for Training | visual | Backbones |
Ego-Exo: Transferring | visual | Representations from Third-person to First-person Videos |
Egocentric Audio- | visual | Object Localization |
Egocentric Deep Multi-Channel Audio- | visual | Active Speaker Localization |
Egocentric Direction and the | visual | Guidance of Robot Locomotion Background, Theory and Implementation |
Egocentric Vision for | visual | Market Basket Analysis |
Egocentric | visual | Event Classification with Location-Based Priors |
EgoLoc: Revisiting 3D Object Localization from Egocentric Videos with | visual | Queries |
egoPortray: | visual | Exploration of Mobile Communication Signature from Egocentric Network Perspective |
EGT: A toolbox for multiple view geometry and | visual | servoing |
EigenPlaces: Training Viewpoint Robust Models for | visual | Place Recognition |
EKF-Based | visual | Inertial Navigation Using Sliding Window Nonlinear Optimization |
EKF-Based | visual | SLAM System with Relative Map Orientation Measurements, The |
EKF-SLAM and Machine Learning Techniques for | visual | Robot Navigation |
Electronic Image Stabilization Using Multiple | visual | Cues |
Elementary Computation of Object Approach by a Wide-Field | visual | Neuron |
Elements of a Fuzzy Geometry for | visual | Space |
Eliciting | visual | primitives for detection of elongated shapes |
Elimination of Spatial Incoherency in Bag-of- | visual | Words Image Representation Using Visual Sentence Modelling |
Elimination of Spatial Incoherency in Bag-of- | visual | Words Image Representation Using Visual Sentence Modelling |
ELITE: Encoding | visual | Concepts into Textual Embeddings for Customized Text-to-Image Generation |
Ellipse Detection for | visual | Cyclists Analysis In the Wild |
ELoPE: Fine-Grained | visual | Classification with Efficient Localization, Pooling and Embedding |
EM Algorithms for Weighted-Data Clustering with Application to Audio- | visual | Scene Analysis |
Embarrassingly Simple Approach to | visual | Domain Adaptation, An |
Embedded processing methods for online | visual | analysis of laser welding |
Embedded Real-Time | visual | Search with Visual Distance Estimation |
Embedded Real-Time | visual | Search with Visual Distance Estimation |
Embedded Robust | visual | Obstacle Detection on Autonomous Lawn Mowers |
Embedded Solution to | visual | Mapping for Consumer Drones, An |
Embedded System-on-Chip Architecture for Real-time | visual | Detection and Matching, An |
Embedding deep networks into | visual | explanations |
Embedding Spatial Relations in | visual | Question Answering for Remote Sensing |
Embedding | visual | Hierarchy With Deep Networks for Large-Scale Visual Recognition |
Embedding | visual | Hierarchy With Deep Networks for Large-Scale Visual Recognition |
Embedding | visual | Words into Concept Space for Action and Scene Recognition |
Embodied Language Grounding With 3D | visual | Feature Representations |
Emergence of | visual | Categories: A Computational Perspective, The |
Emergent Issues in Large Amounts of | visual | Data |
Emergent | visual | Sensors for Autonomous Vehicles |
Emerging Trends in | visual | Computing |
EmoSet: A Large-scale | visual | Emotion Dataset with Rich Attributes |
Emotion Recognition Based on Joint | visual | and Audio Cues |
Emotional Attention: A Study of Image Sentiment and | visual | Attention |
Emotional Semantics-Preserved and Feature-Aligned CycleGAN for | visual | Emotion Adaptation |
Empirical Analysis of | visual | Features for Multiple Object Tracking in Urban Scenes, An |
Empirical Comparison of | visual | Descriptors for Content Based X-Ray Image Retrieval |
Empirical Comparison of | visual | Descriptors for Multiple Bleeding Spots Recognition in Wireless Capsule Endoscopy Video |
Empirical Evaluation of a | visual | Interface for Exploring Message Boards |
Empirical Evaluation of | visual | Question Answering for Novel Objects, An |
Empirical Exploration of Extreme SVM-RBF Parameter Values for | visual | Object Classification |
Empirical Mode Decomposition Analysis for | visual | Stylometry |
Empirical mode decomposition based | visual | enhancement of underwater images |
Empirical Study of Audio- | visual | Features Fusion for Gait Recognition |
Empirical Study of End-to-End Video-Language Transformers with Masked | visual | Modeling, An |
Empirical Study of Query Effectiveness Improvement via Multiple | visual | Feature Integration, An |
Empirical study on using adapters for debiased | visual | Question Answering |
Empirical Validation of the Saliency-based Model of | visual | Attention |
Empowering | visual | Categorization With the GPU |
Emulating human | visual | perception for measuring difference in images using an SPN graph approach |
En Plein Air | visual | Agents |
Enable Scale and Aspect Ratio Adaptability in | visual | Tracking with Detection Proposals |
Enabling | visual | analysis in wireless sensor networks |
Encoder-decoder cycle for | visual | question answering based on perception-action cycle |
Encoding and recognition of faces based on the human | visual | model and DCT |
Encoding color information for | visual | tracking: Algorithms and benchmark |
Encoding pairwise Hamming distances of Local Binary Patterns for | visual | smoke recognition |
Encoding Spatial Arrangement of | visual | Words |
Encoding Spatial Arrangements of | visual | Words for Rotation-Invariant Image Classification |
Encoding | visual | Information Using Anisotropic Transformations |
Encoding | visual | Sensitivity by MaxPol Convolution Filters for Image Sharpness Assessment |
Encouraging Eco-Driving With | visual | , Auditory, and Vibrotactile Stimuli |
Encryption Inspired Adversarial Defense For | visual | Classification |
Encryption, | visual | Cryptography, Authentication |
Encyclopedic VQA: | visual | questions about detailed properties of fine-grained categories |
End Points, Complexity, and | visual | Illusions |
End-to-End Blind Video Quality Assessment Based on | visual | and Memory Attention Modeling |
End-to-end deep metric network for | visual | tracking |
End-to-end DeepNCC framework for robust | visual | tracking |
End-to-end feature fusion Siamese network for adaptive | visual | tracking |
End-to-End Learning of Deep | visual | Representations for Image Retrieval |
End-to-End Learning of | visual | Representations From Uncurated Instructional Videos |
End-to-end Multi-Modal Multi-Task Vehicle Control for Self-Driving Cars with | visual | Perceptions |
End-to-End Policy Learning for Active | visual | Categorization |
End-to-End | visual | Editing with a Generatively Pre-Trained Artist |
End-to-end | visual | grounding via region proposal networks and bilinear pooling |
End-To-End | visual | Place Recognition Based on Deep Metric Learning and Self-Adaptively Enhanced Similarity Metric |
End-to-end | visual | speech recognition for small-scale datasets |
End-to-End | visual | Target Tracking in Multi-Robot Systems Based on Deep Convolutional Neural Network |
Endoscope Navigation and 3D Reconstruction of Oral Cavity by | visual | SLAM with Mitigated Data Scarcity |
Endstopped Neurons in the | visual | Cortex as a Substrate for Calculating Curvature |
Energy and Computation Efficient Audio- | visual | Voice Activity Detection Driven by Event-Cameras |
Energy Consumption of | visual | Sensor Networks: Impact of Spatio-Temporal Coverage |
Energy expenditure estimation using | visual | and inertial sensors |
Energy-Aware Mobile Edge Computing and Routing for Low-Latency | visual | Data Processing |
Enhance | visual | Recognition Under Adverse Conditions via Deep Networks |
Enhanced Bag-of- | visual | Word Vector Space Model to Represent Visual Content in Athletics Images, An |
Enhanced Bag-of- | visual | Word Vector Space Model to Represent Visual Content in Athletics Images, An |
Enhanced Bags of | visual | Words Representation Using Spatial Information |
enhanced image quality assessment by synergizing superpixels and | visual | saliency, An |
Enhanced Laplacian Group Sparse Learning with Lifespan Outlier Rejection for | visual | Tracking |
enhanced threshold | visual | secret sharing based on random grids, An |
Enhanced | visual | appearance, punch-style weight and physical characteristics based Leap Motion game |
Enhanced | visual | categorization performances by incorporation of simple features into BIM features |
Enhanced | visual | Experience and Archival Reusability in Personalized Search Based on Modified Spider Graph |
Enhanced | visual | Separation of Clusters by M-Mapping to Facilitate Cluster Analysis |
Enhanced-IPMH as a Robust | visual | Descriptor from H.264/AVC and Evaluation of Parameters Effects |
Enhancement of Compressed Video Using | visual | Quality Measurements |
Enhancement of | visual | Comfort and Sense of Presence on Stereoscopic 3D Images |
Enhancement of | visual | Perception Through Dynamic Cues: An Application to Mammograms |
Enhancement Strategies For Frame-to-frame Uas Stereo | visual | Odometry |
Enhancement-Registration-Homogenization (ERH): A Comprehensive Underwater | visual | Reconstruction Paradigm |
Enhancing Automatic Maritime Surveillance Systems With | visual | Information |
Enhancing CLIP with GPT-4: Harnessing | visual | Descriptions as Prompts |
Enhancing Fairness of | visual | Attribute Predictors |
Enhancing multi-factor cheating prevention in | visual | cryptography based minimum (k, n)-connected graph |
Enhancing Multimodal Compositional Reasoning of | visual | Language Models with Generative Negative Mining |
Enhancing Self-Supervised Monocular Depth Estimation with Traditional | visual | Odometry |
Enhancing the occlusion technique as an assessment tool for driver | visual | distraction |
Enhancing the perception of a hazy | visual | world using a see-through head-mounted device |
Enhancing the Robustness of Skin-Based Face Detection Schemes Through a | visual | Attention Architecture |
Enhancing Training Data Quality With | visual | Analytics |
Enhancing Video Event Recognition Using Automatically Constructed Semantic- | visual | Knowledge Base |
Enhancing | visual | Embeddings through Weakly Supervised Captioning for Zero-Shot Learning |
Enhancing | visual | Grounding in Vision-Language Pre-Training With Position-Guided Text Prompts |
Enriched Deep Recurrent | visual | Attention Model for Multiple Object Recognition |
Enriching | visual | Knowledge Bases via Object Discovery and Segmentation |
ENSEI: Efficient Secure Inference via Frequency-Domain Homomorphic Convolution for Privacy-Preserving | visual | Recognition |
Ensemble deep learning for automated | visual | classification using EEG signals |
Ensemble Model of | visual | Transformer and CNN Helps BA Diagnosis for Doctors in Underdeveloped Areas |
Ensemble Of adaptive correlation filters for robust | visual | tracking |
eNTERFACE-05 Audio- | visual | Emotion Database, The |
Entity Slot Filling for | visual | Captioning |
Entropy based camera control for | visual | object tracking |
Entropy Based Supervised Merging for | visual | Categorization |
Entropy of primitive: A top-down methodology for evaluating the perceptual | visual | information |
Entropy of Primitive: From Sparse Representation to | visual | Information Evaluation |
Environment Agnostic Representation for | visual | Reinforcement learning |
Environmental Sounds Classification Based on | visual | Features |
Environmental-Centered Representation of Spatial Layout: Available | visual | Information from Texture and Perspective |
EP2P-Loc: End-to-End 3D Point to 2D Pixel Localization for Large-Scale | visual | Localization |
EPIC-Fusion: Audio- | visual | Temporal Binding for Egocentric Action Recognition |
Epistemic Uncertainty-Weighted Loss for | visual | Bias Mitigation |
Error regulation strategies for Model Based | visual | servoing tasks: Application to autonomous object grasping with Nao robot |
Error Weighted Semi-Coupled Hidden Markov Model for Audio- | visual | Emotion Recognition |
Error-tolerant sign retrieval using | visual | features and maximum a posteriori estimation |
Error-Tolerant | visual | Planning of Planar Grasp |
Erudite Fine-Grained | visual | Classification Model, An |
ESResNet: Environmental Sound Classification Based on | visual | Domain Models |
Establishing | visual | Correspondence from Multi-Resolution Graph Cuts for Stereo-Motion |
Estimating Cohesion in Small Groups Using Audio- | visual | Nonverbal Behavior |
Estimating Contribution of Training Datasets using Shapley Values in Data-scale for | visual | Recognition |
Estimating Human Body and Head Orientation Change to Detect | visual | Attention Direction |
Estimating the information gap between textual and | visual | representations |
Estimating the initial values of unobservable variables in | visual | probabilistic networks |
Estimating vanishing points using | visual | spatial frequencies of textures on planar surfaces |
Estimating | visual | Saliency Through Single Image Optimization |
Estimation and Prediction of the Vehicle's Motion Based on | visual | Odometry and Kalman Filter |
Estimation of Center of Mass for Sports Scene Using Weighted | visual | Hull |
Estimation of Depth from Motion Using an Anthropomorphic | visual | Sensor |
Estimation of Emotion Labels via Tensor-Based Spatiotemporal | visual | Attention Analysis |
Estimation of Gaussian, Poissonian: Gaussian, and Processed | visual | Noise and Its Level Function |
Estimation of Impression Associated With Portraits Using Facial Landmarks and | visual | Features |
Estimation of the human performance for pedestrian detectability based on | visual | search and motion features |
Estimation Of | visual | Contents Based On Question Answering From Human Brain Activity |
ETR: An Efficient Transformer for Re-ranking in | visual | Place Recognition |
European Workshop on | visual | Information Processing |
EVA: Exploring the Limits of Masked | visual | Representation Learning at Scale |
EvAC3D: From Event-Based Apparent Contours to 3D Models via Continuous | visual | Hulls |
Evaluating color vision deficiency daltonization methods using a behavioral | visual | -search method |
Evaluating Feature Importance for Object Classification in | visual | Surveillance |
Evaluating Street Lighting Quality in Residential Areas by Combining Remote Sensing Tools and a Survey on Pedestrians' Perceptions of Safety and | visual | Comfort |
Evaluating the influence of packet loss on | visual | quality of perception for high bandwidth automotive networks |
Evaluating | visual | Impressions Based on Gaze Analysis and Deep Learning: A Case Study of Attractiveness Evaluation of Streets in Densely Built-up Wooden Residential Area |
Evaluating | visual | Inertial Odometry Using the Windy Forest Dataset |
Evaluating | visual | Properties via Robust HodgeRank |
Evaluation Framework for 360-Degree | visual | Content Compression with User View-Dependent Transmission |
Evaluation of a Particle Filter to Track People for | visual | Surveillance |
Evaluation of a | visual | Question Answering Architecture for Pedestrian Attribute Recognition |
Evaluation of a | visual | -Tactile Multimodal Display for Surface Obstacle Avoidance During Walking |
Evaluation of Feature Channels for Correlation-Filter-Based | visual | Object Tracking in Infrared Spectrum |
Evaluation of Haptic and | visual | Cues for Repulsive or Attractive Guidance in Nonholonomic Steering Tasks |
Evaluation of Interest Point Detectors and Feature Descriptors for | visual | Tracking |
Evaluation of local descriptors and CNNs for non-adult detection in | visual | content |
Evaluation of model independent image-based | visual | servoing |
Evaluation of MPEG7 color descriptors for | visual | surveillance retrieval |
Evaluation of multiple motion models for multiple pedestrian | visual | tracking |
evaluation of real-time RGB-D | visual | odometry algorithms on mobile devices, An |
Evaluation of Signal Processing Methods for Attention Assessment in | visual | Content Interaction |
Evaluation of TsHARP Utility for Thermal Sharpening of Sentinel-3 Satellite Images Using Sentinel-2 | visual | Imagery |
Evaluation of | visual | Attention Models for Robots |
Evaluation of | visual | Content Descriptors for Supporting Ad-Hoc Video Search Tasks at the Video Browser Showdown |
Evaluation of | visual | object retrieval datasets |
Evaluation of | visual | saliency analysis algorithms in noisy images |
Evaluation of | visual | Speech Features for the Tasks of Speech and Speaker Recognition, An |
Evaluation of | visual | Tracking Algorithms for Embedded Devices |
Evaluation: A Challenge for | visual | Analytics |
EVE: Software for | visual | Modeling |
Event Analogy Based Privacy Preservation in | visual | Surveillance |
Event classification for automatic | visual | -based surveillance of parking lots |
Event Detection in Field Sports Video Using Audio- | visual | Features and a Support Vector Machine |
Event Fisher Vectors: Robust Encoding | visual | Diversity of Visual Streams |
Event Fisher Vectors: Robust Encoding | visual | Diversity of Visual Streams |
Event-Based Neuromorphic Vision for Autonomous Driving: A Paradigm Shift for Bio-Inspired | visual | Sensing and Perception |
Event-Based | visual | Inertial Odometry |
Event-Driven | visual | Sensor Networks: Issues in Reliability |
Event-Specific Audio- | visual | Fusion Layers: A Simple and New Perspective on Video Understanding |
EVET: Enhancing | visual | Explanations of Deep Neural Networks Using Image Transformations |
Evolvable Biologically Plausible | visual | Architectures |
Evolvable | visual | commercial detector |
Evolving Fuzzy Modeling of an Uncalibrated | visual | Servoing System |
Evolving PCB | visual | inspection programs using genetic programming |
Evolving | visual | Attention Programs through EVO Features |
Evolving | visual | sonar: Depth from monocular images |
Exact polyhedral | visual | hulls |
Exact View-Dependent | visual | Hulls |
Examining | visual | saliency prediction in naturalistic scenes |
Exclusive | visual | Descriptor Quantization |
Exemplar based metric learning for robust | visual | localization |
Exemplar Based Recognition of | visual | Shapes |
Exemplar SVMs as | visual | feature encoders |
Exemplar-Based, Semantic Guided Zero-Shot | visual | Recognition |
Expanding the Frontiers of | visual | Analytics and Visualization |
Expansion of | visual | Hints for Improved Generalization in Stereo Matching |
Experimental Evaluation of Autonomous Driving Based on | visual | Memory and Image-Based Visual Servoing |
Experimental Evaluation of Autonomous Driving Based on | visual | Memory and Image-Based Visual Servoing |
experimental evaluation of feature detectors and descriptors for | visual | SLAM, An |
experimental study of employing | visual | appearance as a phenotype, An |
Experimental Study of the Possible Bandwidth Compression of | visual | Image Signals, An |
experimental study on the universality of | visual | vocabularies, An |
Experimentation of | visual | augmented reality for visiting the historical monuments of the medina of Fez |
Experiments in the Machine Interpretation of | visual | Motion |
Experiments in the | visual | Perception of texture |
Experiments on | visual | loop closing using vocabulary trees |
Experiments with monocular | visual | tracking and environment modeling |
Expert Systems Simulating Human | visual | Perception |
Explainable and Explicit | visual | Reasoning Over Scene Graphs |
Explainable few-shot learning with | visual | explanations on a low resource pneumonia dataset |
Explainable Video Entailment with Grounded | visual | Evidence |
Explaining Autonomous Driving by Learning End-to-End | visual | Attention |
Explaining Deep Convolutional Neural Networks via Latent | visual | -Semantic Filter Attention |
Explaining the Ambiguity of Object Detection and 6D Pose From | visual | Data |
Explaining | visual | Models by Causal Attribution |
Explaining VQA predictions using | visual | grounding and a knowledge base |
Explanation vs. attention: A two-player game to obtain attention for VQA and | visual | dialog |
Explanation-based Weakly-supervised Learning of | visual | Relations with Graph Networks |
Explicit Bias Discovery in | visual | Question Answering Models |
Explicit Cross-Modal Representation Learning for | visual | Commonsense Reasoning |
Explicit ensemble attention learning for improving | visual | question answering |
Explicit Knowledge Incorporation for | visual | Reasoning |
Explicit | visual | Prompting for Low-Level Structure Segmentations |
Exploit | visual | Dependency Relations for Semantic Segmentation |
Exploitation of 3D Information for Directing | visual | Attention and Object Recognition |
Exploitation of Meta Knowledge for Learning | visual | Concepts |
Exploiting Attention for | visual | Relationship Detection |
Exploiting Contextual Motion Cues for | visual | Object Tracking |
Exploiting disparity information in | visual | object tracking |
Exploiting distinctive | visual | landmark maps in pan-tilt-zoom camera networks |
Exploiting evidential theory in the fusion of textual, audio, and | visual | modalities for affective music video retrieval |
Exploiting Graph and Geodesic Distance Constraint for Deep Learning-Based | visual | Odometry |
Exploiting Image Motion for Active Vision in a | visual | Servoing Framework |
Exploiting Long-Term Connectivity and | visual | Motion in CRF-Based Multi-Person Tracking |
Exploiting Low-Rank Latent Gaussian Graphical Model Estimation for | visual | Sentiment Distributions |
Exploiting Semantic and | visual | Context for Effective Video Annotation |
Exploiting Semantic Embedding and | visual | Feature for Facial Action Unit Detection |
Exploiting semantics on external resources to gather | visual | examples for video retrieval |
Exploiting shadows in a | visual | , hand-driven user interface |
Exploiting spatial relationships for | visual | tracking |
Exploiting Spatial Sparsity for Event Cameras with | visual | Transformers |
Exploiting structural constraints for | visual | object tracking |
Exploiting structured high-level knowledge for domain-specific | visual | classification |
Exploiting superpixel and hybrid hash for kernel-based | visual | tracking |
Exploiting textual and | visual | features for image categorization |
Exploiting textual queries for dynamically | visual | disambiguation |
Exploiting the Anisotropy of Correlation Filter Learning for | visual | Tracking |
Exploiting the Complementarity of Audio and | visual | Data in Multi-speaker Tracking |
Exploiting | visual | Artifacts to Expose Deepfakes and Face Manipulations |
Exploiting | visual | Constraints in the Synthesis of Uncertainty-Tolerant Motion Plans |
Exploiting | visual | Context Semantics for Sound Source Localization |
Exploiting | visual | Context to Identify People in TV Programs |
Exploiting | visual | Quasi-periodicity for Automated Chewing Event Detection Using Active Appearance Models and Support Vector Machines |
Exploiting | visual | Saliency Algorithms for Object-Based Attention: A New Color and Scale-Based Approach |
Exploiting | visual | Saliency and Bag-of-Words for Road Sign Recognition |
Exploiting | visual | -Audio-Textual Characteristics for Automatic TV Commercial Block Detection and Segmentation |
Exploiting Web Images for Fine-Grained | visual | Recognition by Eliminating Open-Set Noise and Utilizing Hard Examples |
Exploiting Web Images for Fine-Grained | visual | Recognition via Dynamic Loss Correction and Global Sample Selection |
Exploration and evaluation of individual difference to driving fatigue for high-speed railway: a parametric SVM model based on multidimensional | visual | cue |
Exploration of Embodied | visual | Exploration, An |
exploration of factors that drive vertical vergence movements across the | visual | field, An |
Exploration of | visual | Data |
Exploratory Comparison of the | visual | Quality of Virtual Reality Systems Based on Device-Independent Testsets, An |
Exploratory | visual | Sensing for Determining Spatial Layout with an Agile Stereo Camera System |
Explore and Tell: Embodied | visual | Captioning in 3D Environments |
Explore the potential of deep learning and hyperchaotic map in the meaningful | visual | image encryption scheme |
Exploring Causal Relationships in | visual | Object Tracking |
Exploring Compositional | visual | Generation with Latent Classifier Guidance |
Exploring Context and | visual | Pattern of Relationship for Scene Graph Generation |
Exploring Heterogeneous Clues for Weakly-Supervised Audio- | visual | Video Parsing |
Exploring human eye behaviour using a model of | visual | attention |
Exploring human | visual | system: Study to aid the development of automatic facial expression recognition framework |
Exploring Implicit Image Statistics for | visual | Representativeness Modeling |
Exploring Large Movie Collections: Comparing | visual | Berrypicking and Traditional Browsing |
Exploring Lightweight Hierarchical Vision Transformers for Efficient | visual | Tracking |
Exploring Long Tail | visual | Relationship Recognition with Large Vocabulary |
Exploring Motion Information for Distractor Suppression in | visual | Tracking |
Exploring Multi-Scale Spatiotemporal Twitter User Mobility Patterns with a | visual | -Analytics Approach |
Exploring Predicate | visual | Context in Detecting of Human-Object Interactions |
Exploring region relationships implicitly: Image captioning with | visual | relationship attention |
Exploring relations of | visual | codes for image classification |
Exploring Structural Knowledge for Automated | visual | Inspection of Moving Trains |
Exploring the Benefits of | visual | Prompting in Differential Privacy |
Exploring the diversity and invariance in yourself for | visual | pre-training task |
Exploring the Dynamics of | visual | Events in the Multi-dimensional Semantic Concept Space |
Exploring the effects of 3D | visual | discomfort on viewers' emotions |
Exploring the Effects of Blur and Deblurring to | visual | Object Tracking |
Exploring the Prediction Consistency of Multiple Views for Transductive | visual | Recognition |
Exploring two spaces with one feature: kernelized multidimensional modeling of | visual | alphabets |
Exploring | visual | and Motion Saliency for Automatic Video Object Extraction |
Exploring | visual | attention using random walks based eye tracking protocols |
Exploring | visual | Engagement Signals for Representation Learning |
Exploring | visual | Motion Using Projections of Motion Fields |
Exploring | visual | Relationship for Image Captioning |
Exploring | visual | relationship for social media popularity prediction |
Expressive Modulation of Neutral | visual | Speech |
Expressive Querying for Accelerating | visual | Analytics |
Expressive Talking Head Generation with Granular Audio- | visual | Control |
Expressive | visual | text-to-speech as an assistive technology for individuals with autism spectrum conditions |
Expressive | visual | Text-to-Speech Using Active Appearance Models |
Extended Grammar System for Learning and Recognizing Complex | visual | Events, An |
Extended Non-local Feature for | visual | Saliency Detection in Low Contrast Images |
Extended Robust Feature-Based | visual | Navigation System for UAVs |
Extended | visual | Cryptography for Natural Images |
Extended | visual | Cryptography Scheme for Continuous-Tone Images, An |
Extended | visual | Secret Sharing Schemes with High-Quality Shadow Images Using Gray Sub Pixels |
Extending Correlation Filter-Based | visual | Tracking by Tree-Structured Ensemble and Spatial Windowing |
Extending KDDML with a | visual | Metaphor for the KDD Process |
Extension of the CCITT | visual | communication coding algorithm for operation in ATM networks |
extensive evaluation of deep features of convolutional neural networks for saliency prediction of human | visual | attention, An |
Extracting a Domain Theory from Natural Language to Construct a Knowledge Base for | visual | Recognition |
Extracting Causal | visual | Features for Limited Label Classification |
Extracting dense features for | visual | correspondence with graph cuts |
Extracting Motion Features for | visual | Human Activity Representation |
Extracting Multiple | visual | Senses for Web Learning |
Extracting Semantic Information from Basketball Video Based on Audio- | visual | Features |
Extracting | visual | Knowledge from the Internet: Making Sense of Image Data |
Extraction and Classification of | visual | Motion Patterns for Hand Gesture Recognition |
Extraction and Tracking Moving Objects in Detail Considering | visual | Feature Constraint and Structure Constraint |
extraction method for digital camouflage texture based on human | visual | perception and isoperimetric theory, An |
Extraction of 3D freeform surfaces as | visual | landmarks for real-time tracking |
Extraction of Combined Features from Global/Local Statistics of | visual | Words Using Relevant Operations |
Extraction of Object Representations from Stereo Image Sequences Utilizing Statistical and Deterministic Regularities in | visual | Data |
Extraction of Relevant Information from Document Images Using Measures of | visual | Attention |
Extraction of Salient Apexes from an Image by Using the Function at the Primary | visual | Cortex |
Extraction of Salient Contours Via Excitatory-Inhibitory Interactions in the | visual | Cortex |
Extraction of | visual | Features for Lipreading |
Extraction of | visual | features with eye tracking for saliency driven 2D/3D registration |
Extreme Structure from Motion for Indoor Panoramas without | visual | Overlaps |
Eye Movement and | visual | Cognition |
Eye Tracking the | visual | Attention of Nurses Interpreting Simulated Vital Signs Scenarios: Mining Metrics to Discriminate Between Performance Level |
Eye-In-Hand | visual | Servoing for Accurate Shooting in Pool Robotics |
Eyes in the Back of Your Head: Robust | visual | Teach and Repeat Using Multiple Stereo Cameras |
F-SCP: An automatic prompt generation method for specific classes based on | visual | language pre-training models |
Face as Mouse Through | visual | Face Tracking |
Face detection for | visual | surveillance |
Face Spoofing Detection Through | visual | Codebooks of Spectral Temporal Cubes |
Face tracking and recognition with | visual | constraints in real-world videos |
Facial 3D Shape Estimation from Images for | visual | Speech Animation |
Facial Action Coding Using Multiple | visual | Cues and a Hierarchy of Particle Filters |
Facial Attribute Recognition by Recurrent Learning With | visual | Fixation |
Facial Chirality: From | visual | Self-Reflection to Robust Facial Feature Learning |
Facial Expression Recognition Using | visual | Saliency and Deep Learning |
Facial Expression Recognition With | visual | Transformers and Attentional Selective Fusion |
Facial expression understanding in image sequences using dynamic and active | visual | information fusion |
Factorized Tensor Dictionary Learning for | visual | Tensor Data Completion |
Fair Comparison: Quantifying Variance in Results for Fine-grained | visual | Categorization |
Fair Feature Distillation for | visual | Recognition |
Fair | visual | Recognition in Limited Data Regime using Self-Supervision and Self-Distillation |
Fall Detection and Recognition from Egocentric | visual | Data: A Case Study |
FAM: | visual | Explanations for the Feature Representations from Deep Convolutional Networks |
Familiarity based unified | visual | attention model for fast and robust object recognition |
Families of Stationary Patterns Producing Illusory Movement: Insights into the | visual | System |
FamSearch: | visual | Analysis of Genealogical Data |
Fantastic Answers and Where to Find Them: Immersive Question-Directed | visual | Attention |
Fashion Forward: Forecasting | visual | Style in Fashion |
Fashion-Specific Ambiguous Expression Interpretation with Partial | visual | -Semantic Embedding |
FashionVQA: A Domain-Specific | visual | Question Answering System |
Fast Algorithm for Creating a Compact and Discriminative | visual | Codebook, A |
Fast and Accurate One-Stage Approach to | visual | Grounding, A |
Fast and Adaptive Deep Fusion Learning for Detecting | visual | Objects |
Fast and efficient | visual | codebook construction for multi-label annotation using predictive clustering trees |
Fast and Flexible Computer Vision System for Implanted | visual | Prostheses, A |
Fast and low-complexity reinforcement learning for delay-sensitive energy harvesting wireless | visual | sensing systems |
fast and mobile system for registration of low-altitude | visual | and thermal aerial images using multiple small-scale UAVs, A |
Fast and Precise HOG-Adaboost Based | visual | Support System Capable to Recognize Pedestrian and Estimate Their Distance, A |
Fast and Robust Generation of Feature Maps for Region-Based | visual | Attention |
Fast and Robust Heterologous Image Matching Method for | visual | Geo-Localization of Low-Altitude UAVs, A |
Fast and Robust Object Detection Using | visual | Subcategories |
Fast and robust | visual | inspection system for tire surface thin defect |
Fast and robust | visual | tracking with hard balanced focal loss and guided domain adaption |
Fast and Secured | visual | Content Hiding in Lossy Compressed Images and Video Streams |
Fast common | visual | pattern detection via radiate geometric model |
Fast Computation of a | visual | Hull |
Fast Cost-Volume Filtering for | visual | Correspondence and Beyond |
Fast Depth Saliency from Stereo for Region-Based Artificial | visual | Attention |
Fast Edge Detection Algorithm Matching | visual | Contour Perception, A |
Fast Floor Segmentation Algorithm for | visual | -Based Robot Navigation, A |
Fast Global Reflectional Symmetry Detection for Robotic Grasping and | visual | Tracking |
Fast Inference Vision Transformer for Automatic Pavement Image Classification and Its | visual | Interpretation Method, A |
Fast Knowledge Distillation Framework for | visual | Recognition, A |
Fast Learning of Spatially Regularized and Content Aware Correlation Filter for | visual | Tracking |
Fast mode decision and early termination based on perceptual | visual | quality for HEVC encoders |
Fast Monocular | visual | Place Recognition for Non-Uniform Vehicle Speed and Varying Lighting Environment |
Fast Non-Overlapping Multi-Camera People Re-Identification Algorithm and Tracking Based on | visual | Channel Model, A |
Fast Odometry Integration in Local Bundle Adjustment-Based | visual | SLAM |
Fast Perceptual Learning in | visual | Hyperacuity |
Fast Pixelwise Adaptive | visual | Tracking of Non-Rigid Objects |
Fast Re-ranking of | visual | Search Results by Example Selection |
Fast Reconstruction of 3D Point Cloud Model Using | visual | SLAM on Embedded UAV Development Platform |
Fast relocalization for | visual | odometry using binary features |
Fast Retrieval of Isolated | visual | Shapes |
Fast Rotation-Invariant Video Caption Detection Based on | visual | Rhythm |
Fast Semantic-Aware Motion State Detection for | visual | SLAM in Dynamic Environment |
Fast SIFT Design for Real-Time | visual | Feature Extraction |
Fast Techniques for Monocular | visual | Odometry |
Fast Tensor Nuclear Norm for Structured Low-Rank | visual | Inpainting |
Fast texel size estimation in | visual | texture using homogeneity cues |
Fast turtle shell-based data embedding mechanisms with good | visual | quality |
Fast | visual | object counting via example-based density estimation |
Fast | visual | Object Tracking using Ellipse Fitting for Rotated Bounding Boxes |
Fast | visual | Odometry Based Sparse Geometric Constraint for RGB-D Camera |
Fast | visual | Retrieval Using Accelerated Sequence Matching |
Fast | visual | saliency based on multi-scale difference of Gaussians fusion in frequency domain |
Fast | visual | search using simplified pruning rules: Streamlined Active Search |
Fast | visual | Tracking by Temporal Consensus |
Fast | visual | Tracking via Dense Spatio-temporal Context Learning |
Fast | visual | Tracking With Siamese Oriented Region Proposal Network |
Fast | visual | Vocabulary Construction for Image Retrieval Using Skewed-Split k-d Trees |
Fast volumetric | visual | hull computation |
Fast Wavelet-Based | visual | Classification |
Fast wide baseline matching for | visual | navigation |
Fast, | visual | and Interactive Semi-supervised Dimensionality Reduction |
Faster | visual | -Based Localization with Mobile-PoseNet |
Faster-ADNet for | visual | Tracking |
FastSal: a Computationally Efficient Network for | visual | Saliency Prediction |
FC-vSLAM: Integrating Feature Credibility in | visual | SLAM |
FCC: Feature Clusters Compression for Long-Tailed | visual | Recognition |
FD-CAM: Improving Faithfulness and Discriminability of | visual | Explanation for CNNs |
FEAR: Fast, Efficient, Accurate and Robust | visual | Tracker |
Feasibility Analysis of Ultra High Frame Rate | visual | Servoing on FPGA and SIMD Processor |
Feasibility study for | visual | discomfort assessment on stereo images using EEG |
feasibility study on a novel method of | visual | obstacle detection, A |
Feature Aggregation Networks Based on Dual Attention Capsules for | visual | Object Tracking |
Feature Alignment and Aggregation Siamese Networks for Fast | visual | Tracking |
Feature and Region Selection for | visual | Learning |
Feature Cloud: Improving Deep | visual | Recognition with Probabilistic Feature Augmentation |
Feature Clustering with Fading Affect Bias: Building | visual | Vocabularies on the Fly |
Feature combination with Multi-Kernel Learning for fine-grained | visual | classification |
Feature Comparison Based Channel Attention For Fine-Grained | visual | Classification |
Feature compression: A framework for multi-view multi-person tracking in | visual | sensor networks |
Feature detection algorithm based on a | visual | system model |
Feature Extraction For | visual | Speaker Authentication Against Computer-Generated Video Attacks |
Feature Extraction of Three-Dimensional Objects and | visual | Processing in a Hand-Eye System Using Laser Tracker |
Feature First: Advancing Image-Text Retrieval Through Improved | visual | Features |
Feature fusion network for long-tailed | visual | recognition |
Feature grouping and local soft match for mobile | visual | search |
Feature Planning for Robust Execution of General Robot Tasks using | visual | Servoing |
Feature Repetitiveness Similarity Metrics in | visual | Search |
Feature selection by maximum marginal diversity: Optimality and Implications for | visual | Recognition |
Feature Selection for Big | visual | Data: Overview and Challenges |
Feature selection for reliable data association in | visual | SLAM |
Feature Sets and Dimensionality Reduction for | visual | Object Detection |
Feature Similarity and Frequency-Based Weighted | visual | Words Codebook Learning Scheme for Human Action Recognition |
Feature space video stream consistency estimation for dynamic stream weighting in audio- | visual | speech recognition |
Feature tracking for | visual | servoing purposes |
Feature-Based Image Comparison for Semantic Neighbor Selection in Resource-Constrained | visual | Sensor Networks |
Feature-based object modelling for | visual | surveillance |
Feature2Mass: | visual | Feature Processing in Latent Space for Realistic Labeled Mass Generation |
Features-based approach for Alzheimer's disease diagnosis using | visual | pattern of water diffusion in tensor diffusion imaging |
Federated learning-based colorectal cancer classification by convolutional neural networks and general | visual | representation learning |
Federated | visual | Classification with Real-World Data Distribution |
Feedback Convolutional Neural Network for | visual | Localization and Segmentation |
FET-FGVC: Feature-enhanced transformer for fine-grained | visual | classification |
Few-Shot Classification in Unseen Domains by Episodic Meta-Learning Across | visual | Domains |
Few-Shot Learning with | visual | Distribution Calibration and Cross-Modal Distribution Alignment |
Few-Shot Object Detection by Knowledge Distillation Using Bag-of- | visual | -Words Representations |
Few-Shot | visual | Classification Using Image Pairs With Binary Transformation |
Few-Shot | visual | Relationship Co-Localization |
Field Cognitive Styles on | visual | Cognition in the Event Structure Design of Bivariate Interactive Dorling Cartogram: The Similarities and Differences of Field-Independent and Field-Dependent Users |
Field tests on flat ground of an intensity-difference based monocular | visual | odometry algorithm for planetary rovers |
Filter Flow | visual | Querying Language and Interface for Spatial Databases, A |
Filter for | visual | Tracking Based on a Stochastic Model for Driver Behaviour, A |
FIMF score-CAM: Fast score-CAM based on local multi-feature integration for | visual | interpretation of CNNS |
Finding Beans in Burgers: Deep Semantic- | visual | Embedding with Localization |
Finding Fallen Objects Via Asynchronous Audio- | visual | Integration |
Finding It: Weakly-Supervised Reference-Aware | visual | Grounding in Instructional Videos |
Finding Person X: Correlating Names with | visual | Appearances |
Finding textures by textual descriptions, | visual | examples, and relevance feedbacks |
Finding the Best from the Second Bests: Inhibiting Subjective Bias in Evaluation of | visual | Tracking Algorithms |
Fine-Grained and Semantic-Guided | visual | Attention for Image Captioning |
Fine-grained Image Classification and Retrieval by Combining | visual | and Locally Pooled Textual Features |
Fine-grained Image Style Transfer with | visual | Transformers |
Fine-Grained Image-to-Image Transformation Towards | visual | Recognition |
Fine-Grained Motion Representation For Template-Free | visual | Tracking |
Fine-Grained Segmentation Networks: Self-Supervised Segmentation for Improved Long-Term | visual | Localization |
Fine-Grained | visual | Attribute Extraction from Fashion Wear |
Fine-Grained | visual | Categorization by Localizing Object Parts With Single Image |
Fine-Grained | visual | Categorization Using Meta-learning Optimization with Sample Selection of Auxiliary Data |
Fine-grained | visual | categorization via multi-stage metric learning |
Fine-Grained | visual | Categorization with 2D-Warping |
Fine-grained | visual | categorization with fine-tuned segmentation |
Fine-Grained | visual | Categorization: A Spatial-Frequency Feature Fusion Perspective |
Fine-Grained | visual | Classification via Internal Ensemble Learning Transformer |
Fine-grained | visual | Classification via Progressive Multi-granularity Training of Jigsaw Patches |
Fine-Grained | visual | Comparison Based on Relative Attribute Quadratic Discriminant Analysis |
Fine-Grained | visual | Comparisons with Local Learning |
Fine-Grained | visual | Entailment |
Fine-Grained | visual | Recognition and Re-Identification |
Fine-Grained | visual | -Textual Representation Learning |
Finetuning Convolutional Neural Networks for | visual | aesthetics |
Fingerspelling Recognition in the Wild With Iterative | visual | Attention |
First Attempt of SAR | visual | -Inertial Odometry, The |
First Comprehensive Dataset with Multiple Distortion Types for | visual | Just-Noticeable Differences, The |
First Stage of a Human | visual | System Simulator: The Retina |
First | visual | Object Tracking Segmentation VOTS2023 Challenge Results, The |
Fisher Kernels on | visual | Vocabularies for Image Categorization |
FishNet: Fish | visual | recognition with one stage multi-task learning |
Fitting-based optimisation for image | visual | salient object detection |
Fixation Prediction and | visual | Priority Maps for Biped Locomotion |
Fixed-rank representation for unsupervised | visual | learning |
FlexFormer: Flexible Transformer for efficient | visual | recognition |
Flexible Automatic | visual | Inspection Based on the Separation of Detection And Analysis |
Flexible three-dimensional modeling of plants using low- resolution cameras and | visual | odometry |
Flexible | visual | Cryptography Scheme without Distortion |
Flexible | visual | Recognition by Evidential Modeling of Confusion and Ignorance |
Flickr Distance: A Relationship Measure for | visual | Concepts |
Flip | visual | cryptography (FVC) with perfect security, conditionally-optimal contrast, and no expansion |
FLIPDIAL: A Generative Model for Two-Way | visual | Dialogue |
Flow Feedback Traffic Prediction Based on | visual | Quantified Features, A |
Flow Guided Siamese Network for | visual | Tracking |
Flow-guided One-shot Talking Face Generation with a High-resolution Audio- | visual | Dataset |
Flowdometry: An Optical Flow and Deep Learning Based Approach to | visual | Odometry |
FMRI-based perceptual validation of a computational model for | visual | and auditory saliency in videos |
Focal | visual | -Text Attention for Memex Question Answering |
Focal | visual | -Text Attention for Visual Question Answering |
Focal | visual | -Text Attention for Visual Question Answering |
Focus-Aspect-Value model for predicting subjective | visual | attributes, The |
Focused augmented mirror based on human | visual | perception |
Focused Volumetric | visual | Hull with Color Extraction |
Focusing Attention on | visual | Features that Matter |
Focusing | visual | Relation Detection on Relevant Relations with Prior Potentials |
FocusTune: Tuning | visual | Localization through Focus-Guided Sampling |
Folksonomy-Based | visual | Ontology Construction and Its Applications |
Font finder: | visual | recognition of typeface in printed documents |
Force feedback and | visual | constraint for drawing on a terrain: Path type, view complexity, and pseudohaptic effect |
Foreground-Background Distribution Modeling Transformer for | visual | Object Tracking |
ForeSI: Success-Aware | visual | Navigation Agent |
Forest Fire Monitoring Method Based on UAV | visual | and Infrared Image Fusion |
Formalization and Implementation of Topological | visual | Navigation in two Dimensions, A |
Formulation of Radiometric Feasibility Measures for Feature Selection and Planning in | visual | Servoing |
Forward and Backward | visual | Fusion Approach to Motion Estimation with High Robustness and Low Cost |
Forward-Backward Mean-Shift for | visual | Tracking With Local-Background-Weighted Histogram |
Found a Reason for me? Weakly-supervised Grounded | visual | Question Answering using Capsules |
Fourier Descriptors Based on the Structure of the Human Primary | visual | Cortex with Applications to Object Recognition |
Foveated Analysis and Selection of | visual | Fixations in Natural Scenes |
Foveated Model Observers for | visual | Search in 3D Medical Images |
Foveated | visual | Search for Corners |
FPGA based real-time | visual | servoing |
FPGA implementation of Naive Bayes classifier for | visual | object recognition |
Fractal image compression using | visual | -based particle swarm optimization |
Fractal-Based Clustering Approach in Large | visual | Database-Systems, A |
Frame-Based System for Modelling and Executing | visual | Tasks, A |
framework for estimating geometric distortions in video copies based on | visual | -audio fingerprints, A |
Framework For Evaluating | visual | SLAM, A |
Framework for Fast and Robust | visual | Odometry, A |
Framework for Forming Middle Distance Routes Based on Spatial Guidelines, Perceived Accessibility and | visual | Cues in Smart City, A |
Framework for Quick and Accurate Access of Interesting | visual | Events in Surveillance Videos, A |
Framework for Recognizing Multi-Agent Action from | visual | Evidence, A |
Framework for Spatiotemporal Control in the Tracking of | visual | Contours, A |
Framework for Uncertainty Reasoning in Hierarchical | visual | Evidence Space, A |
Framework for | visual | Analytics of Spatio-Temporal Sensor Observations from Data Streams, A |
Framework for | visual | and Haptic Collaboration in Shared Virtual Spaces, A |
Framework for | visual | Motion Understanding, A |
framework for | visual | saliency detection with applications to image thumbnailing, A |
Framework for | visual | Servoing, A |
framework for | visual | -context-aware object detection in still images, A |
Framework of Randomized Distribution Features for | visual | Representation and Categorization |
Free Space Detection from Catadioptric Omnidirectional Images for | visual | Navigation using Optical Flow |
Free-form Description Guided 3D | visual | Graph Network for Object Grounding in Point Cloud |
Frequency Domain | visual | Servoing Using Planar Contours |
Fried Binary Embedding for High-Dimensional | visual | Features |
Fried Binary Embedding: From High-Dimensional | visual | Features to High-Dimensional Binary Codes |
Friendly progressive random-grid-based | visual | secret sharing with adaptive contrast |
Friendly progressive | visual | secret sharing |
From Bikers to Surfers: | visual | Recognition of Urban Tribes |
From captions to | visual | concepts and back |
From cross-country autonomous navigation to intelligent deep space communications: | visual | sensor processing at JPL |
From dictionary of | visual | words to subspaces: Locality-constrained affine subspace coding |
From Discs to Parts of | visual | Form |
From generic to specific deep representations for | visual | recognition |
From Images to Surfaces: A Computational Study of the Human Early | visual | System |
From Images to Textual Prompts: Zero-shot | visual | Question Answering with Frozen Large Language Models |
From known to the unknown: Transferring knowledge to answer questions about novel | visual | and semantic concepts |
From Motion Patterns to | visual | Concepts for Event Analysis in Dynamic Scenes |
From Node to Graph: Joint Reasoning on | visual | -Semantic Relational Graph for Zero-Shot Detection |
From pixels to gestures: learning | visual | representations for human analysis in color and depth data sequences |
From pixels to sentiment: Fine-tuning CNNs for | visual | sentiment prediction |
From Rasters to Vectors: Extracting | visual | Information from Line Drawings |
From Recognition to Cognition: | visual | Commonsense Reasoning |
From Saliency to Eye Gaze: Embodied | visual | Selection for a Pan-Tilt-Based Robotic Head |
From Same Photo: Cheating on | visual | Kinship Challenges |
From Semantic Categories to Fixations: A Novel Weakly-supervised | visual | -auditory Saliency Detection Approach |
From Simulated to | visual | Data: A Robust Low-Rank Tensor Completion Approach Using L_p-Regression for Outlier Resistance |
From Subcategories to | visual | Composites: A Multi-level Framework for Object Detection |
From Two to One: A New Scene Text Recognizer with | visual | Language Modeling Network |
From Uncertainty to | visual | Exploration |
From | visual | Input to Verbal Output in the Visual Translator |
From | visual | Input to Verbal Output in the Visual Translator |
From | visual | patterns to semantic description: A cognitive approach using artificial curiosity as the foundation |
From | visual | Query to Visual Portrayal |
From | visual | Query to Visual Portrayal |
From | visual | ization to Visual Mining: Application to Environmental Data |
From Zero-Shot Learning to Conventional Supervised Classification: Unseen | visual | Data Synthesis |
FSVO: Semi-direct monocular | visual | odometry using fixed maps |
FT-RCNN: Real-time | visual | Face Tracking with Region-based Convolutional Neural Networks |
Full ranking as local descriptor for | visual | recognition: A comparison of distance metrics on S_n |
Full-Reference Quality Metric Based on Neural Network to Assess the | visual | Quality of Remote Sensing Images |
full-reference stereoscopic image quality assessment index based on stable aggregation of monocular and binocular | visual | features, A |
Full-reference | visual | quality assessment for synthetic images: A subjective study |
Full-view: A | visual | Data-mining Environment |
Fully automatic face recognition system using a combined audio- | visual | approach |
Fully Uncalibrated Image-Based | visual | Servoing of 2DOFs Planar Manipulators With a Fixed Camera |
Function from | visual | Analysis and Physical Interaction: A Methodology for Recognition of Generic Classes of Objects |
Function from | visual | Analysis and Physical Interaction: A Methodology for Recognition of Generic Classes of Objects |
Functional architecture of macaque monkey | visual | cortex |
Fundamental | visual | Concept Learning From Correlated Images and Text |
Fundamental | visual | features for aesthetic classification of photographs across datasets |
Funnel Activation for | visual | Recognition |
FUNNRAR: Hybrid rarity/learning | visual | saliency |
Further Constraints on | visual | Articulated Motions |
Fused One-vs-All Features With Semantic Alignments for Fine-Grained | visual | Categorization |
Fusing Audio and | visual | Features of Speech |
Fusing Audio- | visual | Nonverbal Cues to Detect Dominant People in Group Conversations |
Fusing concept detection and geo context for | visual | search |
Fusing Crowd Density Maps and | visual | Object Trackers for People Tracking in Crowd Scenes |
Fusing generic objectness and | visual | saliency for salient object detection |
Fusing integrated | visual | vocabularies-based bag of visual words and weighted colour moments on spatial pyramid layout for natural scene image classification |
Fusing integrated | visual | vocabularies-based bag of visual words and weighted colour moments on spatial pyramid layout for natural scene image classification |
Fusing Keyword Search and | visual | Exploration for Untagged Videos |
Fusing Multiple Independent Estimates via Spectral Clustering for Robust | visual | Tracking |
Fusing Omnidirectional | visual | Data for Probability Matching Prediction |
Fusing Semantics and Motion State Detection for Robust | visual | SLAM |
Fusing target information from multiple views for robust | visual | tracking |
Fusing | visual | and Behavioral Cues for Modeling User Experience in Games |
Fusing | visual | and Inertial Sensors with Semantics for 3D Human Pose Estimation |
Fusing | visual | and range imaging for object class recognition |
Fusing | visual | Features and Metadata to Detect Flooding in Flickr Images |
Fusing | visual | Saliency for Material Recognition |
Fusion Algorithm for Infrared- | visual | Image Sequences |
Fusion in Computer Vision: Understanding Complex | visual | Content |
Fusion Learning using Semantics and Graph Convolutional Network for | visual | Food Recognition |
Fusion of audio and | visual | cues for laughter detection |
Fusion of Audio- and | visual | Cues for Real-Life Emotional Human Robot Interaction |
Fusion of Audio- | visual | Information for Integrated Speech Processing |
Fusion of classifier predictions for audio- | visual | emotion recognition |
Fusion of Individual Tree Detection and | visual | Interpretation in Assessment of Forest Variables from Laser Point Clouds, The |
Fusion of Inertial and | visual | Measurements for RGB-D SLAM on Mobile Devices |
Fusion of laser and | visual | data for robot motion planning and collision avoidance |
Fusion of Magnetic and | visual | Sensors for Indoor Localization: Infrastructure-Free and More Effective |
Fusion of Multiple | visual | Cues for Visual Saliency Extraction from Wearable Camera Settings with Strong Motion |
Fusion of Multiple | visual | Cues for Visual Saliency Extraction from Wearable Camera Settings with Strong Motion |
Fusion of range and | visual | data for the extraction of scene structure information |
Fusion of Template Matching and Foreground Detection for Robust | visual | Tracking |
Fusion of | visual | and Anamnestic Data for the Classification of Skin Lesions with Deep Learning |
Fusion of | visual | and infra-red face scores by weighted power series |
Fusion of | visual | and Range Images for Object Extraction |
Fusion of | visual | and thermal images using complex extension of EMD |
Fusion of | visual | and Thermal Signatures with Eyeglass Removal for Robust Face Recognition |
Fusion of | visual | and Ultrasonic Information for Environmental Modelling |
Fusion of | visual | attention cues by machine learning |
Fusion of | visual | cues of intensity and texture in Markov random fields image segmentation |
Fusion of | visual | salience maps for object acquisition |
Future-Viewer | visual | Environment for Semantic Characterization of Video Sequences, The |
Fuzzy chamfer distance and its probabilistic formulation for | visual | tracking |
fuzzy inference approach to template-based | visual | tracking, A |
Fuzzy Logic | visual | Network (FLVN): A Neuro-symbolic Approach for Visual Features Matching |
Fuzzy Logic | visual | Network (FLVN): A Neuro-symbolic Approach for Visual Features Matching |
Fuzzy VA-Files for multi-label image annotation based on | visual | content of regions |
fuzzy-controlled Kalman filter applied to stereo- | visual | tracking schemes, A |
FVP: Fourier | visual | Prompting for Source-Free Unsupervised Domain Adaptation of Medical Image Segmentation |
G-VIDO: A Vehicle Dynamics and Intermittent GNSS-Aided | visual | -Inertial State Estimator for Autonomous Driving |
G2DeNet: Global Gaussian Distribution Embedding Network and Its Application to | visual | Recognition |
Game-theoretic solutions through intelligent optimization for efficient resource management in wireless | visual | sensor networks |
Game-theoretical occlusion handling for multi-target | visual | tracking |
GAME-theory-based cross-layer optimization for wireless DS-CDMA | visual | sensor networks |
GAN-Supervised Dense | visual | Alignment |
GANalyze: Toward | visual | Definitions of Cognitive Image Properties |
GAPNet: Generic-Attribute-Pose Network For Fine-Grained | visual | Categorization Using Multi-Attribute Attention Module |
GasHis-Transformer: A multi-scale | visual | transformer approach for gastric histopathological image detection |
Gated Channel Transformation for | visual | Recognition |
Gated CNN for | visual | quality assessment based on color perception |
Gated Cross Word- | visual | Attention-driven Generative Adversarial Networks for Text-to-image Synthesis |
Gated Recurrent Capsules for | visual | Word Embeddings |
Gaussian Mixture Model on Tensor Field for | visual | Tracking |
Gaussian Mixture Trees for One Class Classification in Automated | visual | Inspection |
Gaze Selection for Enhanced | visual | Odometry During Navigation |
Gaze Selection for | visual | Search |
Gaze-Based Driver Distraction Warning System and Its Effect on | visual | Behavior, A |
Gender and ethnicity recognition based on | visual | attention-driven deep architectures |
Gender Artifacts in | visual | Datasets |
GenealogyVis: A System for | visual | Analysis of Multidimensional Genealogical Data |
General Dynamic Knowledge Distillation Method for | visual | Analytics, A |
General End-to-End Method for Characterizing Neuropsychiatric Disorders using Free-Viewing | visual | Scanning Tasks, A |
General Facial Representation Learning in a | visual | -Linguistic Manner |
General Framework for Combining | visual | Trackers: The Black Boxes Approach, A |
general framework for trajectory data warehousing and | visual | OLAP, A |
General Spatial Reasoning and Geometric Reasoning Issues, | visual | Relations |
Generalized Gaussian mixture models as a nonparametric Bayesian approach for clustering using class-specific | visual | features |
Generalized Hadamard-Product Fusion Operators for | visual | Question Answering |
Generalized Kernel-Based | visual | Tracking |
Generalized Laplacian Distance and Its Applications for | visual | Matching, The |
Generalized Method to Extract | visual | Time-Sharing Sequences From Naturalistic Driving Data, A |
Generalized Multi-View Embedding for | visual | Recognition and Cross-Modal Retrieval |
Generalized Search Method for Multiple Competing Hypotheses in | visual | Tracking, A |
Generalized Separable Kernel Family with Compact Support, to Improve | visual | Data Quality of Gray Level Images, in Scale Space, A |
Generalized Tensor Total Variation minimization for | visual | data recovery? |
Generalized | visual | Information Analysis Via Tensorial Algebra |
Generalized Zero-Shot Space Target Recognition Based on Global-Local | visual | Feature Embedding Network |
Generalizing to the Open World: Deep | visual | Odometry with Online Adaptation |
Generating 3D | visual | Expression Using Semantic Simplification Description Based on Logical Representation |
Generating and Generalizing Models of | visual | Objects |
Generating Descriptive | visual | Words and Visual Phrases for Large-Scale Image Applications |
Generating Descriptive | visual | Words and Visual Phrases for Large-Scale Image Applications |
Generating Diverse and Descriptive Image Captions Using | visual | Paraphrases |
Generating Knowledge-Enriched Image Annotations for Fine-Grained | visual | Classification |
Generating random grid-based | visual | secret sharing with multi-level encoding |
Generating Reliable Online Adaptive Templates for | visual | Tracking |
Generating Semantic | visual | Templates for Video Databases |
Generating | visual | Explanations |
Generating | visual | Representations for Zero-Shot Classification |
Generating | visual | Scenes from Touch |
Generating | visual | Sensing Strategies in Assembly Tasks |
Generating | visual | story graphs with application to photo album summarization |
Generating | visual | Summaries of Geographic Areas Using Community-Contributed Images |
Generative Bias for Robust | visual | Question Answering |
Generative learning of | visual | concepts using multiobjective genetic programming |
Generative | visual | Manipulation on the Natural Image Manifold |
Generic compact representation through | visual | -semantic ambiguity removal |
Generic Viewpoint Assumption in a Framework for | visual | Perception, The |
Generic | visual | Categorization Using Weak Geometry |
Genetic Programming with Local Improvement for | visual | Learning from Examples |
Genre linked automated assessment and feedback of photographs based on | visual | aesthetics |
Genre-specific modeling of | visual | features for efficient content based video shot classification and retrieval |
Geo-Distinctive | visual | Element Matching for Location Estimation of Images |
Geo-spatial active | visual | surveillance on wireless networks |
Geo- | visual | Analytics Approach To Biological Shepherding: Modelling Animal Movements And Impacts, A |
GeoBot: A High Level | visual | Perception Architecture for Autonomous Robots |
GeoLayoutLM: Geometric Pre-training for | visual | Information Extraction |
Geological Heritage and Conservation: A Case Study of the | visual | Axis Through Digital Terrain Modeling |
Geometric Aspects of | visual | Object Recognition |
Geometric Bargaining Approach for Optimizing Resource Allocation in Wireless | visual | Sensor Networks |
Geometric Consistency for Self-Supervised End-to-End | visual | Odometry |
Geometric Hypergraph Learning for | visual | Tracking |
Geometric Invariant for | visual | Recognition and 3D Reconstruction from Two Perspective/Orthographic Views, A |
Geometric Invariants Construction for Semantic Scene Understanding From Multiple Views Inspired by the Human | visual | System |
Geometric Particle Filter for Template-Based | visual | Tracking, A |
Geometric particle swarm optimization for robust | visual | ego-motion estimation via particle filtering |
Geometric | visual | Similarity Learning in 3D Medical Image Self-Supervised Pre-training |
Geometric-Based Error Concealment for Concealing Transmission Errors and Improving | visual | Quality |
Geometric- | visual | descriptor for improved image based localization |
Geometry and Photometry in 3D | visual | Recognition |
Geometry and Statistics of | visual | Space-Time |
geometry of a scene: On deep semantics for | visual | perception driven cognitive film, studies, The |
Geometry of Distorted | visual | Space and Cremona Transformation |
geometry of | visual | interception, The |
Geometry of | visual | Perception: Retinotopic and Nonretinotopic Representations in the Human Visual System, The |
Geometry of | visual | Perception: Retinotopic and Nonretinotopic Representations in the Human Visual System, The |
Geometry of | visual | Space |
Geometry of | visual | Space: About the Incompatibility Between Science and Mathematics, The |
Geometry of | visual | Space: About the Incompatibility Between Science and Mathematics: Reply, The |
Geometry of | visual | Space: About the Incompatibility Between Science and Mathematics: Reply, The |
Geometry-Aware Similarity Learning on SPD Manifolds for | visual | Recognition |
Geometry-based ranking for mobile 3D | visual | search using hierarchically structured multi-view features |
Geometry-Constrained Scale Estimation for Monocular | visual | Odometry |
GeoSpatial | visual | Analytics |
GeoVLN: Learning Geometry-Enhanced | visual | Representation with Slot Attention for Vision-and-Language Navigation |
Gestural interface to a | visual | computing environment for molecular biologists |
Gesture Localization and Recognition Using Probabilistic | visual | Learning |
GFNet: Global Filter Networks for | visual | Recognition |
GIScience and Historical | visual | Sources: A Promising Look at Past Scenarios and Sceneries |
GisGCN: A | visual | Graph-Based Framework to Match Geographical Areas through Time |
Glance and Focus Networks for Dynamic | visual | Recognition |
Glaucoma Precognition: Recognizing Preclinical | visual | Functional Signs of Glaucoma |
GLAVNet: Global-Local Audio- | visual | Cues for Fine-Grained Material Recognition |
Glimpse-Attend-and-Explore: Self-Attention for Active | visual | Exploration |
Glitch in the matrix: A large scale benchmark for content driven audio- | visual | forgery detection and localization |
Global Affective Video Content Regression Based on Complementary Audio- | visual | Features |
Global and Local Mixture Consistency Cumulative Learning for Long-tailed | visual | Recognitions |
Global Context Extraction for Object Recognition Using a Combination of Range and | visual | Features |
global image fidelity metric: | visual | distance and its properties, A |
Global Performance Evaluation of Image Features for | visual | Servo Control |
Global Processing of | visual | Stimuli in a Neural Network of Coupled Oscillators |
Global Table Extractor (GTE): A Framework for Joint Table Identification and Cell Structure Recognition Using | visual | Context |
Global trajectory reconstruction from distributed | visual | sensors |
Global | visual | saliency: Geometric and colorimetrie saliency fusion and its applications for 3D colored meshes |
Global-Local Self-Distillation for | visual | Representation Learning |
Global-Scale Location Prediction for Social Images Using Geo- | visual | Ranking |
Global-to-local oriented perception on blurry | visual | information |
Globally Consistent Mosaicking for Autonomous | visual | Navigation |
Gloss-free Sign Language Translation: Improving from | visual | -Language Pretraining |
GMC: A general framework of multi-stage context learning and utilization for | visual | detection tasks |
GMFAD: Towards Generalized | visual | Recognition via Multilayer Feature Alignment and Disentanglement |
Goal detection in soccer video using audio/ | visual | keywords |
Goal-oriented top-down probabilistic | visual | attention model for recognition of manipulated objects in egocentric videos |
Goal-Oriented | visual | Question Generation via Intermediate Rewards |
Going Beyond Real Data: A Robust | visual | Representation for Vehicle Re-identification |
Good Continuation of General 2D | visual | Features: Dual Harmonic Models and Computational Inference |
Good Features to Correlate for | visual | Tracking |
Good features to track for | visual | SLAM |
Good, Better, Best: Textual Distractors Generation for Multiple-Choice | visual | Question Answering via Reinforcement Learning |
GPSlam: Marrying Sparse Geometric and Dense Probabilistic | visual | Mapping |
GPU-based parallel construction of compact | visual | hull meshes |
GQA: A New Dataset for Real-World | visual | Reasoning and Compositional Question Answering |
GraB: | visual | Saliency via Novel Graph Model and Background Priors |
Grad-Cam Aware Supervised Attention for | visual | Question Answering for Post-Disaster Damage Assessment |
Grad-CAM++: Generalized Gradient-Based | visual | Explanations for Deep Convolutional Networks |
Grad-CAM: | visual | Explanations from Deep Networks via Gradient-Based Localization |
GradNet: Gradient-Guided Network for | visual | Object Tracking |
Gradual Chroma Reduction and High-Level | visual | Masking in Videos |
Granularity and elasticity adaptation in | visual | tracking |
Grapes | visual | Segmentation for Harvesting Robots Using Local Texture Descriptors |
Graph Convolution Based Efficient Re-Ranking for | visual | Retrieval |
Graph Discovery for | visual | Test Generation |
Graph Neural Networks Based Multi-granularity Feature Representation Learning for Fine-Grained | visual | Categorization |
Graph Regularized and Locality-Constrained Coding for Robust | visual | Tracking |
Graph Representation for Order-aware | visual | Transformation |
Graph-Based Object Semantic Refinement for | visual | Emotion Recognition |
Graph-based transductive learning for robust | visual | tracking |
Graph-Based | visual | Analytic Tools for Parallel Coordinates |
Graph-Based | visual | -Semantic Entanglement Network for Zero-Shot Image Recognition |
Graph-Cut Based Background Subtraction Using | visual | Hull in Multiveiw Images |
Graph-Structured Representations for | visual | Question Answering |
GraphMapper: Efficient | visual | Navigation by Scene Graph Generation |
Grasping | visual | Symmetry |
Grassmann manifold online learning and partial occlusion handling for | visual | object tracking under Bayesian formulation |
Grassmann Pooling as Compact Homogeneous Bilinear Pooling for Fine-Grained | visual | Classification |
Greedy Gradient Ensemble for Robust | visual | Question Answering |
GreenSea: | visual | Soccer Analysis Using Broad Learning System |
GREFIT: | visual | Recognition of Hand Postures |
GRIT: Faster and Better Image Captioning Transformer Using Dual | visual | Features |
Ground-based cloud image categorization using deep convolutional | visual | features |
Ground-Moving-Platform-Based Human Tracking Using | visual | SLAM and Constrained Multiple Kernels |
Ground-plane classification for robot navigation: Combining multiple cues toward a | visual | -based learning system |
Ground-Plane-Based Absolute Scale Estimation for Monocular | visual | Odometry |
Grounding Answers for | visual | Questions Asked by Visually Impaired People |
Grounding Consistency: Distilling Spatial Common Sense for Precise | visual | Relationship Detection |
Grounding | visual | Explanations |
Grounding | visual | Representations with Texts for Domain Generalization |
Group-wise Contrastive Bottleneck for Weakly-Supervised | visual | Representation Learning |
Growing semantically meaningful models for | visual | SLAM |
GSAP: A Global Structure Attention Pooling Method for Graph-Based | visual | Place Recognition |
Guessing State Tracking for | visual | Dialogue |
GuessWhat?! | visual | Object Discovery through Multi-modal Dialogue |
GuessWhich? | visual | dialog with attentive memory network |
Guest Editorial Introduction to the Special Issue on Large-Scale | visual | Sensor Networks: Architectures and Applications |
Guest Editorial Introduction to the Special Issue on | visual | Analysis for ITS |
Guest Editorial Introduction to the Special Section on Intelligent | visual | Content Analysis and Understanding |
Guest Editorial Introduction to the Special Section on Representation Learning for | visual | Content Understanding |
Guest Editorial Special Issue on | visual | Computing in the Cloud: Mobile Computing |
Guest Editorial: Introduction to the Special Section on Fine-Grained | visual | Categorization |
Guest Editorial: Large Scale | visual | Media Geo-Localization |
Guest Editorial: Special Issue on Real-Time | visual | Monitoring and Inspection |
Guest Editorial: | visual | Analytics in Multimedia: Opportunities and Research Challenges |
Guest Editorial: | visual | Domain Adaptation and Generalisation |
Guest Editors' Introduction: Special issue on deep learning with applications to | visual | representation and analysis |
GUID-based mobile | visual | communication using NDN mechanism |
Guidance: A | visual | sensing platform for robotic applications |
Guided by Meta-Set: A Data-Driven Method for Fine-Grained | visual | Recognition |
Guided CNN for generalized zero-shot and open-set recognition using | visual | and semantic prototypes |
Guided Feature Selection for Deep | visual | Odometry |
Guided Importance Sampling Based Particle Filtering for | visual | Tracking |
Guiding a Bottom-Up | visual | Attention Mechanism to Locate Specific Image Regions Using a Distributed Genetic Optimization |
Guiding a robot by | visual | feedback in assembling tasks |
Guiding Early | visual | Processing with a Scale-Space Primal Sketch |
Guiding the focus of attention of blind people with | visual | saliency |
Guiding | visual | Question Answering with Attention Priors |
Guiding | visual | Surveillance by Tracking Human Attention |
H2O: A Benchmark for | visual | Human-human Object Handover Analysis |
HAck: A system for the recognition of human actions by kernels of | visual | strings |
HAIR: Hierarchical | visual | -Semantic Relational Reasoning for Video Question Answering |
Halftone | visual | Cryptography |
Halftone | visual | Cryptography Through Error Diffusion |
Halftone | visual | Cryptography with Complementary Cover Images |
Hallucinating | visual | Instances in Total Absentia |
Hallucination In Object Detection: A Study In | visual | Part Verification |
HammerDrive: A Task-Aware Driving | visual | Attention Model |
Hand Gesture Recognition Based on the Fusion of | visual | and Touch Sensing Data |
Hand-Object Contact Force Estimation from Markerless | visual | Tracking |
Handbook of | visual | Display Technology |
Handcrafted and Deep Trackers: Recent | visual | Object Tracking Approaches and Trends |
Handling Tradeoffs Between Precision and Robustness with Incremental Focus of Attention for | visual | Tracking |
Handling Uncertain Tags in | visual | Recognition |
Handwriting Recognition System Based on | visual | Input, A |
Handwritten and Machine Printed Text Separation in Document Images Using the Bag of | visual | Words Paradigm |
Handwritten Text Generation from | visual | Archetypes |
Haptic Texture Rendering Based on | visual | Texture Information: A Study to Achieve Realistic Haptic Texture Rendering |
Hard negative mining for correlation filters in | visual | tracking |
Hardware for | visual | Image Processing |
Hardware-efficient debanding and | visual | enhancement filter for inverse tone mapped high dynamic range images and videos |
Harmonic Computational Geometry: A new tool for | visual | correspondence |
Harmony Filter: A robust | visual | tracking system using the improved harmony search algorithm |
Harnessing high-level concepts, | visual | , and auditory features for violence detection in videos |
Harvesting Mid-level | visual | Concepts from Large-Scale Internet Images |
Hash-SVM: Scalable Kernel Machines for Large-Scale | visual | Classification |
HCI for Elderly, Measuring | visual | Complexity of Webpages Based on Machine Learning |
HCIL: Hierarchical Class Incremental Learning for Longline Fishing | visual | Monitoring |
HD-CNN: Hierarchical Deep Convolutional Neural Networks for Large Scale | visual | Recognition |
HD-MTL: Hierarchical Deep Multi-Task Learning for Large-Scale | visual | Recognition |
Head and gaze dynamics in | visual | attention and context learning |
Head-Size Equalization for Improved | visual | Perception in Video Conferencing |
Headprint: Person Reacquisition Using | visual | Features of Hair in Overhead Surveillance Video |
Hear The Flow: Optical Flow-Based Self-Supervised | visual | Sound Source Localization |
Heavy-Tailed Model for | visual | Tracking via Robust Subspace Learning |
Hedging Deep Features for | visual | Tracking |
Hedging your bets: Optimizing accuracy-specificity trade-offs in large scale | visual | recognition |
Hero: A Multi-modal Approach on Mobile Devices for | visual | -aware Conversational Assistance in Industrial Domains |
Heterogeneous Contrastive Learning: Encoding Spatial Information for Compact | visual | Representations |
Heterogeneous feature machines for | visual | recognition |
Heterogeneous Knowledge Network for | visual | Dialog |
Heterogeneous | visual | Codebook Integration Via Consensus Clustering for Visual Categorization |
Heterogeneous | visual | Codebook Integration Via Consensus Clustering for Visual Categorization |
Heterogeneous | visual | Features Fusion via Sparse Multimodal Machine |
Heuristic algorithm for | visual | tracking of deformable objects |
Hidden Conditional Random Fields for | visual | Speech Recognition |
Hiding | visual | Information via Obfuscating Adversarial Perturbations |
Hierarchical Analysis of | visual | Motion |
Hierarchical attention vision transformer for fine-grained | visual | classification |
Hierarchical audio- | visual | cue integration framework for activity analysis in intelligent meeting rooms |
Hierarchical Audio- | visual | Surveillance for Passenger Elevators |
Hierarchical Bilinear Pooling for Fine-Grained | visual | Recognition |
Hierarchical Category Detector for Clothing Recognition from | visual | Data |
Hierarchical Cellular Automata for | visual | Saliency |
Hierarchical classification using a frequency-based weighting and simple | visual | features |
Hierarchical CNN-based real-time fatigue detection system by | visual | -based technologies using MSP model |
Hierarchical Combination of Semantic | visual | Words for Image Classification and Clustering |
Hierarchical convolutional features for end-to-end representation-based | visual | tracking |
Hierarchical Convolutional Features for | visual | Tracking |
Hierarchical convolutional features for | visual | tracking via two combined color spaces with SVM classifier |
Hierarchical Data-Driven Predictive Control of Image-Based | visual | Servoing Systems With Unknown Dynamics, A |
Hierarchical Discriminative Learning Improves | visual | Representations of Biomedical Microscopy |
Hierarchical Dynamic Masks for | visual | Explanation of Neural Networks |
Hierarchical estimation for adaptive | visual | tracking |
Hierarchical Feature Embedding for | visual | Tracking |
Hierarchical Feature Fusion for | visual | Tracking |
hierarchical feature fusion framework for adaptive | visual | tracking, A |
Hierarchical feedback algorithm based on | visual | community discovery for interactive video retrieval |
Hierarchical Graph Attention Network for Few-shot | visual | -Semantic Learning |
Hierarchical Graph Attention Network for | visual | Relationship Detection |
Hierarchical Grocery Store Image Dataset With | visual | and Semantic Labels, A |
Hierarchical Joint Max-Margin Learning of Mid and Top Level Representations for | visual | Recognition |
Hierarchical LSTMs with Adaptive Attention for | visual | Captioning |
Hierarchical Memory Decoder for | visual | Narrating |
Hierarchical Model Compression via Shape-Edge Representation of Feature Maps: An Enlightenment From the Primate | visual | System |
Hierarchical Model For Long-Length Video Summarization With Adversarially Enhanced Audio/ | visual | Features |
Hierarchical Model Predictive Image-Based | visual | Servoing of Underwater Vehicles With Adaptive Neural Network Dynamic Control |
Hierarchical Multimodal LSTM for Dense | visual | -Semantic Embedding |
Hierarchical Navigation and | visual | Search for Video Keyframe Retrieval |
Hierarchical Novelty Detection for | visual | Object Recognition |
Hierarchical Object Representations for | visual | Recognition via Weakly Supervised Learning |
Hierarchical organization for medical video summarization using latent | visual | and semantic analysis |
Hierarchical Part Matching for Fine-Grained | visual | Categorization |
Hierarchical Part-Based | visual | Object Categorization |
Hierarchical Scene Coordinate Classification and Regression for | visual | Localization |
Hierarchical Selectivity for Object-Based | visual | Attention |
Hierarchical Semantic Hashing: | visual | Localization from Buildings on Maps |
Hierarchical sparse coding with geometric prior for | visual | geo-location |
Hierarchical Spatiotemporal Graph Regularized Discriminative Correlation Filter for | visual | Object Tracking |
Hierarchical System Integration Approach with Application to | visual | Scene Exploration for Driver Assistance, A |
Hierarchical Variational Autoencoders for | visual | Counterfactuals |
Hierarchical | visual | Categories Modeling: A Joint Representation Learning and Density Estimation Framework for Out-of-Distribution Detection |
Hierarchical | visual | comfort assessment for stereoscopic image retargeting |
Hierarchical | visual | Content Modelling and Query based on Trees |
Hierarchical | visual | description Schemes for Still Images and Video Sequences |
Hierarchical | visual | Feature-Based Approach For Image Sonification, A |
Hierarchical | visual | Model for Video Object Summarization, A |
Hierarchical | visual | pattern image coding |
Hierarchical | visual | Primitive Experts for Compositional Zero-Shot Learning |
Hierarchical | visual | Saliency Model for Character Detection in Natural Scenes, A |
Hierarchical | visual | thesaurus building for satellite image retrieval based on semantic region labelling |
Hierarchical | visual | -textual Graph for Temporal Activity Localization via Language |
Hierarchical | visual | -Textual Knowledge Distillation for Life-Long Correlation Learning |
Hierarchy of | visual | features for object recognition |
High capacity multi-scale image sharing scheme by combining | visual | cryptography with data hiding |
High Confidence | visual | Recognition of Persons by a Test of Statistical Independence |
High discriminative SIFT feature and feature pair selection to improve the bag of | visual | words model |
High Dynamic Range Image and Video Compression: Fidelity Matching Human | visual | Performance |
High performance computing for industrial | visual | inspection |
High Performance | visual | Tracking With Siamese Actor-Critic Network |
High Performance | visual | Tracking with Siamese Region Proposal Network |
High quality, low delay foveated | visual | communications over mobile channels |
High resolution | visual | terrain classification for outdoor robots |
High Spatial Resolution | visual | Band Imagery Outperforms Medium Resolution Spectral Imagery for Ecosystem Assessment in the Semi-Arid Brazilian Sertao |
High Speed | visual | Saliency Computation on GPU |
High | visual | Quality Color Image Reversible Data Hiding Scheme Based on B-R-G Embedding Principle and CIEDE2000 Assessment Metric, A |
High-fidelity recording, compression, and replay of | visual | -haptic telepresence sessions |
High-Level and Generic Models for | visual | Search: When Does High Level Knowledge Help? |
High-level event detection system based on discriminant | visual | concepts |
High-Level Vision: Object Recognition and | visual | Cognition |
High-Level | visual | Masking of Image Compression Artefacts |
High-Order Topology Modeling of | visual | Words for Image Classification |
High-performance compression of | visual | information: A tutorial review. I. Still pictures |
High-precision localization using | visual | landmarks fused with range data |
High-Quality Image Captioning With Fine-Grained and Semantic-Guided | visual | Attention |
High-Quality | visual | Experience: Creation, Processing and Interactivity of High-Resolution and High-Dimensional Video Signals |
High-resolution rectified gradient-based | visual | explanations for weakly supervised segmentation |
High-Speed | visual | Tracking of the Nearest Point of an Object Using 1,000-fps Adaptive Pattern Projection |
Higher rank Support Tensor Machines for | visual | recognition |
Higher-Order Integration of Hierarchical Convolutional Activations for Fine-Grained | visual | Categorization |
Higher-Order Occurrence Pooling for Bags-of-Words: | visual | Concept Detection |
Histogram of Oriented Cameras: A New Descriptor for | visual | SLAM in Dynamic Environments |
Histograms of Optical Flow Orientation for | visual | Abnormal Events Detection |
HIVE: Evaluating the Human Interpretability of | visual | Explanations |
HMM-based temporal difference learning with model-updating capability for | visual | tracking of human communicational behaviors, An |
Holistic Context Models for | visual | Recognition |
Holons | visual | Representation for Image Retrieval |
Home Video | visual | Quality Assessment With Spatiotemporal Factors |
Homogeneous Framework for | visual | Recognition, A |
Homography-based | visual | servo regulation of mobile robots |
Hot spot detection based on feature space representation of | visual | search |
Hough transform-based mouth localization for audio- | visual | speech recognition |
HoughNet: Integrating Near and Long-Range Evidence for | visual | Detection |
How close are we to solving the problem of automated | visual | surveillance?: A review of real-world surveillance, scientific progress and evaluative mechanisms |
How Does Image Content Affect the Added Value of | visual | Attention in Objective Image Quality Assessment? |
How Effectively can Indoor Wireless Positioning Relieve | visual | Tracking Pains: A Cramer-Rao Bound Viewpoi |
How Hard Can It Be? Estimating the Difficulty of | visual | Search in an Image |
How human | visual | systems recognize objects: A Novel Computational Model |
How Many | visual | Concepts? |
How Sound Affects | visual | Attention in Omnidirectional Videos |
How to Design a Three-Stage Architecture for Audio- | visual | Active Speaker Detection in the Wild |
How to Learn the Effect of Non-Uniform Distortion on Perceived | visual | Quality? Case Study Using Convolutional Sparse Coding for Quality Assessment of Synthesized Views |
How to reach top accuracy for a | visual | pedestrian warning system from a car? |
How Useful Is Photo-Realistic Rendering for | visual | Learning? |
How Useful Is Self-Supervised Pretraining for | visual | Tasks? |
How | visual | discomfort affects 3DTV viewers' emotional arousal |
How | visual | salience wins the battle for awareness |
HP-VCS: A high-quality and printer-friendly | visual | cryptography scheme |
HTML page analysis based on | visual | cues |
Human Action Classification Using N-Grams | visual | Vocabulary |
Human action recognition using Histographic methods and hidden Markov models for | visual | martial arts applications |
Human Attention in | visual | Question Answering: Do Humans and Deep Networks Look at the Same Regions? |
Human Carrying Status in | visual | Surveillance |
Human Detection in Indoor Environments Using Multiple | visual | Cues and a Mobile Robot |
Human detection using a mobile platform and novel features derived from a | visual | saliency mechanism |
Human emotional state recognition using real 3D | visual | features from Gabor library |
Human Face Detection in | visual | Scenes |
Human interaction categorization by using audio- | visual | cues |
Human Interaction Recognition Based on the Co-occurrence of | visual | Words |
Human Memory Update Strategy: A Multi-Layer Template Update Mechanism for Remote | visual | Monitoring |
Human Perception of Audio- | visual | Synthetic Character Emotion Expression in the Presence of Ambiguous and Conflicting Information |
Human Perception, Human | visual | System -- General |
Human Pose Regression Through Multiview | visual | Fusion |
Human Smoking Event Detection Using | visual | Interaction Clues |
human texture | visual | field: Fovea-to-periphery pattern recognition, The |
Human tracking & | visual | spatio-temporal statistical analysis |
Human Velocity Control of Admittance-Type Robotic Devices With Scaled | visual | Feedback of Device Motion |
Human | visual | perception and dissimilarity |
Human | visual | system based data embedding method using quadtree partitioning |
Human | visual | System Based Framework for Concealed Weapon Detection |
Human | visual | System Based Mammogram Enhancement and Analysis |
Human | visual | System Based No-Reference Objective Image Sharpness Metric |
Human | visual | system consistent quality assessment for remote sensing image fusion |
Human | visual | System for Complexity Reduction of Image and Video Restoration |
Human | visual | system inspired saliency guided edge preserving tone-mapping for high dynamic range imaging |
Human | visual | System vs Convolution Neural Networks in food recognition task: An empirical comparison |
human | visual | system-based 3D video quality metric, A |
Human | visual | System-Based Fundus Image Quality Assessment of Portable Fundus Camera Photographs |
Human | visual | System-Based Image Enhancement and Logarithmic Contrast Measure |
Human | visual | System-Based Saliency Detection for High Dynamic Range Content |
human | visual | system-driven image segmentation algorithm, A |
Human | visual | Weighted Quantization for Transform/Subband Image-Coding Revisited for Interlaced Pictures |
Human | visual | -System Based Wavelet Decomposition for Image Compression |
Human-Aware Object Placement for | visual | Environment Reconstruction |
Human-Centric Spatio-Temporal Video Grounding With | visual | Transformers |
Human-Centric | visual | Relation Segmentation Using Mask R-CNN and VTransE |
Human-computer interaction based on | visual | hand-gesture recognition using volumetric spatiograms of local binary patterns |
Human-vehicle Cooperative | visual | Perception for Autonomous Driving Under Complex Traffic Environments |
HW/SW co-design of a | visual | SLAM application |
HW/SW Codesign and FPGA Acceleration of | visual | Odometry Algorithms for Rover Navigation on Mars |
Hybrid Approach for 3D Head Reconstruction: Using Neural Networks and | visual | Geometry |
hybrid approach for computing | visual | hulls of complex objects, A |
Hybrid Cascade Filter With Complementary Features for | visual | Tracking |
Hybrid Cascade Structure for License Plate Detection in Large | visual | Surveillance Scenes |
Hybrid CNN-Transformer Features for | visual | Place Recognition |
Hybrid coding of | visual | content and local image features |
Hybrid Generative-Discriminative | visual | Categorization |
Hybrid Integration of | visual | Attention Model into Image Quality Metric |
Hybrid macro-micro | visual | analysis for city-scale state estimation |
Hybrid Mobile | visual | Search System With Compact Global Signatures, A |
hybrid motion and appearance prediction model for robust | visual | object tracking, A |
hybrid multi-modal | visual | data cross fusion network for indoor and outdoor scene segmentation, A |
Hybrid Real-Time | visual | Tracking Using Compressive RGB-D Features, A |
Hybrid Regularization of Diffusion Process for | visual | Re-Ranking |
Hybrid Scene Compression for | visual | Localization |
Hybrid Sparse Subspace Clustering for | visual | Tracking |
Hybrid Structure/Trajectory Constraint for | visual | SLAM, A |
Hybrid Supervised-Unsupervised Vocabulary Generation Algorithm for | visual | Concept Recognition, A |
Hybrid textual- | visual | relevance learning for content-based image retrieval |
Hybrid | visual | and inertial RANSAC for real-time motion estimation |
hybrid | visual | feature extraction method for audio-visual speech recognition, A |
hybrid | visual | feature extraction method for audio-visual speech recognition, A |
Hybrid | visual | Servoing Control for Underwater Vehicle Manipulator Systems With Multiple Cameras |
Hybrid | visual | Transformer for Efficient Deep Human Activity Recognition, A |
Hybrid | visual | -Ranging Servoing for Positioning Based on Image and Measurement Features |
Hybrid-MVS: Robust Multi-View Reconstruction With Hybrid Optimization of | visual | and Depth Cues |
HybVIO: Pushing the Limits of Real-time | visual | -inertial Odometry |
Hyper-Siamese network for robust | visual | tracking |
Hyperbolic Audio- | visual | Zero-shot Learning |
Hyperbolic Contrastive Learning for | visual | Representations beyond Objects |
Hyperbolic | visual | Embedding Learning for Zero-Shot Recognition |
Hyperfeatures: Multilevel Local Coding for | visual | Recognition |
Hypothesis Generation in a Computational Model for | visual | Word Recognition |
Hypothesis Testing in a Computational Theory of | visual | Word Recognition |
I can't believe there's no images!: Learning | visual | Tasks Using Only Language Supervision |
I Saw: A Self-Attention Weighted Method for Explanation of | visual | Transformers |
ICE-BA: Incremental, Consistent and Efficient Bundle Adjustment for | visual | -Inertial SLAM |
iCLIP: Bridging Image Classification and Contrastive Language-Image Pre-training for | visual | Recognition |
Iconic Representation of | visual | Data and Models |
Identification and Mapping of Soil Erosion Processes Using the | visual | Interpretation of LiDAR Imagery |
Identification of Expert Tower Controller | visual | Scanning Patterns in Support of the Development of Automated Training Tools |
Identification of story units in audio- | visual | sequences by joint audio and video processing |
Identification of Structurally Damaged Areas in Airborne Oblique Images Using a | visual | -Bag-of-Words Approach |
Identifying and learning | visual | attributes for object recognition |
Identifying deficits of | visual | security metrics for images |
Identifying dominant people in meetings from audio- | visual | sensors |
Identifying Human Behaviors Using Synchronized Audio- | visual | Cues |
Identifying Resonant Poles by | visual | Inspection of Pole-Zero Plots |
Identifying | visual | attributes for object recognition from text and taxonomy |
Identity-Aware Textual- | visual | Matching with Latent Co-attention |
IEEE Interpretation of | visual | Motion Workshop |
Illumination color covariant locale-based | visual | object retrieval |
Illusory Surface Perception Using a Hierarchical Neural Network Model of the | visual | Pathways |
Image Annotation Incorporating Low-Rankness, Tag and | visual | Correlation and Inhomogeneous Errors |
Image based Monument Recognition using Graph based | visual | Saliency |
Image Based | visual | Motion Cue for Autonomous Navigation, An |
Image Based | visual | Servoing: A New Method for the Estimation of the Image Jacobian in Dynamic Environments |
Image Based | visual | Servoing: Estimated Image Jacobian by Using Fundamental Matrix VS Analytic Jacobian |
Image Caption Generation with Hierarchical Contextual | visual | Spatial Attention |
Image Captioning and | visual | Question Answering Based on Attributes and External Knowledge |
Image Captioning Based on | visual | and Semantic Attention |
Image Categorization by Learned Nonlinear Subspace of Combined | visual | -Words and Low-Level Features |
Image categorization through optimum path forest and | visual | words |
Image classification based on bag of | visual | graphs |
Image Classification Using Spatial Pyramid Coding and | visual | Word Reweighting |
Image classification: Classifying distributions of | visual | features |
Image coding and decoding method and apparatus considering human | visual | characteristics |
Image coding with wavelet representations, edge information and | visual | masking |
Image Comparison by Compound Disjoint Information with Applications to Perceptual | visual | Quality Assessment, Image Registration and Tracking |
Image complexity measure based on | visual | attention |
Image Compression Based on | visual | Saliency at Individual Scales |
Image Data and Backbone in Weakly Supervised Fine-Grained | visual | Categorization: A Revisit and Further Thinking, The |
Image Database for Design and Evaluation of | visual | Quality Metrics in Synthetic Scenarios, An |
Image Dynamics-Based | visual | Servoing for Quadrotors Tracking a Target With a Nonlinear Trajectory Observer |
Image Emotion Recognition Using | visual | and Semantic Features Reflecting Emotional and Similar Objects |
Image enhancement based on the statistics of | visual | representation |
Image Enhancement for Reducing LCD Backlight Power Based on Human | visual | Characteristics |
Image Enhancement Using A Human | visual | System Model |
Image Entropy of Primitive and | visual | quality assessment |
Image fusion based on | visual | salient features and the cross-contrast |
Image Gradient Evolution: A | visual | Cue for Collision Avoidance |
Image Hallucination Using Neighbor Embedding over | visual | Primitive Manifolds |
Image Hatching for | visual | Cryptography |
Image Indexing Using Shape Based | visual | Features |
Image Information and | visual | Quality |
image intellectual property protection scheme for gray-level images using | visual | secret sharing strategy, An |
Image Lightness Conversion and Sharpening Taking Account of | visual | Features of Elderly Person |
Image matching with distinctive | visual | vocabulary |
Image Memorability Using Diverse | visual | Features and Soft Attention |
Image modeling using statistical measures for | visual | object categorization |
Image Modification Based on a | visual | Saliency Map for Guiding Visual Attention |
Image Modification Based on a | visual | Saliency Map for Guiding Visual Attention |
Image Modification Based on Spatial Frequency Components for | visual | Attention Retargeting |
Image moment invariants as local features for content based image retrieval using the Bag-of- | visual | -Words model |
Image pattern discovery by using the spatial closeness of | visual | code words |
Image processing for | visual | prostheses: A clinical perspective |
Image Processing in the Context of a | visual | Model |
Image processing system dedicated to a | visual | intra-cortical stimulator |
Image Processing Unit for General-Purpose Representation and Association System for Recognizing Low-Resolution Digits With | visual | Information Variability |
Image Quality Assessment Based on Multi-Order | visual | Comparison |
Image quality assessment based on the | visual | perception of image contents |
Image Quality Assessment by | visual | Gradient Similarity |
Image Quality Assessment Using Human | visual | DOG Model Fused With Random Forest |
Image quality assessment with | visual | attention |
Image Quality Evaluation, Human | visual | System Based, HVS |
Image Quality Evaluation, | visual | Quality, Quality Assessment, and Imaging Models |
Image Reconstruction from Bag-of- | visual | -Words |
Image registration for sequence of | visual | images captured by UAV |
Image registration for | visual | inspection of imprinted pharmaceutical tablets |
Image representation by compressive sensing for | visual | sensor networks |
Image Representational Model for Predicting | visual | Distinctness of Objects |
Image Representations for | visual | Learning |
Image retargeting based on the sensitivity-tuned | visual | significance map |
Image Retargeting for Preserving Robust Local Feature: Application to Mobile | visual | Search |
Image Retrieval for | visual | Localization via Scene Text Detection and Logo Filtering |
Image Retrieval Method Based on | visual | Map Pre-Sampling Construction in Indoor Positioning |
Image retrieval model based on weighted | visual | features determined by relevance feedback |
Image Retrieval with a | visual | Thesaurus |
Image retrieval with geometry-preserving | visual | phrases |
Image Retrieval With Lingual And | visual | Paraphrasing Via Generative Models |
Image Retrieval with Scale Invariant | visual | Phrases |
Image Segmentation Considering Properties of the Human | visual | System |
Image Sharpening Incorporating Human | visual | Response |
Image Size Invariant | visual | Cryptography for General Access Structures Subject to Display Quality Constraints |
Image Spam Filtering Using | visual | Information |
Image Stabilization Based on Fusing the | visual | Information in Differently Exposed Images |
Image super-resolution by textural context constrained | visual | vocabulary |
Image Thresholding Method Based on Human | visual | Perception, An |
Image tone mapping based on clustering and human | visual | system models |
Image | visual | attention computation and application via the learning of object attributes |
Image | visual | Quality Restoration by Cancellation of the Unmasked Noise |
Image | visual | Realism: From Human Perception to Machine Computation |
Image-Based Robot Task Planning and Control Using a Compact | visual | Representation |
Image-Based | visual | Perception and Representation for Collision Avoidance |
image-based | visual | servoing scheme for following paths with nonholonomic mobile robots, An |
Image-Based | visual | Servoing Techniques for Robot Control |
Image-Based | visual | Speech Animation System, An |
Image-Based | visual | Threat Cue for Autonomous Navigation, An |
Image-Matching Method Using Template Updating Based on Statistical Prediction of | visual | Noise, An |
Image-Question-Answer Synergistic Network for | visual | Dialog |
Image-Text Embedding Learning via | visual | and Textual Semantic Reasoning |
ImageCLEF Medical Retrieval Task at ICPR 2010: Information Fusion to Combine | visual | and Textual Information, The |
ImageCLEF: Experimental Evaluation in | visual | Information Retrieval |
ImageNet Large Scale | visual | Recognition Challenge |
Images of Image Machines. | visual | Interpretability in Computer Vision for Art |
Images Speak in Images: A Generalist Painter for In-Context | visual | Learning |
Immersion into | visual | Media: New Applications of Image Understanding |
Immersive | visual | Communication |
Immersive | visual | media-MPEG-I: 360 video, virtual navigation and beyond |
Impact of Fused Visible-Infrared Video Streams on | visual | Tracking |
Impact of image appeal on | visual | attention during photo triaging |
Impact of Reduced Video Quality on | visual | Speech Recognition, The |
Impact of | visual | angle on attention deployment and robustness of visual saliency models in videos: From SD to UHD |
Impact of | visual | angle on attention deployment and robustness of visual saliency models in videos: From SD to UHD |
Impact of | visual | -Haptic Spatial Discrepancy on Targeting Performance |
Implement contour following task of objects with unknown geometric models by using combination of two | visual | servoing techniques |
Implementation of a Computational Theory of | visual | Surface Interpolation, An |
Implementation of a modular real-time feature-based architecture applied to | visual | face tracking |
Implementation of Model-Based | visual | Feedback for Robot Arc Welding of Thin Sheet Steel, An |
Implementation of | visual | Motion Detection in Analog Neuromorphic Circuitry: A Case Study of the Issue of Circuit Precision |
Implicit 3D Approach to Image Generation: Object-Based | visual | Effects by Linear Processing of Multiple Differently Focused Images |
Implicit Arbitrary Shape | visual | Objects via MPEG-4 Scene Description |
Implicit Intention Communication in Human-Robot Interaction Through | visual | Behavior Studies |
Implicit | visual | concept modeling in image/video annotation |
ImPosing: Implicit Pose Encoding for Efficient | visual | Localization |
Improved 3D Depth Image Estimation Algorithm for | visual | Camera |
improved algorithm of median flow for | visual | object tracking and its implementation on ARM platform, An |
Improved Attention for | visual | Question Answering, An |
Improved Audio- | visual | Speaker Recognition via the Use of a Hybrid Combination Strategy |
Improved data hiding in halftone images with cooperating pair toggling human | visual | system |
Improved Defect Detection In Textile | visual | Inspection Using Wavelet Analysis and Support Vector Machines |
Improved entropy of primitive for | visual | information estimation |
improved error diffusion algorithm based on | visual | difference, An |
Improved Few-Shot | visual | Classification |
Improved Fusion of | visual | and Language Representations by Dense Symmetric Co-attention for Visual Question Answering |
Improved Fusion of | visual | and Language Representations by Dense Symmetric Co-attention for Visual Question Answering |
Improved GrabCut Method Based on a | visual | Attention Model for Rare-Earth Ore Mining Area Recognition with High-Resolution Remote Sensing Images, An |
Improved Lightweight Deep Neural Network with Knowledge Distillation for Local Feature Extraction and | visual | Localization Using Images and LiDAR Point Clouds, An |
improved line continuation model in human | visual | perception, An |
Improved Method for Two-UAV Trajectory Planning for Cooperative Target Locating Based on Airborne | visual | Tracking Platform, An |
Improved Point-Line | visual | -Inertial Odometry System Using Helmert Variance Component Estimation |
Improved Poisson Surface Reconstruction with Various Passive | visual | Cues from Multiple Camera Views |
Improved Real-Time Blob Detection for | visual | Surveillance, An |
Improved Tagged | visual | Cryptograms by Using Random Grids |
Improved | visual | Cryptography with Cheating Prevention, An |
Improved | visual | Fine-tuning with Natural Language Supervision |
Improved | visual | information fidelity based on sensitivity characteristics of digital images |
Improved | visual | /infrared colour fusion method with double-opponency colour constancy mechanism |
Improvement in Accuracy of Respiration Pattern Detection on | visual | Sensing System |
Improvement of Road Driving Safety Guided by | visual | Inattentional Blindness, The |
Improvement of the Quality of | visual | Secret Sharing Schemes with Constraints on the Usage of Shares |
Improvement of | visual | perceptual capabilities by feedback structures for robotic system FRIEND |
Improvements in | visual | Autonomous Road Vehicle Guidance |
Improving 3D Active | visual | Tracking |
Improving Autonomous Vehicle | visual | Perception by Fusing Human Gaze and Machine Vision |
Improving Deep | visual | Representation for Person Re-identification by Global and Local Image-language Association |
Improving detection of surface discontinuities in | visual | -force control systems |
Improving Driver Performance and Experience in Assisted and Automated Driving With | visual | Cues in the Steering Wheel |
Improving Fairness in Facial Albedo Estimation via | visual | -Textual Cues |
Improving Fine-Grained | visual | Recognition in Low Data Regimes via Self-boosting Attention Mechanism |
Improving Generalization Ability of Deep Neural Networks for | visual | Recognition Tasks |
Improving Generalization in | visual | Reinforcement Learning via Conflict-aware Gradient Agreement Augmentation |
Improving handgun detection through a combination of | visual | features and body pose-based data |
Improving Image Description with Auxiliary Modality for | visual | Localization in Challenging Conditions |
Improving image quality assessment with modeling | visual | attention |
Improving mix-and-separate training in audio- | visual | sound source separation with an object prior |
Improving NCC-Based Direct | visual | Tracking |
Improving One-Stage | visual | Grounding by Recursive Sub-query Construction |
Improving Rooftop Detection with Interactive | visual | learning |
Improving Scene Recognition through | visual | Attention |
Improving Selective | visual | Question Answering by Learning from Your Peers |
Improving streaming video segmentation with early and mid-level | visual | processing |
Improving superpixel boundaries using information beyond the | visual | spectrum |
Improving Table Structure Recognition with | visual | -Alignment Sequential Coordinate Modeling |
Improving texture description in remote sensing image multi-scale classification tasks by using | visual | words |
Improving the Discriminative Power of Bag of | visual | Words Model |
Improving the efficiency and accuracy of | visual | attention |
Improving the robustness of particle filter-based | visual | trackers using online parameter adaptation |
Improving the selection and detection of | visual | landmarks through object tracking |
Improving the | visual | Comprehension of Point Sets |
Improving the | visual | Quality of Generative Adversarial Network (GAN)-Generated Images Using the Multi-Scale Structural Similarity Index |
Improving the | visual | Quality of JPEG-Encoded Images via Companding |
Improving the | visual | quality of random grid-based visual secret sharing via error diffusion |
Improving the | visual | quality of random grid-based visual secret sharing via error diffusion |
Improving the | visual | quality of size invariant visual cryptography scheme |
Improving the | visual | quality of size invariant visual cryptography scheme |
Improving the | visual | Quality of Size-Invariant Visual Cryptography for Grayscale Images: An Analysis-by-Synthesis (AbS) Approach |
Improving the | visual | Quality of Size-Invariant Visual Cryptography for Grayscale Images: An Analysis-by-Synthesis (AbS) Approach |
Improving the | visual | Quality of Video Frame Prediction Models Using the Perceptual Straightening Hypothesis |
Improving Video Captioning with Temporal Composition of a | visual | -Syntactic Embedding* |
Improving | visual | evoked potential feature classification for person recognition using PCA and normalization |
Improving | visual | Feature Representations by Biasing Restricted Boltzmann Machines with Gaussian Filters |
Improving | visual | Grounding by Encouraging Consistent Gradient-Based Explanations |
Improving | visual | Grounding with Visual-Linguistic Verification and Iterative Reasoning |
Improving | visual | Grounding with Visual-Linguistic Verification and Iterative Reasoning |
Improving | visual | Inertial Odometry with UWB Positioning for UAV Indoor Navigation |
Improving | visual | Matching |
Improving | visual | multi-object tracking algorithm via integrating GM-PHD and correlation filter |
Improving | visual | quality in wireless capsule endoscopy images with contrast-limited adaptive histogram equalization |
Improving | visual | Question Answering using Active Perception on Static Images |
Improving | visual | question answering using dropout and enhanced question encoder |
Improving | visual | Recognition Using Color Normalization in Digital Video Applications |
Improving | visual | Relation Detection using Depth Maps |
Improving | visual | Relationship Detection With Two-Stage Correlation Exploitation |
Improving | visual | Representation Learning Through Perceptual Understanding |
Improving | visual | -semantic embeddings by learning semantically-enhanced hard negatives for cross-modal information retrieval |
Improving Weakly Supervised | visual | Grounding by Contrastive Knowledge Distillation |
In Defense of Grid Features for | visual | Question Answering |
In Search of Information in | visual | Media |
In Vivo Demonstration of Photoacoustic Image Guidance and Robotic | visual | Servoing for Cardiac Catheter-Based Interventions |
Incident-Supporting | visual | Cloud Computing Utilizing Software-Defined Networking |
Incorporating 3D Information Into | visual | Question Answering |
Incorporating Audio Signals into Constructing a | visual | Saliency Map |
Incorporating Contextual Information into Bag-of- | visual | -Words Framework for Effective Object Categorization |
Incorporating Convolution Designs into | visual | Transformers |
Incorporating Geometry Information with Weak Classifiers for Improved Generic | visual | Categorization |
Incorporating Language Syntax in | visual | Text Recognition with a Statistical-Model |
Incorporating multi-stage spatial | visual | cues and active localization offset for pancreas segmentation |
Incorporating object-centered sampling and Delaunay tetrahedrization for | visual | hull reconstruction |
Incorporating | visual | Grounding In GCN For Zero-shot Learning Of Human Object Interaction Actions |
Incorporating | visual | Knowledge Representation in Stereo Reconstruction |
Increasing flexibility for automatic | visual | inspection: the general analysis graph |
Incremental Co-Boost for | visual | tracking |
Incremental Codebook Adaptation for | visual | Representation and Categorization |
Incremental Focus of Attention for Robust | visual | Tracking |
Incremental Hierarchical Discriminating Regression for Indoor | visual | Navigation |
Incremental Hybrid Approach for Unsupervised Classification: Applications to | visual | Landmarks Recognition |
Incremental Learning for Robust | visual | Tracking |
Incremental Learning of 3D-DCT Compact Representations for Robust | visual | Tracking |
Incremental learning of object detectors using a | visual | shape alphabet |
Incremental Learning of | visual | Landmarks for Mobile Robotics |
Incremental PCA for on-line | visual | learning and recognition |
Incremental PCA-HOG Descriptor for Robust | visual | Hand Tracking, An |
Incremental Reconstruction of Manifold Surface from Sparse | visual | Mapping |
Incremental Solid Modeling from Sparse Structure-from-Motion Data with Improved | visual | Artifacts Removal |
Incremental structural modeling on sparse | visual | SLAM |
Incremental Subspace Learning for Cognitive | visual | Processes |
Incremental | visual | Hull Reconstruction |
Independent Encoding of Position and Orientation by Population Responses in Primary | visual | Cortex |
Indexing and Retrieval of Binary Images for | visual | Information Systems |
Indexing for | visual | Recognition from a Large Model Base |
Indexing Images by Trees of | visual | Content |
Indexing Large Online Multimedia Repositories Using Semantic Expansion and | visual | Analysis |
Indexing Large | visual | Vocabulary by Randomized Dimensions Hashing for High Quantization Accuracy: Improving the Object Retrieval Quality |
Indexing of multilingual news telecast using audio- | visual | keywords |
Indexing | visual | Representations Through the Complexity Map |
Individual trait oriented scanpath prediction for | visual | attention analysis |
Indoor autonomous navigation using | visual | memory and pattern tracking |
Indoor Localization in Dynamic Human Environments Using | visual | Odometry and Global Pose Refinement |
Indoor Objects and Outdoor Urban Scenes Recognition by 3D | visual | Primitives |
Indoor Scene Recognition with a | visual | Attention-Driven Spatial Pooling Strategy |
Indoor Target-Driven | visual | Navigation based on Spatial Semantic Information |
Indoor Topological Localization Using a | visual | Landmark Sequence |
Indoor | visual | Localization using Point and Line Correspondences in dense colored point cloud |
Industrial Image Processing: | visual | Quality Control in Manufacturing |
Industrial | visual | perception technology in Smart City |
Industry Begins to Use | visual | Pattern Recognition |
Inertial sensor-aligned | visual | feature descriptors |
Infer unseen from seen: Relation regularized zero-shot | visual | dialog |
Inferring Affective Experience from the Big Picture Metaphor: A Two-dimensional | visual | Breadth Model |
Inferring and Executing Programs for | visual | Reasoning |
Inferring hand pose: A comparative study of | visual | shape features |
Inferring Ignorance from the Locality of | visual | Perception |
Inferring social relations from | visual | concepts |
Inferring | visual | Persuasion via Body Language, Setting, and Deep Features |
Inflated Episodic Memory With Region Self-Attention for Long-Tailed | visual | Recognition |
Influence of color on | visual | saliency in short videos |
Influence of haptic guidance on driving behaviour under degraded | visual | feedback conditions |
Influence of local scene color on fixation position in | visual | search |
Influence of Viewpoint on | visual | Saliency Models for Volumetric Content |
Influence-Balanced Loss for Imbalanced | visual | Classification |
Influencing Human Escape Maneuvers With Perceptual Cues in the Presence of a | visual | Task |
Information Bottleneck Approach to Optimize the Dictionary of | visual | Data, An |
Information Bottleneck Domain Adaptation with Privileged Information for | visual | Recognition |
Information Bottleneck Learning Using Privileged Information for | visual | Recognition |
Information Fusion for Combining | visual | and Textual Image Retrieval |
Information Fusion for Combining | visual | and Textual Image Retrieval in ImageCLEF@ICPR |
Information Fusion in | visual | -Task Inference |
Information in Streetscapes: Research on | visual | Perception Information Quantity of Street Space Based on Information Entropy and Machine Learning |
Information Maximizing | visual | Question Generation |
Information Theoretic Learning for Pixel-Based | visual | Agents |
Information Theoretic Measure for | visual | Target Distinctness |
information theory of | visual | communication, An |
Information-Theoretic Analysis of Input Strokes in | visual | Object Cutout |
Informative | visual | words construction to improve bag of words image representation |
Infrared and Visible Image Fusion Based on TRPCA and | visual | Saliency Detection |
Infrared Small Target Detection Method Based on a Weighted Human | visual | Comparison Mechanism for Safety Monitoring, An |
Infrared- | visual | Image Registration Based on Corners and Hausdorff Distance |
Initialization-Insensitive | visual | Tracking through Voting with Salient Local Features |
InLoc: Indoor | visual | Localization with Dense Matching and View Synthesis |
Inquiry Driven Vision System Based on | visual | and Conceptual Hierarchies, An |
InstaBoost++: | visual | Coherence Principles for Unified 2D/3D Instance Level Data Augmentation |
Instance significance guided multiple instance boosting for robust | visual | tracking |
Instance-Based Feature Pyramid for | visual | Object Tracking |
InstanceRefer: Cooperative Holistic Understanding for | visual | Grounding on Point Clouds through Instance Multi-level Contextual Referring |
Instantaneous Evaluation of the Sense of Presence in Audio- | visual | Content |
Instruct Me More! Random Prompting for | visual | In-Context Learning |
integer programming approach to | visual | compliance, An |
Integral opponent-colors features for computing | visual | target distinctness |
Integrally Migrating Pre-trained Transformer Encoder-decoders for | visual | Object Detection |
Integrated Analysis of Thermal and | visual | Images for Scene Interpretation |
integrated approach to | visual | attention modelling using spatial-temporal saliency and objectness, An |
Integrated Mining of | visual | Features, Speech Features, and Frequent Patterns for Semantic Video Annotation |
integrated model of | visual | attention using shape-based features, An |
Integrated Modelling of Thermal and | visual | Image Generation |
Integrated Motion Detection and Tracking for | visual | Surveillance |
Integrated Spatio-Temporal Approach to Automatic | visual | Guidance of Autonomous Vehicles, An |
Integrated Techniques for Self-organisation, Sampling, Habituation, and Motion-tracking in | visual | Robotics Applications |
Integrated Testing and Algorithms for | visual | Inspection of Integrated Circuits |
Integrated | visual | Language and Software Development Environment, An |
integrated | visual | saliency-based watermarking approach for synchronous image authentication and copyright protection, An |
Integrated | visual | System for Solder Inspection, An |
Integrating Boundary and Center Correlation Filters for | visual | Tracking with Aspect Ratio Variation |
Integrating Boxes and Masks: A Multi-Object Framework for Unified | visual | Tracking and Segmentation |
Integrating foreground-background feature distillation and contrastive feature learning for ultra-fine-grained | visual | classification |
Integrating Historical States and Co-attention Mechanism for | visual | Dialog |
Integrating ILSR to Bag-of- | visual | Words Model Based on Sparse Codes of SIFT Features Representations |
Integrating Information from Thermal and | visual | Images for Scene Analysis |
Integrating Low-Level and Semantic | visual | Cues for Improved Image-to-Video Experiences |
Integrating multi-level deep learning and concept ontology for large-scale | visual | recognition |
Integrating Multiple | visual | Cues for Robust Real-Time 3D Face Tracking |
Integrating Object Affordances with Artificial | visual | Attention |
Integrating Perceptual Properties of the HVS into the Computational Model of | visual | Attention |
Integrating Spatio-Temporal Context With Multiview Representation for Object Recognition in | visual | Surveillance |
Integrating the Projective Transform with Particle Filtering for | visual | Tracking |
Integrating | visual | and Geometric Consistency for Pose Estimation |
Integrating | visual | and range data for road detection |
Integrating | visual | and Range Data for Robotic Object Detection |
Integrating | visual | and semantic contexts for topic network generation and word sense disambiguation |
Integrating | visual | and Textual Cues for Query-by-String Word Spotting |
Integrating | visual | Information across Camera Movements with a Visual-Motor Calibration Map |
Integrating | visual | Information across Camera Movements with a Visual-Motor Calibration Map |
Integrating | visual | Perception With Decision Making in Neuromorphic Fault-Tolerant Quadruplet-Spike Learning Framework |
Integrating | visual | Saliency and Consistency for Re-Ranking Image Search Results |
Integrating | visual | search with visual memory in a knowledge directed image interpretation system |
Integrating | visual | search with visual memory in a knowledge directed image interpretation system |
Integrating | visual | words as bunch of n-grams for effective biomedical image classification |
Integrating | visual | , Audio and Text Analysis for News Video |
Integration and Control of Reactive | visual | Processes |
Integration of audio and | visual | information for content-based video segmentation |
Integration of Audio/ | visual | Information for Use in Human-Computer Intelligent Interaction |
Integration of Bottom-Up and Top-Down Cues for | visual | Attention Using Non-Linear Relaxation |
Integration of GPS/BDS Real-Time Kinematic Positioning and | visual | -Inertial Odometry Based on Smartphones, The |
Integration of semantic and | visual | hashing for image retrieval |
Integration of | visual | and Haptic Feedback for Teleoperation, The |
Integration of | visual | and Inertial Information for Egomotion: A Stochastic Approach |
Integration of | visual | and Shape Attributes for Object Action Complexes |
Integration of | visual | Cues for Robotic Grasping |
Integration of | visual | Modules: An Extension of the Marr Paradigm |
Integration of | visual | Processes |
Integration of | visual | Temporal Information and Textual Distribution Information for News Web Video Event Mining |
Intelligent Vision-Based System Applied to | visual | Quality Inspection of Beans, An |
Intelligent | visual | Acuity Estimation System With Hand Motion Recognition |
intelligent | visual | task system for lateral skull X-ray images, An |
Intensity comparison based compact descriptor for mobile | visual | search |
Intensity Independent Color Models and | visual | Tracking |
Intensity-augmented Ordinal Measure for | visual | Correspondence, An |
Inter-Concept Distance Measurement with Adaptively Weighted Multiple | visual | Features |
Inter-frame contextual modelling for | visual | speech recognition |
Inter-Observer Consistent Deep Adversarial Training for | visual | Scanpath Prediction, An |
Interaction Design for Mobile | visual | Search |
Interaction Region | visual | Transformer for Egocentric Action Anticipation |
Interaction with On-Screen Objects Using | visual | Gesture Recognition |
Interactive components for | visual | exploration of multimedia archives |
Interactive Dynamics for | visual | Analysis |
Interactive Editing of Live | visual | s |
Interactive maps for | visual | exploration of grid and vector geodata |
Interactive Multimodal | visual | Search on Mobile Device |
Interactive | visual | Analytics Platform for Smart Intelligent Transportation Systems Management, An |
Interactive | visual | Dialog |
interactive | visual | environment to understand, model and exploit user subjectivity in image retrieving systems, An |
Interactive | visual | Event Analytics: Opportunities and Challenges |
Interactive | visual | Exploration of Human Mobility Correlation Based on Smart Card Data |
Interactive | visual | Hull Refinement for Specular and Transparent Object Surface Reconstruction |
Interactive | visual | pattern recognition |
Interactive | visual | User Interfaces: A Survey |
Interference Reduction in Reverberant Speech Separation With | visual | Voice Activity Detection |
International Conference on | visual | Information Systems |
International Workshop on | visual | Surveillance and Sensory Networks |
Internet | visual | media processing: A survey with graphics and vision applications |
Interpolation-Based Event | visual | Data Filtering Algorithms |
Interpretable and Fine-Grained | visual | Explanations for Convolutional Neural Networks |
Interpretable Attention Guided Network for Fine-grained | visual | Classification |
Interpretable Basis Decomposition for | visual | Explanation |
Interpretable Representation Learning on Natural Image Datasets via Reconstruction in | visual | -Semantic Embedding Space |
Interpretable | visual | models for human perception-based object retrieval |
Interpretable | visual | Question Answering by Reasoning on Dependency Trees |
Interpretable | visual | Question Answering by Visual Grounding From Attention Supervision Mining |
Interpretable | visual | Question Answering by Visual Grounding From Attention Supervision Mining |
Interpretable | visual | Question Answering Referring to Outside Knowledge |
Interpretable | visual | Question Answering Via Reasoning Supervision |
Interpretable | visual | Reasoning via Induced Symbolic Space |
Interpretable | visual | Reasoning via Probabilistic Formulation Under Natural Supervision |
Interpretable | visual | reasoning: A survey |
Interpretation of | visual | Motion, The |
Interpretation of | visual | Motion: A Computational Study |
Interpretation of | visual | Motion: Recognizing Moving Light Displays, The |
Interpreting and Explaining | visual | AI Models |
Interpreting Deep | visual | Representations via Network Dissection |
Interpreting the Rhetoric of | visual | Advertisements |
interpretive model of line continuation in human | visual | perception, An |
Intersection Recognition Using Results of Semantic Segmentation for | visual | Navigation |
Into the woods: | visual | surveillance of noncooperative and camouflaged targets in complex outdoor settings |
IntPhys 2019: A Benchmark for | visual | Intuitive Physics Understanding |
Intra- and Inter-Class Induced Discriminative Deep Dictionary Learning for | visual | Recognition |
Introducing Memory and Association Mechanism Into a Biologically Inspired | visual | Model |
Introducing temporal order of dominant | visual | word sub-sequences for human action recognition |
Introduction to Active Contours and | visual | Dynamics |
Introduction to Special Issue on Representation and Retrieval of | visual | Media in Multimedia Systems |
Introduction to Special Issue on Representation and Retrieval of | visual | Media in Multimedia Systems II |
Introduction to the special issue on | visual | understanding and applications with RGB-D cameras |
Introduction to the Special Section on Deep Learning for | visual | Surveillance |
Introduction to the Special Section on | visual | Computing in the Cloud: Cloud Gaming and Virtualization |
Introduction to the Special Section on | visual | Computing in the Cloud: Fundamentals and Applications |
invariant pattern representation based on nonuniform sampling in the human | visual | system, An |
Invariant Reconstruction of | visual | Surfaces |
Invariant signatures for omnidirectional | visual | place recognition and robot localization in unknown environments |
Invariant | visual | patterns for video copy detection |
Invarint Image Retrieval using Block-Based | visual | Pattern Matching |
Inverse Nonnegative Local Coordinate Factorization for | visual | Tracking |
Inverse | visual | Question Answering: A New Benchmark and VQA Diagnosis Tool |
Inverting | visual | Representations with Convolutional Networks |
Investigating 3D holoscopic | visual | content upsampling using super-resolution for cultural heritage digitization |
Investigating a New | visual | Cue for Image Motion Estimation: Motion-from-Smear |
Investigating keyframe selection methods in the novel domain of passively captured | visual | lifelogs |
Investigating the Role of Image Retrieval for | visual | Localization |
Investigating | visual | Features for Cognitive Impairment Detection Using In-the-wild Data |
Investigation into the Closure Process in Human | visual | -Perception for Line Patterns, An |
Investigation of Driver Performance With Night Vision and Pedestrian Detection Systems Part I: Empirical Study on | visual | Clutter and Glance Behavior |
Investigation of mobile surroundings for | visual | attention based on image perception model |
Investigation of the Effects of Data Collection on | visual | Stylometry |
Investigation on the peripheral | visual | field for information display with real and virtual wide field-of-view see-through HMDs |
Investigations into the Robustness of Audio- | visual | Gender Classification to Background Noise and Illumination Effects |
Invisible Calibration Pattern Based on Human | visual | Perception Characteristics |
Invisible Calibration Pattern for Print-and-Scan Data Hiding Based on Human | visual | Perception |
Involution: Inverting the Inherence of Convolution for | visual | Recognition |
IoU Attack: Towards Temporally Coherent Black-Box Adversarial Attack for | visual | Object Tracking |
IOU: Siamtrack: IOU Guided Siamese Network For | visual | Object Tracking |
IoUNet++: Spatial cross-layer interaction-based bounding box regression for | visual | tracking |
IQ-VQA: Intelligent | visual | Question Answering |
IQA: | visual | Question Answering in Interactive Environments |
iQuery: Instruments as Queries for Audio- | visual | Sound Separation |
IR Reasoner: Real-time Infrared Object Detection by | visual | Reasoning |
IR saliency detection via a GCF-SB | visual | attention framework |
IR small target detection based on human | visual | attention using pulsed discrete cosine transform |
Iris Image Classification Based on Hierarchical | visual | Codebook |
IRVR: A General Image Restoration Framework for | visual | Recognition |
IR_URFS_VF: image recommendation with user relevance feedback session and | visual | features in vertical image search |
Is BERT Blind? Exploring the Effect of Vision-and-Language Pretraining on | visual | Language Understanding |
Is Geometry Enough for Matching in | visual | Localization? |
Is GPT-3 All You Need for | visual | Question Answering in Cultural Heritage? |
Is Second-Order Information Helpful for Large-Scale | visual | Recognition? |
Is This the Right Place? Geometric-Semantic Pose Verification for Indoor | visual | Localization |
ISLA: Temporal Segmentation and Labeling for Audio- | visual | Emotion Recognition |
ISNN: Impact Sound Neural Network for Audio- | visual | Object Classification |
Isotropic Versus Anisotropic Encoding of | visual | Information |
Isowarp: The Template-Based | visual | Geometry of Isometric Surfaces, The |
ISR3: A Token Database for Integration of | visual | Modules |
Issues and Directions in | visual | Information Retrieval |
Issues in adapting research algorithms to stereoscopic | visual | effects |
It's Not About the Journey; It's About the Destination: Following Soft Paths Under Question-Guidance for | visual | Reasoning |
Iterative Augmentation of | visual | Evidence for Weakly-Supervised Lesion Localization in Deep Interpretability Frameworks: Application to Color Fundus Images |
Iterative Combination Scheme for multimodal | visual | feature detection, An |
Iterative Context-Aware Graph Inference for | visual | Dialog |
Iterative Object Localization Algorithm Using | visual | Images with a Reference Coordinate |
Iterative particle filter for | visual | tracking |
Iterative Robust | visual | Grounding with Masked Reference based Centerpoint Supervision |
Iterative | visual | Reasoning Beyond Convolutions |
ITT | visual | Information Solutions |
iVQA: Inverse | visual | Question Answering |
Jensen-Shannon divergence for | visual | quality assessment |
Jigsaw Clustering for Unsupervised | visual | Representation Learning |
Joint Answering and Explanation for | visual | Commonsense Reasoning |
Joint audio- | visual | bi-modal codewords for video event detection |
Joint Audio- | visual | Deepfake Detection |
Joint Audio- | visual | Tracking Using Particle Filters |
Joint Channel Reliability and Correlation Filters Learning for | visual | Tracking |
Joint Classification and Regression for | visual | Tracking with Fully Convolutional Siamese Networks |
Joint Coding of Local and Global Deep Features in Videos for | visual | Search |
Joint Coding/Routing Optimization for Distributed Video Sources in Wireless | visual | Sensor Networks |
Joint Compression Scheme of Video Feature Descriptors and | visual | Content, A |
Joint Contours, Corner and T-Junction Detection: An Approach Inspired by the Mammal | visual | System |
Joint Correlation and Attention Based Feature Fusion Network for Accurate | visual | Tracking |
Joint Correlation Filtering for | visual | Tracking |
Joint Cross-Attention Model for Audio- | visual | Fusion in Dimensional Emotion Recognition, A |
Joint Decision Tree and | visual | Feature Rate Control Optimization for VVC UHD Coding |
Joint Embedding of Deep | visual | and Semantic Features for Medical Image Report Generation |
Joint estimation of head pose and | visual | focus of attention |
Joint Gaussian Process Model for Active | visual | Recognition with Expertise Estimation in Crowdsourcing, A |
Joint Geometrical and Statistical Alignment for | visual | Domain Adaptation |
Joint Group Feature Selection and Discriminative Filter Learning for Robust | visual | Object Tracking |
Joint Identification-Verification Model for | visual | Tracking |
Joint Illumination and Shape Model for | visual | Tracking, A |
Joint learning and weighting of | visual | vocabulary for bag-of-feature based tissue classification |
Joint learning hash codes and distance metric for | visual | tracking |
Joint learning of | visual | and spatial features for edit propagation from a single image |
Joint learning of | visual | attributes, object classes and visual saliency |
Joint learning of | visual | attributes, object classes and visual saliency |
Joint Low-Rank and Sparse Principal Feature Coding for Enhanced Robust Representation and | visual | Classification |
Joint Object-Material Category Segmentation from Audio- | visual | Cues |
Joint optimization of JPEG quantization table and coefficient thresholding for low bitrate mobile | visual | search |
Joint Screening Halftoning and | visual | Cryptography for Image Protection |
Joint Sparse Representation and Robust Feature-Level Fusion for Multi-Cue | visual | Tracking |
Joint sparsity-based robust | visual | tracking |
Joint Syntax Representation Learning and | visual | Cue Translation for Video Captioning |
Joint Tensor Feature Analysis For | visual | Object Recognition |
Joint video/depth/FEC rate allocation with considering 3D | visual | saliency for scalable 3D video streaming |
Joint | visual | and Audio Learning for Video Highlight Detection |
Joint | visual | and Temporal Consistency for Unsupervised Domain Adaptive Person Re-identification |
Joint | visual | denoising and classification using deep learning |
Joint | visual | Grounding and Tracking with Natural Language Specification |
Joint | visual | Phrase Detection to Boost Scene Parsing |
Joint | visual | Semantic Reasoning: Multi-Stage Decoder for Text Recognition |
Joint | visual | Sharpness-Contrast-Tone Mapping Model |
Joint | visual | vocabulary for animal classification |
joint | visual | -inertial image registration for mobile HDR imaging, A |
Joint | visual | -Temporal Embedding for Unsupervised Learning of Actions in Untrimmed Sequences |
Joint | visual | -Textual Sentiment Analysis Based on Cross-Modality Attention Mechanism |
Joint Workshop on | visual | and Contextual Learning from Annotated Images and Videos, and Visual Scene Understanding |
Joint Workshop on | visual | and Contextual Learning from Annotated Images and Videos, and Visual Scene Understanding |
Joint-Modal Label Denoising for Weakly-Supervised Audio- | visual | Video Parsing |
Jointly Discovering | visual | Objects and Spoken Words from Raw Sensory Input |
Jointly Discriminating and Frequent | visual | Representation Mining |
Jointly Learning | visual | Motion and Confidence from Local Patches in Event Cameras |
Jointly Learning | visual | Poses and Pose Lexicon for Semantic Action Recognition |
Jointly Learning | visual | ly Correlated Dictionaries for Large-Scale Visual Recognition Applications |
Jointly social grouping and identification in | visual | dynamics with causality-induced hierarchical Bayesian model |
Journal of | visual | Communication and Image Representation |
Journal of | visual | Languages and Computing |
JPEG AI Standard: Providing Efficient Human and Machine | visual | Data Consumption, The |
JPEG Pleno: Toward an Efficient Representation of | visual | Reality |
JRDB: A Dataset and Benchmark of Egocentric Robot | visual | Perception of Humans in Built Environments |
JRDB: | visual | Perception for Navigation in Human Environments: The JackRabbot Social Grouping and Activity Dataset and Benchmark |
Just Noticeable Differences in | visual | Attributes |
k Out of n Region Incrementing Scheme in | visual | Cryptography |
k Out of n Region-Based Progressive | visual | Cryptography |
k,n) threshold partial reversible AMBTC-based | visual | cryptography using one reference image, A |
K-Shot Contrastive Learning of | visual | Features With Multiple Instance Augmentations |
K-VQG: Knowledge-aware | visual | Question Generation for Common-sense Acquisition |
KAN-AV dataset for audio- | visual | face and speech analysis in the wild |
KCRC-LCD: Discriminative kernel collaborative representation with locality constrained dictionary for | visual | categorization |
Keep CALM and Improve | visual | Feature Attribution |
Kernalised Multi-resolution Convnet for | visual | Tracking |
Kernel analysis over Riemannian manifolds for | visual | recognition of actions, pedestrians and textures |
Kernel autoassociator with applications to | visual | classification |
Kernel Based Multiple Cue Adaptive Appearance Model For Robust Real-time | visual | Tracking |
Kernel Fusion of Audio and | visual | Information for Emotion Recognition |
Kernel Particle Filter for | visual | Quality Inspection from Monocular Intensity Images |
Kernel Particle Filter for | visual | Tracking |
Kernel particle filter: iterative sampling for efficient | visual | tracking |
Kernel Subspace Integral Image Based Probabilistic | visual | Object Tracking |
Kernel-based high-dimensional histogram estimation for | visual | tracking |
Kernel-based sliding mode control for | visual | servoing system |
Kernelized temporal locality learning for real-time | visual | tracking |
Kernels for | visual | Words Histograms |
Key frame extraction based on | visual | attention model |
Key Point Sensitive Loss for Long-Tailed | visual | Recognition |
Key Technology of Dynamic | visual | Image Modeling Basing on Mobile Robot Self-Organizing Network |
Keyframe-Based Video Summary Using | visual | Attention Clues |
Keynotes: Deep learning for | visual | understanding: Effectiveness vs. efficiency |
Keypoint induced distance profiles for | visual | recognition |
Keypoint Matching for Non-Rigid Object via Locally Consistent | visual | Pattern Mining |
Keywords to | visual | categories: Multiple-instance learning for weakly supervised object categorization |
King-Kong Effects: Improving sensation of walking in VR with | visual | and tactile vibrations at each step, The |
KIZUKI Processing for | visual | Inspection: A Smart Pattern Pop-Out Algorithm Based on Human Visual Architecture |
KIZUKI Processing for | visual | Inspection: A Smart Pattern Pop-Out Algorithm Based on Human Visual Architecture |
Knowing When to Look: Adaptive Attention via a | visual | Sentinel for Image Captioning |
Knowing when you don't: Bag of | visual | words with reject option for automatic visual inspection of bulk materials |
Knowing when you don't: Bag of | visual | words with reject option for automatic visual inspection of bulk materials |
Knowing Where to Look? Analysis on Attention of | visual | Question Answering System |
Knowledge Acquisition for | visual | Question Answering via Iterative Querying |
Knowledge and the | visual | Process: Content, Form and Use |
Knowledge base graph embedding module design for | visual | question answering model |
Knowledge Distillation and Student-Teacher Learning for | visual | Intelligence: A Review and New Outlooks |
Knowledge Synthesizing Approach for Classification of | visual | Information, A |
Knowledge-Augmented | visual | Question Answering With Natural Language Explanation |
Knowledge-based | visual | Context-Aware Framework for Applications in Robotic Services |
Knowledge-Based | visual | Question Generation |
Knowledge-Embedded Mutual Guidance for | visual | Reasoning |
Knowledge-Enriched Attention Network With Group-Wise Semantic for | visual | Storytelling |
Knowledge-Guided | visual | Perception of 3-D Human Gait from a Single Image Sequence |
Kullback-Leibler Kernel as a Framework for Discriminant and Localized Representations for | visual | Recognition, The |
KVD: Scale invariant keypoints by combining | visual | and depth data |
L*ReLU: Piece-wise Linear Activation Functions for Deep Fine-grained | visual | Categorization |
L-DAWA: Layer-wise Divergence Aware Weight Aggregation in Federated Self-Supervised | visual | Representation Learning |
L0-Regularized Object Representation for | visual | Tracking |
Label Consistent Quadratic Surrogate model for | visual | saliency prediction |
Label-Aware Calibration and Relation-Preserving in | visual | Intention Understanding |
laminar architecture of | visual | cortex and image processing technology, The |
Land Surface Water Mapping Using Multi-Scale Level Sets and a | visual | Saliency Model from SAR Images |
Landmark recognition in VISITO: | visual | Support to Interactive TOurism in Tuscany |
Landscape | visual | Impact Evaluation for Onshore Wind Farm: A Case Study |
Landscape | visual | Sensitivity Assessment of Historic Districts: A Case Study of Wudadao Historic District in Tianjin, China |
Lane Detection Based on | visual | Attention |
Lane Detection Transformer Based on Multi-frame Horizontal and Vertical Attention and | visual | Transformer Module |
Language Adaptive Weight Generation for Multi-Task | visual | Grounding |
Language and | visual | Relations Encoding for Visual Question Answering |
Language and | visual | Relations Encoding for Visual Question Answering |
Language Identification as Improvement for Lip-Based Biometric | visual | Systems |
Language modeling for bag-of- | visual | words image categorization |
Language-Agnostic | visual | -Semantic Embeddings |
Language-Guided Audio- | visual | Source Separation via Trimodal Consistency |
Laplacian embedding and key points topology verification for large scale mobile | visual | identification |
Large Scale Audio- | visual | Video Analytics Platform for Forensic Investigations of Terroristic Attacks |
Large scale continuous | visual | event recognition using max-margin Hough transformation framework |
Large scale image retrieval with | visual | groups |
Large Scale Multi-View RGBD | visual | Affordance Learning Dataset, A |
Large Scale Semi-Supervised Object Detection Using | visual | and Semantic Knowledge Transfer |
Large Scale | visual | Commerce |
Large Scale | visual | Food Recognition |
Large Scale | visual | Geo-Localization of Images in Mountainous Terrain |
Large scale | visual | -based event matching |
Large | visual | Repository Search with Hash Collision Design Optimization |
Large | visual | words for large scale image classification |
Large Vocabulary Audio- | visual | Speech Recognition Using Active Shape Models |
Large Vocabulary Audio- | visual | Speech Recognition Using the Janus Speech Recognition Toolkit |
Large-scale EMM identification based on geometry-constrained | visual | word correspondence voting |
Large-scale functional models of | visual | cortex for remote sensing |
Large-Scale Gaussian Process Inference with Generalized Histogram Intersection Kernels for | visual | Recognition Tasks |
Large-scale image annotation using | visual | synset |
Large-scale Pretraining for | visual | Dialog: A Simple State-of-the-art Baseline |
Large-scale video event classification using dynamic temporal pyramid matching of | visual | semantics |
Large-Scale | visual | Font Recognition |
Large-Scale, Multiple Level-of-Detail Change Detection from Remote Sensing Imagery Using Deep | visual | Feature Clustering |
Larger Receptive Field Based RGB | visual | Relocalization Method Using Convolutional Network |
Laser- | visual | -inertial Odometry Based Solution for 3d Heritage Modeling: The Sanctuary of The Blessed Virgin of Trompone |
LASER: LAtent SpacE Rendering for 2D | visual | Localization |
Late fusion of deep learning and handcrafted | visual | features for biomedical image modality classification |
Latent Feature Disentanglement for | visual | Domain Generalization |
Latent Model Clustering and Applications to | visual | Recognition |
Latent Model for | visual | Disambiguation of Keyword-based Image Search, A |
Latent multi-feature co-regression for | visual | recognition by discriminatively leveraging multi-source models |
Latent Subspace Projection Pursuit with Online Optimization for Robust | visual | Tracking |
Latent Variable Models for | visual | Question Answering |
Latent | visual | context analysis for image re-ranking |
Latent | visual | context learning for web image applications |
Lattice-Support repetitive local feature detection for | visual | search |
LAVA:Label-efficient | visual | Learning and Adaptation |
LAVSS: Location-Guided Audio- | visual | Spatial Audio Separation |
Layout2: A Production System Modeling | visual | Perspective Information |
LDA based color information fusion for | visual | objects tracking |
Leaf Vocabulary: Fine-Grained Leaf Image Retrieval Using Bag-of- | visual | -Words Representation |
Learn decision trees with deep | visual | primitives |
Learn from each other to Classify better: Cross-layer mutual attention learning for fine-grained | visual | classification |
Learn How to Choose: Independent Detectors Versus Composite | visual | Phrases |
Learn More: Sub-Significant Area Learning for Fine-Grained | visual | Classification |
Learn to Match: Automatic Matching Network Design for | visual | Tracking |
Learnable and Nonlearnable | visual | Concepts |
Learnable Descriptors for | visual | Search |
Learnable Hierarchical Label Embedding and Grouping for | visual | Intention Understanding |
Learned Event-based | visual | Perception for Improved Space Object Detection |
Learned Monocular Depth Priors in | visual | -Inertial Initialization |
Learning a blind image quality index based on | visual | saliency guided sampling and Gabor filtering |
Learning a Combined Model of | visual | Saliency for Fixation Prediction |
Learning a Dictionary of Shape-Components in | visual | Cortex: Comparison with Neurons, Humans and Machines |
Learning a Dynamic Map of | visual | Appearance |
Learning a locality preserving subspace for | visual | recognition |
Learning a Novel Ensemble Tracker for Robust | visual | Tracking |
Learning a scale-and-rotation correlation filter for robust | visual | tracking |
Learning a temporally invariant representation for | visual | tracking |
Learning a Two-Dimensional Fuzzy Discriminant Locality Preserving Subspace for | visual | Recognition |
Learning Adaptive Discriminative Correlation Filters via Temporal Consistency Preserving Spatial Feature Selection for Robust | visual | Object Tracking |
Learning Adaptive Metric for Robust | visual | Tracking |
Learning adaptive motion search for fast versatile video coding in | visual | surveillance systems |
Learning Adaptive Sparse Spatially-Regularized Correlation Filters for | visual | Tracking |
Learning adaptive spatial-temporal regularized correlation filters for | visual | tracking |
Learning Adaptive Target-and-Surrounding Soft Mask for Correlation Filter Based | visual | Tracking |
Learning Affective Features With a Hybrid Deep Model for Audio- | visual | Emotion Recognition |
Learning an Adaptation Function to Assess Image | visual | Similarities |
Learning and Calibrating Per-Location Classifiers for | visual | Place Recognition |
Learning and Evaluating | visual | Features for Pose Estimation |
Learning and Prediction of Soft Object Deformation Using | visual | Analysis of Robot Interactions |
Learning and using taxonomies for fast | visual | categorization |
Learning Answer Embeddings for | visual | Question Answering |
Learning Attentional Recurrent Neural Network for | visual | Tracking |
Learning Attentions: Residual Attentional Siamese Network for High Performance Online | visual | Tracking |
Learning Audio- | visual | Source Localization via False Negative Aware Contrastive Learning |
Learning Background-Aware Correlation Filters for | visual | Tracking |
Learning Bayesian Classifiers for Scene Classification With a | visual | Grammar |
Learning Better | visual | Data Similarities via New Grouplet Non-Euclidean Embedding |
Learning Better | visual | Dialog Agents with Pretrained Visual-Linguistic Representation |
Learning Better | visual | Dialog Agents with Pretrained Visual-Linguistic Representation |
Learning by Watching: Extracting Reusable Task Knowledge from | visual | Observation of Human Performance |
Learning Cascaded Context-Aware Framework for Robust | visual | Tracking |
Learning Cascaded Siamese Networks for High Performance | visual | Tracki |
Learning CLIP Guided | visual | -Text Fusion Transformer for Video-based Pedestrian Attribute Recognition |
Learning Collaborative Model for | visual | Tracking |
Learning Common and Specific | visual | Prompts for Domain Generalization |
Learning Common Rationale to Improve Self-Supervised Representation for Fine-Grained | visual | Recognition Problems |
Learning Common Sense through | visual | Abstraction |
Learning Compact Binary Codes for | visual | Tracking |
Learning Compact | visual | Attributes for Large-Scale Image Classification |
Learning Compact | visual | Descriptors for Low Bit Rate Mobile Landmark Search |
Learning complementary Siamese networks for real-time high-performance | visual | tracking |
Learning Compositional | visual | Concepts with Mutual Consistency |
Learning Concise and Descriptive Attributes for | visual | Recognition |
Learning contextual dissimilarity on tensor product graph for | visual | re-ranking |
Learning Contextually Fused Audio- | visual | Representations for Audio-Visual Speech Recognition |
Learning Contextually Fused Audio- | visual | Representations for Audio-Visual Speech Recognition |
Learning Cooperative | visual | Dialog Agents with Deep Reinforcement Learning |
Learning Correlation Filter With Detection Response For | visual | Tracking |
Learning correlation filters in independent feature channels for robust | visual | tracking |
Learning Correspondences Between | visual | Features and Functional Features |
Learning Cross-Domain Semantic- | visual | Relationships for Transductive Zero-Shot Learning |
Learning Crowdsourced User Preferences for | visual | Summarization of Image Collections |
Learning Customized | visual | Models with Retrieval-Augmented Knowledge |
Learning Deep Lucas-Kanade Siamese Network for | visual | Tracking |
Learning Deep Neural Networks for Vehicle Re-ID with | visual | -spatio-Temporal Path Proposals |
Learning Deep Representations of Fine-Grained | visual | Descriptions |
Learning deep-sea substrate types with | visual | topic models |
Learning depth from a single image using | visual | -depth words |
Learning descriptive | visual | representation for image classification and annotation |
Learning Discriminative Hidden Structural Parts for | visual | Tracking |
Learning Discriminative | visual | N-grams from Mid-level Image Features |
Learning discriminative | visual | semantic embedding for zero-shot recognition |
Learning Distances for Arbitrary | visual | Features |
Learning Domain-Agnostic | visual | Representation for Computational Pathology Using Medically-Irrelevant Style Transfer Augmentation |
Learning Dual Encoding Model for Adaptive | visual | Understanding in Visual Dialogue |
Learning Dual Encoding Model for Adaptive | visual | Understanding in Visual Dialogue |
Learning Dynamic Siamese Network for | visual | Object Tracking |
Learning Efficient Multi-Agent Cooperative | visual | Exploration |
Learning Everything about Anything: Webly-Supervised | visual | Concept Learning |
Learning explicit and implicit | visual | manifolds by information projection |
Learning Feature Channel Weighting for Real-Time | visual | Tracking |
Learning Federated | visual | Prompt in Null Space for MRI Reconstruction |
Learning Foresightful Dense | visual | Affordance for Deformable Object Manipulation |
Learning From | visual | Demonstrations via Replayed Task-Contrastive Model-Agnostic Meta-Learning |
Learning Fundamental | visual | Concepts Based on Evolved Multi-Edge Concept Graph |
Learning generalized | visual | odometry using position-aware optical flow and geometric bundle adjustment |
Learning Generative | visual | Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories |
Learning Generic Prior Models for | visual | Computation |
Learning Graph Features for Colored Mesh | visual | Quality Assessment |
Learning Graphs to Model | visual | Objects across Different Depictive Styles |
Learning Hierarchical Features for | visual | Object Tracking With Recursive Neural Networks |
Learning inter-related | visual | dictionary for object recognition |
Learning Invariant | visual | Representations for Compositional Zero-Shot Learning |
Learning Joint Top-Down and Bottom-up Processes for 3D | visual | Inference |
Learning Joint | visual | Semantic Matching Embeddings for Language-guided Retrieval |
Learning Kernels for Unsupervised Domain Adaptation with Applications to | visual | Object Recognition |
Learning Knowledge-Directed | visual | Strategies |
Learning language to symbol and language to vision mapping for | visual | grounding |
Learning Latent Temporal Connectionism of Deep Residual | visual | Abstractions for Identifying Surgical Tools in Laparoscopy Procedures |
Learning Like a Child: Fast Novel | visual | Concept Learning from Sentence Descriptions of Images |
Learning Linear Regression via Single-Convolutional Layer for | visual | Object Tracking |
Learning Local Appearances With Sparse Representation for Robust and Fast | visual | Tracking |
Learning Localization-Aware Target Confidence for Siamese | visual | Tracking |
Learning Low-Rank and Sparse Discriminative Correlation Filters for Coarse-to-Fine | visual | Object Tracking |
Learning mappings for face synthesis from near infrared to | visual | light images |
Learning Meshes for Dense | visual | SLAM |
Learning Monocular | visual | Odometry via Self-supervised Long-term Modeling |
Learning more discriminative clues with gradual attention for fine-grained | visual | categorization |
Learning Motion-Perceive Siamese Network for Robust | visual | Object Tracking |
Learning Multi-domain Convolutional Neural Networks for | visual | Tracking |
Learning Multi-feature Based Spatially Regularized and Scale Adaptive Correlation Filters for | visual | Tracking |
Learning multi-scale sparse representation for | visual | tracking |
Learning Multi-Task Correlation Particle Filters for | visual | Tracking |
Learning multimodal relationship interaction for | visual | relationship detection |
Learning multiple | visual | tasks while discovering their structure |
Learning Navigational | visual | Representations with Semantic Map Supervision |
Learning non-metric | visual | similarity for image retrieval |
Learning object intrinsic structure for robust | visual | tracking |
Learning Object Models From | visual | Observation and Background Knowledge |
Learning Object Relation Graph and Tentative Policy for | visual | Navigation |
Learning occlusion with likelihoods for | visual | tracking |
Learning of | visual | Modules from Examples: A Framework for Understanding Adaptive Visual Performance |
Learning of | visual | Modules from Examples: A Framework for Understanding Adaptive Visual Performance |
Learning of | visual | Relations: The Devil is in the Tails |
Learning On-Road | visual | Control for Self-Driving Vehicles With Auxiliary Tasks |
Learning Pairwise Dissimilarity Profiles for Appearance Recognition in | visual | Surveillance |
Learning Partial Correlation based Deep | visual | Representation for Image Classification |
Learning Pre-attentive Driving Behaviour from Holistic | visual | Features |
Learning Quintuplet Loss for Large-Scale | visual | Geolocalization |
Learning Recurrent Memory Activation Networks for | visual | Tracking |
Learning Relationship-Aware | visual | Features |
Learning Reliable | visual | Saliency For Model Explanations |
Learning reliable-spatial and spatial-variation regularization correlation filters for | visual | tracking |
Learning Representation on Optimized High-Order Manifold for | visual | Classification |
Learning Representations by Predicting Bags of | visual | Words |
Learning Robust Deep | visual | Representations from EEG Brain Recordings |
Learning Robust | visual | -Semantic Embeddings |
Learning Rotation Adaptive Correlation Filters in Robust | visual | Object Tracking |
Learning Rotation-Equivariant Features for | visual | Correspondence |
Learning salient | visual | word for scalable mobile image retrieval |
Learning Science Using AR Book: A Preliminary Study on | visual | Needs of Deaf Learners |
Learning Self-supervised Audio- | visual | Representations for Sound Recommendations |
Learning Semantic and | visual | Similarity for Endomicroscopy Video Retrieval |
Learning Semantic Scene Models From Observing Activity in | visual | Surveillance |
Learning semantic | visual | concepts from video |
Learning semantic | visual | vocabularies using diffusion distance |
Learning Semantic-Aware Local Features for Long Term | visual | Localization |
Learning Semantic- | visual | Embeddings with a Priority Queue |
Learning Semantics for | visual | Place Recognition Through Multi-scale Attention |
Learning Semantics-Guided | visual | Attention for Few-Shot Image Classification |
Learning Shape Descriptions: Generating and Generalizing Models of | visual | Objects |
Learning Shared, Discriminative, and Compact Representations for | visual | Recognition |
Learning Sight from Sound: Ambient Sound Provides Supervision for | visual | Learning |
Learning spatial self-attention information for | visual | tracking |
Learning Spatial-Aware Regressions for | visual | Tracking |
Learning Spatial-Context-Aware Global | visual | Feature Representation for Instance Image Retrieval |
Learning Spatial-Frequency Transformer for | visual | Object Tracking |
Learning Spatial-Temporal Regularized Correlation Filters for | visual | Tracking |
Learning Spatially Regularized Correlation Filters for | visual | Tracking |
Learning spatially regularized similarity for robust | visual | tracking |
Learning Spatio-Appearance Memory Network for High-Performance | visual | Tracking |
Learning spatio-temporal context via hierarchical features for | visual | tracking |
Learning Spatio-Temporal Transformer for | visual | Tracking |
Learning statistically relevant edge structure improves low-level | visual | descriptors |
Learning Structured | visual | Detectors From User Input At Multiple Levels |
Learning structured | visual | dictionary for object tracking |
Learning Structures of | visual | Patterns from Single Instances |
Learning Support Correlation Filters for | visual | Tracking |
Learning target-aware correlation filters for | visual | tracking |
Learning Target-specific Response Attention for Siamese Network Based | visual | Tracking |
Learning Temporal-Correlated and Channel- Decorrelated Siamese Networks for | visual | Tracking |
Learning temporally correlated representations using LSTMS for | visual | tracking |
Learning the Best Pooling Strategy for | visual | Semantic Embedding |
Learning the Compositional Nature of | visual | Object Categories for Recognition |
Learning the Compositional Nature of | visual | Objects |
Learning the Distribution-Based Temporal Knowledge with Low Rank Response Reasoning for UAV | visual | Tracking |
Learning the easy things first: Self-paced | visual | category discovery |
Learning the model update with local trusted templates for | visual | tracking |
Learning the Multilinear Structure of | visual | Data |
Learning the Roots of | visual | Domain Shift |
Learning the | visual | Interpretation of Sentences |
Learning to Adversarially Blur | visual | Object Tracking |
Learning to Aggregate and Refine Noisy Labels for | visual | Sentiment Analysis |
Learning to Answer Questions in Dynamic Audio- | visual | Scenarios |
Learning to Ask Informative Sub-Questions for | visual | Question Answering |
Learning to Assemble Neural Module Tree Networks for | visual | Grounding |
Learning to Collocate | visual | -Linguistic Neural Modules for Image Captioning |
Learning to Compose and Reason with Language Tree Structures for | visual | Grounding |
Learning to Compose Dynamic Tree Structures for | visual | Contexts |
Learning to Compose Hypercolumns for | visual | Correspondence |
Learning to describe color composition of | visual | objects |
Learning to Detect Salient Objects in Natural Scenes Using | visual | Attention |
Learning to Diffuse: A New Perspective to Design PDEs for | visual | Analysis |
Learning to Discover Novel | visual | Categories via Deep Transfer Clustering |
Learning to Distribute Vocabulary Indexing for Scalable | visual | Search |
Learning to Generate Grounded | visual | Captions Without Localization Supervision |
Learning to Generate Language-Supervised and Open-Vocabulary Scene Graph Using Pre-Trained | visual | -Semantic Space |
Learning to Learn How to Learn: Self-Adaptive | visual | Navigation Using Meta-Learning |
Learning to Learn Image Classifiers With | visual | Analogy |
Learning to Learn Words from | visual | Scenes |
Learning to Localize Sound Source in | visual | Scenes |
Learning to Localize Sound Sources in | visual | Scenes: Analysis and Applications |
Learning to Locate Informative Features for | visual | Identification |
Learning to Match Anchors for | visual | Object Detection |
Learning to Perform | visual | Tasks from Human Demonstrations |
Learning to Predict Salient Faces: A Novel | visual | -Audio Saliency Model |
Learning to predict the perceived | visual | quality of photos |
Learning to Predict | visual | Attributes in the Wild |
Learning to rank approach for refining image retrieval in | visual | arts |
Learning to Rank Proposals for Siamese | visual | Tracking |
Learning to Reason: End-to-End Module Networks for | visual | Question Answering |
Learning to recognize generic | visual | categories using a hybrid structural approach |
Learning to Recognize | visual | Dynamic Events from Examples |
Learning to Remember Past to Predict Future for | visual | Tracking |
Learning to Segment Actions from | visual | and Language Instructions via Differentiable Weak Sequence Alignment |
Learning to Select Long-Track Features for Structure-From-Motion and | visual | SLAM |
Learning to share | visual | appearance for multiclass object detection |
Learning to Supervise Knowledge Retrieval Over a Tree Structure for | visual | Question Answering |
Learning to Track the | visual | -Motion of Contours |
Learning to Weight Color and Depth for RGB-D | visual | Search |
Learning top down scene context for | visual | attention modeling in natural images |
Learning Unsupervised Video Object Segmentation Through | visual | Attention |
Learning Variance Kernelized Correlation Filters for Robust | visual | Object Tracking |
Learning Versatile Convolution Filters for Efficient | visual | Recognition |
Learning Video Object Segmentation with | visual | Memory |
Learning Video Preferences Using | visual | Features and Closed Captions |
Learning | visual | and textual representations for multimodal matching and classification |
Learning | visual | Attention to Identify People with Autism Spectrum Disorder |
Learning | visual | Behavior for Gesture Analysis |
Learning | visual | Body-shape-Aware Embeddings for Fashion Compatibility |
Learning | visual | categories through a sparse representation classifier based cross-category knowledge transfer |
Learning | visual | Clothing Style with Heterogeneous Dyadic Co-Occurrences |
Learning | visual | Commonsense for Robust Scene Graph Generation |
Learning | visual | Compound Models from Parallel Image-Text Datasets |
Learning | visual | Concepts for Content Based Retrieval |
Learning | visual | Context by Comparison |
Learning | visual | Contexts for Image Annotation From Flickr Groups |
Learning | visual | dictionaries and decision lists for object recognition |
Learning | visual | Dictionaries from Class-Specific Superpixel Segmentation |
Learning | visual | Emotion Representations From Web Data |
Learning | visual | Engagement for Trauma Recovery |
Learning | visual | Explanations for DCNN-based Image Classifiers Using an Attention Mechanism |
Learning | visual | features for relational CBIR |
Learning | visual | Features from Large Weakly Supervised Data |
Learning | visual | flows: A Lie algebraic approach |
Learning | visual | Free Space Detection for Deep-diving Robots |
Learning | visual | Ideals |
Learning | visual | Instance Retrieval from Failure: Efficient Online Local Metric Adaptation from Negative Samples |
Learning | visual | Knowledge Memory Networks for Visual Question Answering |
Learning | visual | Knowledge Memory Networks for Visual Question Answering |
Learning | visual | Landmarks for Localization with Minimal Supervision |
Learning | visual | Landmarks for Pose Estimation |
Learning | visual | Models for Lipreading |
Learning | visual | Models from Shape Contours Using Multiscale Convex/Concave Structure Matching |
Learning | visual | models of semantic concepts |
Learning | visual | Motion Segmentation Using Event Surfaces |
Learning | visual | N-Grams from Web Data |
Learning | visual | operators from examples: a new paradigm in image processing |
Learning | visual | Quality Inspection from Multiple Humans Using Ensembles of Classifiers |
Learning | visual | Question Answering by Bootstrapping Hard Attention |
Learning | visual | question answering on controlled semantic noisy labels |
Learning | visual | Recognition With Bayesian Networks |
Learning | visual | Relationship and Context-Aware Attention for Image Captioning |
Learning | visual | Representation from Modality-Shared Contrastive Language-Image Pre-training |
Learning | visual | Representations using Images with Captions |
Learning | visual | Representations via Language-Guided Sampling |
Learning | visual | Representations with Caption Annotations |
Learning | visual | saliency using topographic independent component analysis |
Learning | visual | Shape Lexicon for Document Image Content Recognition |
Learning | visual | Similarity Measures for Comparing Never Seen Objects |
Learning | visual | Speech |
Learning | visual | Storylines with Skipping Recurrent Neural Networks |
Learning | visual | Styles from Audio-Visual Associations |
Learning | visual | Styles from Audio-Visual Associations |
Learning | visual | variation for object recognition |
Learning | visual | Voice Activity Detection with an Automatically Annotated Dataset |
Learning | visual | -Spatial Saliency for Multiple-Shot Person Re-Identification |
Learning-based compression of | visual | objects for smart surveillance |
Learning-Based Multi-UAV Flocking Control With Limited | visual | Field and Instinctive Repulsion |
Learning-Based Object Segmentation Using Regional Spatial Templates and | visual | Features |
Learning-Based Prediction of | visual | Attention for Video Signals |
Learning-based | visual | Compression |
Least squares estimation-based adaptive observation model for aerial | visual | tracking applications |
Lens Model Selection for | visual | Tracking |
Less is More: Pursuing the | visual | Turing Test with the Kuleshov Effect |
Lessons from the Primate | visual | System |
Let's Weave the | visual | Web |
Leveraging attention-based | visual | clue extraction for image classification |
Leveraging High Level | visual | Information for Matching Images and Captions |
Leveraging Human | visual | Perception for an Optimized Virtual Reality Experience |
Leveraging image based prior for | visual | place recognition |
Leveraging Local and Global Cues for | visual | Tracking via Parallel Interaction Network |
Leveraging local and global descriptors in parallel to search correspondences for | visual | localization |
Leveraging observation uncertainty for robust | visual | tracking |
Leveraging over prior knowledge for online learning of | visual | categories |
Leveraging recent advances in deep learning for audio- | visual | emotion recognition |
Leveraging Semantic Scene Characteristics and Multi-Stream Convolutional Architectures in a Contextual Approach for Video-Based | visual | Emotion Recognition in the Wild |
Leveraging Tacit Information Embedded in CNN Layers for | visual | Tracking |
Leveraging TCN and Transformer for effective | visual | -audio fusion in continuous emotion recognition |
Leveraging the Video-Level Semantic Consistency of Event for Audio- | visual | Event Localization |
Leveraging | visual | Attention for out-of-distribution Detection |
Leveraging | visual | concepts and query performance prediction for semantic-theme-based video retrieval |
Leveraging | visual | Prompts To Guide Language Modeling for Referring Video Object Segmentation |
Leveraging | visual | Question Answering for Image-Caption Ranking |
Lexicon-driven recognition of one-stroke character strings in | visual | gesture |
LFI-CAM: Learning Feature Importance for Better | visual | Explanation |
LGCOAMix: Local and Global Context-and-Object-Part-Aware Superpixel-Based Data Augmentation for Deep | visual | Recognition |
LiDAR- | visual | -Inertial Odometry Based on Optimized Visual Point-Line Features |
LiDAR- | visual | -Inertial Odometry Based on Optimized Visual Point-Line Features |
Lidar- | visual | -Inertial Odometry Using Point and Line Features |
LiDAR/ | visual | SLAM Backend with Loop Closure Detection and Graph Optimization, A |
Lifelong robotic | visual | -tactile perception learning |
Lifelong | visual | -Tactile Spectral Clustering for Robotic Object Perception |
Lighter and Faster Cross-Concatenated Multi-Scale Residual Block Based Network for | visual | Saliency Prediction |
Lightweight Deep Neural Network for Real-Time | visual | Tracking with Mutual Learning |
Lightweight Pixel Difference Networks for Efficient | visual | Representation Learning |
Lightweight Protection of | visual | Data Using High-Dimensional Wavelet Parametrization |
Lightweight real-time error-resilient encoding of | visual | sensor data |
Lightweight Salient Object Detection via Hierarchical | visual | Perception Learning |
Lightweight single image deraining algorithm incorporating | visual | saliency |
Likelihood Map Fusion for | visual | Object Tracking |
Likelihood tuning for particle filter in | visual | tracking |
Line-based | visual | odometry using local gradient fitting |
Linear color-separable human | visual | system models for vector error diffusion halftoning |
Linear Demosaicing Inspired by the Human | visual | System |
Linear Neural Circuitry Model for | visual | Receptive Fields |
Linear Scale-Space II: Early | visual | Operators |
Linear Velocity-Free | visual | Servoing Control for Unmanned Helicopter Landing on a Ship With Visibility Constraint |
Linearization to Nonlinear Learning for | visual | Tracking |
Linguistic Feature Vector for the | visual | Interpretation of Sign Language, A |
Linguistic Structures as Weak Supervision for | visual | Scene Graph Generation |
Linguistically Routing Capsule Network for Out-of-distribution | visual | Question Answering |
Linker: Learning Long Short-term Associations for Robust | visual | Tracking |
Linking text and | visual | concepts semantically for cross modal multimedia search |
Linking | visual | concept detection with viewer demographics |
Linking | visual | saliency deviation to image quality degradation: A saliency deviation-based image quality index |
Lip Reading: Automatic | visual | Recognition of Spoken Words |
Lip2Vec: Efficient and Robust | visual | Speech Recognition via Latent-to-Latent Visual to Audio Representation Mapping |
Lip2Vec: Efficient and Robust | visual | Speech Recognition via Latent-to-Latent Visual to Audio Representation Mapping |
LipFormer: Learning to Lipread Unseen Speakers Based on | visual | -Landmark Transformers |
Listen and Look: Audio- | visual | Matching Assisted Speech Source Separation |
Listening with Your Eyes: Towards a Practical | visual | Speech Recognition System Using Deep Boltzmann Machines |
Live Demonstration: Event-based | visual | Microphone |
LiveSketch: Query Perturbations for Guided Sketch-Based | visual | Search |
LiVLR: A Lightweight | visual | -Linguistic Reasoning Framework for Video Question Answering |
LM-Reloc: Levenberg-Marquardt Based Direct | visual | Relocalization |
Local 3D Symmetry for | visual | Saliency in 2.5D Point Clouds |
Local Aggregation for Unsupervised Learning of | visual | Embeddings |
Local co-occurrence features in subspace obtained by KPCA of local blob | visual | words for scene classification |
Local Dissimilarity Measures of Frames in | visual | Substitution System for Blind People |
Local Feature Reliability Measure Consistent with Match Conditions for Mobile | visual | Search |
Local Features and | visual | Words Emerge in Activations |
Local Hypersphere Coding Based on Edges between | visual | Words |
Local relation network with multilevel attention for | visual | question answering |
Local to Global: Efficient | visual | Localization for a Monocular Camera |
Local | visual | features extraction from texture+depth content based on depth image analysis |
Local | visual | Operator Which Recognizes Edges and Lines, A |
Local | visual | patch for 3d shape retrieval |
Local | visual | Primitives (LVP) for Face Modelling and Recognition |
Local-feature-based similarity measure for stochastic resonance in | visual | perception of spatially structured images |
Local-Guided Global: Paired Similarity Representation for | visual | Reinforcement Learning |
local-motion-based probabilistic model for | visual | tracking, A |
Locale-based | visual | Object Retrieval Under Illumination Change |
Locality-Constrained Collaborative Model for Robust | visual | Tracking |
Localization and Manipulation of Immoral | visual | Cues for Safe Text-to-Image Generation |
Localization Based on Semantic Map and | visual | Inertial Odometry |
Localize to Binauralize: Audio Spatialization from | visual | Sound Source Localization |
Localized Receptive Fields May Mediate Transformation-Invariant Recognition in the | visual | Cortex |
Localizing a polyhedral object in a robot hand by integrating | visual | and tactile data |
Localizing | visual | Sounds the Easy Way |
Localizing | visual | Sounds the Hard Way |
Locally adaptive subspace and similarity metric learning for | visual | data clustering and retrieval |
Locally Adaptive Support-Weight Approach for | visual | Correspondence Search |
Locally discriminative stable model for | visual | tracking with clustering and principle component analysis |
Locally Varying Distance Transform for Unsupervised | visual | Anomaly Detection |
Locally Weighted Fixation Density-Based Metric for Assessing the Quality of | visual | Saliency Predictions, A |
Locating | visual | Explanations for Video Question Answering |
Location and recognition of flashlight projections for | visual | interfaces |
Location-Wise Predetermined Deployment for Optimizing Lifetime in | visual | Sensor Networks, A |
Location2Vec: A Situation-Aware Representation for | visual | Exploration of Urban Locations |
LocTex: Learning Data-Efficient | visual | Representations from Localized Textual Supervision |
Logarithmic Spiral Grid and Gaze Control for the Development of Strategies of | visual | Segmentation on a Document |
Logical Implications for | visual | Question Answering Consistency |
LogicSeg: Parsing | visual | Semantics with Neural Logic Learning and Reasoning |
Logistic regression projection-based feature representation for | visual | domain adaptation |
LoGoPrompt: Synthetic Text Images Can Be Good | visual | Prompts for Vision-Language Models |
LOH and Behold: Web-Scale | visual | Search, Recommendation and Clustering Using Locally Optimized Hashing |
LOIS: Looking Out of Instance Semantics for | visual | Question Answering |
Long and Short Memory Balancing in | visual | Co-Tracking Using Q-Learning |
Long-Range Comprehensive Modeling for Fine-Grained | visual | Classification |
Long-Tailed Multi-Label | visual | Recognition by Collaborative Training on Uniform and Re-balanced Samplings |
Long-tailed | visual | Recognition via Gaussian Clouded Logit Adjustment |
Long-Tailed | visual | Recognition via Self-Heterogeneous Integration with Knowledge Excavation |
Long-Term Incremental Web-Supervised Learning of | visual | Concepts via Random Savannas |
Long-Term Recurrent Convolutional Networks for | visual | Recognition and Description |
Long-Term Variability of Atmospheric | visual | Range (1980-2020) over Diverse Topography of Pakistan |
Long-Term | visual | Localization Revisited |
Long-Term | visual | Localization with Mobile Sensors |
Long-term | visual | Map Sparsification with Heterogeneous GNN |
Long-Term | visual | Object Tracking Benchmark |
Long-term | visual | Place Recognition |
Longitudinal error improvement by | visual | odometry trajectory trail and road segment matching |
Look and Think Twice: Capturing Top-Down | visual | Attention with Feedback Convolutional Neural Networks |
Look Before You Leap: Learning Landmark Features for One-Stage | visual | Grounding |
Look Here! A Parametric Learning Based Approach to Redirect | visual | Attention |
Look, Imagine and Match: Improving Textual- | visual | Cross-Modal Retrieval with Generative Models |
Look, Radiate, and Learn: Self-Supervised Localisation via Radio- | visual | Correspondence |
Looking and Hearing Into Details: Dual-Enhanced Siamese Adversarial Network for Audio- | visual | Matching |
Looking into Your Speech: Learning Cross-modal Affinity for Audio- | visual | Speech Separation |
Looktel: A comprehensive platform for computer-aided | visual | assistance |
Loop Closing for | visual | Pose Tracking during Close-Range 3-D Modeling |
Lossless Tagged | visual | Cryptography Scheme, A |
Lossless Watermarking Considering the Human | visual | System |
Lost! Leveraging the Crowd for Probabilistic | visual | Self-Localization |
Low bit rate audio- | visual | communication having improved face and lip region detection |
Low bit-rate compression of underwater image based on human | visual | system |
Low bit-rate video coding via mode-dependent adaptive regression for wireless | visual | communications |
Low Delay Foveated | visual | Communications over Wireless Channels |
Low Dimensional | visual | Attributes: An Interpretable Image Encoding |
Low Latency 2D Position Estimation with a Line Scan Camera for | visual | Servoing |
Low Transmission Overhead Framework of Mobile | visual | Search Based on Vocabulary Decomposition, A |
Low | visual | difference virtual high dynamic range image synthesizer from a single legacy image |
Low-complexity block tree image coder for | visual | sensor networks |
low-cost | visual | sensor network for elderly care, A |
Low-cost | visual | tracking with an intelligent wheelchair for innovative assistive care |
low-cost, accurate strain measurement using multi-view amplification mechanism and | visual | polydimethylsiloxane lens, A |
Low-level features for | visual | attribute recognition: An evaluation |
Low-Power Neuromorphic System for Real-Time | visual | Activity Recognition, A |
Low-Rank High-Order Tensor Completion With Applications in | visual | Data |
Low-rank regularized multi-view inverse-covariance estimation for | visual | sentiment distribution prediction |
Low-Rank Representation with Graph Constraints for Robust | visual | Tracking |
Low-Rank Sparse Learning for Robust | visual | Tracking |
Low-rank tensor completion for | visual | data recovery via the tensor train rank-1 decomposition |
Low-Rank Tensor Completion Method for Implicitly Low-Rank | visual | Data |
Low-resolution color-based | visual | tracking with state-space model identification |
Low-Shot | visual | Recognition by Shrinking and Hallucinating Features |
Low-Visibility Vehicle-Road Environment Perception Based on the Multi-Modal | visual | Features Fusion of Polarization and Infrared |
Lower-level Estimates and Interpretation of | visual | Motion |
LPCL: Localized prominence contrastive learning for self-supervised dense | visual | pre-training |
LRGAN: | visual | anomaly detection using GAN with locality-preferred recoding |
LSDT: Latent Sparse Domain Transfer Learning for | visual | Adaptation |
LSTM for Image Annotation with Relative | visual | Importance |
LUCFER: A Large-Scale Context-Sensitive Image Dataset for Deep Learning of | visual | Emotions |
Lung Nodule Classification by Jointly Using | visual | Descriptors and Deep Features |
L_inf Norm Based Solution for | visual | Odometry |
M-CoTransT: Adaptive spatial continuity in | visual | tracking |
M-O SiamRPN with Weight Adaptive Joint MIoU for UAV | visual | Localization |
M-SBIR: An Improved Sketch-Based Image Retrieval Method Using | visual | Word Mapping |
MA-VIED: A Multisensor Automotive | visual | Inertial Event Dataset |
Machine Learning and Computing for | visual | Semantic Analysis |
Machine learning for big | visual | analysis |
Machine Vision: Automated | visual | Inspection and Robot Vision |
Machine-to-Machine | visual | Dialoguing with ChatGPT for Enriched Textual Image Description |
MADE: A Composite | visual | -Based 3D Shape Descriptor |
MagicLensVS: Towards a flexible framework for quick setup of | visual | feedback in a virtual studio |
Maintaining Reasoning Consistency in Compositional | visual | Question Answering |
Maintaining Trajectories of Salient Objects for Robust | visual | Tracking |
Make-A-Story: | visual | Memory Conditioned Consistent Story Generation |
Making Convolutional Networks Recurrent for | visual | Sequence Learning |
Making Heads or Tails: Towards Semantically Consistent | visual | Counterfactuals |
Making History Matter: History-Advantage Sequence Training for | visual | Dialog |
Making machine intelligence less scary for criminal analysts: reflections on designing a | visual | comparative case analysis tool |
Making the V in VQA Matter: Elevating the Role of Image Understanding in | visual | Question Answering |
Making | visual | Object Categorization More Challenging: Randomized Caltech-101 Data Set |
Man Made Object Recognition Based on | visual | Perception |
Man-Machine Communication System Based on the | visual | Analysis of Dynamic Gestures, A |
Manifold constraint transfer for | visual | structure-driven optimization |
Manifold Siamese Network: A Novel | visual | Tracking ConvNet for Autonomous Vehicles |
Manifold-Based | visual | Object Counting |
ManipulaTHOR: A Framework for | visual | Object Manipulation |
Manipulating Template Pixels for Model Adaptation of Siamese | visual | Tracking |
Map Archive Mining: | visual | -Analytical Approaches to Explore Large Historical Map Collections |
MAP-based image tag recommendation using a | visual | folksonomy |
Map-Based Probabilistic | visual | Self-Localization |
Map-Free | visual | Relocalization: Metric Pose Relative to a Single Image |
Mapping | visual | features to semantic profiles for retrieval in medical imaging |
Mapping | visual | field with positron emission tomography by mathematical modeling of the retinotopic organization in the calcarine cortex |
Mapping, Localization and Path Planning for Image-Based Navigation Using | visual | Features and Map |
Margin-based discriminant dimensionality reduction for | visual | recognition |
Marked Point Process Model For | visual | Perceptual Groups Extraction, A |
Markerless Motion Capture through | visual | Hull, Articulated ICP and Subject Specific Model Generation |
Markerless | visual | Servoing Using Virtual Face Tag and Image Moment Invariants |
Markerless | visual | Tracking of a Container Crane Spreader |
Markov chain based computational | visual | attention model that learns from eye tracking data |
MART: Motion-Aware Recurrent Neural Network for Robust | visual | Tracking |
Mask-Guided Feature Extraction and Augmentation for Ultra-Fine-Grained | visual | Categorization |
Mask-Vit: an Object Mask Embedding in Vision Transformer for Fine-Grained | visual | Classification |
MaskCOV: A random mask covariance network for ultra-fine-grained | visual | categorization |
Masked Feature Prediction for Self-Supervised | visual | Pre-Training |
Masking and quantization laws in a | visual | subband coding scheme |
Massive-scale image retrieval based on deep | visual | feature representation |
Massive-scale | visual | information retrieval towards city residential environment surveillance |
MAT: Multianchor | visual | Tracking With Selective Search Region |
Match Cutting: Finding Cuts with Smooth | visual | Transitions |
Matching | visual | Features to Hierarchical Semantic Topics for Image Paragraph Captioning |
Material Appearance Transfer with | visual | Cortex Image |
MAVA: Multi-Level Adaptive | visual | -Textual Alignment by Cross-Media Bi-Attention Mechanism |
Max Planck Center for | visual | Computing and Communication |
Max-Confidence Boosting With Uncertainty for | visual | Tracking |
Maximal Weighted Coverage Deployment of UAV-Enabled Rechargeable | visual | Sensor Networks |
Maximizing image quality over | visual | Sensor Networks via DCT bit allocation |
Maximum clique based RGB-D | visual | odometry |
Maximum Likelihood Method for Estimating Performance in a Rapid Serial | visual | Presentation Target-Detection Task, A |
Maximum Margin Projection Subspace Learning for | visual | Data Analysis |
Maximum-Likelihood Approach to | visual | Event Classification, A |
Maximum-Likelihood Strategy for Directing Attention during | visual | Search, A |
MBA-VO: Motion Blur Aware | visual | Odometry |
MCMC based sampling technique for robust multi-model fitting and | visual | data segmentation |
MDAN: Multi-level Dependent Attention Network for | visual | Emotion Analysis |
Mead: A Large-scale Audio- | visual | Dataset for Emotional Talking-face Generation |
Mean Box Pooling: A Rich Image Representation and Output Embedding for the | visual | Madlibs Task |
Mean shift-based Bayesian image reconstruction into | visual | subspace |
Mean-Shift and Sparse Sampling-Based SMC-PHD Filtering for Audio Informed | visual | Speaker Tracking |
Mean-shift | visual | Tracking with NP-Windows Density Estimates |
Measurement errors in | visual | servoing |
Measurement Function Design for | visual | Tracking Applications |
Measurement of 3D | visual | Fatigue Using Event-Related Potential (ERP): 3D Oddball Paradigm |
Measurement of | visual | Motion, The |
Measuring conceptual relation of | visual | words for visual categorization |
Measuring conceptual relation of | visual | words for visual categorization |
Measuring robustness of | visual | SLAM |
Measuring the Effect of High-Level | visual | Masking in Subjective Image Quality Assessment with Priming |
Measuring | visual | distraction in driving: The potential of head movement analysis |
Measuring | visual | Motion from Image Sequences |
Measuring | visual | saliency by Site Entropy Rate |
Measuring | visual | Surprise Jointly from Intrinsic and Extrinsic Contexts for Image Saliency Estimation |
Medial | visual | Fragments as an Intermediate Image Representation for Segmentation and Perceptual Grouping |
Median-Pooling Grad-Cam: An Efficient Inference Level | visual | Explanation for CNN Networks in Remote Sensing Image Classification |
Medical Ultrasound Image Similarity Measurement by Human | visual | System (HVS) Modelling |
Medical | visual | Question Answering via Conditional Reasoning and Contrastive Learning |
Medical-Based Pictogram: Comprehension of | visual | Language with Semiotic Theory |
MEDIRL: Predicting the | visual | Attention of Drivers via Maximum Entropy Deep Inverse Reinforcement Learning |
medXGAN: | visual | Explanations for Medical Classifiers through a Generative Latent Space |
Memorable Maps: A Framework for Re-Defining Places in | visual | Place Recognition |
Memory and Expectations in Learning, Language, and | visual | Understanding |
Memory Network With Pixel-Level Spatio-Temporal Learning for | visual | Object Tracking |
Memory-Based Neighbourhood Embedding for | visual | Recognition |
Memory-Based Parameterized Skills Learning for Mapless | visual | Navigation |
Memory-efficient and GPU-oriented | visual | anomaly detection with incremental dimension reduction |
Memory-Efficient Image Databases for Mobile | visual | Search |
MeMu: Metric Correlation Siamese Network and Multi-Class Negative Sampling for | visual | Tracking |
Mental | visual | Browsing |
Mereology of | visual | Form |
Merging thermal and | visual | images by a contrast pyramid |
MeshAdv: Adversarial Meshes for | visual | Recognition |
MeshLoc: Mesh-Based | visual | Localization |
Meta Module Network for Compositional | visual | Reasoning |
Meta-Analysis of Vibrotactile and | visual | Information Displays for Improving Task Performance, A |
Meta-learning Approach for Domain Generalisation across | visual | Modalities in Vehicle Re-identification, A |
Meta-tracker: Fast and Robust Online Adaptation for | visual | Object Trackers |
MetaCleaner: Learning to Hallucinate Clean Representations for Noisy-Labeled | visual | Recognition |
MetaCLUE: Towards Comprehensive | visual | Metaphors Research |
MetaSAug: Meta Semantic Augmentation for Long-Tailed | visual | Recognition |
MetaVG: A Meta-Learning Framework for | visual | Grounding |
Method and apparatus for evaluating the | visual | quality of processed digital video sequences |
Method and apparatus for processing both still and moving | visual | pattern images |
Method and apparatus for producing audio- | visual | synthetic speech |
Method and apparatus for summarizing and indexing the contents of an audio- | visual | presentation |
Method and system for detecting conscious hand movement patterns and computer-generated | visual | feedback for facilitating human-computer interaction |
Method and system for generating facial animation values based on a combination of | visual | and audio information |
Method for adapting quantization in video coding using face detection and | visual | eccentricity weighting |
Method for Automating the | visual | Inspection of Printed Wiring Boards, A |
Method for minimizing | visual | artifacts converting two-dimensional motion pictures into three-dimensional motion pictures |
Method for simultaneous | visual | tracking of multiple bodies in a closed structured environment |
method for the | visual | analysis of early-stage Parkinson's disease based on virtual MRI-derived SPECT images, A |
method of | visual | metrology from uncalibrated images, A |
method to combine | visual | and infrared face image verification systems, A |
Methods and apparatuses for segmenting an audio- | visual | recording using image similarity searching and audio speaker recognition |
Methods and devices for producing and using synthetic | visual | speech based on natural coarticulation |
Methods for Multiloop Identification of | visual | and Neuromuscular Pilot Responses |
Methods for reducing | visual | discomfort in stereoscopic 3D: A review |
Methods for Volumetric Reconstruction of | visual | Scenes |
Metric Learning Based Structural Appearance Model for Robust | visual | Tracking |
Metric learning with two-dimensional smoothness for | visual | analysis |
Metric Learning-Based Multimodal Audio- | visual | Emotion Recognition |
MHSAN: Multi-Head Self-Attention Network for | visual | Semantic Embedding |
Microphone Arrays as Generalized Cameras for Integrated Audio | visual | Processing |
mid-level representation of | visual | structures for video compression, A |
Midstream Content Access of | visual | Pattern Coded Imagery |
MIL based | visual | object tracking with kernel and scale adaptation |
MILES: | visual | BERT Pre-training with Injected Language Semantics for Video-Text Retrieval |
Mind's eye: A recurrent | visual | representation for image caption generation |
Minimal Closed-form Solution for the Perspective Three Orthogonal Angles (P3oA) Problem: Application To | visual | Odometry, A |
Minimal Conditions for the | visual | Detection of Structure and Motion in Three Dimensions |
Minimized Database of Unit Selection in | visual | Speech Synthesis without Loss of Naturalness |
Minimizing Pixel Expansion in | visual | Cryptographic Scheme for General Access Structures |
Minimizing the Perceptual Impact of | visual | Distortion in Scalable Wavelet Compressed Video |
Minimum Bayes error features for | visual | recognition |
Minimum Bayes Error Features for | visual | Recognition by Sequential Feature Selection and Extraction |
Minimum Uncertainty Gap for Robust | visual | Tracking |
Mining Compact Bag-of-Patterns for Low Bit Rate Mobile | visual | Search |
Mining Compositional Features From GPS and | visual | Cues for Event Recognition in Photo Collections |
Mining discriminative co-occurrence patterns for | visual | recognition |
Mining exoticism from | visual | content with fusion-based deep neural networks |
Mining Insights From | visual | Assets |
Mining Mid-level | visual | Patterns with Deep CNN Activations |
Mining semantic affordances of | visual | object categories |
Mining Spatial-Temporal Similarity for | visual | Tracking |
Mining | visual | actions from movies |
Mining | visual | Collocation Patterns via Self-Supervised Subspace Learning |
Mirror Detection With the | visual | Chirality Cue |
Mitigating Urban | visual | Pollution through a Multistakeholder Spatial Decision Support System to Optimize Locational Potential of Billboards |
MIVC: Multiple Instance | visual | Component for Visual-Language Models |
MIVC: Multiple Instance | visual | Component for Visual-Language Models |
Mix-ViT: Mixing attentive vision transformer for ultra-fine-grained | visual | categorization |
Mixed Autoencoder for Self-Supervised | visual | Representation Learning |
Mixed Finite Element Based Neural Networks In | visual | Reconstruction |
MixPHM: Redundancy-Aware Parameter-Efficient Tuning for Low-Resource | visual | Question Answering |
MixSpeech: Cross-Modality Self-Learning with Audio- | visual | Stream Mixup for Visual Speech Translation and Recognition |
MixSpeech: Cross-Modality Self-Learning with Audio- | visual | Stream Mixup for Visual Speech Translation and Recognition |
Mixture Component Identification and Learning for | visual | Recognition |
Mixture of grouped regressors and its application to | visual | mapping |
MixVPR: Feature Mixing for | visual | Place Recognition |
MLVSNet: Multi-level Voting Siamese Network for 3D | visual | Tracking |
Mobile 3D | visual | search using the Helmert transformation of stereo features |
Mobile Manipulator Robot | visual | Servoing and Guidance for Dynamic Target Grasping |
Mobile Robot | visual | Navigation Using Multiple Features |
Mobile | visual | Assistive Apps: Benchmarks of Vision Algorithm Performance |
Mobile | visual | Location Recognition |
Mobile | visual | Search |
Mobile | visual | Search Compression With Grassmann Manifold Embedding |
Mobile | visual | Search from Dynamic Image Databases |
Mobile | visual | search on printed documents using text and low bit-rate features |
Mobile | visual | Search: Architectures, Technologies, and the Emerging MPEG Standard |
Modality-invariant | visual | Odometry for Embodied Vision |
Model based | visual | inspection of pharmaceutical tablets with photometric stereo |
Model for Cell Receptive Fields in the | visual | Striate Cortex, A |
Model for local image velocity detection of early | visual | processing |
model for selection of attributes for automatic pattern recognition. Stepwise data compression monitored by | visual | classification, A |
model for the qualitative description of images based on | visual | and spatial features, A |
Model for | visual | Flow-Field Cueing and Self-Motion Estimation, A |
Model of Attention-Guided | visual | Perception and Recognition, A |
Model of Contour Integration in Early | visual | Cortex, A |
Model of Frequency Analysis in the | visual | Cortex and the Shape from Texture Problem |
Model of Human | visual | -Motion Sensing |
Model of Predictive Control in | visual | Target Tracking, A |
Model of Saliency-Based | visual | Attention for Rapid Scene Analysis, A |
model of the | visual | attention to speed up image analysis, A |
Model of | visual | Knowledge Representation, A |
Model of | visual | Motion Sensing |
Model of | visual | Texture Discrimination Using Multiple Weak Operators and Spatial Averaging, A |
Model Predictive Control to Improve | visual | Control of Motion: Applications in Active Tracking of Moving Targets |
Model Selection for Unsupervised Learning of | visual | Context |
Model-Agnostic | visual | Explanations via Approximate Bilinear Models |
Model-assisted Labeling via Explainability for | visual | Inspection of Civil Infrastructures |
Model-based adaptive preprocessing of images in automatic | visual | inspection |
Model-based integration of | visual | cues for hand tracking |
Model-based stereo- | visual | tracking: Covariance analysis and tracking schemes |
Model-Based | visual | Feedback Control for a Hand-Eye Coordinated Robotic System |
Model-based | visual | self-localization using geometry and graphs |
Model-driven Active | visual | Tracking |
Model-free augmented reality by virtual | visual | servoing |
Model-Free | visual | Servo Swarming of Manned-Unmanned Surface Vehicles With Visibility Maintenance and Collision Avoidance |
Modeling and analysis of driver behaviour under shared control through weighted | visual | and haptic guidance |
Modeling and Representation of | visual | Information: Reply, The |
Modeling Bottom-Up | visual | Attention for Color Images |
Modeling Camera Effects to Improve | visual | Learning from Synthetic Data |
Modeling continuous | visual | features for semantic image annotation and retrieval |
Modeling Driver's | visual | Fixation Behavior Using White-Box Representations |
Modeling Ecologically Specialized Biological | visual | Systems |
Modeling Entities as Semantic Points for | visual | Information Extraction in the Wild |
Modeling of a neural network system for active | visual | perception and recognition |
Modeling of some spatio-temporal aspects of | visual | information processing in the retinal neural network |
Modeling of Unbounded Long-Range Drift in | visual | Odometry |
Modeling the Relative | visual | Tempo for Self-supervised Skeleton-based Action Recognition |
Modeling Two-Stream Correspondence for | visual | Sound Separation |
Modeling Varying Camera-IMU Time Offset in Optimization-Based | visual | -Inertial Odometry |
Modeling | visual | and word-conditional semantic attention for image captioning |
Modeling | visual | Attention's Modulatory Aftereffects on Visual Sensitivity and Quality Evaluation |
Modeling | visual | Attention's Modulatory Aftereffects on Visual Sensitivity and Quality Evaluation |
Modeling | visual | Context Is Key to Augmenting Object Detection Datasets |
Modeling | visual | Impairments with Artificial Neural Networks: a Review |
Modeling | visual | information by spatio-temporal patterns to analyze event tactic in sports video |
Modeling | visual | Information Processing in Brain: A Computer Vision Point of View and Approach |
Modeling | visual | interactive systems through dynamic visual languages |
Modeling | visual | interactive systems through dynamic visual languages |
Modeling | visual | Patterns by Integrating Descriptive and Generative Methods |
Modeling | visual | -Attention Via Selective Tuning |
Modeling, Simulation and | visual | Analysis of Crowds: A Multidisciplinary Perspective |
Modelling and combining emotions, | visual | speech and gestures in virtual head models |
Modelling periodic scene elements for | visual | surveillance |
Modelling Stochastic Context of Audio- | visual | Expressive Behaviour With Affective Processes |
Modelling the Human | visual | Process by Evolving Images from Noise |
Modelling | visual | Appearance of Handwriting |
Modelling | visual | attention and motion effect for visual quality evaluation |
Modelling | visual | attention and motion effect for visual quality evaluation |
Modelling | visual | impressions for Chinese and Pakistani ethnic groups |
Modelling | visual | Objects Invariant to Depictive Style |
Modelling | visual | saliency using degree centrality |
Models for the Perception of Speech and | visual | Form |
Models of Statistical | visual | Motion Estimation |
modified eye-in-hand stereo | visual | control for grasping unknown objects via Scara robot, A |
Modular BDPCA based | visual | feature representation for lip-reading |
Modular Graph Attention Network for Complex | visual | Relational Reasoning |
Modulating Bottom-Up and Top-Down | visual | Processing via Language-Conditional Filters |
MolGrapher: Graph-based | visual | Recognition of Chemical Structures |
Momentum Contrast for Unsupervised | visual | Representation Learning |
Monocular Omnidirectional | visual | Odometry for Outdoor Ground Vehicles |
Monocular Road Terrain Detection by Combining | visual | and Spatial Information |
Monocular | visual | Odometry and Dense 3D Reconstruction for On-Road Vehicles |
Monocular | visual | odometry from frame to frame intensity differences for planetary exploration mobile robots |
Monocular | visual | Scene Understanding: Understanding Multi-Object Traffic Scenes |
Monocular | visual | Traffic Surveillance: A Review |
Monocular | visual | -IMU Odometry: A Comparative Evaluation of Detector-Descriptor-Based Methods |
Monocular | visual | -IMU Odometry: A Comparative Evaluation of the Detector-Descriptor Based Methods |
Monocular | visual | -Inertial Navigation for Dynamic Environment |
Monocular | visual | -Inertial SLAM for Fixed-Wing UAVs Using Sliding Window Based Nonlinear Optimization |
Monocular | visual | -Inertial-Wheel Odometry Using Low-Grade IMU in Urban Areas |
Monte Carlo sampling for | visual | pose tracking |
Monte Carlo | visual | Tracking Using Color Histograms and a Spatially Weighted Oriented Hausdorff Measure |
More Real Than Real: A Study on Human | visual | Perception of Synthetic Faces |
more you learn, the less you store: Memory-controlled incremental SVM for | visual | place recognition, The |
Most and Least Retrievable Images in | visual | -Language Query Systems |
Motion Correlation Discovery for | visual | Tracking |
Motion detection and tracking using belief indicators for an automatic | visual | -surveillance system |
Motion Detection using a Model of | visual | Attention |
Motion Estimation Using a General Purpose Neural Network Simulator for | visual | Attention |
Motion features to enhance scene segmentation in active | visual | attention |
Motion JPEG2000 Coding Scheme Based on Human | visual | System for Digital Cinema |
motion model based on recurrent neural networks for | visual | object tracking, A |
Motion Observability Analysis of the Simplified Color Correlogram for | visual | Tracking |
motion parameters estimating method based on deep learning for | visual | blurred object tracking, A |
Motion Pattern Extraction and Event Detection for Automatic | visual | Surveillance |
Motion priors for multiple target | visual | tracking |
Motion Profiles for Deception Detection Using | visual | Cues |
Motion trajectory based | visual | saliency for video quality assessment |
Motion Trajectory Classification for | visual | Surveillance and Tracking |
Motion Understanding from Qualitative | visual | Dynamics |
motion-based | visual | interface for 3D visualization and robotic control applications, A |
Motion-Compensated | visual | -Pattern Image Sequence Coding for Full-Motion Multisession Videoconferencing on Multimedia Workstations |
Motion-compensated wavelet transform coder for very low bit-rate | visual | telephony |
Motion-Driven | visual | Tempo Learning for Video-Based Action Recognition |
Motion-Related Resource Allocation in Dynamic Wireless | visual | Sensor Network Environments |
Motion-tolerance contextual | visual | saliency preserving for video retargeting |
Mouth Cavity | visual | Analysis Based on Deep Learning for Oropharyngeal Swab Robot Sampling |
Move2Hear: Active Audio- | visual | Source Separation |
Movement-flow-based | visual | servoing and force control fusion for Manipulation Tasks in unstructured environments |
Movie Classifier Based on | visual | Features, A |
Movie genre classification by exploiting audio- | visual | features of previews |
Movie segmentation into scenes and chapters using locally weighted bag of | visual | words |
MovieCLIP: | visual | Scene Recognition in Movies |
Moving | visual | focus in salient object segmentation |
Moving | visual | Representations of Video Objects for Content-Based Search and Browsing |
Moving: A Modular and Flexible Platform for Embodied | visual | Navigation |
MP2020: | visual | quality assessment database for macro photography images |
MPEG-4 Systems and Description Languages: A Way Ahead in Audio | visual | Information Representation, The |
MPEG-7 | visual | Description Framework: Concepts, Accuracy, and Applications, The |
MPEG-7 | visual | Descriptors: Contributions for Automated Feature Extraction in Capsule Endoscopy |
MPEG-7 | visual | motion descriptors |
MPEG-7 | visual | shape descriptors |
MPEG-7 | visual | standard for content description-an overview, The |
MRF-MAP-MFT | visual | object segmentation based on motion boundary field |
MsVRL: Self-Supervised Multiscale | visual | Representation Learning via Cross-Level Consistency for Medical Image Segmentation |
MT-UNET: A Novel U-Net Based Multi-Task Architecture For | visual | Scene Understanding |
MTBI Identification From Diffusion MR Images Using Bag of Adversarial | visual | Features |
MTUNet: Few-shot Image Classification with | visual | Explanations |
MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based | visual | Question Answering |
Multi attention module for | visual | tracking |
Multi Event Localization by Audio- | visual | Fusion with Omnidirectional Camera and Microphone Array |
Multi feature-rich synthetic colour to improve human | visual | perception of point clouds |
multi sensory approach using error bounds for improved | visual | odometry, A |
multi-agent architecture to support active fusion in a | visual | sensor network, A |
multi-agent framework for | visual | surveillance, A |
Multi-Attention DenseNet: A Scattering Medium Imaging Optimization Framework for | visual | Data Pre-Processing of Autonomous Driving Systems |
Multi-branch and Multi-scale Attention Learning for Fine-grained | visual | Categorization |
Multi-Camera Surveillance with | visual | Tagging and Generic Camera Placement |
Multi-Camera | visual | Surveillance for Motion Detection, Occlusion Handling, Tracking and Event Recognition |
Multi-Camera | visual | Surveillance System for Tracking of Reoccurrences of People, A |
Multi-class ada-boost classification of object poses through | visual | and infrared image information fusion |
Multi-class Multi-annotator Active Learning with Robust Gaussian Process for | visual | Recognition |
Multi-class Object Detection with Hough Forests Using Local Histograms of | visual | Words |
Multi-Classes and Motion Properties for Concurrent | visual | SLAM in Dynamic Environments |
Multi-cue Based | visual | Tracking in Clutter Scenes with Occlusions |
Multi-cue Correlation Filters for Robust | visual | Tracking |
Multi-cue | visual | Tracking Using Robust Feature-Level Fusion Based on Joint Sparse Representation |
Multi-Difference Image Fusion Change Detection Using a | visual | Attention Model on VHR Satellite Data |
Multi-Dimensional Traffic Congestion Detection Based on Fusion of | visual | Features and Convolutional Neural Network |
Multi-Dimensional | visual | Data Completion via Low-Rank Tensor Representation Under Coupled Transform |
Multi-dimensional | visual | tracking using scatter search particle filter |
Multi-factor cheating prevention in | visual | secret sharing by hybrid codebooks |
multi-factors approach for image quality assessment based on a human | visual | system model, A |
Multi-FoV Viewport-Based | visual | Saliency Model Using Adaptive Weighting Losses for 360° Images, A |
Multi-frame Approach to | visual | Motion Perception, A |
Multi-frame Feature Integration for Multi-camera | visual | Odometry |
Multi-hypothesis motion planning for | visual | object tracking |
Multi-kernel Correlation Filter for | visual | Tracking |
Multi-label | visual | classification with label exclusive context |
Multi-Label | visual | Feature Learning with Attentional Aggregation |
Multi-layer CNN Features Aggregation for Real-time | visual | Tracking |
Multi-layer fusion network for blind stereoscopic 3D | visual | quality prediction |
Multi-layer linear model for top-down modulation of | visual | attention in natural egocentric vision |
Multi-level Attention Networks for | visual | Question Answering |
Multi-level Discriminative Dictionary Learning towards Hierarchical | visual | Categorization |
Multi-level Fusion of Audio and | visual | Features for Speaker Identification |
Multi-Level Knowledge Injecting for | visual | Commonsense Reasoning |
Multi-level Net: A | visual | Saliency Prediction Model |
Multi-level Particle Filter Fusion of Features and Cues for Audio- | visual | Person Tracking |
Multi-level prediction Siamese network for real-time UAV | visual | tracking |
Multi-Level Reconstruction of | visual | Surfaces: Variational Principles and Finite Element Representations |
Multi-Level Signal Fusion for Enhanced Weakly-Supervised Audio- | visual | Video Parsing |
Multi-level | visual | alphabets |
Multi-Manifold Positive and Unlabeled Learning for | visual | Analysis |
Multi-modal Contextual Graph Neural Network for Text | visual | Question Answering |
Multi-Modal Dynamic Graph Transformer for | visual | Grounding |
Multi-modal Factorized Bilinear Pooling with Co-attention Learning for | visual | Question Answering |
Multi-modal spatial relational attention networks for | visual | question answering |
Multi-Modal Structure-Embedding Graph Transformer for | visual | Commonsense Reasoning |
Multi-modal Text Recognition Networks: Interactive Enhancements Between | visual | and Semantic Features |
Multi-modal Variational Faster R-CNN for Improved | visual | Object Detection in Manufacturing |
Multi-modal | visual | concept classification of images via Markov random walk over tags |
Multi-Modal | visual | Place Recognition in Dynamics-Invariant Perception Space |
Multi-Modality Latent Interaction Network for | visual | Question Answering |
Multi-modality Network with | visual | and Geometrical Information for Micro Emotion Recognition |
Multi-Mode Online Knowledge Distillation for Self-Supervised | visual | Representation Learning |
Multi-net System Configuration for | visual | Object Segmentation by Error Backpropagation |
Multi-object Motion Pattern Classification for | visual | Surveillance and Sports Video Retrieval |
Multi-Object Tracking Hierarchically in | visual | Data Taken From Drones |
Multi-object | visual | tracking based on reversible jump Markov chain Monte Carlo |
Multi-Objective Matrix Normalization for Fine-Grained | visual | Recognition |
Multi-objective | visual | Odometry |
Multi-observation | visual | recognition via joint dynamic sparse representation |
Multi-Order Feature Statistical Model for Fine-Grained | visual | Categorization |
Multi-path Neural Networks for On-device Multi-domain | visual | Classification |
Multi-Perspective LSTM for Joint | visual | Representation Learning |
Multi-proxy feature learning for robust fine-grained | visual | recognition |
Multi-resolution dyadic wavelet denoising approach for extraction of | visual | evoked potentials in the brain |
Multi-resolution recognition of 3D objects based on | visual | resolution limits |
Multi-resolutional human | visual | perception optimized pathology image progressive coding based on JPEG2000 |
Multi-Robot Repeated Area Coverage: Performance Optimization Under Various | visual | Ranges |
Multi-robot, EKF-Based | visual | SLAM System |
Multi-scale Color Local Binary Patterns for | visual | Object Classes Recognition |
Multi-scale Direct Sparse | visual | Odometry for Large-Scale Natural Environment |
Multi-Scale Learning Framework for | visual | Categorization, A |
Multi-scale relation reasoning for multi-modal | visual | Question Answering |
Multi-scale Relational Reasoning with Regional Attention for | visual | Question Answering |
Multi-scale | visual | Aggregation Residual Network for Super-Resolution |
Multi-scale | visual | attention & saliency modelling with decision theory |
Multi-scale | visual | attention for attribute disambiguation in zero-shot learning |
Multi-Scale | visual | Perception Based Progressive Feature Interaction Network for Stereo Image Super-Resolution |
Multi-scale | visual | tracking by sequential belief propagation |
Multi-sensor Fire Detection by Fusing | visual | and Non-visual Flame Features |
Multi-sensor Fire Detection by Fusing | visual | and Non-visual Flame Features |
Multi-Sensor Fusion Tracking Using | visual | Information and WI-Fl Location Estimation |
Multi-Source Image Matching Network for UAV | visual | Location, A |
Multi-Speaker Tracking From an Audio- | visual | Sensing Device |
Multi-Spectral | visual | Odometry without Explicit Stereo Matching |
Multi-stage Attention based | visual | Question Answering |
Multi-stage vector quantization towards low bit rate | visual | search |
Multi-Stage | visual | Tracking With Siamese Anchor-Free Proposal Network |
Multi-step Entropy Based Sensor Control for | visual | Object Tracking |
Multi-step Multi-camera View Planning for Real-Time | visual | Object Tracking |
Multi-Streaming of | visual | Scenes with Scalable Partial Reliability |
Multi-task Compositional Network for | visual | Relationship Detection |
Multi-Task Convolution Operators with Object Detection for | visual | Tracking |
Multi-Task Deep Dual Correlation Filters for | visual | Tracking |
Multi-Task Deep Relative Attribute Learning for | visual | Urban Perception |
Multi-task deep | visual | -semantic embedding for video thumbnail selection |
Multi-task hierarchical convolutional network for | visual | -semantic cross-modal retrieval |
Multi-task Learning for Human Affect Prediction with Auditory- | visual | Synchronized Representation |
Multi-Task Occlusion Learning for Real-Time | visual | Object Tracking |
Multi-Task Probabilistic Regression with Overlap Maximization for | visual | Tracking |
Multi-Task Rank Learning for | visual | Saliency Estimation |
Multi-task Self-Supervised | visual | Learning |
Multi-Tier Attention Network using Term-weighted Question Features for | visual | Question Answering |
Multi-type decision fusion network for | visual | Q&A |
Multi-VAE: Learning Disentangled View-common and View-peculiar | visual | Representations for Multi-view Clustering |
Multi-View Active Fine-Grained | visual | Recognition |
Multi-view depth estimation based on | visual | -hull enhanced Hybrid Recursive Matching for 3D video conference systems |
Multi-view Domain Generalization for | visual | Recognition |
Multi-View Image Classification With | visual | , Semantic and View Consistency |
Multi-View Image Registration for Wide-Baseline | visual | Sensor Networks |
Multi-view task-driven recognition in | visual | sensor networks |
Multi-View Transformer for 3D | visual | Grounding |
Multi-View | visual | Saliency-Based MRI Classification for Alzheimer's Disease Diagnosis |
Multi-view | visual | speech recognition based on multi task learning |
Multi- | visual | -Modality Human Activity Understanding |
Multiagent | visual | Surveillance of Dynamic Scenes |
Multichannel Attention Network for Analyzing | visual | Behavior in Public Speaking |
Multiclass Steady-State | visual | Evoked Potential Frequency Evaluation Using Chirp-Modulated Stimuli |
Multidimensional Indexing for Recognizing | visual | Shapes |
Multihypothesis trajectory analysis for robust | visual | tracking |
Multilabel Deep | visual | -Semantic Embedding |
Multilevel Computational Processes for | visual | Surface Reconstruction |
Multilinear Isometric Embedding for | visual | pattern analysis |
Multilingual | visual | sentiment concept clustering and analysis |
Multimedia Analysis + | visual | Analytics = Multimedia Analytics |
Multimedia forensic hash based on | visual | words |
Multimedia Search Without | visual | Analysis: The Value of Linguistic and Contextual Information |
Multimedia translation for linking | visual | data to semantics in videos |
Multimedia, Audio- | visual | Communications, Survey |
Multimodal and Multi-task Audio- | visual | Vehicle Detection and Classification |
Multimodal Continuous | visual | Attention Mechanisms |
Multimodal Contrastive Training for | visual | Representation Learning |
Multimodal Data Augmentation for | visual | -Infrared Person ReID with Corrupted Data |
Multimodal framework based on audio- | visual | features for summarisation of cricket videos |
Multimodal grid features and cell pointers for scene text | visual | question answering |
Multimodal Integration of Human-Like Attention in | visual | Question Answering |
Multimodal person authentication using speech, face and | visual | speech |
Multimodal Prompting with Missing Modalities for | visual | Recognition |
Multimodal recognition of | visual | concepts using histograms of textual concepts and selective weighted late fusion scheme |
Multimodal Saliency and Fusion for Movie Summarization Based on Aural, | visual | , and Textual Attention |
Multimodal Saliency Model for Videos With High Audio- | visual | Correspondence, A |
Multimodal tracking and classification of audio- | visual | features |
Multimodal Transformer With Multi-View | visual | Representation for Image Captioning |
Multimodal Variational Auto-encoder based Audio- | visual | Segmentation |
Multimodal | visual | Concept Learning with Weakly Supervised Techniques |
Multimodal | visual | Data Registration for Web-Based Visualization in Media Production |
Multinomial processing models in | visual | cognitive effort diagnostics |
Multipage document retrieval by textual and | visual | representations |
Multiparty | visual | Co-Occurrences for Estimating Personality Traits in Group Meetings |
Multiperson | visual | Focus of Attention from Head Pose and Meeting Contextual Cues |
Multiple Anchor Learning for | visual | Object Detection |
Multiple and variable target | visual | tracking for video-surveillance applications |
Multiple Context Features in Siamese Networks for | visual | Object Tracking |
Multiple Description Video Coding Based on Human | visual | System Characteristics |
Multiple Dynamic Object Tracking for | visual | SLAM |
Multiple Feature Fusion via Weighted Entropy for | visual | Tracking |
Multiple feature kernel hashing for large-scale | visual | search |
Multiple instance deep learning for weakly-supervised | visual | object tracking |
Multiple Instance Models Regression for Robust | visual | Tracking |
Multiple Kernel Learning for | visual | Object Recognition: A Review |
Multiple Layers of Contrasted Images for Robust Feature-Based | visual | Tracking |
Multiple Line Skew Estimation of Handwritten Images of Documents Based on a | visual | Perception Approach |
Multiple Mask Enhanced Transformer for Robust | visual | Tracking |
Multiple model adaptive | visual | tracking with correlation filters |
Multiple rotation symmetry group detection via saliency-based | visual | attention and Frieze expansion pattern |
Multiple target tracking using cognitive data association of spatiotemporal prediction and | visual | similarity |
Multiple | visual | Models Based Perceptive Analysis Framework for Multilevel Video Summarization, A |
Multiple Watermarking in | visual | Cryptography |
Multiple-Hypothesis Approach for Multiobject | visual | Tracking, A |
Multiple-Kernel, Multiple-Instance Similarity Features for Efficient | visual | Object Detection |
Multiplicative Model of Appearance for | visual | Tracking, A |
Multiplier-Less Stream Processor for 2D Filtering in | visual | Search Applications |
Multiresolution Approach to | visual | Pattern Partitioning of 3D Images |
Multiresolution Gaussian Mixture Models for | visual | Motion Estimation |
Multiscale color invariants based on the human | visual | system |
Multiscale Dictionary Learning via Cross-Scale Cooperative Learning and Atom Clustering for | visual | Signal Processing |
Multiscale Feature Extraction from the | visual | Environment in an Active Vision System |
Multiscale Minimization of Global Energy Functions in Some | visual | Recovery Problems |
Multiscale salient region-based | visual | tracking |
Multiscale spatially regularised correlation filters for | visual | tracking |
Multiscale | visual | Object Detection for Unsupervised Ubiquitous Projection Based on a Portable Projector-Camera System |
Multisensor Fusion for Scene Perception: Integrating Thermal and | visual | Imagery |
Multisensor Integration: Experiments in Integrating Thermal and | visual | Sensors |
Multisensor of thermal and | visual | images to detect concealed weapon using harmony search image fusion approach |
Multisensory integration of a sound with stereo 3-D | visual | events |
Multisensory | visual | Servoing by a Neural Network |
Multistrategical Approach in | visual | Learning |
Multistream Articulatory Feature-Based Models for | visual | Speech Recognition |
Multitarget | visual | Tracking Based Effective Surveillance With Cooperation of Multiple Active Cameras |
Multitone reconstruction | visual | cryptography based on phase periodicity |
Multivalued Default Logic for Identity Maintenance in | visual | Surveillance |
Multiview Label Sharing for | visual | Representations and Classifications |
Multiview Language Bias Reduction for | visual | Question Answering |
Multiview occlusion analysis for tracking densely populated objects based on 2-D | visual | angles |
Multiview Similarity Learning for Robust | visual | Clustering |
MUREL: Multimodal Relational Reasoning for | visual | Question Answering |
Music Gesture for | visual | Sound Separation |
MUTAN: Multimodal Tucker Fusion for | visual | Question Answering |
Mutual Learning and Feature Fusion Siamese Networks for | visual | Object Tracking |
Mutually Textual and | visual | Refinement Network for Image-Text Matching, A |
MV-CDN: Multi- | visual | Collaborative Deep Network for Change Detection of Double-Temporal Hyperspectral Images |
MVP: Multimodality-Guided | visual | Pre-training |
MVSSC: Meta-reinforcement learning based | visual | indoor navigation using multi-view semantic spatial context |
MyPlaces: detecting important settings in a | visual | diary |
Name block location in facsimile images using spatial/ | visual | cues |
Natural color image enhancement and evaluation algorithm based on human | visual | system |
Natural Contrast Statistics and the Selection of | visual | Fixations |
Natural language letter based | visual | cryptography scheme |
nature of the | visual | field, a phenomenological analysis, The |
NaturePix: | visual | Cognitive Modeling Research |
Navigation in Indoor Environments: Does the Type of | visual | Learning Stimulus Matter? |
NCL++: Nested Collaborative Learning for long-tailed | visual | recognition |
Near-duplicate keyframe retrieval with | visual | keywords and semantic context |
Near-Optimal Time Function for Secure Dynamic | visual | Cryptography |
Negative-Driven Training Pipeline for Siamese | visual | Tracking |
NEIL: Extracting | visual | Knowledge from Web Data |
Neonatal Pain Scales and Human | visual | Perception: An Exploratory Analysis Based on Facial Expression Recognition and Eye-tracking |
NeRD: A Neural Response Divergence Approach to | visual | Saliency Detection |
Nested Collaborative Learning for Long-Tailed | visual | Recognition |
Network Dissection: Quantifying Interpretability of Deep | visual | Representations |
Network in network based weakly supervised learning for | visual | tracking |
network of co-operative cameras for | visual | surveillance, A |
Network Uncertainty Informed Semantic Feature Selection for | visual | SLAM |
Networked High-Speed Vision System for 1,000-FPS | visual | Feature Communication, A |
Neural Architecture for | visual | Information-Processing, A |
Neural Descent for | visual | 3D Human Pose and Shape |
Neural Fields Models of | visual | Areas: Principles, Successes, and Caveats |
Neural guided | visual | SLAM system with Laplacian of Gaussian operator |
Neural Image Compression Using Masked Sparse | visual | Representation |
Neural Mechanisms of | visual | Flow Integration and Segregation: Insights from the Pinna-Brelstaff Illusion and Variations of It |
Neural Model for Attentional Modulation of Lateral Interactions in the | visual | Cortex, A |
Neural Model of Human Texture Processing: Texture Segmentation vs. | visual | Search, A |
Neural Model of | visual | Stereomatching: Slant, Transparency and Clouds |
neural network approach to | visual | tracking, A |
Neural network based reinforcement learning for audio- | visual | gaze control in human-robot interaction |
Neural Network for | visual | Pattern Recognition, A |
Neural Structures for | visual | -Motion Tracking |
Neural Topological SLAM for | visual | Navigation |
Neural Volumetric Memory for | visual | Locomotion Control |
Neural-Network Capable of Learning and Inference for | visual | -Pattern Recognition, A |
Neuromimetic Indicators for | visual | Perception of Motion |
Neuromorphic Architecture for Cortical Multilayer Integration of Early | visual | Tasks, A |
New Active | visual | System for Humanoid Robots, A |
New Angle-Based Spatial Modeling for Query by | visual | Thesaurus Composition, A |
New Anthropomorphic Retina-Like | visual | Sensor, A |
New Approach of GPU Accelerated | visual | Tracking, A |
New Approach of | visual | Activity Measuring with Background Subtraction Algorithms |
New Approach to Integrate Audio and | visual | Features of Speech, A |
New Approach to | visual | Servoing in Robotics, A |
New Aspect Ratio Invariant | visual | Secret Sharing Schemes Using Square Block-Wise Operation |
new authentication based cheating prevention scheme in Naor-Shamir’s | visual | cryptography, A |
New Automatic Planning of Inspection of 3D Industrial Parts by Means of | visual | System, A |
new bag of | visual | words encoding method for human action recognition, A |
new computer vision-based system to help clinicians objectively assess | visual | pursuit with the moving mirror stimulus for the diagnosis of minimally conscious state, A |
New Content-Based Image Retrieval System Using Deep | visual | Features, A |
New Datasets and Models for Contextual Reasoning in | visual | Dialog |
New Edge-Detection Method for Automatic | visual | Inspection, A |
New Formulation for Non-Linear Camera Calibration Using Virtual | visual | Servoing, A |
New Framework for Measuring 2D and 3D | visual | Information in Terms of Entropy, A |
New Hardware Module for Automated | visual | Inspection Based on a Cellular-Automaton Architecture, A |
new head pose tracking method based on stereo | visual | SLAM, A |
New Histogram Modification Based Reversible Data Hiding Algorithm Considering the Human | visual | System, A |
new image size reduction model for an efficient | visual | sensor network, A |
New Kalman-Filter-Based Framework for Fast and Accurate | visual | Tracking of Rigid Objects, A |
New local difference binary image descriptor and algorithm for rapid and precise vehicle | visual | localisation |
New Manifold Representation for | visual | Speech Recognition, A |
New Method for Projector Calibration Based on | visual | Servoing, A |
New method for the fusion of complementary information from infrared and | visual | images for object detection |
New Model-Based Approach for Industrial | visual | Inspection, A |
New models of | visual | saliency: Contourlet transform based model and hybrid model |
new multi-purpose audio- | visual | UNMC-VIER database with multiple variabilities, A |
New Partitioned Approach to Image-Based | visual | Servo Control, A |
New Pose-Detection Method for Self-Calibrated Cameras Based on Parallel Lines and Its Application in | visual | Control System |
New Preprocessing Method for Measuring Image | visual | Quality Robust to Rotation and Spatial Shifts, A |
New privilege-based | visual | cryptography with arbitrary privilege levels |
new pyramid-based color image representation for | visual | localization, A |
New reduced-reference objective stereo image quality assessment model based on human | visual | system |
new robust semi-blind image watermarking based on block classification and | visual | cryptography, A |
New strategy for CBIR by combining low-level | visual | features with a colour descriptor |
new technique for geometry based | visual | depth estimation for uncalibrated camera, A |
New Technique for | visual | Motion Alarm, A |
New | visual | Comfort-Based Stereoscopic Image Retargeting Method, A |
New | visual | Invariants for Obstacle Detection Using Optical Flow Induced from General Motion |
New | visual | Invariants for Terrain Navigation without 3D Reconstruction |
New | visual | secret sharing schemes using probabilistic method |
New | visual | Speech Recognition Approach for RGB-D Cameras, A |
new Wronskian change detection model based codebook background subtraction for | visual | surveillance applications, A |
NewsStories: Illustrating Articles with | visual | Summaries |
Next-generation 3D | visual | ization for visual surveillance |
Next-Generation Web Searches from | visual | Content |
Night Rider: | visual | Odometry Using Headlights |
NinjaDesc: Content-Concealing | visual | Descriptors via Adversarial Learning |
Ninth | visual | Object Tracking VOT2021 Challenge Results, The |
NMF-based multimodal image indexing for querying by | visual | example |
no reference texture granularity index and application to | visual | media compression, A |
No-reference image quality assessment based on | visual | codebook |
No-Reference Image Quality Assessment Using | visual | Codebooks |
No-Reference Learning-Based and Human | visual | -Based Image Quality Assessment Metric |
No-reference mesh | visual | quality assessment via ensemble of convolutional neural networks and compact multi-linear pooling |
No-reference perceptual quality assessment of stereoscopic images based on binocular | visual | characteristics |
No-Reference Quality Assessment for Screen Content Images Using | visual | Edge Model and AdaBoosting Neural Network |
No-reference quality assessment of 3D videos based on human | visual | perception |
No-reference quality assessment of HEVC video streams based on | visual | memory modelling |
No-Reference Stereoscopic Image Quality Assessment Based On | visual | Attention Mechanism |
No-reference stereoscopic image quality evaluator based on human | visual | characteristics and relative gradient orientation |
No-reference stereoscopic images quality assessment method based on monocular superpixel | visual | features and binocular visual features? |
No-reference stereoscopic images quality assessment method based on monocular superpixel | visual | features and binocular visual features? |
No-Reference Texture Regularity Metric Based on | visual | Saliency, A |
No-Reference Video Quality Assessment Using Voxel-Wise fMRI Models of the | visual | Cortex |
Nocal-Siam: Refining | visual | Features and Response With Advanced Non-Local Blocks for Real-Time Siamese Tracking |
Noise Adaptive Stream Weighting in Audio- | visual | Speech Recognition |
Noise estimation and adaptive filtering during | visual | tracking |
Noise-Aware Framework for Robust | visual | Tracking |
Noise-Tolerant Learning for Audio- | visual | Action Recognition |
NomMer: Nominate Synergistic Context in Vision Transformer for | visual | Recognition |
Non-invasive Facial | visual | -Infrared Stereo Vision Based Measurement as an Alternative for Physiological Measurement, A |
Non-Linear Direct Multi-Scale Image Enhancement Based on the Luminance and Contrast Masking Characteristics of the Human | visual | System |
Non-Metrical Navigation Through | visual | Path Control |
non-myopic approach to | visual | search, A |
Non-parametric local transforms for computing | visual | correspondence |
Non-restrictive | visual | Respiration Monitoring |
Non-sparse linear representations for | visual | tracking with online reservoir metric learning |
Non- | visual | Sensing of Metallic Pavement Markers From a Moving Vehicle |
Nonlinear Discrete Cross-Modal Hashing for | visual | -Textual Data |
Nonlinear Dynamic Model for | visual | Object Tracking on Grassmann Manifolds With Partial Occlusion Handling |
Nonlinear dynamic range transformation in | visual | communication channels |
Nonlinear Interaction of on and off Data Streams for the Detection of | visual | Structure |
Nonlinear Manifold Learning for | visual | Speech Recognition |
Nonlinear Supervised Locality Preserving Projections for | visual | Pattern Discrimination |
Nonlinear | visual | Mapping Model for 3-D Visual Tracking With Uncalibrated Eye-in-Hand Robotic System |
Nonlinear | visual | Mapping Model for 3-D Visual Tracking With Uncalibrated Eye-in-Hand Robotic System |
Nonlinearity in Simple and Complex Cells in Early Biological | visual | Systems |
Nonnegative Decompositions for Dynamic | visual | Data Analysis |
Nonparametric Treatment for Location/Segmentation Based | visual | Tracking, A |
Normalized classifier fusion for semantic | visual | concept detection |
Normalized Training for HMM-based | visual | Speech Recognition |
Not All Frames Are Equal: Weakly-Supervised Video Grounding With Contextual Similarity and | visual | Clustering Losses |
Not All Tokens Are Equal: Human-centric | visual | Analysis via Token Clustering Transformer |
Not Just a Matter of Semantics: The Relationship Between | visual | and Semantic Similarity |
Note on the Paper The | visual | Potential: One Convex Polygon, A |
novel 2D-to-3D scheme by | visual | attention and occlusion analysis, A |
Novel Active Vision-Based | visual | Threat Cue for Autonomous Navigation Tasks |
Novel Affective | visual | ization System for Videos Based on Acoustic and Visual Features, A |
Novel Anti-Drift | visual | Object Tracking Algorithm Based on Sparse Response and Adaptive Spatial-Temporal Context-Aware, A |
Novel Approach for Video Quantization Using the Spatiotemporal Frequency Characteristics of the Human | visual | System, A |
novel approach for | visual | Saliency detection and segmentation based on objectness and top-down attention, A |
Novel Class Activation Map for | visual | Explanations in Multi-Object Scenes, A |
novel feature extractor for human action recognition in | visual | question answering, A |
Novel framework for multimodal biometric image authentication using | visual | share neural network |
Novel general KNN classifier and general nearest mean classifier for | visual | classification |
novel generalization of the gray-scale histogram and its application to the automated | visual | measurement and inspection of wooden Pallets, A |
Novel Heterogeneous Network for Modeling Driver Attention With Multi-Level | visual | Content, A |
Novel High Breakdown M-estimator for | visual | Data Segmentation, A |
Novel Hybrid CNN-AIS | visual | Pattern Recognition Engine, A |
Novel Image Descriptor with Aggregated Semantic Skeleton Representation for Long-term | visual | Place Recognition, A |
Novel Image Quality Assessment With Globally and Locally Consilient | visual | Quality Perception, A |
novel incremental topological mapping using global | visual | features, A |
novel incremental weighted PCA algorithm for | visual | tracking, A |
Novel Lip Descriptor for Audio- | visual | Keyword Spotting Based on Adaptive Decision Fusion, A |
novel locally linear KNN model for | visual | recognition, A |
novel method for image classification based on bag of | visual | words, A |
novel monochromatic cue for detecting regions of | visual | interest, A |
novel Position-Based | visual | Servoing approach for robust global stability with feature points kept within the Field-of-View, A |
Novel Rate Control Algorithm for H.264/AVC Based on Human | visual | System, A |
Novel rate-distortion analysis framework for bit rate and picture quality control in DCT | visual | coding |
Novel Remote | visual | Inspection System for Bridge Predictive Maintenance, A |
Novel Resection-Intersection Algorithm With Fast Triangulation Applied to Monocular | visual | Odometry, A |
Novel Robust Statistical Method for Background Initialization and | visual | Surveillance, A |
Novel Schemes for Hyperbolic PDEs Using Osmosis Filters from | visual | Computing |
novel selective encryption scheme for H.264/AVC video with improved | visual | security, A |
Novel Shadow Detection Algorithm for Real Time | visual | Surveillance Applications, A |
Novel Ship Detection Method For Large-scale Optical Satellite Images Based On | visual | Lbp Feature And Visual Attention Model, A |
Novel Ship Detection Method For Large-scale Optical Satellite Images Based On | visual | Lbp Feature And Visual Attention Model, A |
Novel Smart Lightweight | visual | Attention Model for Fine-Grained Vehicle Recognition, A |
Novel Texture-Less Object Oriented | visual | SLAM System, A |
Novel Three-Dimensional P300 Speller Based on Stereo | visual | Stimuli, A |
Novel Two-in-One Image Secret Sharing Scheme Based on Perfect Black | visual | Cryptography, A |
Novel Video Content Classification Algorithm Based on Combined | visual | Features Model, A |
novel video salient object extraction method based on | visual | attention, A |
Novel | visual | Analysis Oriented Rate Control Scheme for HEVC, A |
Novel | visual | and Statistical Image Features for Microblogs News Verification |
novel | visual | classification framework on panoramic attention mechanism network, A |
novel | visual | codebook model based on fuzzy geometry for large-scale image classification, A |
Novel | visual | Cryptography Scheme with Different Importance of Shadows, A |
Novel | visual | Detecting and Positioning Method for Screw Holes, A |
novel | visual | distortion sensitivity analysis for video encoder bit allocation, A |
Novel | visual | Feature and Gaze Driven Egocentric Video Retargeting, A |
novel | visual | landmark matching for a biologically inspired homing, A |
Novel | visual | Narrative Framework for Tourist Map Design Based on Local Chronicles: A Case Study of the Songshan Scenic Area, A |
Novel | visual | Organization Based on Topological Perception, A |
Novel | visual | Perception Framework, A |
Novel | visual | Representation on Text Using Diverse Conditional GAN for Visual Recognition, A |
Novel | visual | Representation on Text Using Diverse Conditional GAN for Visual Recognition, A |
Novel | visual | Speech Representation and HMM Classification for Visual Speech Recognition, A |
Novel | visual | Speech Representation and HMM Classification for Visual Speech Recognition, A |
Novel | visual | tracking approach via ant lion optimiser |
Novel | visual | Word Co-occurrence Model for Person Re-identification, A |
NRspttemVQA: Real-Time Video Quality Assessment Based on the User's | visual | Perception |
NUS-PRO: A New | visual | Tracking Challenge |
NuWA: | visual | Synthesis Pre-training for Neural visUal World creAtion |
NuWA: | visual | Synthesis Pre-training for Neural visUal World creAtion |
NVAutoNet: Fast and Accurate 360° 3D | visual | Perception For Self Driving |
O(N2) Square Root Unscented Kalman Filter for | visual | Simultaneous Localization and Mapping, An |
Object Bank: An Object-Level Image Representation for High-Level | visual | Recognition |
Object Categorization Based on Kernel Principal Component Analysis of | visual | Words |
Object Categorization by Learned Universal | visual | Dictionary |
Object Category Detection Using Audio- | visual | Cues |
Object Classification and Grasp Planning Using | visual | and Tactile Sensing |
Object classification in 3D baggage security computed tomography imagery using | visual | codebooks |
Object Classification in | visual | Surveillance Using Adaboost |
Object Detection for Embedded Systems Using Tiny Spiking Neural Networks: Filtering Noise Through | visual | Attention |
Object Detection of Tobacco-Related Information Based on | visual | Features |
Object Discovery by Clustering Correlated | visual | Word Sets |
Object displays for identifying multidimensional outliers within a crowded | visual | periphery |
Object drift determination network based on dual-template joint decision-making in long-term | visual | tracking |
Object Formation by Learning in | visual | Databases using Hierarchical Content Description |
Object Level | visual | Reasoning in Videos |
Object Manipulation via | visual | Target Localization |
Object of interest-based | visual | navigation, retrieval, and semantic content identification system |
Object recognition and segmentation in videos by connecting heterogeneous | visual | features |
Object Recognition by Learning Informative, Biologically Inspired | visual | Features |
Object Recognition Model Based on | visual | Grammars and Bayesian Networks, An |
Object Recognition With an Elastic Net-Regularized Hierarchical MAX Model of the | visual | Cortex |
Object Recognition with Features Inspired by | visual | Cortex |
Object recognition with top-down | visual | attention modeling for behavioral studies |
Object Referring in | visual | Scene with Spoken Language |
Object Retrieval Using | visual | Query Context |
Object retrival based on | visual | word pairs |
Object semantic-guided graph attention feature fusion network for Siamese | visual | tracking |
Object sequences: encoding categorical and spatial information for a yes/no | visual | question answering task |
Object Templates for | visual | Place Categorization |
Object Tracking over Multiple Uncalibrated Cameras Using | visual | , Spatial and Temporal Similarities |
Object Tracking Using Deep Convolutional Neural Networks and | visual | Appearance Models |
Object-Adaptive LSTM Network for | visual | Tracking |
Object-and-action Aware Model for | visual | Language Navigation |
Object-Based | visual | 3D Tracking of Articulated Objects via Kinematic Sets |
Object-based | visual | attention for computer vision |
Object-Based | visual | Attention Model for Robotic Applications, An |
Object-based | visual | Attention: a Model for a Behaving Robot |
Object-Based | visual | Camera Pose Estimation From Ellipsoidal Model and 3D-Aware Ellipse Prediction |
Object-Based | visual | Saliency via Laplacian Regularized Kernel Regression |
Object-Goal | visual | Navigation via Effective Exploration of Relations Among Historical Navigation States |
Object-Graphs for Context-Aware | visual | Category Discovery |
Object-Guided Day-Night | visual | localization in Urban Scenes |
Object-Oriented | visual | Saliency Detection Framework Based on Sparse Coding Representations, An |
objective distortion measure for binary document images based on human | visual | perception, An |
Objective Evaluation of Impression of Faces with Various Female Hairstyles Using Field of | visual | Perception |
Objective validation of a dynamical and plausible computational model of | visual | attention |
Objectness to improve the bag of | visual | words model |
Objectness-based smoothing stochastic sampling and coherence approximate nearest neighbor for | visual | tracking |
Oblivious Spatio-Temporal Watermarking of Digital Video by Exploiting the Human | visual | System |
OBoW: Online Bag-of- | visual | -Words Generation for Self-Supervised Learning |
Obstacle detection for pedestrians with a | visual | impairment based on 3D imaging |
Occam Algorithms for Computing | visual | -Motion |
Occlusion and Deformation Handling | visual | Tracking for UAV via Attention-Based Mask Generative Network |
Occlusion detection and drift-avoidance framework for 2D | visual | object tracking |
Occlusion Edge Blur: A Cue to Relative | visual | Depth |
Occlusion-aware 3D multiple object tracker with two cameras for | visual | surveillance |
Occlusion-robust online multi-object | visual | tracking using a GM-PHD filter with CNN-based re-identification |
OFVL-MS: Once for | visual | Localization across Multiple Indoor Scenes |
OK-VQA: A | visual | Question Answering Benchmark Requiring External Knowledge |
OKIRAKU search: Leaf images based | visual | tree search system |
Omni-directional | visual | surveillance |
Omni-range spatial contexts for | visual | classification |
Omnidirectional Image Stabilization for | visual | Object Recognition |
Omnidirectional Information Gathering for Knowledge Transfer-based Audio- | visual | Navigation |
Omnidirectional | visual | camera |
Omnidirectional | visual | image detector and processor |
Omnivore: A Single Model for Many | visual | Modalities |
On Computing Exact | visual | Hulls of Solids Bounded by Smooth Surfaces |
On Computing | visual | Flows with Boundaries: The Case of Shading and Edges |
On contrast combinations for | visual | saliency detection |
On deformable models for | visual | pattern recognition |
On Embodied | visual | Navigation in Real Environments Through Habitat |
On Environmental Model-Based | visual | Perception for Humanoids |
On Exploring Undetermined Relationships for | visual | Relationship Detection |
On general construction for extended | visual | cryptography schemes |
On Growing Persian Words with L-Systems: | visual | Modeling of Neyname |
On Guiding | visual | Attention with Language Specification |
On Models for the Perception of | visual | Texture |
On Network Design Spaces for | visual | Recognition |
On Parsing | visual | Sequences with the Hidden Markov Model |
On partitioning a dictionary for | visual | text recognition |
On Perceptual Analyzers Underlying | visual | Texture Discrimination: Part I |
On Perceptual Analyzers Underlying | visual | Texture Discrimination: Part II |
On Person Authentication by Fusing | visual | and Thermal Face Biometrics |
On Photometric Issues in 3D | visual | Recognition from a Single 2D Image |
On Pose Recovery for Generalized | visual | Sensors |
On Predicting | visual | Comfort of Stereoscopic Images: A Learning to Rank Based Approach |
On privacy and security in distributed | visual | sensor networks |
On robust image spam filtering via comprehensive | visual | modeling |
On Scale Initialization in Non-overlapping Multi-perspective | visual | Odometry |
On the Analysis and Design of | visual | Cryptography With Error Correcting Capability |
On the Audio- | visual | Synchronization for Lip-to-Speech Synthesis |
On the Bayes fusion of | visual | features |
On the binding mechanism of synchronised | visual | events |
On the burstiness of | visual | elements |
On the coherency of quantitative evaluation of | visual | explanations |
On the Correlation of Automatic Audio and | visual | Segmentations of Music Videos |
On the correspondence between objects and events for the diagnosis of situations in | visual | surveillance tasks |
On the coupled use of signal and semantic concepts to bridge the semantic and user intention gaps for | visual | content retrieval |
On the difficulty of feature-based attentional modulations in | visual | object recognition: A modeling study. |
On the Effect of Observed Subject Biases in Apparent Personality Analysis From Audio- | visual | Signals |
On the Eigenvalues of Global Covariance Pooling for Fine-Grained | visual | Recognition |
On the Estimation of Depth from Motion Using an Anthropomorphic | visual | Sensor |
On the Exploration of Convolutional Fusion Networks for | visual | Recognition |
On the General Value of Evidence, and Bilingual Scene-Text | visual | Question Answering |
On the Geometry of | visual | Correspondence |
On the Importance of | visual | Context for Data Augmentation in Scene Understanding |
On the information-theoretic assessment of | visual | communication |
On the Limits of Fourier Decompositions in | visual | Texture Perception |
On the Limits of Pseudo Ground Truth in | visual | Camera Re-localisation |
On the Medial Axis Function for | visual | Patterns |
On the optimal placement of multiple | visual | sensors |
On the Performance of Pose-Based RGB-D | visual | Navigation Systems |
On the Qualitative Structure of Temporally Evolving | visual | Motion Fields |
On the receptive field misalignment in CAM-based | visual | explanations |
On the relation between probabilistic inference and fuzzy sets in | visual | scene analysis |
On the relationship between | visual | attributes and convolutional networks |
On the Relative Complexity of Active vs. Passive | visual | Search |
On the representation of a digital contour with an unordered point set for | visual | perception |
On the Representation of | visual | Information |
On the Robustness of | visual | Cryptographic Schemes |
On the role of context in probabilistic models of | visual | saliency |
On the role of question encoder sequence model in robust | visual | question answering |
On the sampling of web images for learning | visual | concept classifiers |
On the security of a | visual | cryptography scheme for color images |
On the semantics of | visual | behaviour, structured events and trajectories of human action |
On the Spatial Extents of SIFT Descriptors for | visual | Concept Detection |
On the use of linear camera-object interaction models in | visual | servoing |
On the use of snakes for 3-D robotic | visual | tracking |
On the | visual | discrimination between small objects and large textured patterns |
On the | visual | Mathematics of Tracking |
On Using Shadowgrams for | visual | Hull Reconstruction |
On | visual | ambiguities due to transparency in motion and stereo |
On | visual | BMI analysis from facial images |
On | visual | Detection of Light Sources |
On | visual | gaze tracking based on a single low cost camera |
On | visual | masking estimation for adaptive quantization using steerable filters |
On | visual | Perception and Retinal Motions |
On | visual | Periodicity Estimation Using Singular Value Decomposition |
On | visual | Real Time Mapping for Unmanned Aerial Vehicles |
On | visual | Realism of Synthesized Imagery |
On | visual | Servoing to Improve Performance of Robotic Grasping |
On | visual | similarity based interactive product recommendation for online shopping |
On-chip semidense representation map for dense | visual | features driven by attention processes |
On-Device Mobile | visual | Location Recognition by Integrating Vision and Inertial Sensors |
On-line classroom | visual | tracking and quality evaluation by an advanced feature mining technique |
On-Line Detection of Drowsiness Using Brain and | visual | Information |
On-Line Estimation Of | visual | -Motor Models Using Active Vision |
On-Line Recognition (OLREC): A Novel Approach to | visual | Pattern Recognition |
On-line Simultaneous Learning and Tracking of | visual | Feature Graphs |
on-line | visual | human tracking algorithm using SURF-based dynamic object model, An |
On-the-fly learning for | visual | search of large-scale image and video datasets |
Once for All: A Two-Flow Convolutional Neural Network for | visual | Tracking |
Once upon a Spacetime: | visual | Storytelling in Cognitive and Geotemporal Information Spaces |
One Metric to Measure Them All: Localisation Recall Precision (LRP) for Evaluating | visual | Detection Tasks |
One scan shadow compensation and | visual | enhancement of color images |
One step beyond bags of features: | visual | categorization using components |
One-Shot Adversarial Attacks on | visual | Tracking With Dual Attention |
One-shot and Partially-Supervised Cell Image Segmentation Using Small | visual | Prompt |
One-Shot SADI-EPE: A | visual | Framework of Event Progress Estimation |
One-Stage | visual | Relationship Referring With Transformers and Adaptive Message Passing |
Online Appearance Model Learning and Generation for Adaptive | visual | Tracking |
Online Class Incremental Learning on Stochastic Blurry Task Boundary via Mask and | visual | Prompt Tuning |
Online Collaborative Learning for Open-Vocabulary | visual | Classifiers |
Online Continual Learning For | visual | Food Classification |
Online Continual Learning with Natural Distribution Shifts: An Empirical Study with | visual | Data |
Online Cross-Modal Adaptation for Audio- | visual | Person Identification With Wearable Cameras |
Online discriminative dictionary learning for | visual | tracking |
Online Feature Classification and Clustering for Transformer-based | visual | Tracker |
Online Feature Selection for | visual | Tracking |
Online Gesture Spotting from | visual | Hull Data |
Online Knowledge Distillation via Mutual Contrastive Learning for | visual | Recognition |
Online Learning for PLSA-Based | visual | Recognition |
Online learning of task-driven object-based | visual | attention control |
Online Metric-Weighted Linear Representations for Robust | visual | Tracking |
Online Multi-Expert Learning for | visual | Tracking |
Online multi-modal task-driven dictionary learning and robust joint sparse representation for | visual | tracking |
Online multiple instance gradient feature selection for robust | visual | tracking |
Online Multiple Instance Joint Model for | visual | Tracking |
Online Multiple Kernel Similarity Learning for | visual | Search |
Online Random Ferns for robust | visual | tracking |
Online Regression of Grandmother-Cell Responses with | visual | Experience Learning for Face Recognition |
Online Robust Non-negative Dictionary Learning for | visual | Tracking |
Online Scale Adaptive | visual | Tracking Based on Multilayer Convolutional Features |
Online semi-supervised compressive coding for robust | visual | tracking |
Online Spatio-temporal Structural Context Learning for | visual | Tracking |
Online State-Based Structured SVM Combined With Incremental PCA for Robust | visual | Tracking |
Online SVM and backward model validation based | visual | tracking |
Online unsupervised feature learning for | visual | tracking |
Online | visual | tracking by integrating spatio-temporal cues |
Online | visual | Tracking Using Temporally Coherent Part Cluster |
Online | visual | tracking via correlation filter with convolutional networks |
Online | visual | Tracking via Coupled Object-Context Dictionary |
Online | visual | Tracking via Two View Sparse Representation |
Online | visual | tracking with histograms and articulating blocks |
Online | visual | Vocabularies |
Online | visual | vocabulary pruning using pairwise constraints |
Ontology-Based | visual | Query Formulation: An Industry Experience |
Ontology-Driven | visual | Browsing of Historical Industrial Archives |
Open cross-domain | visual | search |
Open Scene Understanding: Grounded Situation Recognition Meets Segment Anything for Helping People with | visual | Impairments |
Open-domain | visual | Entity Recognition: Towards Recognizing Millions of Wikipedia Entities |
Open-Vocabulary One-Stage Detection with Hierarchical | visual | -Language Knowledge Distillation |
OpenMix: Reviving Known Knowledge for Discovering Novel | visual | Categories in an Open World |
Operator Choice Modeling for Collaborative UAV | visual | Search Tasks |
Optical filter selection for automatic | visual | inspection |
Optical Flow Techniques Applied to the Calibration of | visual | Perception Experiments |
Optical Motion and Transformations as Stimuli for | visual | Perceptions |
Optical Transformations in | visual | Navigation |
Optimal (2,n) and (2,infinity) | visual | secret sharing by generalized random grids |
Optimal Carrier Loading for Maximizing | visual | Entropy Over OFDMA Cellular Networks |
Optimal Estimation of 3D Structures Using | visual | Servoing |
Optimal image transmission over | visual | Sensor Networks |
Optimal motion estimation from | visual | and inertial measurements |
Optimal Multiclass Classifier Threshold Estimation with Particle Swarm Optimization for | visual | Object Recognition |
Optimal power allocation for minimizing | visual | distortion over MIMO communication systems |
Optimal RBF Networks for | visual | Learning |
Optimal Transport View of Class-Imbalanced | visual | Recognition, An |
Optimal View Path Planning for | visual | SLAM |
Optimal | visual | Motion Estimation: A Note |
Optimal Window Size for | visual | Tracking for uniform CCDs |
Optimal XOR Based (2,n)- | visual | Cryptography Schemes |
Optimisation-based training of evolutionary convolution neural network for | visual | classification applications |
Optimistic Agent: Accurate Graph-Based Value Estimation for More Successful | visual | Navigation |
Optimization of JPEG2000 Image Compression by Incorporating Human | visual | System |
Optimization of Robot Self-Localization Accuracy by Automatic | visual | -Landmark Selection |
Optimized Tone Mapping Function for Contrast Enhancement Considering Human | visual | Perception System |
Optimizing a Virtual Re-Convergence System to Reduce | visual | Fatigue in Stereoscopic Camera |
Optimizing Camera Perspective for Stereo | visual | Odometry |
Optimizing JPEG quantization table for low bit rate mobile | visual | search |
Optimizing kd-trees for scalable | visual | descriptor indexing |
Optimizing LBP Structure For | visual | Recognition Using Binary Quadratic Programming |
Optimizing | visual | dictionaries for effective image retrieval |
Optimizing | visual | Search Reranking via Pairwise Learning |
Optimizing | visual | search with implicit user feedback in interactive video retrieval |
Optimizing | visual | Vocabularies Using Soft Assignment Entropies |
Oracle Performance for | visual | Captioning |
Order determination and sparsity-regularized metric learning adaptive | visual | tracking |
Ordering of | visual | Descriptors in a Classifier Cascade Towards Improved Video Concept Detection |
Orderless and Blurred | visual | Tracking via Spatio-temporal Context |
Ordinal Measures for | visual | Correspondence |
Ordinal Representations of | visual | Space |
Organizing and Browsing Image Search Results Based on Conceptual and | visual | Similarities |
Organizing image databases as | visual | -content search trees |
Organizing | visual | Data in Structured Layout by Maximizing Similarity-Proximity Correlation |
Orientation of Point Clouds for Complex Surfaces In Medical Surgery Using Trinocular | visual | Odometry and Stereo Orb-SLAM2 |
Orientation Template Matching for Face Localization in Complex | visual | Scenes |
OrienterNet: | visual | Localization in 2D Public Maps with Neural Matching |
Oropharynx | visual | Detection by Using a Multi-Attention Single-Shot Multibox Detector for Human-Robot Collaborative Oropharynx Sampling |
OSANet: Object Semantic Attention Network for | visual | Sentiment Analysis |
Overcoming Shortcut Learning in a Target Domain by Generalizing Basic | visual | Factors from a Source Domain |
Overcoming | visual | reverberations |
Overt | visual | attention for free-viewing and quality assessment tasks: Impact of the regions of interest on a video quality metric |
Overview of Panoramic Video Projection Schemes in the IEEE 1857.9 Standard for Immersive | visual | Content Coding, An |
Overview of the MPEG-7 Standard and of Future Challenges for | visual | Information Analysis |
overview of the | visual | optimization tools in JPEG 2000, An |
Overview on | visual | SLAM: From Tradition to Semantic, An |
OWP: Objectness Weighted Patch Descriptor for | visual | Tracking |
P ˜ NP, at least in | visual | Question Answering |
P-CNN: Part-Based Convolutional Neural Networks for Fine-Grained | visual | Categorization |
Paddle Juggling of one Ball by Robot Manipulator with | visual | Servo |
PADS: Policy-Adapted Sampling for | visual | Similarity Learning |
PageSense: Toward Stylewise Contextual Advertising via | visual | Analysis of Web Pages |
Paired-Point Lifting for Enhanced Privacy-Preserving | visual | Localization |
Pairwise Confusion for Fine-Grained | visual | Classification |
Panel Session on | visual | Scene Representation |
Pano-AVQA: Grounded Audio- | visual | Question Answering on 360° Videos |
Panoptic-Level Image-to-Image Translation for Object Recognition and | visual | Odometry Enhancement |
Panorama: A What I See Is What I Want Contactless | visual | Interface |
Panoramic | visual | tracking based on adaptive mechanism |
Panoramic | visual | -Inertial SLAM Tightly Coupled with a Wheel Encoder |
Parallel Attention: A Unified Framework for | visual | Object Discovery Through Dialogs and Queries |
Parallel CBIR Implementation using Perceptual Grouping of Block-Based | visual | Patterns, A |
Parallel Coordinates: | visual | Multidimensional Geometry and Its Applications |
Parallel high resolution real-time | visual | Hull on GPU |
Parallel Imperative and Functional Approaches to | visual | Scene Labelling |
Parallel implementation of a spatio-temporal | visual | saliency model |
Parallel Tracking and Verifying: A Framework for Real-Time and High Accuracy | visual | Tracking |
Parallel | visual | Motion Analysis Using Multiscale Markov Random Fields |
Parallel-fusion LSTM with synchronous semantic and | visual | information for image captioning |
Paralleled attention modules and adaptive focal loss for Siamese | visual | tracking |
ParallelEye Pipeline: An Effective Method to Synthesize Images for Improving the | visual | Intelligence of Intelligent Vehicles |
Parametric and | visual | Programming BIM Applied to Museums, Linking Container and Content |
Parametric Eigenspace Representation for | visual | Learning and Recognition |
Parametric Processes for the Implementation of HBIM: | visual | Programming Language for the Digitisation of the Index of Masonry Quality |
Parametric Representations for Nonlinear Modeling of | visual | Data |
Part Context Learning for | visual | Tracking |
Part Segmentation of | visual | Hull for 3D Human Pose Estimation |
Part-Based Convolutional Neural Network for | visual | Recognition |
Part-Based Data Association for | visual | Tracking |
Part-based multi-graph ranking for | visual | tracking |
Part-based | visual | tracking via structural support correlation filter |
Part-Based | visual | Tracking with Online Latent Structural Learning |
Part-based | visual | tracking with spatially regularized correlation filters |
Part-Guided Relational Transformers for Fine-Grained | visual | Recognition |
Part-Object Relational | visual | Saliency |
Part-Stacked CNN for Fine-Grained | visual | Categorization |
Partial Face Matching between Near Infrared and | visual | Images in MBGC Portal Challenge |
Partial Occlusion Handling for | visual | Tracking via Robust Part Matching |
Partial | visual | -Tactile Fused Learning for Robotic Object Recognition |
Partial-Duplicate Clustering and | visual | Pattern Discovery on Web Scale Image Database |
Partial-Duplicate Image Retrieval via Saliency-Guided | visual | Matching |
Participants increasing for threshold random grids-based | visual | secret sharing |
Particle dynamics and multi-channel feature dictionaries for robust | visual | tracking |
Particle filter to track multiple people for | visual | surveillance |
Particle Filter With a Mode Tracker for | visual | Tracking Across Illumination Changes |
Particle filter with occlusion handling for | visual | tracking |
Particle filter-based | visual | tracking with a first order dynamic model and uncertainty adaptation |
particle filtering framework with indirect measurements for | visual | tracking, A |
Particle filtering strategies for data fusion dedicated to | visual | tracking from a mobile robot |
particle swarm optimization inspired tracker applied to | visual | tracking, A |
Parts of | visual | Form: Computational Aspects |
Parts-based multi-task sparse learning for | visual | tracking |
Pascal | visual | Object Classes (VOC) Challenge, The |
PASCAL | visual | Object Classes Challenge 2007 (VOC2007) Results, The |
PASCAL | visual | Object Classes Challenge 2012, The |
Pascal | visual | Object Classes Challenge: A Retrospective, The |
Patch to the Future: Unsupervised | visual | Prediction |
Patch Tracking-Based Streaming Tensor Ring Completion for | visual | Data Recovery |
Patch-based Scale Calculation for Real-time | visual | Tracking |
Patch-Based Separable Transformer for | visual | Recognition |
Patch-based | visual | microphone for improving quality of sound |
Patch-Wise Auto-Encoder for | visual | Anomaly Detection |
PatchMatch Filter: Edge-Aware Filtering Meets Randomized Search for | visual | Correspondence |
Path Coding on Geometric Planar Graph for 2D/3D | visual | Data Partitioning |
PathGAN: | visual | Scanpath Prediction with Generative Adversarial Networks |
Patra: A Novel Document Architecture for Integrating Handwriting with Audio- | visual | Information |
Pattern recognition and understanding for | visual | information media |
Pattern Recognition for Automatic | visual | Inspection |
Pattern regularity as a | visual | key |
Patterns of approximated localised moments for | visual | loop closure detection |
Pay Attention! - Robustifying a Deep Visuomotor Policy Through Task-Focused | visual | Attention |
pDisVPL: Probabilistic Discriminative | visual | Part Learning for Image Classification |
Pedestrian Detectability Estimation Considering | visual | Adaptation to Drastic Illumination Change |
Pedestrian Detection and Tracking Based on Far Infrared | visual | Information |
Pedestrian detection using multi-channel | visual | feature fusion by learning deep quality model |
Pedtrans: A Fine-grained | visual | Classification Model for Self-attention Patch Enhancement and Dropout |
Peek Into the Reasoning of Neural Networks: Interpreting with Structural | visual | Concepts, A |
People, Penguins and Petri Dishes: Adapting Object Counting Models to New | visual | Domains and Object Types Without Forgetting |
Per-Sample Kernel Adaptation for | visual | Recognition and Grouping |
Per-Sample Multiple Kernel Approach for | visual | Concept Learning |
Perceivable Light Fields: Matching the Requirements Between the Human | visual | System and Autostereoscopic 3-D Displays |
Perceived interest and overt | visual | attention in natural images |
Perceiving distortions in | visual | signals |
Perception Enhanced Frame For | visual | Object Tracking |
Perception Model for People with | visual | Impairments |
Perception of order within disorder: 1. | visual | ranking of random textures |
Perception of Springs With | visual | and Proprioceptive Motion Cues: Implications for Prosthetics |
Perception of the | visual | World, The |
Perception-Aware Cross-Modal Signal Reconstruction: From Audio-Haptic to | visual | |
Perception-guided multi-channel | visual | feature fusion for image retargeting |
Perceptive | visual | texture classification and retrieval |
Perceptnet: A Human | visual | System Inspired Neural Network For Estimating Perceptual Distance |
PerceptSent: Exploring Subjectivity in a Novel Dataset for | visual | Sentiment Analysis |
Perceptual Adaptation of Image Based on Chevreul: Mach Bands | visual | Phenomenon |
Perceptual and Pixel-Wise Information for | visual | Novelty Detection |
Perceptual backlight scaling for low power liquid crystal displays based on | visual | saliency |
Perceptual Full-Reference Quality Assessment of Stereoscopic Images by Considering Binocular | visual | Characteristics |
Perceptual Grouping and Attention in | visual | Search for Features and Objects |
Perceptual Grouping Approach for | visual | Interpolation between Good Continuation and Minimal Path using Tensor Voting, A |
Perceptual Hashing With | visual | Content Understanding for Reduced-Reference Screen Content Image Quality Assessment |
Perceptual image quality assessment based on structural similarity and | visual | masking |
Perceptual Measure to Predict the | visual | Distinction Between Two Color Images, A |
Perceptual modeling in the problem of active object recognition in | visual | scenes |
Perceptual Narratives of Space and Motion for Semantic Interpretation of | visual | Data |
Perceptual Optimization for Scalable Video Compression Based on | visual | Masking Principles |
Perceptual Organization and | visual | Recognition |
Perceptual Organization as a Basis for | visual | Recognition |
Perceptual Organization of | visual | Images: Segmentation as a Basis for Recognition, The |
Perceptual | visual | quality assessment using deeply-learned gaze shifting kernel |
Perceptual | visual | quality metrics: A survey |
Perceptual | visual | Signal Compression and Transmission |
Perceptual-based quality assessment for audio- | visual | services: A survey |
perceptually based spatio-temporal computational framework for | visual | saliency estimation, A |
Perceptually-Weighted Cnn for 360-Degree Image Quality Assessment Using | visual | Scan-Path and Jnd |
Performance analysis on | visual | attention using spiking and oscillatory neural model |
Performance and Human Interface Issues of a System for | visual | Interpretation of Hand Gestures |
Performance assessment of a | visual | attention system entirely based on a human vision modeling |
Performance Characterization of Reactive | visual | Systems |
Performance evaluation in | visual | surveillance using the F-measure |
Performance evaluation of a 3D multi-view-based particle filter for | visual | object tracking using GPUs and multicore CPUs |
Performance evaluation of an incorporated DCT block-based watermarking algorithm with human | visual | system model |
Performance Evaluation of Face Recognition using | visual | and Thermal Imagery with Advanced Correlation Filters |
Performance Evaluation of Image Feature Detectors and Descriptors for Outdoor-Scene | visual | Navigation |
Performance Evaluation of Multi-camera | visual | Tracking |
Performance Evaluation of | visual | Object Detection and Tracking Algorithms Used in Remote Photoplethysmography |
Performance improvement of multi-view video coding based on geometric prediction and human | visual | system |
Performance of a Steady-State | visual | Evoked Potential and Eye Gaze Hybrid Brain-Computer Interface on Participants With and Without a Brain Injury |
Performance of | visual | search tasks from various types of contour information |
performance study for camera pose estimation using | visual | marker based tracking, A |
Performance Tests for | visual | Servo Control Systems, with Application to Partitioned Approaches to Visual Servo Control |
Performance Tests for | visual | Servo Control Systems, with Application to Partitioned Approaches to Visual Servo Control |
Peripheral | visual | Field, Fixation and Direction of Heading |
PerSE: | visual | analytics for calendar related spatiotemporal periodicity detection and analysis |
Persistent Data Management for | visual | Applications |
Persistent Stereo | visual | Localization on Cross-Modal Invariant Map |
Person authentication from neural activity of face-specific | visual | self-representation |
Person Identification Based on | visual | Analysis of Soft-Biometric Features in Surveillance Environments |
Person re-identification using | visual | attention |
Person re-identification with coarse-to-fine | visual | attention |
Person Search in Videos with One Portrait Through | visual | and Temporal Links |
Person Surveillance Using | visual | and Infrared Imagery |
Person Tracking with Audio- | visual | Cues Using the Iterative Decoding Framework |
Personal | visual | Assistance Systems and Techniques, Visually Impaired |
Personality Traits Classification Using Deep | visual | Activity-Based Nonverbal Features of Key-Dynamic Images |
Personalized Face Inpainting with Diffusion Models by Parallel | visual | Attention |
Personalizing Fast-Forward Videos Based on | visual | and Textual Features from Social Network |
Perspective correction for improved | visual | registration using natural features |
Perspective Reconstruction by Determining Vanishing Points for Autonomous Mobile Robot | visual | Localization on Supermarkets |
perspective view on | visual | information retrieval systems, A |
PEYE: Toward a | visual | Motion Based Perceptual Interface for Mobile Devices |
PFT | visual | Attention Detection Model Using Bayesian Framework, A |
PG-Net: Pixel to Global Matching Network for | visual | Tracking |
Phase based feature detector consistent with human | visual | system characteristics |
Phenological | visual | rhythms: Compact representations for fine-grained plant species identification |
Phenomenal Coherence of Moving | visual | Patterns |
PhenoTree: Interactive | visual | Analytics for Hierarchical Phenotyping From Large-Scale Electronic Health Records |
phone-viseme dynamic Bayesian network for audio- | visual | automatic speech recognition, A |
Photogeometric Direct | visual | Tracking for Central Omnidirectional Cameras |
Photographic paper texture classification using model deviation of local | visual | descriptors |
Photometric | visual | Gyroscope for Full-View Spherical Camera |
PhotonLabeler: An Inter-Disciplinary Platform for | visual | Interpretation and Labeling of ICESat-2 Geolocated Photon Data |
Photorealistic adaptation and interpolation of facial expressions using HMMS and AAMS for audio- | visual | speech synthesis |
Phrase Localization and | visual | Relationship Detection with Comprehensive Image-Language Cues |
Physical Adversarial Textures That Fool | visual | Object Tracking |
Physical Passive Patch Adversarial Attacks on | visual | Odometry Systems |
Physics-Based | visual | Understanding |
PIC1: A | visual | Database Interface |
PICASSO: | visual | Querying by Color Perceptive Regions |
Piecewise Linear Approximation Method Preserving | visual | Feature Points of Original Figures, A |
PIL-EYE: Integrated System for Sustainable Development of Intelligent | visual | Surveillance Algorithms |
PIVO: Probabilistic Inertial- | visual | Odometry for Occlusion-Robust Navigation |
Pixel domain referenceless | visual | degradation detection and error concealment for mobile video |
Pixel-Wise Histograms for | visual | Segment Description and Applications |
Pixel-Wise Prediction based | visual | Odometry via Uncertainty Estimation |
PixNet: A Localized Feature Representation for Classification and | visual | Search |
PKS: A photogrammetric key-frame selection method for | visual | -inertial systems built on ORB-SLAM3 |
PKUBench: A context rich mobile | visual | search benchmark |
Place Classification Using | visual | Object Categorization and Global Information |
Place Recognition in Gardens by Learning | visual | Representations: Data Set and Benchmark Analysis |
Plane Transform | visual | Cryptography |
Planning of a Multi Stereo | visual | Sensor System: Depth Accuracy and Variable Baseline Approach |
Planning to see: A hierarchical approach to planning | visual | actions on a robot using POMDPs |
Plant Disease Recognition: A Large-Scale Benchmark Dataset and a | visual | Region and Loss Reweighting Approach |
Plenty is Plague: Fine-Grained Learning for | visual | Question Answering |
PLL Powered, Real-Time | visual | Motion Tracking |
PnP-DETR: Towards Efficient | visual | Analysis with Transformers |
Point Me In The Right Direction: Improving | visual | Localization on UAVs with Active Gimballed Camera Pointing |
Point of Interest Detection and | visual | Distance Estimation for Sensor-Rich Video |
Point-Line | visual | Stereo SLAM Using EDlines and PL-BoW |
Point-to-Set Distance Metric Learning on Deep Representations for | visual | Tracking |
Point-wise Extended | visual | Masking for JPEG-2000 Image Compression |
Pointer-type meter automatic reading from complex environment based on | visual | saliency |
Pointing out Human Answer Mistakes in a Goal-Oriented | visual | Dialogue |
Polarized Light Pollution of Fixed-Tilt Photovoltaic Solar Panels Measured by Drone-Polarimetry and Its | visual | -Ecological Importance |
PolarViz: a discriminating | visual | ization and visual analytics tool for high-dimensional data |
Polyhedral Conic Classifiers for | visual | Object Detection and Classification |
Polysemous | visual | -Semantic Embedding for Cross-Modal Retrieval |
Pooling in image representation: The | visual | codeword point of view |
POP-VQA: Privacy preserving, On-device, Personalized | visual | Question Answering |
Pornographic image region detection based on | visual | attention model in compressed domain |
portable geo-aware | visual | surveillance system for vehicles, A |
Portilla-Simoncelli Texture Model: Towards Understanding the Early | visual | Cortex, The |
Pose Aware Fine-Grained | visual | Classification Using Pose Experts |
Pose Correction for Highly Accurate | visual | Localization in Large-scale Indoor Spaces |
Pose Invariant Topological Memory for | visual | Navigation |
Pose-Controllable Talking Face Generation by Implicitly Modularized Audio- | visual | Representation |
Pose-Only Solution to | visual | Reconstruction and Navigation, A |
PoseConvGRU: A Monocular Approach for | visual | Ego-motion Estimation by Learning |
Position Estimation from Outdoor | visual | Landmarks for Teleoperation of Lunar Rovers |
Position Interpolation Using Feature Point Scale for Decimeter | visual | Localization |
Positional Attention Guided Transformer-Like Architecture for | visual | Question Answering |
Positive Sample Propagation along the Audio- | visual | Event Line |
PosterLayout: A New Benchmark and Approach for Content-Aware | visual | -Textual Presentation Layout |
Posture estimation in | visual | surveillance of archaeological sites |
Potential of | visual | ChatGPT for Remote Sensing, The |
Potentialities of Chorems as | visual | Summaries of Geographic Databases Contents |
PourIt!: Weakly-supervised Liquid Perception from a Single Image for | visual | Closed-Loop Robotic Pouring |
Power mean SVM for large scale | visual | classification |
PPR-FCN: Weakly Supervised | visual | Relation Detection via Parallel Pairwise R-FCN |
PQ kernel: A rank correlation kernel for | visual | word histograms |
PQMET: A digital image quality metric based on human | visual | system |
Practical camera auto-calibration based on object appearance and motion for traffic scene | visual | surveillance |
practical classifier for photographs and non-photographic images based on local | visual | features, A |
Practical Considerations of Uncalibrated | visual | Servoing |
Practical Guide to Marker Based and Hybrid | visual | Registration for AR Industrial Applications, A |
Practical Infrared | visual | Odometry |
Practical | visual | Inspection Techniques: Optics, Micro-electronics and Advanced Software Technology |
Practical | visual | Positioning Method for Industrial Overhead Crane Systems, A |
Pre-trained CNNs as | visual | Feature Extractors: A Broad Evaluation |
Preattentive Grouping and Attentive Selection for Early | visual | Computation |
Precise Indoor | visual | Positioning Approach Using a Built Image Feature Database and Single User Image from Smartphone Cameras, A |
Precise Registration of 3D Images Acquired from a Hand-Held | visual | Sensor |
Precise | visual | inspection for LSI wafer patterns using subpixel image alignment |
precision of triangulation in monocular | visual | odometry, The |
Precision of | visual | Localization Using Dynamic Ground Control Points |
PRECOG: PREdiction Conditioned on Goals in | visual | Multi-Agent Settings |
Predicting audio- | visual | salient events based on visual, audio and text modalities for movie summarization |
Predicting audio- | visual | salient events based on visual, audio and text modalities for movie summarization |
Predicting Eye Fixations With Higher-Level | visual | Features |
Predicting Human Scanpaths in | visual | Question Answering |
Predicting Multiple Structured | visual | Interpretations |
Predicting Perceived | visual | and Cognitive Distractions of Drivers With Multimodal Features |
Predicting Social Interactions for | visual | Tracking |
Predicting the Category and Attributes of | visual | Search Targets Using Deep Gaze Pooling |
Predicting the Probability of Target Detection in Static Infrared and | visual | Scenes Using the Fuzzy-Logic Approach |
Predicting User Annoyance Using | visual | Attributes |
Predicting | visual | Attention in Graphic Design Documents |
Predicting | visual | difference maps for computer-generated images by integrating human visual system model and deep learning |
Predicting | visual | difference maps for computer-generated images by integrating human visual system model and deep learning |
Predicting | visual | Discomfort of Stereoscopic Images Using Human Attention Model |
Predicting | visual | Exemplars of Unseen Classes for Zero-Shot Learning |
Predicting | visual | Features From Text for Image and Video Caption Retrieval |
Predicting | visual | Focus of Attention From Intention in Remote Collaborative Tasks |
Predicting | visual | Overlap of Images Through Interpretable Non-metric Box Embeddings |
Predicting | visual | Political Bias Using Webly Supervised Data and an Auxiliary Task |
Predicting | visual | Semantic Descriptive Terms From Radiological Image Data: Preliminary Results With Liver Lesions in CT |
Prediction of Chromatic | visual | Masking with Deep Learning |
Prediction of Driver's | visual | Attention in Critical Moment Using Optical Flow |
Prediction of the Leadership Style of an Emergent Leader Using Audio and | visual | Nonverbal Features |
Prediction of | visual | discomfort in watching 3D video using multiple features |
Prediction of | visual | fatigue from spatiotemporal characteristics in stereoscopic video |
Predictive Coding Light: learning compact | visual | codes by combining excitatory and inhibitory spike timing-dependent plasticity* |
prequantizer with the human | visual | effect for the DPCM, A |
Preserving Motion-Tolerant Contextual | visual | Saliency for Video Resizing |
Pretrained Language Models as | visual | Planners for Human Assistance |
Principal Manifolds and Bayesian Subspaces for | visual | Recognition |
Principal Manifolds and Probabilistic Subspaces for | visual | Recognition |
Principal | visual | Word Discovery for Automatic License Plate Detection |
Principle of a Parallel Vision System Adapted to Textures: A Theoretical Solution for Selecting | visual | Filters |
Principles and Methods of Histogram Modification Adapted for | visual | Perception, The |
Principles Emerging from the Design of | visual | Search Algorithms for Practical Inspection Tasks |
Principles of | visual | Information Retrieval |
Print registration for automated | visual | inspection of transparent pharmaceutical capsules |
Printed Text Featuring Using the | visual | Criteria of Legibility and Complexity |
Prior Guided Dropout for Robust | visual | Localization in Dynamic Environments |
Prior Knowledge, Level Set Representations and | visual | Grouping |
Prior | visual | Relationship Reasoning For Visual Question Answering |
Prior | visual | Relationship Reasoning For Visual Question Answering |
Priority-based cross-layer optimization for multihop DS-CDMA | visual | Sensor Networks |
Privacy Attributes-aware Message Passing Neural Network for | visual | Privacy Attributes Classification |
Privacy Preserving | visual | SLAM |
Privacy Protected Surveillance Using Secure | visual | Object Coding |
Privacy protecting | visual | processing for secure video surveillance |
Privacy-Preserving In-Home Fall Detection Using | visual | Shielding Sensing and Private Information-Embedding |
Privacy-Preserving | visual | Learning Using Doubly Permuted Homomorphic Encryption |
Privacy-Preserving | visual | Recognition PA-HMDB51 |
PrivAttNet: Predicting Privacy Risks in Images Using | visual | Attention |
privilege-based | visual | secret sharing model, A |
Prob-POS: A Framework for Improving | visual | Explanations from Convolutional Neural Networks for Remote Sensing Image Classification |
Probabilistic Appearance-Based Mapping and Localization Using | visual | Features |
Probabilistic Approach for the Adaptive Integration of Multiple | visual | Cues Using an Agent Framework, A |
Probabilistic Approach to Integrating Multiple Cues in | visual | Tracking, A |
Probabilistic camera hand-off for | visual | surveillance |
Probabilistic color | visual | cryptography schemes for black and white secret images |
Probabilistic combination of spatial context with | visual | and co-occurrence information for semantic image analysis |
Probabilistic Combination of | visual | Cues for Object Classification |
Probabilistic Contour Observer For Online | visual | Tracking, A |
Probabilistic Data Association Methods for Tracking Complex | visual | Objects |
Probabilistic data association methods in | visual | tracking of groups |
Probabilistic Framework for 3D | visual | Object Representation, A |
Probabilistic framework for solving | visual | dialog |
Probabilistic fusion-based parameter estimation for | visual | tracking |
Probabilistic Graph-Based Framework for Plug-and-Play Multi-Cue | visual | Tracking, A |
Probabilistic Graphical Model Based on Neural-symbolic Reasoning for | visual | Relationship Detection, A |
Probabilistic Grouping Principle to Go from Pixels to | visual | Structures, A |
Probabilistic Hypothesis for the Prediction of | visual | Fixations, A |
Probabilistic Integration of 2D and 3D Cues for | visual | Servoing |
Probabilistic Kernels for the Classification of Auto-Regressive | visual | Processes |
Probabilistic learning of task-specific | visual | attention |
Probabilistic Learning of | visual | Object Composition from Attended Segments |
Probabilistic Model of Overt | visual | Attention for Cognitive Robots, A |
Probabilistic Model of | visual | Attention and Perceptual Organization for Constructive Object Recognition, A |
Probabilistic Modeling of Scene Dynamics for Applications in | visual | Surveillance |
Probabilistic Multi-Task Learning for | visual | Saliency Estimation in Video |
Probabilistic Regression for | visual | Tracking |
probabilistic representation for efficient large scale | visual | recognition tasks, A |
Probabilistic structure matching for | visual | SLAM with a multi-camera rig |
Probabilistic Topic Model for Context-Driven | visual | Attention Understanding |
probabilistic topic model using deep | visual | word representation for simultaneous image classification and annotation, A |
Probabilistic | visual | Learning for Object Detection |
Probabilistic | visual | Learning For Object Representation |
Probabilistic Voting for Sequence Based | visual | Place Recognition |
Probabilistic-Based Semantic Image Feature Using | visual | Words |
Probability-based Dynamic Time Warping and Bag-of- | visual | -and-Depth-Words for Human Gesture Recognition in RGB-D |
Procedures of | visual | Analysis for Multidimensional Data Volumes, The |
Process Capability of Automated | visual | Inspection Systems |
Processing lidar waveform data for 3D | visual | assessment of forest environments |
Processing | visual | Data with an Automaton Eye |
Producing object-based special | visual | effects by integrating multiple differently focused images: Implicit 3D Approach to Image Content Manipulation |
Product Quantization Network for Fast | visual | Search |
profile of MPEG-7 for | visual | surveillance, A |
Progressive Difference Method for Capturing | visual | Tempos on Action Recognition, A |
Progressive Erasing Network with consistency loss for fine-grained | visual | classification |
Progressive Language-Customized | visual | Feature Learning for One-Stage Visual Grounding |
Progressive Language-Customized | visual | Feature Learning for One-Stage Visual Grounding |
Progressive Learning of Category-Consistent Multi-Granularity Features for Fine-Grained | visual | Classification |
Progressive Reconstruction of | visual | Structure for Image Inpainting |
Progressive Semantic- | visual | Mutual Adaption for Generalized Zero-Shot Learning |
Progressive Unsupervised Learning for | visual | Object Tracking |
Progressive | visual | Cryptography With Unexpanded Shares |
Progressive | visual | Object Detection with Positive Training Examples Only |
Progressive | visual | Secret Sharing with Multiple Decryptions and Unexpanded Shares |
Progressively diffused networks for semantic | visual | parsing |
Projection-Based AR: Effective | visual | Feedback in Gait Rehabilitation |
Projective Depth: A Geometric Invariant for 3D Reconstruction from Two Perspective/Orthographic Views and for | visual | Recognition |
Projective Line Geometry of the | visual | Operator |
Projective reconstruction of all | visual | primitives |
Projective | visual | Hulls |
Prompt Prototype Learning Based on Ranking Instruction For Few-Shot | visual | Tasks |
Prompt-RSVQA: Prompting | visual | context to a language model for Remote Sensing Visual Question Answering |
Prompt-RSVQA: Prompting | visual | context to a language model for Remote Sensing Visual Question Answering |
Prompting Large Language Models with Answer Heuristics for Knowledge-Based | visual | Question Answering |
Prompting | visual | -Language Models for Efficient Video Understanding |
Proof-of-Concept Demonstration of | visual | Teach and Repeat on a Quadrocopter Using an Altitude Sensor and a Monocular Camera, A |
Propagate Yourself: Exploring Pixel-Level Consistency for Unsupervised | visual | Representation Learning |
Propagating Over Phrase Relations for One-stage | visual | Grounding |
Proper Scale for Modeling | visual | Data |
Properties of Patch Based Approaches for the Recognition of | visual | Object Classes |
Property Analysis of XOR-Based | visual | Cryptography |
Proposal and Evaluation of | visual | Analytics Interface for Time-Series Data Based on Trajectory Representation |
Proprioceptive | visual | Tracking of a Humanoid Robot Head Motion |
Protected Areas from Space Map Browser with Fast | visual | ization and Analytical Operations on the Fly. Characterizing Statistical Uncertainties and Balancing Them with Visual Perception |
Protected Pooling Method of Sparse Coding in | visual | Classification |
Protecting | visual | Secrets Using Adversarial Nets |
Protocol for Evaluating Model Interpretation Methods from | visual | Explanations, A |
Prototype for Data-Driven | visual | Attention, A |
Prototype Towards Modeling | visual | Data Using Decentralized Generative Adversarial Networks, A |
Providing synthetic views for teleoperation using | visual | pose tracking in multiple cameras |
Pruning CNN filters via quantifying the importance of deep | visual | representations |
Pseudo loss active learning for deep | visual | tracking |
Pseudo-Active Vision for Improving Deep | visual | Perception Through Neural Sensory Refinement |
Pseudo-LiDAR From | visual | Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving |
Pseudo-Q: Generating Pseudo Language Queries for | visual | Grounding |
PSS: Progressive Sample Selection for Open-World | visual | Representation Learning |
Pulmonary-Restricted COVID-19 Informative | visual | Screening Using Chest X-ray Images from Portable Devices |
PUnDA: Probabilistic Unsupervised Domain Adaptation for Knowledge Transfer Across | visual | Categories |
Pushing it out of the Way: Interactive | visual | Navigation |
Putting Knowledge into a | visual | Shape Representation |
Putting | visual | Object Recognition in Context |
PVO: Panoptic | visual | Odometry |
PVT++: A Simple End-to-End Latency-Aware | visual | Tracking Framework |
Pyramid Based Interpolation for Face-Video Playback in Audio | visual | Recognition |
Pyramid-Based | visual | Tracking Using Sparsity Represented Mean Transform |
Quadruplet Network With One-Shot Learning for Fast | visual | Object Tracking |
QUALIFIER: Question-Guided Self-Attentive Multimodal Fusion Network for Audio | visual | Scene-Aware Dialog |
Qualitative comparison of | visual | models in an iterative halftoning procedure |
Qualitative Landmark Recognition Using | visual | Cues |
Qualitative | visual | Control of a Robot Manipulator |
Qualitative | visual | Environment Retrieval |
Qualitative | visual | navigation using weighted correlation |
Quality Assessment for Natural and Screen | visual | Contents |
Quality Diversity for | visual | Pre-Training |
Quality Scalability Aware Watermarking for | visual | Content |
Quality-driven power control and resource allocation in wireless multi-rate | visual | Sensor Networks |
Quality-improved threshold | visual | secret sharing scheme by random grids |
QuantArt: Quantizing Image Style Transfer Towards High | visual | Fidelity |
Quantifying Spatial Heterogeneity in Urban Landscapes: Integrating | visual | Interpretation and Object-Based Classification |
Quantifying the Amount of | visual | Information Used by Neural Caption Generators |
Quantifying the relation between perceived interest and | visual | salience during free viewing using trellis based optimization |
Quantifying | visual | Distortion in Low-rate Wavelet-coded Images |
Quantifying | visual | Similarity for Artistic Styles |
Quantitative Analysis of Human-Model Agreement in | visual | Saliency Modeling: A Comparative Study |
Quantitative Evaluation of Feature Extractors for | visual | SLAM |
quantitative evaluation of the conceptual consistency of | visual | words and visual vocabularies, A |
quantitative evaluation of the conceptual consistency of | visual | words and visual vocabularies, A |
Quantization adaptive to the human | visual | system |
Quantizer designed by using human | visual | sensitivity |
Quasi Monte Carlo partitioned filtering for | visual | Human Motion Capture |
Query Based | visual | Analysis: Spatio-Temporal Reasoning in Computer Vision |
Query Bootstrapping: A | visual | Mining Based Query Expansion |
Query-Adaptive Asymmetrical Dissimilarities for | visual | Object Retrieval |
Query-Adaptive Hash Code Ranking for Large-Scale Multi-View | visual | Search |
Query-specific | visual | semantic spaces for web image re-ranking |
Ques-to- | visual | Guided Visual Question Answering |
Ques-to- | visual | Guided Visual Question Answering |
quest for the integration of | visual | saliency models in objective image quality assessment: A distraction power compensated combination strategy, The |
Question Type Guided Attention in | visual | Question Answering |
Question-Agnostic Attention for | visual | Question Answering |
Question-aware dynamic scene graph of local semantic representation learning for | visual | question answering |
Question-Centric Model for | visual | Question Answering in Medical Imaging, A |
Question-Guided Hybrid Convolution for | visual | Question Answering |
QUICKSAL: A small and sparse | visual | saliency model for efficient inference in resource constrained hardware |
R2-trans: Fine-grained | visual | categorization with redundancy reduction |
RadioTransformer: A Cascaded Global-Focal Transformer for | visual | Attention-Guided Disease Classification |
Raindrop detection and removal using salient | visual | features |
random center surround bottom up | visual | attention model useful for salient region detection, A |
Random grid based color | visual | cryptography scheme for black and white secret images with general access structures |
Random grid-based | visual | secret sharing with abilities of OR and XOR decryptions |
Random grid-based | visual | secret sharing with multiple decryptions |
Random Grids-Based Threshold | visual | Secret Sharing with Improved Contrast by Boolean Operations |
Random Grids-Based Threshold | visual | Secret Sharing with Improved Visual Quality |
Random Grids-Based Threshold | visual | Secret Sharing with Improved Visual Quality |
Random Matrix Ensembles of Time Correlation Matrices to Analyze | visual | Lifelogs |
Random-Grid-Based | visual | Cryptography Schemes |
Randomised | visual | secret sharing scheme for grey-scale and colour images |
Randomized Max-Margin Compositions for | visual | Recognition |
Randomized texture flow estimation using | visual | similarity |
Randomized | visual | phrases for object search |
Range Estimation with a Panoramic | visual | Sensor |
Ranking Based Attention Approach for | visual | Tracking, A |
Ranking the local invariant features for the robust | visual | saliencies |
Ranking-Based Siamese | visual | Tracking |
Rao-Blackwellised particle filter for tracking with application in | visual | surveillance |
Rao-Blackwellized particle filtering with Gaussian mixture models for robust | visual | tracking |
Rapid Biologically-Inspired Scene Classification Using Features Shared with | visual | Attention |
Rapid selection of reliable templates for | visual | tracking |
Rapid | visual | Presentation to Support Geospatial Big Data Processing |
Rarity-Based | visual | Attention Map: Application to Texture Description, A |
Rate control for consistent | visual | quality of H.264/AVC encoding |
Rate-accuracy optimization in | visual | wireless sensor networks |
Rate-adaptive Compact Fisher Codes for Mobile | visual | Search |
Rate-efficient | visual | correspondences using random projections |
Rate-Invariant Analysis of Trajectories on Riemannian Manifolds with Application in | visual | Speech Recognition |
Rate-invariant comparisons of covariance paths for | visual | speech recognition |
Rate- | visual | -distortion optimized extraction with Quality Layers for scalable coding of stereo videos |
RAVEN: A Dataset for Relational and Analogical | visual | REasoNing |
Ray Saliency: Bottom-Up | visual | Saliency for a Rotating and Zooming Camera |
Re-Attention for | visual | Question Answering |
Re-identification framework for long term | visual | object tracking based on object detection and classification |
Re-identification of | visual | Targets in Camera Networks: A Comparison of Techniques |
Reading During Fully Automated Driving: A Study of the Effect of Peripheral | visual | and Haptic Information on Situation Awareness and Mental Workload |
Reading Speed and Superiority of Right | visual | Field on Foveated Vision |
Reading-Strategy Inspired | visual | Representation Learning for Text-to-Video Retrieval |
real time adaptive | visual | surveillance system for tracking low-resolution colour targets in dynamically changing scenes, A |
Real Time Direct | visual | Odometry for Flexible Multi-camera Rigs |
Real time hardware architecture for | visual | robot navigation |
Real Time Human | visual | System Based Framework for Image Fusion |
Real Time Implementation of the Saliency-Based Model of | visual | Attention on a SIMD Architecture, A |
Real Time Inventory Management: | visual | Survey of Interior Architecture Elements and Space Making Crafts of Gujarat, India |
Real Time | visual | Cues Extraction for Monitoring Driver Vigilance |
Real time | visual | servoing around a complex Object |
Real time | visual | tracking using a spatially weighted von Mises mixture model |
Real-Time 3-D | visual | Detection-Based Soft Wire Avoidance Scheme for Industrial Robot Manipulators, A |
Real-Time 3D | visual | Singing Synthesis: From Appearance to Internal Articulators, A |
Real-time 6D stereo | visual | Odometry with non-overlapping fields of view |
Real-Time Accurate Geo-Localization of a MAV with Omnidirectional | visual | Odometry and GPS |
Real-Time Active | visual | Surveillance by Integrating Peripheral Motion Detection With Foveated Tracking |
Real-time Activity Recognition by Discerning Qualitative Relationships Between Randomly Chosen | visual | Features |
Real-time adaptive | visual | secret sharing with reversibility and high capacity |
Real-Time and Robust | visual | Tracking with Scene-Perceptual Memory |
Real-time Automated Concurrent | visual | Tracking of Many Animals and Subsequent Behavioral Compilation |
Real-Time automated | visual | inspection of color tablets in pharmaceutical blisters |
Real-time automated | visual | inspection system for contaminant removal from wool |
Real-Time Automated | visual | Inspection System for Hot Steel Slabs, A |
Real-Time Automatic Detection of Violent-Acts by Low-Level Colour | visual | Cues |
Real-Time Color Image Improvement System for | visual | Testing of Nuclear Reactors |
Real-time constant memory | visual | summaries for surveillance |
Real-time data fusion on tracking camera pose for direct | visual | guidance |
Real-time Dense | visual | Tracking under Large Lighting Variations |
Real-Time Detection and Tracking of Multiple Humans from High Bird's-Eye Views in the | visual | and Infrared Spectrum |
Real-Time Driver | visual | Attention Monitoring System, A |
Real-Time Extraction of Colored Segments for Robot | visual | Navigation |
Real-Time Gabor Primal Sketch for | visual | Attention, A |
Real-time Generation of Novel Views of a Dynamic Scene Using Morphing and | visual | Hull |
Real-Time Gesture Recognition by Learning and Selective Control of | visual | Interest Points |
Real-time global localization with a pre-built | visual | landmark database |
Real-Time High Speed Motion Prediction Using Fast Aperture-Robust Event-Driven | visual | Flow |
Real-time human detection using fast contour template matching for | visual | surveillance |
Real-Time Illumination and | visual | Coherence for Photorealistic Augmented/Mixed Reality |
Real-Time Image Alignment for a Gyro- | visual | Hybrid Pointing Device |
Real-time image improvement system for | visual | testing of nuclear reactors |
Real-time image segmentation for | visual | inspection of pharmaceutical tablets |
Real-Time Knowledge-Based Processing of Images: Application of the Online NLPM Method to Perceptual | visual | Analysis |
Real-time large-scale | visual | concept detection with linear classifiers |
Real-time least-squares ensemble | visual | tracking |
Real-Time Lip Tracking for Audio- | visual | Speech Recognition Applications |
Real-Time Model-Based Rigid Object Pose Estimation and Tracking Combining Dense and Sparse | visual | Cues |
Real-Time Multiaperture Omnidirectional | visual | Sensor Based on an Interconnected Network of Smart Cameras, A |
Real-Time Multisensory Image Segmentation Algorithm with an Application to | visual | and X-Ray Inspection, A |
Real-Time Parallel and Cooperative Recognition of Facial Images for an Interactive | visual | Human Interface |
Real-time part-based | visual | tracking via adaptive correlation filters |
Real-Time Processing Stand-Alone Multiple Object | visual | Tracking System, A |
Real-Time Recognition and | visual | Control: Image Understanding Research at Rochester |
Real-Time Scalable | visual | Tracking via Quadrangle Kernelized Correlation Filters |
Real-Time Sparse | visual | Tracking Using Circulant Reverse LASSO Model |
Real-Time Stereo Face Tracking System for | visual | Human Interfaces |
Real-Time Tracking for | visual | Interface Applications in Cluttered and Occluding Situations |
Real-time tracking of multiple objects in space-variant vision based on magnocellular | visual | pathway |
Real-time TV commercial monitoring based on robust | visual | hashing |
Real-Time Vehicle Localization using on-Board | visual | SLAM for Detection and Tracking |
Real-time | visual | analysis of microvascular blood flow for critical care |
Real-Time | visual | Analytics for Text Streams |
Real-Time | visual | attention on a massively parallel SIMD architecture |
real-time | visual | card reader for mobile devices, A |
Real-Time | visual | Concept Classification |
Real-Time | visual | Ground-Truth System for Indoor Robotic Applications |
Real-Time | visual | Inspection of Moulded Plastics Drippers |
Real-Time | visual | Inspection System for Railway Maintenance: Automatic Hexagonal-Headed Bolts Detection, A |
Real-time | visual | Object Tracking with Natural Language Description |
Real-time | visual | odometry from dense RGB-D images |
real-time | visual | postprocessor for MPEG-coded video sequences, A |
Real-time | visual | recognition of facial gestures for human-computer interaction |
Real-Time | visual | Recognition of Objects and Scenes Using P-Channel Matching |
Real-time | visual | Recovery of Pose using Line Tracking in Multiple Cameras |
Real-Time | visual | Rotational Velocity Estimation Using a Biologically-Inspired Algorithm on Embedded Hardware |
Real-time | visual | saliency by Division of Gaussians |
Real-Time | visual | Sensing for Task Planning in a Field Navigation Vehicle |
Real-Time | visual | Servoing |
Real-Time | visual | Servoing Using Controlled Illumination |
Real-Time | visual | SLAM with Resilience to Erratic Motion |
Real-Time | visual | Target Tracking in RGB-D Data for Person-Following Robots |
Real-time | visual | tracking by deep reinforced decision making |
Real-Time | visual | Tracking for Surveillance and Path Planning |
Real-Time | visual | Tracking of Complex Structures |
Real-time | visual | Tracking under Arbitrary Illumination Changes |
Real-time | visual | tracking using compressive sensing |
Real-time | visual | tracking using L2 norm regularization based collaborative representation |
Real-time | visual | Tracking via Incremental Covariance Tensor Learning |
Real-Time | visual | Tracking Via Online Weighted Multiple Instance Learning |
Real-time | visual | tracking with ELM augmented adaptive correlation filter |
Real-Time | visual | Tracking: Promoting the Robustness of Correlation Filter Learning |
Real-time | visual | -Inertial Odometry for Event Cameras using Keyframe-based Nonlinear Optimization |
Real-Time | visual | -Inertial SLAM Based on Adaptive Keyframe Selection for Mobile AR Applications |
Real-world stimuli show perceived hue shifts in the peripheral | visual | field |
Realization of CUDA-based real-time multi-camera | visual | SLAM in embedded systems |
RealPixVSR: Pixel-Level | visual | Representation Informed Super-Resolution of Real-World Videos |
Realtime Edge-Based | visual | Odometry for a Monocular Camera |
Realtime motion segmentation based multibody | visual | SLAM |
Realtime multibody | visual | SLAM with a smoothly moving monocular camera |
Realtime | visual | Tracking of Aircrafts |
Reasonable Effectiveness of Synthetic | visual | Data, The |
Reasoning on the Relation: Enhancing | visual | Representation for Visual Question Answering and Cross-Modal Retrieval |
Reasoning on the Relation: Enhancing | visual | Representation for Visual Question Answering and Cross-Modal Retrieval |
Reasoning | visual | Dialogs With Structural and Partial Observations |
Reasoning with Multi-Structure Commonsense Knowledge in | visual | Dialog |
Rebuilding | visual | Vocabulary via Spatial-temporal Context Similarity for Video Retrieval |
Receding Horizon Estimation for Hybrid Particle Filters and Application for Robust | visual | Tracking |
Receiving a Mediated Touch From Your Partner vs. a Male Stranger: How | visual | Feedback of Touch and Its Sender Influence Touch Experience |
Recent Advances in Structural Pattern Recognition with Applications to | visual | Form Analysis |
Recent Advances in Transfer Learning for Cross-Dataset | visual | Recognition: A Problem-Oriented Perspective |
Recent Advances In Video Content Analysis: From | visual | Features to Semantic Video Segments |
Recent advances in | visual | and infrared face recognition: A review |
Recent Advances in Zero-Shot Recognition: Toward Data-Efficient Understanding of | visual | Content |
Receptive fields, binocular interaction and functional architecture in the cat's | visual | cortex |
Recognition of Dynamic Video Contents With Global Probabilistic Models of | visual | Motion |
Recognition of Moving Object in High Dynamic Scene for | visual | Prosthesis |
Recognition of social dancing from auditory and | visual | information |
Recognition of social interactions based on feature selection from | visual | codebooks |
Recognition of | visual | Activities and Interactions by Stochastic Parsing |
Recognition of | visual | speech elements using adaptively boosted hidden Markov models |
Recognition of | visual | -related non-driving activities using a dual-camera monitoring system |
Recognition using | visual | phrases |
Recognizing High-level Audio- | visual | Concepts Using Context |
Recognizing human-human interaction activities using | visual | and textual information |
Recognizing manipulation actions in arts and crafts shows using domain-specific | visual | and textual cues |
Recognizing object manipulation activities using depth and | visual | cues |
Recognizing Objects in Adversarial Clutter: Breaking a | visual | CAPTCHA |
Recognizing the Emotions Evoked by Artworks Through | visual | Features and Knowledge Graph-Embeddings |
Recognizing | visual | Categories with Symbol-Relational Grammars and Bayesian Networks |
Recognizing | visual | Focus of Attention From Head Pose in Natural Meetings |
Recognizing | visual | Signatures of Spontaneous Head Gestures |
Recombining Vision Transformer Architecture for Fine-grained | visual | Categorization |
Recommended keypoint-aware tracker: Adaptive real-time | visual | tracking using consensus feature prior ranking |
reconfigurable architecture for autonomous | visual | -navigation, A |
Reconstructing Natural | visual | Scenes From Spike Times |
Reconstruction by inpainting for | visual | anomaly detection |
Reconstruction of High Resolution 3D | visual | Information Using Sub-pixel Camera Displacements |
Reconstruction of | visual | appearance |
Reconstruction of | visual | surfaces from sparse data using parametric triangular approximants |
Reconstruction-Based | visual | -Acoustic-Semantic Embedding Method for Speech-Image Retrieval, A |
Reconstructive and Discriminative Sparse Representation for | visual | Object Categorization |
Recovering Generalization via Pre-Training-Like Knowledge Distillation for Out-of-Distribution | visual | Question Answering |
Recovering human pose in 3D by | visual | manifolds |
Recovering Very Large | visual | Motion Fields |
Recovery of | visual | structure in illustrated Japanese gardens |
Recurrent bag-of-features for | visual | information analysis |
Recurrent Filter Learning for | visual | Tracking |
Recurrent Neural Network for (Un-)Supervised Learning of Monocular Video | visual | Odometry and Depth |
Recurrent Refinement for | visual | Saliency Estimation in Surveillance Scenarios |
Recurrent Saliency Transformation Network: Incorporating Multi-stage | visual | Cues for Small Organ Segmentation |
Recurrent Topic-Transition GAN for | visual | Paragraph Generation |
Recurrent Vision Transformer for Solving | visual | Reasoning Problems |
Recursive 3-D | visual | -Motion Estimation Using Subspace Constraints |
Recursive Least-Squares Estimator-Aided Online Learning for | visual | Tracking |
Recursive | visual | Attention in Visual Dialog |
Recursive | visual | Attention in Visual Dialog |
Recursive | visual | Sound Separation Using Minus-Plus Net |
Redro: Efficiently Learning Large-sized SPD | visual | Representation |
Reduce shadow size in aspect ratio invariant | visual | secret sharing schemes using a square block-wise operation |
Reduced Analytic Dependency Modeling: Robust Fusion for | visual | Recognition |
Reduced Reference Metric for | visual | Quality Evaluation of Point Cloud Contents, A |
Reduced-Reference Image Quality Assessment with | visual | Information Fidelity |
Reducing Language Biases in | visual | Question Answering with Visually-grounded Question Encoder |
Reducing noisy labels in weakly labeled data for | visual | sentiment analysis |
Reduction of blocking effect in DCT-coded images based on a | visual | perception criterion |
Reduction of Map Information Regulates | visual | Attention without Affecting Route Recognition Performance |
Refer-it-in-RGBD: A Bottom-up Approach for 3D | visual | Grounding in RGBD Images |
Reference Guided Reflection Removal Using Deep | visual | Attribute Cues |
Reference Pose Generation for Long-term | visual | Localization via Learned Features and View Synthesis |
Region based fusion of 3D and 2D | visual | data for Cultural Heritage objects |
Region Based | visual | Object Categorization Using Segment Features and Polynomial Modeling |
Region Incrementing | visual | Cryptography |
Region-based fully convolutional siamese networks for robust real-time | visual | tracking |
Regression based landmark estimation and multi-feature fusion for | visual | speech recognition |
Regression-Selective Feature-Adaptive Tracker for | visual | Object Tracking |
Regularisation learning of correlation filters for robust | visual | tracking |
Regularization of Inverse | visual | Problems Involving Discontinuities |
Regularizing | visual | Semantic Embedding With Contrastive Learning for Image-Text Matching |
Reinforcement learning based | visual | attention with application to face detection |
Reinforcement Learning for | visual | Object Detection |
Reinforcing Generated Images via Meta-Learning for One-Shot Fine-Grained | visual | Recognition |
Reinterpretation of three famous experiments on | visual | perception from the view of spatial frequency multi-channel |
Relatable Clothing: Detecting | visual | Relationships between People and Clothing |
ReLaText: Exploiting | visual | relationships for arbitrary-shaped scene text detection with graph convolutional networks |
Relation-Aware Graph Attention Network for | visual | Question Answering |
Relation-aware Instance Refinement for Weakly Supervised | visual | Grounding |
Relational Models for | visual | Understanding of Graphical Documents. Application to Architectural Drawings. |
Relational | visual | cluster validity (RVCV) |
relationship between information prioritization and | visual | distinctness in two progressive image transmission schemes, The |
Relationship Between | visual | Complexity and Aesthetics: Application to Beauty Prediction of Photos |
Relative 3-D-state Estimation for Autonomous | visual | Guidance of Road Vehicles |
Relative End-Effector Control Using Cartesian Position Based | visual | Servoing |
Relative Forest for | visual | Attribute Prediction |
Relay dueling network for | visual | tracking with broad field-of-view |
Relevance Assessment for | visual | Video Re-ranking |
Relevance Feedback for Keyword and | visual | Feature-Based Image Retrieval |
Relevance of a Feed-Forward Model of | visual | Attention for Goal-Oriented and Free-Viewing Tasks |
Reliability Object Layer for Deep Hashing-Based | visual | Indexing, A |
reliable descriptor for face objects in | visual | content, A |
Reliable Parts Picking System with an Active and Multisensor | visual | System, A |
Reliable Patch Trackers: Robust | visual | tracking by exploiting reliable patches |
Reliable Shot Identification for Complex Event Detection Via | visual | -Semantic Embedding |
Reliable Temporally Consistent Feature Adaptation for | visual | Object Tracking |
Reliable | visual | Question Answering: Abstain Rather Than Answer Incorrectly |
RelTransformer: A Transformer-Based Long-Tail | visual | Relationship Recognition |
Remarkable | visual | Abilities of Nocturnal Insects: Neural Principles and Bioinspired Night-Vision Algorithms, The |
Remembering Key Features of | visual | Images Based on Spike Timing Dependent Plasticity of Spiking Neurons |
Remote Sensing Image Fusion Method Combining Low-Level | visual | Features and Parameter-Adaptive Dual-Channel Pulse-Coupled Neural Network, A |
Remote Sensor Design for | visual | Recognition With Convolutional Neural Networks |
Remote-Sensing Image Superresolution Based on | visual | Saliency Analysis and Unequal Reconstruction Networks |
Removal of | visual | Disruption Caused by Rain Using Cycle-Consistent Generative Adversarial Networks |
Remove Cosine Window From Correlation Filter-Based | visual | Trackers: When and How |
Removing Label Ambiguity in Learning-Based | visual | Saliency Estimation |
Renderable Neural Radiance Map for | visual | Navigation |
Renormalization for Initialization of Rolling Shutter | visual | -Inertial Odometry |
Rephrasing | visual | Questions by Specifying the Entropy of the Answer Distribution |
Replay-based Online Adaptation for Unsupervised Deep | visual | Odometry |
Representation and | visual | Recognition of Complex, Multi-Agent Actions Using Belief Networks |
Representation for | visual | Information, A |
Representation Learning for | visual | Object Tracking by Masked Appearance Transfer |
Representation Learning on | visual | -Symbolic Graphs for Video Understanding |
Representation of Moving Rigid Objects Based on | visual | Observations |
Representation of Similarity as a Goal of Early | visual | Processing |
Representation of | visual | Scenes |
Representation Space: An Approach to the Integration of | visual | Information |
Representational strategy in the | visual | cortex |
Representing 3D Structures for | visual | Recognition |
Representing and Computing | visual | Information |
Representing and Recognizing the | visual | Appearance of Materials using Three-dimensional Textons |
Representing and recognizing | visual | dynamic events with support vector machines |
Representing and Using Functional Definitions for | visual | Recognition |
Representing Knowledge of the | visual | World |
Representing Relative | visual | Attributes with a Reference-Point-Based Decision Model |
Representing | visual | appearance by video Brownian covariance descriptor for human action recognition |
Representing | visual | Information |
Representing | visual | Information: A Computational Approach |
Requirements for Future Collision Avoidance Systems in | visual | Flight: A Human-Centered Approach |
Research of | visual | tracking based on prior knowledge |
Research on Cyberpunk Images in the | visual | Digital Media |
Research on fatigue detection based on | visual | features |
Research on Feature Extraction Method of Indoor | visual | Positioning Image Based on Area Division of Foreground and Background |
Research on Mixed Reality | visual | Augmentation Method for Teleoperation Interactive System |
Research on pornographic images recognition method based on | visual | words in a compressed domain |
Research on the Influence of Icon Shape Complexity and Composition on | visual | Search Based on Military Geographic Intelligence System |
Research on Traditional Tangka Image Classification Based on | visual | Features, A |
Research on Traffic Accident Risk Prediction Method Based on Spatial and | visual | Semantics |
Research On | visual | Analysis Methods Of Terrorism Events |
Residual Graph Attention Network and Expression-Respect Data Augmentation Aided | visual | Grounding |
Resolution limits on | visual | speech recognition |
Resolution reduction by growth of zones for | visual | prosthesis |
Resolvability Ellipsoid for | visual | Servoing, The |
Resolving Copycat Problems in | visual | Imitation Learning via Residual Action Prediction |
Resolving | visual | Uncertainty and Occlusion through Probabilistic Reasoning |
Resolving Zero-Shot and Fact-Based | visual | Question Answering via Enhanced Fact Retrieval |
Resource management for wireless | visual | sensor networks based on individual video characteristics |
Resource-Aware Coverage and Task Assignment in | visual | Sensor Networks |
Response Time Analysis for Explainability of | visual | Processing in CNNs |
Results on | visual | Road Recognition for Road Vehicle Guidance |
Rethink Cross-Modal Fusion in Weakly-Supervised Audio- | visual | Video Parsing |
Rethinking 360° Image | visual | Attention Modelling with Unsupervised Learning |
Rethinking Class-Balanced Methods for Long-Tailed | visual | Recognition From a Domain Adaptation Perspective |
Rethinking Data Augmentation for Robust | visual | Question Answering |
Rethinking Long-Tailed | visual | Recognition with Dynamic Probability Smoothing and Frequency Weighted Focusing |
Rethinking | visual | Geo-localization for Large-Scale Applications |
Rethinking Zero-Shot Learning: A Conditional | visual | Classification Perspective |
Retina-Like | visual | Image Reconstruction via Spiking Neural Model |
Retina-Like | visual | Sensor for Fast Tracking and Navigation Robots |
Retinex based | visual | identicalness detection for videos corrupted by imaging noise |
RETRACTED: Camera network analysis for | visual | surveillance in electric industrial context |
RETRACTED: Image quality tendency modeling by fusing multiple | visual | cues |
Retraction: Deep network for | visual | saliency prediction by encoding image composition |
Retrieval Augmented Classification for Long-Tail | visual | Recognition |
Retrieving Images by Similarity of | visual | Appearance |
Reveal: Retrieval-Augmented | visual | -Language Pre-Training with Multi-Source Multimodal Knowledge Memory |
Revealing Smooth Structure of | visual | Data by Permutation on Manifolds |
REVERIE: Remote Embodied | visual | Referring Expression in Real Indoor Environments |
Reverse Variational Autoencoder for | visual | Attribute Manipulation and Anomaly Detection |
Reversible | visual | transformation via exploring the correlations within color images |
review of monocular | visual | odometry, A |
review of recent advances in | visual | speech decoding, A |
Review of | visual | Saliency Detection With Comprehensive Information |
Review of | visual | Simultaneous Localization and Mapping Based on Deep Learning |
Review on on-line | visual | ferrographic analysis system for marine diesel engine |
REVISE: A Tool for Measuring and Mitigating Bias in | visual | Datasets |
ReVISE: Self-Supervised Speech Resynthesis with | visual | Input for Universal and Generalized Speech Regeneration |
Revisiting correlation-based filters for low-resolution and long-term | visual | tracking |
Revisiting Jump-Diffusion Process for | visual | Tracking: A Reinforcement Learning Approach |
Revisiting Metric Learning for SPD Matrix Based | visual | Representation |
Revisiting Problem in Simultaneous Localization and Mapping: A Survey on | visual | Loop Closure Detection, The |
Revisiting Robust | visual | Tracking Using Pixel-Wise Posteriors |
Revisiting Self-Supervised | visual | Representation Learning |
Revisiting | visual | attention identification based on eye tracking data analytics |
Revisiting | visual | Odometry for Real-Time Performance |
Revisiting | visual | Question Answering Baselines |
Revisiting Weakly Supervised Pre-Training of | visual | Perception Models |
REVT: Robust and Efficient | visual | Tracking by Region-Convolutional Regression Network |
RFID-assisted | visual | multiple object tracking without using visual appearance and motion |
RFID-assisted | visual | multiple object tracking without using visual appearance and motion |
RGB-D Based Multi-attribute People Search in Intelligent | visual | Surveillance |
RGFF Representational Model: A System for the Automatically Learned Partitioning of | visual | Pattern in Digital Images, The |
Right to Talk: An Audio- | visual | Transformer Approach, The |
Rigid Registration of Renal Perfusion Images Using a Neurobiology-Based | visual | Saliency Model |
RILS: Masked | visual | Reconstruction in Language Semantic Space |
Risk-Aware Pairwise Rank Learning Approach for | visual | Discomfort Prediction of Stereoscopic 3D, A |
RL-CAM: | visual | Explanations for Convolutional Networks using Reinforcement Learning |
RMLVQA: A Margin Loss Approach For | visual | Question Answering with Language Biases |
RNGC-VIWO: Robust Neural Gyroscope Calibration Aided | visual | -Inertial-Wheel Odometry for Autonomous Vehicle |
Road Constrained Monocular | visual | Localization Using Gaussian-Gaussian Cloud Model |
Road sign detection based on | visual | saliency and shape analysis |
road sign recognition system based on dynamic | visual | model, A |
Roadmap to the Integration of Early | visual | Modules, A |
Robot Command Interface Using an Audio- | visual | Speech Recognition System |
Robot Path Generation Method for a Welding System Based on Pseudo Stereo | visual | Servo Control |
Robot Pose Estimation Optimized | visual | SLAM Algorithm Based on CO-HDC Instance Segmentation Network for Dynamic Scenes, A |
Robot Rover | visual | Navigation |
Robot Self-Location Using | visual | Reasoning Relative to a Single Target Object |
Robot | visual | servoing with iterative learning control |
Robot-arm Pick and Place Behavior Programming System Using | visual | Perception |
Robotic Control with Partial | visual | Information |
Robust 3D Face Recognition Using Learned | visual | Codebook |
robust 3D | visual | saliency computation model for human fixation prediction of stereoscopic videos, A |
Robust abandoned object detection integrating wide area | visual | surveillance and social context |
Robust and Accurate | visual | Echo Cancelation in a Full-duplex Projector-Camera System |
Robust and Real-Time | visual | Tracking Based on Complementary Learners |
Robust and Scalable | visual | Category and Action Recognition System Using Kernel Discriminant Analysis With Spectral Regression, A |
Robust Asymptotically Stable | visual | Servoing of Planar Robots |
Robust Attribute-Based | visual | Recognition Using Discriminative Latent Representation |
Robust Audio- | visual | Instance Discrimination |
Robust Audio- | visual | Mandarin Speech Recognition Based On Adaptive Decision Fusion And Tone Features |
Robust Audio- | visual | Speech Recognition Based on Hybrid Fusion |
Robust Audio- | visual | Speech Recognition Based on Late Integration |
Robust Audio- | visual | Speech Recognition Under Noisy Audio-Video Conditions |
Robust Auxiliary Particle Filter with an Adaptive Appearance Model for | visual | Tracking |
Robust Collision Perception | visual | Neural Network With Specific Selectivity to Darker Objects, A |
Robust common | visual | pattern discovery using graph matching |
Robust Density Comparison for | visual | Tracking |
Robust Direct | visual | Localisation using Normalised Information Distance |
Robust Explanations for | visual | Question Answering |
Robust Face Frontalization For | visual | Speech Recognition* |
Robust face pose classification method based on geometry-preserving | visual | phrase |
Robust Fastener Detection for Autonomous | visual | Railway Track Inspection |
Robust Filter-Based | visual | Navigation Solution with Miscalibrated Bi-Monocular or Stereo Cameras |
Robust Fine-Grained | visual | Recognition With Neighbor-Attention Label Correction |
Robust GNSS-denied localization for UAV using particle filter and | visual | odometry |
Robust Hybrid | visual | Servoing of Omnidirectional Mobile Manipulator With Kinematic Uncertainties Using a Single Camera |
Robust Image Analysis With Sparse Representation on Quantized | visual | Features |
Robust Image and Video Dehazing with | visual | Artifact Suppression via Gradient Residual Minimization |
Robust Image Enhancement Technique for Improving Image | visual | Quality in Shadowed Scenes, A |
Robust image hashing with | visual | attention model and invariant moments |
Robust image watermarking using dual tree complex wavelet transform based on Human | visual | System |
Robust image-based crack detection in concrete structure using multi-scale enhancement and | visual | features |
Robust Image-Sequence-Based Framework for | visual | Place Recognition in Changing Environments, A |
Robust Infrared Maritime Target Detection Based on | visual | Attention and Spatiotemporal Filtering |
Robust Iris Localization and Tracking based on Constrained | visual | Fitting |
Robust large scale monocular | visual | SLAM |
Robust Loop Closure Detection Integrating | visual | -Spatial-Semantic Information via Topological Graphs and CNN Features |
Robust loop closures for scene reconstruction by combining odometry and | visual | correspondences |
Robust Map Alignment for Cooperative | visual | SLAM |
Robust markers for | visual | navigation using Reed-Solomon codes |
robust measure for | visual | correspondence, A |
Robust metric for the evaluation of | visual | saliency algorithms |
Robust monocular | visual | odometry for road vehicles using uncertain perspective projection |
Robust multi-feature | visual | tracking via multi-task kernel-based sparse learning |
Robust Multi-Model | visual | Tracking With Distractor-Aware Template-Coupled Correlation Filters Joint Learning |
Robust multi-source adaptation | visual | classification using supervised low-rank representation |
Robust multiple-vehicle tracking via adaptive integration of multiple | visual | features |
Robust Multispectral | visual | -Inertial Navigation With Visual Odometry Failure Recovery |
Robust Multispectral | visual | -Inertial Navigation With Visual Odometry Failure Recovery |
Robust Object Modeling for | visual | Tracking |
Robust Object Recognition via | visual | Pathway Feedback |
Robust object tracking using correspondence voting for smart surveillance | visual | sensing nodes |
Robust occlusion-aware part-based | visual | tracking with object scale adaptation |
Robust Online Appearance Models for | visual | Tracking |
Robust Online Learned Spatio-Temporal Context Model for | visual | Tracking |
Robust online multi-target | visual | tracking using a HISP filter with discriminative deep appearance learning |
Robust Online | visual | Tracking with a Single Convolutional Neural Network |
Robust part-based | visual | tracking via adaptive collaborative modelling |
Robust PCA-Based | visual | Tracking by Adaptively Maximizing the Matching Residual Likelihood |
Robust Person-Independent | visual | Sign Language Recognition |
Robust Perturbation for | visual | Explanation: Cross-Checking Mask Optimization to Avoid Class Distortion |
Robust Physical-World Attacks on Deep Learning | visual | Classification |
Robust Physics-Based Analysis of Thermal and | visual | Imagery |
Robust Real-Time 3D Object Tracking with Interfering Background | visual | Projections |
Robust real-time ship detection and tracking for | visual | surveillance of cage aquaculture |
Robust Real-Time | visual | Object Tracking via Multi-Scale Fully Convolutional Siamese Networks |
Robust Real-Time | visual | Odometry with a Single Camera and an IMU |
Robust Real-Time | visual | SLAM Using Scale Prediction and Exemplar Based Feature Description |
Robust Real-Time | visual | Tracking using a 2D-3D Model-based Approach |
Robust Real-Time | visual | Tracking Using Pixel-Wise Posteriors |
Robust Region-of-Interest Determination Based on User Attention Model Through | visual | Rhythm Analysis |
Robust registration of aerial and close-range photogrammetric point clouds using | visual | context features and scale consistency |
Robust Saliency-Aware Distillation for Few-Shot Fine-Grained | visual | Recognition |
Robust Scale Adaptive and Real-Time | visual | Tracking with Correlation Filters |
Robust Sensor Fusion: Analysis and Application to Audio- | visual | Speech Recognition |
Robust Subjective | visual | Property Prediction from Crowdsourced Pairwise Labels |
Robust tracking using | visual | cue integration for mobile mixed images |
Robust traffic lights detection on mobile devices for pedestrians with | visual | impairment |
Robust TV News Story Identification Via | visual | Characteristics of Anchorperson Scenes |
Robust UAV | visual | Teach and Repeat Using Only Sparse Semantic Object Features |
Robust validation of | visual | Focus of Attention using adaptive fusion of head and eye gaze patterns |
Robust | visual | analysis for planogram compliance problem |
Robust | visual | Behavior Recognition |
Robust | visual | Cooperative Tracking Using Constrained Adaptive Sparse Representations and Sparse Classifier Grids |
Robust | visual | data segmentation: Sampling from distribution of model parameters |
Robust | visual | domain adaptation with low-rank reconstruction |
Robust | visual | features for the multimodal identification of unregistered speakers in TV talk-shows |
Robust | visual | Identifier for Cropped Natural Photos |
Robust | visual | Knowledge Transfer via Extreme Learning Machine-Based Domain Adaptation |
Robust | visual | Method for Assessing the Relative Performance of Edge Detection Algorithms, A |
Robust | visual | Object Tracking Using Multi-Mode Anisotropic Mean Shift and Particle Filters |
Robust | visual | Object Tracking Via Adaptive Attribute-Aware Discriminative Correlation Filters |
Robust | visual | Object Tracking via Sparse Representation and Reconstruction |
Robust | visual | Object Tracking with Spatiotemporal Regularisation and Discriminative Occlusion Deformation |
Robust | visual | Object Tracking with Two-Stream Residual Convolutional Networks |
Robust | visual | odometry estimation of road vehicle from dominant surfaces for large-scale mapping |
Robust | visual | Odometry Using Uncertainty Models |
Robust | visual | Place Recognition with Graph Kernels |
Robust | visual | question answering via semantic cross modal augmentation |
Robust | visual | Recognition of Color Images |
Robust | visual | Servoing Based on Relative Orientation |
Robust | visual | Servoing in 3-D Reaching Tasks |
Robust | visual | Servoing of Robot Manipulators with Neuro Compensation |
Robust | visual | servoing using global features based on random process |
Robust | visual | similarity retrieval in single model face databases |
Robust | visual | speakingness detection using bi-level HMM |
Robust | visual | Tracker with a Coupled-Classifier Based on Multiple Representative Appearance Models, A |
Robust | visual | Tracking and Vehicle Classification via Sparse Representation |
Robust | visual | Tracking based on Adversarial Unlabeled Instance Generation with Label Smoothing Loss Regularization |
Robust | visual | Tracking Based on an Effective Appearance Model |
Robust | visual | Tracking Based on Incremental Tensor Subspace Learning |
Robust | visual | tracking based on modified mayfly optimization algorithm |
Robust | visual | Tracking Based on Multi-channel Compressive Features |
Robust | visual | tracking based on product sparse coding |
Robust | visual | tracking based on simplified biologically inspired features |
Robust | visual | tracking based on watershed regions |
Robust | visual | Tracking by Coupling 2D Motion and 3D Pose Estimation |
Robust | visual | tracking by embedding combination and weighted-gradient optimization |
Robust | visual | Tracking by Exploiting the Historical Tracker Snapshots |
Robust | visual | Tracking by Integrating Lucas-Kanade into Mean-Shift |
Robust | visual | Tracking by Integrating Multiple Cues Based on Co-Inference Learning |
Robust | visual | Tracking by Segmentation |
Robust | visual | Tracking for Multiple Targets |
Robust | visual | Tracking in Low-Resolution Sequence |
robust | visual | tracking method via local feature extraction and saliency detection, A |
Robust | visual | Tracking Revisited: From Correlation Filter to Template Matching |
Robust | visual | Tracking Using an Adaptive Coupled-Layer Visual Model |
Robust | visual | Tracking Using an Adaptive Coupled-Layer Visual Model |
Robust | visual | tracking using autoregressive hidden Markov Model |
Robust | visual | Tracking Using Case-Based Reasoning with Confidence |
Robust | visual | Tracking Using Dynamic Classifier Selection with Sparse Representation of Label Noise |
Robust | visual | Tracking Using Exemplar-Based Detectors |
Robust | visual | Tracking Using Hierarchical Vision Transformer with Shifted Windows Multi-Head Self-Attention |
Robust | visual | tracking using joint scale-spatial correlation filters |
Robust | visual | Tracking Using L1 Minimization |
Robust | visual | Tracking Using Local Sparse Appearance Model and K-Selection |
Robust | visual | Tracking Using Multi-Frame Multi-Feature Joint Modeling |
Robust | visual | Tracking Using Oblique Random Forests |
Robust | visual | Tracking Using Sparse Discriminative Graph Embedding |
Robust | visual | Tracking Using Structurally Random Projection and Weighted Least Squares |
Robust | visual | Tracking Using Structure-Preserving Sparse Learning |
Robust | visual | tracking using template anchors |
Robust | visual | Tracking Using the Time-Reversibility Constraint |
Robust | visual | Tracking Via An Imbalance-Elimination Mechanism |
Robust | visual | tracking via augmented kernel SVM |
Robust | visual | Tracking via Basis Matching |
Robust | visual | tracking via CAMShift and structural local sparse appearance model |
Robust | visual | tracking via co-trained Kernelized correlation filters |
Robust | visual | Tracking Via Consistent Low-Rank Sparse Learning |
Robust | visual | tracking via constrained correlation filter coding |
Robust | visual | Tracking via Constrained Multi-Kernel Correlation Filters |
Robust | visual | tracking via context objects computing |
Robust | visual | Tracking via Convolutional Networks Without Training |
Robust | visual | Tracking via Coupled Randomness |
Robust | visual | Tracking via Dirac-Weighted Cascading Correlation Filters |
Robust | visual | tracking via discriminative sequential ranking |
Robust | visual | tracking via efficient manifold ranking with low-dimensional compressive features |
Robust | visual | Tracking via Exclusive Context Modeling |
Robust | visual | Tracking via Hierarchical Convolutional Features |
Robust | visual | Tracking via Hierarchical Particle Filter and Ensemble Deep Features |
Robust | visual | Tracking via Least Soft-Threshold Squares |
Robust | visual | tracking via modified Harris hawks optimization |
Robust | visual | tracking via multi-feature response maps fusion using a collaborative local-global layer visual model |
Robust | visual | tracking via multi-feature response maps fusion using a collaborative local-global layer visual model |
Robust | visual | Tracking via Multi-Scale Spatio-Temporal Context Learning |
Robust | visual | Tracking Via Multi-Task Sparse Learning |
Robust | visual | tracking via multi-view discriminant based sparse representation |
Robust | visual | Tracking via Multiple Kernel Boosting With Affinity Constraints |
Robust | visual | tracking via nonlocal regularized multi-view sparse representation |
Robust | visual | Tracking via Online Discriminative and Low-Rank Dictionary Learning |
Robust | visual | tracking via online multiple instance learning with Fisher information |
Robust | visual | Tracking via Pixel Classification and Integration |
Robust | visual | Tracking via Rank-Constrained Sparse Learning |
Robust | visual | tracking via ranking SVM |
Robust | visual | Tracking via Semiadaptive Weighted Convolutional Features |
Robust | visual | Tracking via Smooth Manifold Kernel Sparse Learning |
Robust | visual | Tracking via Sparse Representation Under Subclass Discriminant Constraint |
Robust | visual | Tracking via Sparsity-Induced Subspace Learning |
Robust | visual | tracking via spatio-temporal adaptive and channel selective correlation filters |
Robust | visual | Tracking via Structured Multi-Task Sparse Learning |
Robust | visual | tracking via transfer learning |
Robust | visual | tracking with adaptive initial configuration and likelihood landscape analysis |
Robust | visual | Tracking with Deep Convolutional Neural Network Based Object Proposals on PETS |
Robust | visual | tracking with discriminative sparse learning |
Robust | visual | Tracking with Double Bounding Box Model |
Robust | visual | Tracking with Dual Group Structure |
Robust | visual | Tracking With Multitask Joint Dictionary Learning |
Robust | visual | tracking with the cross-bin metric |
Robust | visual | Vocabulary Tracking Using Hierarchical Model Fusion |
Robust | visual | Voice Activity Detection Using Long Short-Term Memory Recurrent Neural Network |
Robust | visual | -Inertial Integrated Navigation System Aided by Online Sensor Model Adaption for Autonomous Ground Vehicles in Urban Areas |
Robust | visual | -Inertial Navigation System for Low Precision Sensors under Indoor and Outdoor Environments |
Robust | visual | -Inertial Odometry Based on a Kalman Filter and Factor Graph |
Robust watermarking scheme for color image based on quaternion-type moment invariants and | visual | cryptography |
Robustfusion: Human Volumetric Capture with Data-driven | visual | Cues Using a RGBD Camera |
Robustness and repeatability of saliency models subjected to | visual | degradations |
Robustness of Multiplexing Protocols for Audio- | visual | Services Over Wireless Networks |
Robustness of | visual | Explanations to Common Data Augmentation Methods |
ROI Pooled Correlation Filters for | visual | Tracking |
ROI-Based Medical Image Retrieval Using Human-Perception and MPEG-7 | visual | Descriptors |
role of features, algorithms and data in | visual | recognition, The |
Role of Fixation and | visual | Attention in Object Recognition, The |
Role of Fixation in | visual | -Motion Analysis, The |
Role of Implicit Context Information in Guiding | visual | -Spatial Attention, The |
Role of Knowledge in | visual | Shape Representation, The |
role of Saliency and Error Propagation in | visual | Object Recognition, The |
Role of Spatiotemporal Oriented Energy Features for Robust | visual | Tracking in Video Surveillance |
Role of Synchronic Causal Conditions in | visual | Knowledge Learning, The |
role of | visual | attention in the aesthetic appeal of consumer images: A preliminary study, The |
Role of | visual | Complexity in Affective Reactions to Webpages: Subjective, Eye Movement, and Cardiovascular Responses, The |
Rotation Adaptive | visual | Object Tracking with Motion Consistency |
Rotation and Direction Judgment from | visual | Images Head-Slaved in Two and Three Degrees-of-Freedom |
Rotation-aware correlation filters for robust | visual | tracking |
rotation-invariant bag of | visual | words model for symbols based ancient coin classification, A |
Rotation-Translation-Decoupled Solution for Robust and Efficient | visual | -Inertial Initialization, A |
Rover | visual | Obstacle Avoidance |
RSINet: Rotation-Scale Invariant Network for Online | visual | Tracking |
RSTNet: Captioning with Adaptive Attention on | visual | and Non-Visual Words |
RSTNet: Captioning with Adaptive Attention on | visual | and Non-Visual Words |
RSVQA: | visual | Question Answering for Remote Sensing Data |
RT-SLAM: A Generic and Real-Time | visual | SLAM Implementation |
RTSformer: A Robust Toroidal Transformer With Spatiotemporal Features for | visual | Tracking |
RTSVC: Real-time system for | visual | control of robots |
RUArt: A Novel Text-Centered Solution for Text-Based | visual | Question Answering |
Rule Based Approach for | visual | Pattern Inspection, A |
Rule Based Technique for Extraction of | visual | Attention Regions Based on Real-Time Clustering, A |
Rule-Based System for Verifying Engineering Specifications in Industrial | visual | Inspection Applications, A |
Runway to Realway: | visual | Analysis of Fashion |
RVIO: An Effective Localization Algorithm for Range-Aided | visual | -Inertial Odometry System |
S-VVAD: | visual | Voice Activity Detection by Motion Segmentation |
S2-aware network for | visual | recognition |
S2-VER: Semi-supervised | visual | Emotion Recognition |
SaccadeCam: Adaptive | visual | Attention for Monocular Depth Sensing |
SAILenv: Learning in Virtual | visual | Environments Made Simple |
SalGaze: Personalizing Gaze Estimation using | visual | Saliency |
Saliency Density Maximization for Efficient | visual | Objects Discovery |
Saliency Driven Perceptual Quality Metric for Omnidirectional | visual | Content |
Saliency Heat-Map as | visual | Attention for Autonomous Driving Using Generative Adversarial Network (GAN) |
Saliency prior context model for | visual | tracking |
Saliency selection for robust | visual | tracking |
Saliency Tubes: | visual | Explanations for Spatio-Temporal Convolutions |
Saliency-Based Search Mechanism for Overt and Covert Shifts of | visual | Attention |
Saliency-Based | visual | Representation for Compression |
Saliency-weighted graphs for efficient | visual | content description and their applications in real-time image retrieval systems |
Saliency4ASD: Challenge, dataset and tools for | visual | attention modeling for autism spectrum disorder |
Salient Feature Selection for CNN-Based | visual | Place Recognition |
Salient Motion Features for | visual | Attention Models |
Salient object detection based on global to local | visual | search guidance |
Salient Object Detection Based on | visual | Perceptual Saturation and Two-Stream Hybrid Networks |
Salient region detection and feature extraction in 3D | visual | data |
Salient target detection in hyperspectral image based on | visual | attention |
Salient traffic sign recognition based on sparse representation of | visual | perception |
Salypath: A Deep-Based Architecture for | visual | Attention Prediction |
SAM-Net: Semantic probabilistic and attention mechanisms of dynamic objects for self-supervised depth and camera pose estimation in | visual | odometry applications |
Sample and Pixel Weighting Strategies for Robust Incremental | visual | Tracking |
Sample selection for | visual | domain adaptation via sparse coding |
Sample-Specific Late Fusion for | visual | Category Recognition |
Sampling and Ontologically Pooling Web Images for | visual | Concept Learning |
SANet: Structure-Aware Network for | visual | Tracking |
SAR Image Segmentation Based on Hierarchical | visual | Semantic and Adaptive Neighborhood Multinomial Latent Model |
SAT: 2D Semantics Assisted Training for 3D | visual | Grounding |
SAVAM, | visual | Salience Dataset |
SAVE: A framework for semantic annotation of | visual | events |
SAVE: Spatial-Attention | visual | Exploration |
SC-SM CAM: An Efficient | visual | Interpretation of CNN for SAR Images Target Recognition |
Scalability Analysis of Audio- | visual | Person Identity Verification |
Scalable Aural- | visual | Environment for Security Event Monitoring, Analysis, and Response, A |
Scalable Bag of Selected Deep Features for | visual | Instance Retrieval |
Scalable Feature Extraction for | visual | Surveillance |
scalable image-based multi-camera | visual | surveillance system, A |
Scalable k-NN graph construction for | visual | descriptors |
Scalable Mobile | visual | Classification by Kernel Preserving Projection Over High-Dimensional Features |
scalable MPEG-4 wavelet-based | visual | texture compression system with optimized memory organization, A |
Scalable Object Discovery: A Hash-Based Approach to Clustering Co-occurring | visual | Words |
Scalable Shape Representation for Content-Based | visual | Data Compression |
Scalable Unsupervised Feature Merging Approach to Efficient Dimensionality Reduction of High-Dimensional | visual | Data, A |
Scalable Video Event Retrieval by | visual | State Binary Embedding |
Scalable | visual | assessment of cluster tendency for large data sets |
Scalable | visual | Instance Mining with Instance Graph |
Scalable Zero-Shot Learning via Binary | visual | -Semantic Embeddings |
Scale estimation-based | visual | tracking with optimized convolutional activation features |
Scale invariant 3D multi-person tracking using a base set of bundle adjusted | visual | landmarks |
Scale Invariant Keypoint Detector Based on | visual | and Geometrical Cues, A |
Scale Proportionate Histograms of Oriented Gradients for Object Detection in Co-Registered | visual | and Range Data |
Scale Recovery for Monocular | visual | Odometry Using Depth Estimated with Deep Convolutional Neural Fields |
Scale-Aware Monocular | visual | Odometry and Extrinsic Calibration Using Vehicle Kinematics |
Scale-Awareness of Light Field Camera Based | visual | Odometry |
Scale-Invariant | visual | Language Modeling for Object Categorization |
Scale-Space Filtering Approach for | visual | Feature-Extraction, A |
Scale-weighted dense bag of | visual | features for 3D model retrieval from a partial view 3D model |
SCaLE: Supervised and Cascaded Laplacian Eigenmaps for | visual | Object Recognition Based on Nearest Neighbors |
Scaling and Benchmarking Self-Supervised | visual | Representation Learning |
Scaling Local Self-Attention for Parameter Efficient | visual | Backbones |
Scanning the issue: Special issue on technology and tools for | visual | perception |
Scene and viewpoint based | visual | summarization for landmarks |
Scene Categorization by Introducing Contextual Information to the | visual | Words |
Scene categorization via contextual | visual | words |
Scene Change Detection Based on Audio- | visual | Analysis and Interaction |
Scene Classification Using Local Co-occurrence Feature in Subspace Obtained by KPCA of Local Blob | visual | Words |
Scene graph captioner: Image captioning based on structural | visual | representation |
Scene Graph Contextualization in | visual | Commonsense Reasoning |
Scene Graph Refinement Network for | visual | Question Answering |
Scene Interpretation: Unified Modeling of | visual | Context by Particle-Based Belief Propagation in Hierarchical Graphical Model |
Scene recognition by semantic | visual | words |
Scene Segmentation from | visual | Motion Using Global Optimization |
Scene Text | visual | Question Answering |
Scene-assisted Point-line Feature Based | visual | Slam Method For Autonomous Flight in Unknown Indoor Environments, A |
Scene-Aware Error Modeling of LiDAR/ | visual | Odometry for Fusion-Based Vehicle Localization |
Scene-Intuitive Agent for Remote Embodied | visual | Grounding |
Scene-pathy: Capturing the | visual | Selective Attention of People Towards Scene Elements |
Scene-Unified Image Translation For | visual | Localization |
Score-CAM: Score-Weighted | visual | Explanations for Convolutional Neural Networks |
Scotopic | visual | Recognition |
Scratching | visual | Transformer's Back with Uniform Attention |
SDCS-CF: Saliency-driven localization and cascade scale estimation for | visual | tracking |
SDE: A Novel Selective, Discriminative and Equalizing Feature Representation for | visual | Recognition |
SDV-LOAM: Semi-Direct | visual | -LiDAR Odometry and Mapping |
Search method and apparatus for locating digitally stored content, such as | visual | images, music and sounds, text, or software, in storage devices on a computer network |
Searching for Complex Human Activities with No | visual | Examples |
Searching the World's Herbaria: A System for | visual | Identification of Plant Species |
SeCo: Separating Unknown Musical | visual | Sounds with Consistency Guidance |
second generation | visual | Secret Sharing scheme for color images, A |
Second Order Enhanced Multi-Glimpse Attention in | visual | Question Answering |
Second-order Attention Guided Convolutional Activations for | visual | Recognition |
Second-Order Motion Perception in the Peripheral | visual | -Field |
secure fingerprint template generation mechanism using | visual | secret sharing with inverse halftoning, A |
Security Assessment of Selectively Encrypted | visual | Data: Iris Recognition on Protected Samples |
Security Enhancement of | visual | Hashes Through Key Dependent Wavelet Transformations |
Security-oriented picture-in-picture | visual | modifications |
See and Learn More: Dense Caption-Aware Representation for | visual | Question Answering |
See More and Know More: Zero-shot Point Cloud Segmentation via Multi-modal | visual | Data |
See the Silence: Improving | visual | -Only Voice Activity Detection by Optical Flow and RGB Fusion |
Seeing and Understanding: Representing the | visual | World |
Seeing into Darkness: Scotopic | visual | Recognition |
Seeing is Believing: Pedestrian Trajectory Forecasting Using | visual | Frustum of Attention |
Seeing Through Darkness: | visual | Localization at Night via Weakly Supervised Learning of Domain Invariant Features |
Seeing through the Human Reporting Bias: | visual | Classifiers from Noisy Human-Centric Labels |
Seeking Subjectivity in | visual | Emotion Distribution Learning |
Seemore: A View-Based Approach to 3-D Object Recognition Using Multiple | visual | Cues |
Seemore: Combining Color, Shape, and Texture Histogramming in a Neurally Inspired Approach to | visual | Object Recognition |
SEGA: Semantic Guided Attention on | visual | Prototype for Few-Shot Learning |
SegEQA: Video Segmentation Based | visual | Attention for Embodied Question Answering |
SegLoc: Learning Segmentation-Based Representations for Privacy-Preserving | visual | Localization |
Segment-Phrase Table for Semantic Segmentation, | visual | Entailment and Paraphrasing |
Segmentation of | visual | Motion by Minimizing Convex Non-Quadratic Functionals |
Segmentation-Based PolSAR Image Classification Using | visual | Features: RHLBP and Color Features |
Segmenting focused objects in complex | visual | images |
Segmenting Objects From Relational | visual | Data |
Segmenting | visual | Actions based on Spatio-Temporal Motion Patterns |
Selecting relevant | visual | features for speechreading |
Selection and Execution of Simple Actions via | visual | Attention and Direct Parameter Specification |
Selection of a best metric and evaluation of bottom-up | visual | saliency models |
Selection of Features and Evaluation of | visual | Measurements During Robotic Visual Servoing Tasks |
Selection of Features and Evaluation of | visual | Measurements During Robotic Visual Servoing Tasks |
Selection of Image Primitives for General-Purpose | visual | Processing |
Selection of local features for | visual | search |
Selection of relevant information to improve Image Classification using Bag of | visual | Words |
Selection/substitution of | visual | Features for Object Tracking |
Selective Attention for Identification Model: Simulating | visual | neglect |
Selective Attention-Based Method for | visual | Pattern Recognition with Application to Handwritten Digit Recognition and Face Recognition, A |
Selective Sensor Fusion for Neural | visual | -Inertial Odometry |
Selective | visual | attention enables learning and recognition of multiple objects in cluttered scenes |
Selective Weighted Late Fusion for | visual | Concept Recognition, A |
Selectively guiding | visual | concept discovery |
Self Calibrating | visual | Sensor Networks |
Self Occlusions and Graph Based Edge Measurement Schemes for | visual | Tracking Applications |
Self Supervision to Distillation for Long-Tailed | visual | Recognition |
Self-Adaptive Neural Module Transformer for | visual | Question Answering |
Self-Localization Based on | visual | Lane Marking Maps: An Accurate Low-Cost Approach for Autonomous Driving |
Self-Matching CAM: A Novel Accurate | visual | Explanation of CNNs for SAR Image Interpretation |
Self-Organized Integration of Adaptive | visual | Cues for Face Tracking |
Self-Organizing Approach to Background Subtraction for | visual | Surveillance Applications, A |
Self-Organizing Map Based User Interface for | visual | Surface Inspection |
Self-Organizing Neural Network that Learns to Detect and Represent | visual | Depth from Occlusion Events, A |
Self-Similarity Measure for Assessment of Image | visual | Quality |
Self-Supervised Contrastive Learning for Audio- | visual | Action Recognition |
Self-supervised cross-modal | visual | retrieval from brain activities |
Self-Supervised Deep | visual | Odometry Based on Geometric Attention Model |
Self-Supervised Deep | visual | Odometry With Online Adaptation |
Self-Supervised Depth Completion From Direct | visual | -LiDAR Odometry in Autonomous Driving |
Self-Supervised Domain Adaptation for | visual | Navigation with Global Map Consistency |
Self-Supervised Fine-Grained Cycle-Separation Network (FSCN) for | visual | -Audio Separation |
Self-Supervised Learning for | visual | Relationship Detection through Masked Bounding Box Reconstruction |
Self-supervised Learning of Audio- | visual | Objects from Video |
Self-Supervised Learning of Video-Induced | visual | Invariances |
Self-Supervised Learning of | visual | Features through Embedding Images into Text Topic Spaces |
Self-supervised Learning of | visual | Graph Matching |
Self-supervised object detection from audio- | visual | correspondence |
Self-Supervised Pyramid Representation Learning for Multi-Label | visual | Analysis and Beyond |
Self-Supervised Representation Learning using | visual | Field Expansion on Digital Pathology |
Self-Supervised Solution for the Switch-Toggling | visual | Task, A |
Self-Supervised Variable Rate Image Compression using | visual | Attention |
Self-Supervised Video Forensics by Audio- | visual | Anomaly Detection |
Self-supervised | visual | Attribute Learning for Fashion Compatibility |
Self-Supervised | visual | Feature Learning With Deep Neural Networks: A Survey |
Self-Supervised | visual | Representations Learning by Contrastive Mask Prediction |
Self-supervised | visual | -LiDAR Odometry with Flip Consistency |
Self-Supervision-Augmented Deep Autoencoder for Unsupervised | visual | Anomaly Detection |
Self-taught learning of a deep invariant representation for | visual | tracking via temporal slowness principle |
Self-Training Approach for | visual | Tracking and Recognition of Complex Human Activity Patterns, A |
Semantic analysis of human | visual | attention in mobile eye tracking applications |
Semantic and edge-based | visual | odometry by joint minimizing semantic and edge distance error |
Semantic and Relation Modulation for Audio- | visual | Event Localization |
Semantic Audio- | visual | Navigation |
Semantic combination of textual and | visual | information in multimedia retrieval |
Semantic Compositional Networks for | visual | Captioning |
Semantic Curiosity for Active | visual | Learning |
Semantic Equivalent Adversarial Data Augmentation for | visual | Question Answering |
Semantic Event Fusion of Different | visual | Modality Concepts for Activity Recognition |
Semantic granularity metric learning for | visual | search |
Semantic grouping of | visual | features |
Semantic Hierarchies for | visual | Object Recognition |
Semantic Image Segmentation Based Cable Vibration Frequency | visual | Monitoring Using Modified Convolutional Neural Network with Pixel-wise Weighting Strategy |
Semantic Indexing of Multimedia Content Using | visual | , Audio, and Text Cues |
Semantic indexing of soccer audio- | visual | sequences: A multimodal approach based on controlled Markov chains |
Semantic indexing of sports program sequences by audio- | visual | analysis |
Semantic Jitter: Dense Supervision for | visual | Comparisons via Synthetic Images |
Semantic Loopback Detection Method Based on Instance Segmentation and | visual | SLAM in Autonomous Driving |
Semantic Match Consistency for Long-Term | visual | Localization |
Semantic Pose Verification for Outdoor | visual | Localization with Self-supervised Contrastive Learning |
Semantic saliency using k-TR theory of | visual | perception |
Semantic Scene Models for | visual | Localization under Large Viewpoint Changes |
Semantic Sparse Recoding of | visual | Content for Image Applications |
Semantic Texture Complexity Model for Feature Generation and Selection in | visual | SLAM |
Semantic Transfer from Head to Tail: Enlarging Tail Margin for Long-Tailed | visual | Recognition |
Semantic Transform: Weakly Supervised Semantic Inference for Relating | visual | Attributes |
Semantic Video Entity Linking Based on | visual | Content and Metadata |
Semantic video labeling by developmental | visual | agents |
Semantic | visual | Localization |
Semantic | visual | Templates: Linking visual Features to Semantics |
Semantic | visual | Templates: Linking visual Features to Semantics |
Semantic | visual | Understanding of Indoor Environments: From Structures to Opportunities for Action |
Semantic-associative | visual | content labelling and retrieval: A multimodal approach |
Semantic-Aware Modular Capsule Routing for | visual | Question Answering |
Semantic-aware spatial regularization correlation filter for | visual | tracking |
Semantic-aware | visual | attributes learning for zero-shot recognition |
Semantic-Aware | visual | Decomposition for Image Coding |
Semantic-guided de-attention with sharpened triplet marginal loss for | visual | place recognition |
Semantic-only | visual | Odometry based on dense class-level segmentation |
Semantic- | visual | concept relatedness and co-occurrences for image retrieval |
Semantically Grounded | visual | Embeddings for Zero-Shot Learning |
Semantically Guided | visual | Question Answering |
Semantics-Aware | visual | Object Tracking |
Semantics-based selection of everyday concepts in | visual | lifelogging |
Semantics-Consistent Feature Search for Self-Supervised | visual | Representation Learning |
Semantics-Guided Data Hallucination for Few-Shot | visual | Classification |
Semantics-Guided Representation Learning with Applications to | visual | Synthesis |
Semantics-Guided | visual | Simultaneous Localization and Mapping with U-Net for Complex Dynamic Indoor Environments, A |
Semi-Automatic Annotation For | visual | Object Tracking |
Semi-Automatic Annotation of Objects in | visual | -Thermal Video |
Semi-dense | visual | Odometry for a Monocular Camera |
Semi-independent Stereo | visual | Odometry for Different Field of View Cameras |
Semi-Reference Sonar Image Quality Assessment Based on Task and | visual | Perception |
Semi-Supervised and Unsupervised Deep | visual | Learning: A Survey |
Semi-supervised boosting using | visual | similarity learning |
Semi-supervised Domain Adaptation with Subspace Learning for | visual | recognition |
Semi-supervised Learning for Cross-Device | visual | Location Recognition |
Semi-Supervised Learning of | visual | Features by Non-Parametrically Predicting View Assignments with Support Samples |
Semi-Supervised Method for Surveillance-Based | visual | Location Recognition, A |
Semi-Supervised Tensor-Based Graph Embedding Learning and Its Application to | visual | Discriminant Tracking |
Semi-supervised | visual | recognition with constrained graph regularized non negative matrix factorization |
Semi-supervised | visual | Tracking of Marine Animals Using Autonomous Underwater Vehicles |
Semiautomatic | visual | -attention modeling and its application to video compression |
Semiconductio IC's: Integrated Testing and Algorithms for | visual | Inspection |
Sensible Scenes: | visual | Understanding of Complex Scenes Through Causal Analysis |
Sensing | visual | Attention by Sequential Patterns |
Sensing, predicting, and utilizing human | visual | attention |
Sensitivity Analysis of the Human | visual | System for Depth Cues in Stereoscopic 3-D Displays |
Sensor Planning for 3D | visual | Search with Task Constraints |
Sensor planning techniques and active | visual | inspection |
Sentiment Recognition for Short Annotated GIFs Using | visual | -Textual Fusion |
Sentimental | visual | Captioning using Multimodal Transformer |
SentiStory: A Multi-Layered Sentiment-Aware Generative Model for | visual | Storytelling |
Separating Self-Expression and | visual | Content in Hashtag Supervision |
Separating Skills and Concepts for Novel | visual | Question Answering |
Separation of Audio- | visual | Speech Sources: A New Approach Exploiting the Audio-Visual Coherence of Speech Stimuli |
Separation of Audio- | visual | Speech Sources: A New Approach Exploiting the Audio-Visual Coherence of Speech Stimuli |
SeqTR: A Simple Yet Universal Network for | visual | Grounding |
SeqTrack: Sequence to Sequence Learning for | visual | Object Tracking |
Sequential Adversarial Learning for Self-Supervised Deep | visual | Odometry |
Sequential Kernel Density Approximation and Its Application to Real-Time | visual | Tracking |
Sequential Mastery of Multiple | visual | Tasks: Networks Naturally Learn to Learn and Forget to Forget |
Sequential model-based segmentation and recognition of image structures driven by | visual | features and spatial relations |
Sequential Monte Carlo Approach to Anomaly Detection in Tracking | visual | Events, A |
Sequential Multiple LSB methods and real-time data hiding: variations for | visual | Cryptography ciphers |
Sequential Particle Generation for | visual | Tracking |
Sequential particle swarm optimization for | visual | tracking |
Sequential | visual | and semantic consistency for semi-supervised text recognition |
SERVOMATIC: A Modular System for Robust Positioning Using Stereo | visual | Servoing |
Set Descriptors for | visual | Evaluation of Human Corneal Endothelia |
set of | visual | feature descriptors and their combination in a low-level description scheme, A |
Seventh | visual | Object Tracking VOT2019 Challenge Results, The |
Severe Thunderstorm Detection by | visual | Learning Using Satellite Images |
SEWA DB: A Rich Database for Audio- | visual | Emotion and Sentiment Research in the Wild |
SgVA-CLIP: Semantic-Guided | visual | Adapting of Vision-Language Models for Few-Shot Image Classification |
Shap-CAM: | visual | Explanations for Convolutional Neural Networks Based on Shapley Value |
Shape Based People Detection for | visual | Surveillance Systems |
Shape Decomposition for | visual | Recognition: The Role of Transversality |
Shape from Textures: A Paradigm for Fusing Middle-Level | visual | Cues |
Shape recovery methods for | visual | inspection |
Shape Similarity Measure Based on Correspondence of | visual | Parts |
Shape Space Sampling Distributions and Their Impact on | visual | Tracking |
Shape-Adaptive Discrete Wavelet Transforms for Arbitrarily Shaped | visual | Object Coding |
Shape-from-Texture Algorithm Based on the Human | visual | Psychophysics, A |
Shapelearner: Towards Shape-based | visual | Knowledge Harvesting |
Shaping Deep Feature Space Towards Gaussian Mixture for | visual | Classification |
Shaping | visual | Representations With Attributes for Few-Shot Recognition |
Sharing more information in gray | visual | cryptography scheme |
Sharing multiple secrets in | visual | cryptography |
Sharing Multiple Secrets in XOR-Based | visual | Cryptography by Non-Monotonic Threshold Property |
Sharing | visual | Features for Animal Categorization: An Empirical Study |
Sharing | visual | Features for Multiclass and Multiview Object Detection |
Sharing | visual | Secrets Among Multiple Groups With Enhanced Performance |
Sharing | visual | Secrets in Single Image Random Dot Stereograms |
Shifting More Attention to | visual | Backbone: Query-modulated Refinement Networks for End-to-End Visual Grounding |
Shifting More Attention to | visual | Backbone: Query-modulated Refinement Networks for End-to-End Visual Grounding |
Ship Detection in Panchromatic Optical Remote Sensing Images Based on | visual | Saliency and Multi-Dimensional Feature Description |
Ship Object Detection of Remote Sensing Image Based on | visual | Attention |
Shot genre classification using compressed audio- | visual | features |
Show, Tell and Summarize: Dense Video Captioning Using | visual | Cue Aided Sentence Summarization |
Showcasing Deeply Supervised Multimodal Attentional Translation Embeddings: a Demo for | visual | Relationship Detection |
Shrinking large | visual | vocabularies using multi-label agglomerative information bottleneck |
Shuffle-Then-Assemble: Learning Object-Agnostic | visual | Relationship Features |
Siam R-CNN: | visual | Tracking by Re-Detection |
SiamCAN: Real-Time | visual | Tracking Based on Siamese Center-Aware Network |
SiamCAR: Siamese Fully Convolutional Classification and Regression for | visual | Tracking |
SiamCCF: Siamese | visual | tracking via cross-layer calibration fusion |
SiamCorners: Siamese Corner Networks for | visual | Tracking |
SiamDA: Dual attention Siamese network for real-time | visual | tracking |
Siamese Box Adaptive Network for | visual | Tracking |
Siamese Cascaded Region Proposal Networks for Real-Time | visual | Tracking |
Siamese Graph Attention Networks for robust | visual | object tracking |
Siamese Implicit Region Proposal Network With Compound Attention for | visual | Tracking |
Siamese networks with distractor-reduction method for long-term | visual | object tracking |
Siamese recurrent architecture for | visual | tracking |
Siamese self-supervised learning for fine-grained | visual | classification |
Siamese target estimation network with AIoU loss for real-time | visual | tracking |
Siamese | visual | Tracking with Dual-Pipeline Correlated Fusion Network |
Siamese | visual | tracking with multilayer feature fusion and corner distance IoU loss |
Siamese-Based Twin Attention Network for | visual | Tracking |
SiamON: Siamese Occlusion-Aware Network for | visual | Tracking |
SiamRank: A siamese based | visual | tracking network with ranking strategy |
SiamRPN++: Evolution of Siamese | visual | Tracking With Very Deep Networks |
SiamSampler: Video-Guided Sampling for Siamese | visual | Tracking |
SID4VAM: A Benchmark Dataset With Synthetic Images for | visual | Attention Modeling |
Sidekick Policy Learning for Active | visual | Exploration |
SIEVE: Search Images Effectively Through | visual | Elimination |
Sign language detection using 3D | visual | cues |
signature-based bag of | visual | words method for image indexing and search, A |
Significance of | visual | Limitations in Automated Pattern Recognition Applications |
Significant Pixel Watermarking Using Human | visual | System Model in Wavelet Domain |
Significant scene detection and frame filtering for a | visual | indexing system |
Significant scene detection and frame filtering for a | visual | indexing system using dynamic thresholds |
Silhouette-based multi-sensor smoke detection: Coverage analysis of moving object silhouettes in thermal and | visual | registered images |
Sim VQA: Exploring Simulated Environments for | visual | Question Answering |
Sim2Real Viewpoint Invariant | visual | Servoing by Recurrent Control |
SimGlim: Simplifying glimpse based active | visual | reconstruction |
Similar Manga Retrieval Using | visual | Vocabulary Based on Regions of Interest |
Similarity Fusion for | visual | Tracking |
Similarity Measure of the | visual | Features Using the Constrained Hierarchical Clustering for Content Based Image Retrieval |
Simple and effective | visual | question answering in a single modality |
Simple and Strong Baseline for Universal Targeted Attacks on Siamese | visual | Tracking, A |
Simple Baseline for Audio- | visual | Scene-Aware Dialog, A |
Simple contrastive learning in a self-supervised manner for robust | visual | question answering |
Simple Episodic Linear Probe Improves | visual | Recognition in the Wild, A |
Simple Signed-Distance Function Depth Calculation Applied to Measurement of the fMRI BOLD Hemodynamic Response Function in Human | visual | Cortex |
Simple solution for | visual | servoing of camera-in-hand robots in the 3d Cartesian space |
Simple Technique for Improving Camera Displacement Estimation in Eye-in-Hand | visual | Servoing, A |
Simple | visual | Words Selection Strategy for Pedestrian Detection, A |
Simple | visual | -Textual Baseline for Pedestrian Attribute Recognition, A |
Simple, Cheap, and Robust | visual | Navigation System, A |
Simulated Tearing: An Algorithm for Discontinuity-Preserving | visual | Surface Reconstruction |
Simulating vision through time: Hierarchical, sparse models of | visual | cortex for motion imagery |
Simulation Framework for a | visual | -Inertial Navigation System |
Simulation of Automated | visual | Inspection Systems for Specular Surfaces Quality Control |
Simulation of the Retina: A Tool for | visual | Prostheses |
Simultaneous and Sequential Reconstruction of | visual | Primitives with Bundle Adjustment |
Simultaneous Classification and | visual | Word Selection using Entropy-based Minimum Description Length |
Simultaneous Determination of Camera Pose and Intrinsic Parameters by | visual | Servoing |
Simultaneous | visual | Data Completion and Denoising Based on Tensor Rank and Total Variation Minimization and Its Primal-Dual Splitting Algorithm |
Simultaneous | visual | Recognition of Manipulation Actions and Manipulated Objects |
SimVODIS: Simultaneous | visual | Odometry, Object Detection, and Instance Segmentation |
Single Image Defogging Based on Illumination Decomposition for | visual | Maritime Surveillance |
Single Image Human Proxemics Estimation for | visual | Social Distancing |
Single-Camera and Inter-Camera Vehicle Tracking and 3D Speed Estimation Based on Fusion of | visual | and Semantic Features |
Single-frame super-resolution by a cortex based mechanism using high level | visual | features in natural images |
single-frame | visual | gyroscope, A |
Single-modal Incremental Terrain Clustering from Self-Supervised Audio- | visual | Feature Learning |
Single-Modal Video Analysis of Personality Traits using Low-Level | visual | Features |
Single-shot underwater image restoration: A | visual | quality-aware method based on light propagation model |
Singularities of the | visual | Mapping, The |
Singularities of the | visual | Motion Field: 3D Rotation or 3D Translation |
Singularity Avoidance in Uncalibrated | visual | Servoing |
SINT++: Robust | visual | Tracking via Adversarial Positive Instance Generation |
SIR-Net: Scene-Independent End-to-End Trainable | visual | Relocalizer |
SiRi: A Simple Selective Retraining Mechanism for Transformer-Based | visual | Grounding |
Situation Recognition: | visual | Semantic Role Labeling for Image Understanding |
Situational Fusion of | visual | Representation for Visual Navigation |
Situational Fusion of | visual | Representation for Visual Navigation |
Sixth | visual | Object Tracking VOT2018 Challenge Results, The |
Size and Position Invariance in the | visual | System |
Size Invariant | visual | Cryptography Schemes With Evolving Threshold Access Structures |
Size-invariant Detection of Marine Vessels From | visual | Time Series |
SKCS: New separable kernel family with compact support: Application to | visual | segmentation of handwritten data |
Sketch of a (Computational) Theory of | visual | Kinesthesis, A |
Sketch Retrieval Method for Full Color Image Database Query by | visual | Example, A |
Sketch-based image retrieval with deep | visual | semantic descriptor |
Sketching with Style: | visual | Search with Sketches and Aesthetic Context |
Skill-Based Hierarchical Reinforcement Learning for Target | visual | Navigation |
Slant-Tilt: The | visual | Encoding of Surface Orientation |
Slaving Head and Eye Movements for | visual | Telepresence |
Sleep Analysis for Elderly Care Using a Low-Resolution | visual | Sensor Network |
Slot Cars: 3D Modelling for Improved | visual | Traffic Analytics |
Slow | visual | Search in a Fast-Changing World |
smart sensor based | visual | landmarks detection for indoor robot navigation, A |
Smart Toothbrushes: Inertial Measurement Sensors Fusion with | visual | Tracking |
SMART-I2: Spatial Multi-user Audio- | visual | Real-time interactive interface, A broadcast application context |
SMART: Joint Sampling and Regression for | visual | Tracking |
SmartMonitor: An Approach to Simple, Intelligent and Affordable | visual | Surveillance System |
SmartOverlays: A | visual | Saliency Driven Label Placement for Intelligent Human-Computer Interfaces |
Smile detection in the wild with hierarchical | visual | feature |
Smile, Be Happy :) Emoji Embedding for | visual | Sentiment Analysis |
Smooth Incremental Learning of Correlation Filters for | visual | Tracking |
Snoopertext: A multiresolution system for text detection in complex | visual | scenes |
So Many Heads, So Many Wits: Multimodal Graph Reasoning for Text-Based | visual | Question Answering |
Soccer players identification based on | visual | local features |
Soccer: Who Has the Ball? Generating | visual | Analytics and Player Statistics |
Social negative bootstrapping for | visual | categorization |
Social-oriented | visual | image search |
Soft assignment of | visual | words as Linear Coordinate Coding and optimisation of its reconstruction error |
Soft Mask Correlation Filter for | visual | Object Tracking |
Soft Measure of | visual | Token Occurrences for Object Categorization |
Soft Pseudo-labeling Semi-supervised Learning Applied to Fine-grained | visual | Classification |
Soft Transfer Learning via Gradient Diagnosis for | visual | Relationship Detection |
Soft-Switching Approach to Improve | visual | Quality of Colour Image Smoothing Filters, A |
Software Architecture for Distributed | visual | Tracking in a Global Vision Localization System, A |
Software Based Object Tracking with | visual | Feature Integration |
Software Laboratory for | visual | Inspection and Recognition, A |
SOLVER: Scene-Object Interrelated | visual | Emotion Reasoning Network |
Solving Rendering Issues in Realistic 3D Immersion for | visual | Rehabilitation |
Solving | visual | Madlibs with Multiple Cues |
Some Algorithms for Image Enhancement Incorporating Human | visual | Response |
Some Features of | visual | Form |
Some Informational Aspects of | visual | Perception |
Some Like It Hot: | visual | Guidance for Preference Prediction |
Some Observations on the Human | visual | Perception System and their Relevance to Computer Vision Research |
Something Something Video Database for Learning and Evaluating | visual | Common Sense, The |
SORT: Second-Order Response Transform for | visual | Recognition |
Sound and | visual | Representation Learning with Multiple Pretraining Tasks |
Sound to | visual | Scene Generation by Audio-to-Visual Latent Alignment |
Sound to | visual | Scene Generation by Audio-to-Visual Latent Alignment |
Sound2sight: Generating | visual | Dynamics from Sound and Context |
Soundspaces: Audio- | visual | Navigation in 3d Environments |
SOWP: Spatially Ordered and Weighted Patch Descriptor for | visual | Tracking |
Space-time | visual | effects as a post-production process |
Space-Variant Dynamic Neural Fields for | visual | Attention |
SPARE: Self-supervised part erasing for ultra-fine-grained | visual | categorization |
Spark: Spatial-aware Online Incremental Attack Against | visual | Tracking |
Sparse and Semi-supervised | visual | Mapping with the S^3GP |
Sparse and Structured | visual | Attention |
Sparse Bayesian Learning for Efficient | visual | Tracking |
Sparse coding based | visual | tracking: Review and experimental comparison |
Sparse concept coding for | visual | analysis |
Sparse Contextual Activation for Efficient | visual | Re-Ranking |
Sparse Embedding | visual | Attention Systems Combined with Edge Information |
Sparse Feature Learning for | visual | Tracking by Least Absolute Shrinkage and Selection Operator |
Sparse Gradient Pursuit for Robust | visual | Analysis |
Sparse Output Coding for Large-Scale | visual | Recognition |
Sparse Output Coding for Scalable | visual | Recognition |
Sparse representation based | visual | element analysis |
Sparse Representation Model Using the Complete Marginal Fisher Analysis Framework and Its Applications to | visual | Recognition, A |
Sparse representations of image gradient orientations for | visual | recognition and tracking |
Sparse Spatial Coding: A Novel Approach to | visual | Recognition |
Sparse-coded cross-domain adaptation from the | visual | to the brain domain |
Sparse-to-Dense Hypercolumn Matching for Long-Term | visual | Localization |
Sparsity preserving multiple canonical correlation analysis with | visual | emotion recognition to multi-feature fusion |
Sparsity-Driven Approach to Multi-Camera Tracking in | visual | Sensor Networks, A |
Sparsity-Driven Bandwidth-Efficient Decentralized Tracking in | visual | Sensor Networks |
Spatial and long-short temporal attention correlation filters for | visual | tracking |
Spatial and | visual | Perspective-Taking via View Rotation and Relation Reasoning for Embodied Reference Understanding |
Spatial arrangement of color in retrieval by | visual | similarity |
Spatial Coherence for | visual | Motion Analysis |
Spatial context for | visual | vocabulary construction |
Spatial Correlation Model for | visual | Information in Wireless Multimedia Sensor Networks, A |
Spatial dependence in the observation of | visual | contours |
Spatial Exploration Indicators in the Remote Assessment of | visual | Neglect |
Spatial extensions to bag of | visual | words |
Spatial Frequency Band Division in Human | visual | System Based-Watermarking |
Spatial histograms of soft pairwise similar patches to improve the bag-of- | visual | -words model |
Spatial Information and Features, | visual | Relationships |
Spatial Knowledge Distillation to Aid | visual | Reasoning |
Spatial Models for Wide-Area | visual | Surveillance: Computational Approaches and Spatial Building-Blocks |
Spatial Noise Shaping Based on Human | visual | Sensitivity and Its Application to Image Coding |
Spatial orientations of | visual | word pairs to improve Bag-of-Visual-Words model |
Spatial orientations of | visual | word pairs to improve Bag-of-Visual-Words model |
Spatial Perception by Object-Aware | visual | Scene Representation |
Spatial probabilistic distribution map-based two-channel 3D U-net for | visual | pathway segmentation |
Spatial Pyramid Pooling in Deep Convolutional Networks for | visual | Recognition |
Spatial Random Partition for Common | visual | Pattern Discovery |
Spatial selection for attentional | visual | tracking |
Spatial Sensitive GRAD-CAM: | visual | Explanations for Object Detection by Incorporating Spatial Sensitivity |
Spatial Similarity Measure of | visual | Phrases for Image Retrieval |
Spatial Statistics of | visual | Keypoints for Texture Recognition |
Spatial-DiscLDA for | visual | recognition |
Spatial-Semantic Image Search by | visual | Feature Synthesis |
Spatial-slice Feature Learning Using | visual | Transformer and Essential Slices Selection Module for Covid-19 Detection of Ct Scans in the Wild |
Spatial-Temporal Adaptive Feature Weighted Correlation Filter for | visual | Tracking |
Spatial- | visual | Label Propagation for Local Feature Classification |
Spatially Attentive Correlation Filters for | visual | Tracking |
Spatially Regularized Low Rank Tensor Optimization for | visual | Data Completion |
Spatially-Constrained Semantic Segmentation with Topological Maps and | visual | Embeddings |
Spatio-Temporal Auxiliary Particle Filtering With L_1-Norm-Based Appearance Model Learning for Robust | visual | Tracking |
Spatio-temporal context based recurrent | visual | attention model for lymph node detection |
Spatio-Temporal Dynamics and Semantic Attribute Enriched | visual | Encoding for Video Captioning |
Spatio-Temporal Model for Early | visual | Processing, A |
Spatio-Temporal Model of the Selective Human | visual | Attention, A |
Spatio-temporal modeling of | visual | attention for stereoscopic 3D video |
Spatio-temporal quality pooling adaptive to distortion distribution and | visual | attention |
Spatio-Temporal | visual | Analysis for Urban Traffic Characters Based on Video Surveillance Camera Data |
Spatio-Temporal VLAD Encoding of | visual | Events Using Temporal Ordering of the Mid-Level Deep Semantics |
Spatiotemporal Alignment of | visual | Signals on a Special Manifold |
Spatiotemporal Inseparability in Early | visual | Processing |
Spatiotemporal Oriented Energy Features for | visual | Tracking |
Spatiotemporal Registration for Event-based | visual | Odometry |
Spatiotemporal Representations for | visual | Navigation |
Spatiotemporal Salience via Centre-Surround Comparison of | visual | Spacetime Orientations |
spatiotemporal saliency model of | visual | attention based on maximum entropy, A |
Spatiotemporal salient points for | visual | recognition of human actions |
Spatiotemporal | visual | Considerations for Video Coding |
SPCNet: Scale Position Correlation Network for End-to-End | visual | Tracking |
Speaker and Digit Recognition by Audio- | visual | Lip Biometrics |
Speaker dependent video indexing based on audio- | visual | interaction |
Speaker Independent Audio- | visual | Speech Recognition |
Speaker Tracking Algorithm Based on Audio and | visual | Information Fusion Using Particle Filter, A |
Special Cane with | visual | Odometry for Real-time Indoor Navigation of Blind People |
Special edition on semi-supervised learning for | visual | content analysis and understanding |
Special Issue on Advances in Statistical Methods-based | visual | Quality Assessment |
Special Issue on Applied | visual | Inspection |
Special issue on European projects on | visual | representation systems and services |
Special Issue on Generating Realistic | visual | Data of Human Behavior |
Special Issue on Image Sequence Processing and Motion Analysis in | visual | Communication |
Special issue on Intelligent | visual | Surveillance |
Special Issue on Large-scale | visual | Sensor Networks: Architectures and Applications |
Special issue on spatial coherence for | visual | motion analysis |
Special issue on | visual | communication in the ubiquitous era |
Special issue on | visual | concept detection in the MIRFLICKR/ImageCLEF benchmark |
Special issue on | visual | form |
Special issue on | visual | information retrieval |
Special issue on | visual | search and augmented reality |
Special Issue on | visual | Surveillance |
Special Issue on | visual | Tracking |
Special Issue on | visual | -Perception: Guest Editorial |
Special Issue: Algorithm/Architecture Co-Exploration of | visual | Computing on Emerging Platforms |
Special Issue: Real-Time Data Hiding and | visual | Cryptography |
Special Section on the 1997 | visual | Communications and Image Processing Award Papers |
Special section on | visual | information processing |
Spectral attribute learning for | visual | regression |
Spectral histogram representations for | visual | modeling |
Spectral Unsupervised Domain Adaptation for | visual | Recognition |
SPECTRE: | visual | Speech-Informed Perceptual 3D Facial Expression Reconstruction from Videos |
Speech-assisted lip synchronization in audio- | visual | communications |
Speech-to-video synthesis using MPEG-4 compliant | visual | features |
Speech- | visual | Emotion Recognition by Fusing Shared and Specific Features |
Speech- | visual | Emotion Recognition via Modal Decomposition Learning |
Speech/Gesture Interface to a | visual | Computing Environment for Molecular Biologists |
Spherical Correlation of | visual | Representations for 3D Model Retrieval |
Spherical Image Processing for Accurate | visual | Odometry with Omnidirectional Cameras |
spherical representation for efficient | visual | loop closing, A |
SPIHT-Based Coding of the Shape and Texture of Arbitrarily Shaped | visual | Objects |
SPiKeS: Superpixel-Keypoints structure for robust | visual | tracking |
Spline Error Weighting for Robust | visual | -Inertial Fusion |
Spline Fusion: A continuous-time representation for | visual | -inertial fusion with application to rolling shutter cameras |
SplitNet: Sim2Sim and Task2Task Transfer for Embodied | visual | Navigation |
SPM-Tracker: Series-Parallel Matching for Real-Time | visual | Object Tracking |
Spoken Moments: Learning Joint Audio- | visual | Representations from Video Descriptions |
Spontaneous | visual | database for detecting learning-centered emotions during online learning |
Spoofing Detection of Civilian UAVs Using | visual | Odometry |
Spotting Audio- | visual | Inconsistencies (SAVI) in Manipulated Video |
Spread Spectrum | visual | Sensor Network Resource Management Using an End-to-End Cross-Layer Design |
Squeezed Bilinear Pooling for Fine-Grained | visual | Categorization |
SSBNet: Improving | visual | Recognition Efficiency by Adaptive Sampling |
SSPNet: Predicting | visual | Saliency Shifts |
SSSD: Speech Scene database by Smart Device for | visual | Speech Recognition |
Stabilizing | visual | Reinforcement Learning via Asymmetric Interactive Cooperation |
StableNet: Distinguishing the hard samples to overcome language priors in | visual | question answering |
Stacked squeeze-and-excitation recurrent residual network for | visual | -semantic matching |
StARformer: Transformer with State-Action-Reward Representations for | visual | Reinforcement Learning |
State-Based SHOSLIF for Indoor | visual | Navigation |
State-of-the-Art in Handling Occlusions for | visual | Object Tracking, The |
State-of-the-Art in | visual | Attention Modeling |
Static and Space-Time | visual | Saliency Detection by Self-Resemblance |
Statistical adaptive metric learning in | visual | action feature set recognition |
Statistical Analysis of | visual | Attentional Patterns for Video Surveillance |
Statistical and Geometrical Approaches to | visual | Motion Analysis |
Statistical evaluation of no-reference image | visual | quality metrics |
Statistical Image-Based Shape Model for | visual | Hull Reconstruction and 3D Structure Inference, A |
Statistical learning for effective | visual | information retrieval |
Statistical Learning of | visual | Feature Hierarchies |
Statistical modeling and conceptualization of | visual | patterns |
Statistical Modeling of Long-Range Drift in | visual | Odometry |
Statistical Modeling of | visual | Attention of Junior and Senior Anesthesiologists During the Induction of General Anesthesia in Real and Simulated Cases |
Statistical Models of | visual | Shape and Motion |
Statistical Richness of | visual | Phase Information: Update on Recognizing Persons by Iris Patterns |
Statistics of Second Order Multi-modal Feature Events and Their Exploitation in Biological and Artificial | visual | Systems |
STCT: Sequentially Training Convolutional Networks for | visual | Tracking |
Stereo and IMU assisted | visual | odometry on an OMAP3530 for small robots |
Stereo DSO: Large-Scale Direct Sparse | visual | Odometry with Stereo Cameras |
Stereo Image Quality Assessment Considering the Asymmetry of Statistical Information in Early | visual | Pathway |
Stereo image quality assessment considering the difference of statistical feature in early | visual | pathway |
Stereo Matching Based on | visual | Sensitive Information |
Stereo Tracking and Three-Point/One-Point Algorithms: A Robust Approach in | visual | Odometry |
Stereo | visual | Odometry Without Temporal Filtering |
stereo | visual | pattern image coding system, A |
Stereo | visual | Tracking Within Structured Environments for Measuring Vehicle Speed |
Stereoscopic 3D | visual | Discomfort Prediction: A Dynamic Accommodation and Vergence Interaction Model |
stereoscopic content analysis system with | visual | discomfort-aware, A |
Stereoscopic image quality assessment by analysing | visual | hierarchical structures and binocular effects |
Stereoscopic image quality assessment by learning non-negative matrix factorization-based color | visual | characteristics and considering binocular interactions |
Stereoscopic image quality assessment considering | visual | mechanism and multi-loss constraints |
Stereoscopic Video Quality Assessment Based on | visual | Attention and Just-Noticeable Difference Models |
Stereoscopic video shot clustering into semantic concepts based on | visual | and disparity information |
STGL: Spatial-Temporal Graph Representation and Learning for | visual | Tracking |
Still an Ineffective Method With Supertrials/ERPs: Comments on Decoding Brain Representations by Multimodal Learning of Neural Activity and | visual | Features |
Stimuli-Aware | visual | Emotion Analysis |
Stitching Dynamic Movement Primitives and Image-Based | visual | Servo Control |
STMTrack: Template-free | visual | Tracking with Space-time Memory Networks |
Stochastic Decorrelation Constraint Regularized Auto-Encoder for | visual | Recognition |
Stochastic Guided Search Model for Search Asymmetries in | visual | Search Tasks |
stochastic model of selective | visual | attention with a dynamic Bayesian network, A |
Stochastic refinement of the | visual | hull to satisfy photometric and silhouette consistency constraints |
Stopwords Detection in Bag-of- | visual | -Words: The Case of Retrieving Maya Hieroglyphs |
Story Segmentation in News Videos Using | visual | and Text Cues |
Straight to the Facts: Learning Knowledge Base Retrieval for Factual | visual | Question Answering |
Structural Analysis of | visual | Form on Packaging Graphics and Its Use in an Automated Design System, A |
Structural Correlation Filter for Robust | visual | Tracking |
Structural Pyramids for Representing and Locating Moving Obstacles in | visual | Guidance of Navigation |
Structural sparse representation-based semi-supervised learning and edge detection proposal for | visual | tracking |
Structural SVM for | visual | localization and continuous state estimation |
Structure alignment of attributes and | visual | features for cross-dataset person re-identification |
Structure Description of | visual | Information, A |
Structure Is a | visual | Class Invariant |
Structure of | visual | Spaces, The |
Structure-Aware Local Sparse Coding for | visual | Tracking |
Structure-Encoding Auxiliary Tasks for Improved | visual | Representation in Vision-and-Language Navigation |
Structured Attentions for | visual | Question Answering |
Structured Label Inference for | visual | Understanding |
Structured Semantic Representation for | visual | Question Answering |
Structured Siamese Network for Real-Time | visual | Tracking |
Structured Triplet Learning with POS-Tag Guided Attention for | visual | Question Answering |
Structured | visual | Feature Learning for Classification via Supervised Probabilistic Tensor Factorization |
Structured | visual | Search via Composition-aware Learning |
Structured | visual | Tracking with Dynamic Graph |
Structured-light based joint recognition using bottom-up and top-down combined | visual | processing |
Structuring | visual | Words in 3D for Arbitrary-View Object Localization |
StructVPR: Distill Structural Knowledge with Weighting Samples for | visual | Place Recognition |
Student-t Mixture Filter for Robust, Real-Time | visual | Tracking |
study of Bag-of- | visual | -Words representations for handwritten keyword spotting, A |
study of brain networks driven by steady-state | visual | evoked potentials, The |
Study of Subjective and Objective Quality Assessment of Audio- | visual | Signals |
Study of Top-Down | visual | Attention Model Based on Similarity Distance, A |
study of virtual | visual | servoing sensitivity in the context of image/GIS registration for urban environments, A |
Study of | visual | saliency detection via nonlocal anisotropic diffusion equation |
Study of | visual | Shape Perception, A |
Study on performance of MPEG-7 | visual | descriptors for deformable object retrieval |
Study on the Cardinality of Ordered Average Pooling in | visual | Recognition, A |
study on the distribution of social biases in self-supervised learning | visual | models, A |
study on the effect of camera motion on human | visual | attention, A |
Study on | visual | Attack to BPCS-Steganography and Countermeasure, A |
Studying the added value of | visual | attention in objective image quality metrics based on eye movement data |
STVGBert: A | visual | -linguistic Transformer based Framework for Spatio-temporal Video Grounding |
Style-Aware Mid-level Representation for Discovering | visual | Connections in Space and Time |
Style-Hallucinated Dual Consistency Learning: A Unified Framework for | visual | Domain Generalization |
StyleNet: Generating Attractive | visual | Captions with Styles |
Sub-word Level Lip Reading With | visual | Attention |
Subjective and Objective Audio- | visual | Quality Assessment for User Generated Content |
Subjective Evaluation of | visual | Quality and Simulator Sickness of Short 360° Videos: ITU-T Rec. P.919 |
Subjective Evaluation on | visual | Perceptibility of Embedding Complementary Patterns for Nonintrusive Projection-Based Augmented Reality |
Subjective Image Fidelity Metric Based on Bit Allocation of the Human | visual | -System in the DCT Domain |
Subjective Logic Based Hybrid Approach to Conditional Evidence Fusion for Forensic | visual | Surveillance |
Subjective | visual | Quality Assessment of Immersive 3D Media Compressed by Open-Source Static 3D Mesh Codecs |
Submodular Attribute Selection for | visual | Recognition |
Substantial improvement of stereo | visual | odometry by multi-path feature tracking |
Subsurface Structure Analysis Using Computational Interpretation and Learning: A | visual | Signal Processing Perspective |
Sufficient dimension reduction for | visual | sequence classification |
Summarization of | visual | Content in Instructional Videos |
Summarizing Long-Length Videos with GAN-Enhanced Audio/ | visual | Features |
Summarizing | visual | data using bidirectional similarity |
Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in | visual | Reasoning |
super-resolution based method to synthesize | visual | images from near infrared, A |
Super-resolution image | visual | quality assessment based on structure-texture features |
SuperDisco: Super-Class Discovery Improves | visual | Recognition for the Long-Tail |
Superimposing Thermal-Infrared Data on 3D Structure Reconstructed by RGB | visual | Odometry |
Supervised Approach in Background Modelling for | visual | Surveillance, A |
Supervised Kernel Descriptors for | visual | Recognition |
Supervised learning of Gaussian mixture models for | visual | vocabulary generation |
Supervised | visual | Vocabulary with Category Information |
Supplementary Material: AVA-ActiveSpeaker: An Audio- | visual | Dataset for Active Speaker Detection |
support system for | visual | ly impaired persons to understand three-dimensional visual information using acoustic interface, A |
Support Tools for | visual | Information Management |
Support Vector Machine Based Online Learning Approach for Automated | visual | Inspection, A |
Support Vector Machine-Based Dynamic Network for | visual | Speech Recognition Applications, A |
Support Vector Machines for | visual | Gender Classification |
Supporting Human-Robot Interaction Based on the Level of | visual | Focus of Attention |
Supporting Multitracking Performance With Novel | visual | , Auditory, and Tactile Displays |
Supporting | visual | quality assessment with machine learning |
Surprising Effectiveness of | visual | Odometry Techniques for Embodied PointGoal Navigation, The |
Surveillance System Integrating | visual | Telepresence, A |
Survey of Affect Recognition Methods: Audio, | visual | , and Spontaneous Expressions, A |
Survey of Automated | visual | Inspection, A |
Survey of compressed-domain features used in audio- | visual | indexing and analysis |
survey of datasets for | visual | tracking, A |
Survey of information theory in | visual | quality assessment |
survey of methods, datasets and evaluation metrics for | visual | question answering, A |
Survey of single-target | visual | tracking methods based on online learning |
survey on bias in | visual | datasets, A |
Survey on classifying human actions through | visual | sensors |
Survey on hardware implementations of | visual | object trackers |
Survey on Long-Tailed | visual | Recognition, A |
Survey on the Analysis and Modeling of | visual | Kinship: A Decade in the Making |
Survey on | visual | Analytics of Social Media Data, A |
Survey on | visual | Content-Based Video Indexing and Retrieval, A |
Survey on | visual | Navigation and Positioning for Autonomous UUVs, A |
survey on | visual | quality assessment methods for light fields, A |
Survey on | visual | sentiment analysis |
Survey on | visual | Surveillance of Object Motion and Behaviors, A |
survey on | visual | -Based Localization: On the benefit of heterogeneous data, A |
Susceptibility to | visual | Discomfort of 3-D Displays by Visual Performance Measures |
Susceptibility to | visual | Discomfort of 3-D Displays by Visual Performance Measures |
SVD based Kalman Particle Filter for Robust | visual | Tracking |
SVG-Loop: Semantic- | visual | -Geometric Information-Based Loop Closure Detection |
SVGC-AVA: 360-Degree Video Saliency Prediction With Spherical Vector-Based Graph Convolution and Audio- | visual | Attention |
SVM-KNN: Discriminative Nearest Neighbor Classification for | visual | Category Recognition |
SVO Pro: Semi-direct | visual | -Inertial Odometry and SLAM for Monocular, Stereo, and Wide Angle Cameras |
SwapMix: Diagnosing and Regularizing the Over-Reliance on | visual | Context in Visual Question Answering |
SwapMix: Diagnosing and Regularizing the Over-Reliance on | visual | Context in Visual Question Answering |
Switching particle filters for efficient real-time | visual | tracking |
Symbol Spotting Approach Based on the Vector Model and a | visual | Vocabulary, A |
Symbolic Description and | visual | Querying of Image Sequences Using Spatiotemporal Logic |
Symmetry in | visual | Symbol Sets |
Symmetry-aware Neural Architecture for Embodied | visual | Exploration |
Symmetry-aware Neural Architecture for Embodied | visual | Navigation |
Synchronization of Multiple Camera Videos Using Audio- | visual | Features |
Synchronized Audio- | visual | Frames with Fractional Positional Encoding for Transformers in Video-to-Text Translation |
syntactic framework for bitstream-level representation of audio- | visual | objects, A |
Syntax-based arithmetic video coding for very low bit rate | visual | telephony |
Synthesis of Silhouettes and | visual | Hull Reconstruction for Articulated Humans |
Synthesis of | visual | Modules from Examples: Learning Hyperacuity |
SynthVSR: Scaling Up | visual | Speech Recognition With Synthetic Supervision |
System and method for triphone-based unit selection for | visual | speech synthesis |
System and method of | visual | orientation |
System for the Automatic | visual | Inspection of Bare-Printed Circuit Boards, A |
System for Various | visual | Classification Tasks Based on Neural Networks, A |
Systematic Methods for Multivariate Data | visual | ization and Numerical Assessment of Class Separability and Overlap in Automated Visual Industrial Quality Control |
Systemic | visual | Structures: Design Solution for Complexities of Big Data Interfaces |
Systems and methods for the automated sensing of motion in a mobile robot using | visual | data |
t, k, n) XOR-based | visual | cryptography scheme with essential shadows |
Table Detection Method for Multipage PDF Documents via | visual | Seperators and Tabular Structures, A |
Tag refinement in an image folksonomy using | visual | similarity and tag co-occurrence statistics |
Tagged | visual | Cryptography |
Taking a Closer Look At | visual | Relation: Unbiased Video Scene Graph Generation With Decoupled Label Learning |
Taking | visual | motion prediction to new heightfields |
Tale of Two Classifiers: SNoW vs. SVM in | visual | Recognition, A |
Talking Face: Using Facial Feature Detection and Image Transformations for | visual | Speech |
Talking Head Generation with Probabilistic Audio-to- | visual | Diffusion Priors |
TAMPAR: | visual | Tampering Detection for Parcel Logistics in Postal Supply Chains |
Tangent Bundle Theory for | visual | Curve Completion, A |
TapTell: Interactive | visual | search for mobile task recommendation |
Target Acquisition Methodology for | visual | and Infrared Imaging Sensors |
Target Aware | visual | Object Tracking |
Target Model Estimation using Particle Filters for | visual | Servoing |
Target-Aware State Estimation for | visual | Tracking |
Target-Cognisant Siamese Network for Robust | visual | Object Tracking |
Target-Oriented Deformation of | visual | -Semantic Embedding Space |
Task Specific | visual | Saliency Prediction with Memory Augmented Conditional Generative Adversarial Networks |
Task-Aware Few-Shot | visual | Classification with Improved Self-Supervised Metric Learning |
Task-Dependent | visual | -Codebook Compression |
Task-Driven Learning of Spatial Combinations of | visual | Features |
Task-Oriented Generation of | visual | Sensing Strategies |
Task-Oriented Generation of | visual | Sensing Strategies in Assembly Tasks |
TAT: Targeted backdoor attacks against | visual | object tracking |
taxonomy of | visual | recognition, A |
TCD-TIMIT: An Audio- | visual | Corpus of Continuous Speech |
Teaching Categories to Human Learners with | visual | Explanations |
Technical Perspective: Lighting the Way to | visual | Privacy |
Technical Perspective: Progress in | visual | Categorization |
Technical Perspective: | visual | Reconstruction |
Teleimmersive Audio- | visual | Communication Using Commodity Hardware |
Telepresence teaching | visual | equipment |
Tell Me What You Like and I'll Tell You What You Are: Discriminating | visual | Preferences on Flickr Data |
Template Guided | visual | Inspection |
Template Matching Method Based on | visual | Feature Constraint and Structure Constraint |
template polynomial approach for image processing and | visual | recognition, A |
Temporal accumulation of oriented | visual | features |
Temporal and Cross-modal Attention for Audio- | visual | Zero-Shot Learning |
Temporal and | visual | Analysis-Based Approach to Commercial Detection in News Video, A |
Temporal causality for the analysis of | visual | events |
Temporal Cue Guided Video Highlight Detection with Low-Rank Audio- | visual | Fusion |
Temporal Integration Based | visual | Cryptography Scheme and Its Application |
Temporal Integration of | visual | Surface Reconstruction |
Temporal Knowledge Consistency for Unsupervised | visual | Representation Learning |
Temporal precedence in asynchronous | visual | indexing |
Temporal resolution vs. | visual | saliency in videos: Analysis of gaze patterns and evaluation of saliency models |
Temporal Restricted | visual | Tracking Via Reverse-Low-Rank Sparse Learning |
Temporally adaptive motion interpolation exploiting temporal masking in | visual | perception |
temporally consistent grid-based | visual | odometry framework for multi-core architectures, A |
Ten Years of Digital | visual | Restoration Systems |
Ten-fold Improvement in | visual | Odometry Using Landmark Matching |
Tensor Completion for Estimating Missing Values in | visual | Data |
Tensor error correction for corrupted values in | visual | data |
Tensorize, Factorize and Regularize: Robust | visual | Relationship Learning |
Tenth | visual | Object Tracking VOT2022 Challenge Results, The |
Test-Time Model Adaptation for | visual | Question Answering With Debiased Self-Supervisions |
Text Detection and Recognition on Traffic Panels From Street-Level Imagery Using | visual | Appearance |
Text to | visual | synthesis with appearance models |
text-based | visual | context modulation neural model for multimodal machine translation, A |
Text-guided | visual | representation learning for medical image retrieval systems |
Text-instance graph: Exploring the relational semantics for text-based | visual | question answering |
Text- | visual | Prompting for Efficient 2D Temporal Video Grounding |
TextManiA: Enriching | visual | Feature by Text-driven Manifold Augmentation |
TextPlace: | visual | Place Recognition and Topological Localization Through Reading Scene Texts |
TextSLAM: | visual | SLAM With Semantic Planar Text Features |
Textual Enhanced Adaptive Meta-Fusion for Few-Shot | visual | Recognition |
Textual | visual | Semantic Dataset for Text Spotting |
Textual- | visual | Reference-Aware Attention Network for Visual Dialog |
Textual- | visual | Reference-Aware Attention Network for Visual Dialog |
Textural Features Corresponding to | visual | Perception |
Texture Analysis and Classification Using a Human | visual | Model |
Texture Analysis by Bag-Of- | visual | -Words of Complex Networks |
Texture Edge-Detection by Modeling | visual | Cortical Channels |
Texture feature extraction via | visual | cortical channel modelling |
TGIF-QA: Toward Spatio-Temporal Reasoning in | visual | Question Answering |
Theme-Aware | visual | Attribute Reasoning for Image Aesthetics Assessment |
Theory and Practice of Coplanar Shadowgram Imaging for Acquiring | visual | Hulls of Intricate Objects, The |
Theory of the | visual | -Motion Coding in the Primary Visual-Cortex, A |
Theory of the | visual | -Motion Coding in the Primary Visual-Cortex, A |
Thermal and | visual | Information Fusion for Outdoor Scene Perception |
Thermal Infrared Video Benchmark for | visual | Analysis, A |
Thermal Infrared | visual | Object Tracking VOT-TIR2015 Challenge Results, The |
Thermal Infrared | visual | Object Tracking VOT-TIR2016 Challenge Results, The |
Thermal to | visual | Person Re-Identification Using Collaborative Metric Learning Based on Maximum Margin Matrix Factorization |
Thermo- | visual | feature fusion for object tracking using multiple spatiogram trackers |
Thinking Fast and Slow: Efficient Text-to- | visual | Retrieval with Transformers |
Thinning Noisy Binary Patterns Using Human | visual | Symmetry |
Thirteen Hard Cases in | visual | Tracking |
Three Guidelines of Online Learning for Large-Scale | visual | Recognition |
Three Processing Characteristics of | visual | Texture Segmentation |
Three-Color | visual | Response |
Three-Dimensional Reconstruction by Active Integration of | visual | Cues |
Three-Dimensional Speaker Localization: Audio-Refined | visual | Scaling Factor Estimation |
Three-Point Direct Stereo | visual | Odometry |
Three-systems theory of human | visual | motion perception: Review and Update |
Threshold | visual | Cryptographic Scheme With Meaningful Shares |
Threshold | visual | secret sharing with comprehensive properties based on random grids |
Thresholds of Vision of the Human | visual | System: Visual Adaptation for Monocular and Binocular Vision |
Thresholds of Vision of the Human | visual | System: Visual Adaptation for Monocular and Binocular Vision |
Through Hawks' Eyes: Synthetically Reconstructing the | visual | Field of a Bird in Flight |
TIF: Threshold Interception and Fusion for Compact and Fine-Grained | visual | Attribution |
Tightly integrated sensor fusion for robust | visual | tracking |
Tile-based image | visual | codeword extraction for efficient indexing and retrieval |
Time Shifted IMU Preintegration for Temporal Calibration in Incremental | visual | -Inertial Initialization |
Time Varying Metric Learning for | visual | tracking |
time-frequency convolutional neural network for the offline classification of steady-state | visual | evoked potential responses, A |
Time-to-Collision Estimation from Motion Based on Primate | visual | Processing |
Time-varying delay measurement of video capture-to-display components with application to | visual | servoing |
TimeClassifier: a | visual | analytic system for the classification of multi-dimensional time series data |
TimeCluster: dimension reduction applied to temporal data for | visual | analytics |
Tips and Tricks for | visual | Question Answering: Learnings from the 2017 Challenge |
Token Boosting for Robust Self-Supervised | visual | Transformer Pre-training |
Tone mapping based HDR compression: Does it affect | visual | experience? |
Top Rank Supervised Binary Coding for | visual | Search |
Top-down control of | visual | attention in object detection |
Top-down saliency detection driven by | visual | classification |
Top-Down Segmentation of Non-rigid | visual | Objects Using Derivative-Based Search on Sparse Manifolds |
Top-Down | visual | Attention Estimation Using Spatially Localized Activation Based on Linear Separability of Visual Features |
Top-Down | visual | Attention Estimation Using Spatially Localized Activation Based on Linear Separability of Visual Features |
Top-Down | visual | Attention for Efficient Rendering of Task Related Scenes |
Top-Down | visual | Attention from Analysis by Synthesis |
Top-down | visual | attention integrated particle filter for robust object tracking |
Top-Down | visual | Saliency Guided by Captions |
Top-Down | visual | Saliency via Joint CRF and Dictionary Learning |
Topic Tracking Across Broadcast News Videos with | visual | Duplicates and Semantic Concepts |
Topological Direction-Giving and | visual | Navigation in Large Environments |
Topological Maps for | visual | Navigation |
Topology and Language of Relationships in the | visual | Genome Dataset, The |
Topology of Digital Images: | visual | Pattern Discovery in Proximity Spaces |
Totems: Physical Objects for Verifying | visual | Integrity |
TOUCHDOWN: Natural Language Navigation and Spatial Reasoning in | visual | Street Environments |
Tourist Attraction Recommendation Model Fusing Spatial, Temporal, and | visual | Embeddings for Flickr-Geotagged Photos, A |
Toward a Computational Theory of Early | visual | Processing in Reading |
Toward a higher-level | visual | representation for object-based image retrieval |
Toward a More Representative Monitoring of Land-Use and Land-Cover Dynamics: The Use of a Sample-Based Assessment through Augmented | visual | Interpretation Using Open Foris Collect Earth |
Toward a | visual | Concept Vocabulary for GAN Latent Space |
Toward Automatic Robot Programming: Learning Human Skill from | visual | Data |
Toward evaluation of | visual | navigation algorithms on RGB-D data from the first- and second-generation Kinect |
Toward Explainable 3D Grounded | visual | Question Answering: A New Benchmark and Strong Baseline |
Toward Improving The | visual | Characterization of Sport Activities With Abstracted Scene Graphs |
Toward kinship verification using | visual | attributes |
Toward Learning | visual | Discrimination Strategies |
Toward Multi-Granularity Decision-Making: Explicit | visual | Reasoning with Hierarchical Knowledge |
Toward Occlusion Handling in | visual | Tracking via Probabilistic Finite State Machines |
Toward Robust | visual | Object Tracking With Independent Target-Agnostic Detection and Effective Siamese Cross-Task Interaction |
Toward Sensing Emotions With Deep | visual | Analysis: A Long-Term Psychological Modeling Approach |
Toward Simultaneous | visual | Comfort and Depth Sensation Optimization for Stereoscopic 3-D Experience |
Toward Statistical Modeling of Saccadic Eye-Movement and | visual | Saliency |
Toward Storytelling From | visual | Lifelogging: An Overview |
Toward Unsupervised Realistic | visual | Question Answering |
Toward | visual | Behavior Markers of Suicidal Ideation |
Toward | visual | Distortion in Black-Box Attacks |
Toward | visual | microprocessors |
Toward | visual | Voice Activity Detection for Unconstrained Videos |
Towards a Better Match in Siamese Network Based | visual | Object Tracker |
Towards a Cloud Robotics Platform for Distributed | visual | SLAM |
Towards a Contextualized | visual | Analysis of Heterogeneous Manufacturing Data |
Towards a Hazard Perception Assistance System using | visual | Motion |
Towards a Model of Information Seeking by Integrating | visual | , Semantic and Memory Maps |
Towards a more discriminative and semantic | visual | vocabulary |
Towards a Real-time Framework for | visual | Monitoring Tasks |
Towards a Unified Compositional Model for | visual | Pattern Modeling |
Towards a Unified Framework for | visual | Compatibility Prediction |
Towards a Universal and Limited | visual | Vocabulary |
Towards a | visual | Perception System for Pipe Inspection: Monocular Visual Odometry |
Towards a | visual | Perception System for Pipe Inspection: Monocular Visual Odometry |
Towards a | visual | Privacy Advisor: Understanding and Predicting Privacy Risks in Images |
Towards a | visual | Sign Language dataset for home care services |
Towards a | visual | -hull based multi-agent surveillance system |
Towards Accurate | visual | and Natural Language-Based Vehicle Retrieval Systems |
Towards an Active | visual | Observer |
Towards an alternative GPS sensor in dense urban environment from | visual | memory |
Towards an Effective Web-Based Virtual Health Intervention: The Impact of Media Platform, | visual | Framing, and Race on Social Presence and Transportation Ratings |
Towards an End-to-End | visual | -to-Raw-Audio Generation With GAN |
Towards Audio- | visual | On-line Diarization Of Participants In Group Meetings |
Towards Audio- | visual | Saliency Prediction for Omnidirectional Video with Spatial Audio |
Towards Automated Understanding of Student-Tutor Interactions Using | visual | Deictic Gestures |
Towards Automated | visual | Assessment of Progress in Construction Projects |
Towards Automated | visual | Monitoring of Individual Gorillas in the Wild |
Towards automated wide area | visual | surveillance: tracking objects between spatially-separated, uncalibrated views |
Towards Automatic Parsing of Structured | visual | Content through the Use of Synthetic Data |
Towards Automatic | visual | Obstacle Avoidance |
Towards automating | visual | in-field monitoring of crop health |
Towards Benchmarking and Assessing | visual | Naturalness of Physical World Adversarial Attacks |
Towards complex | visual | surveillance algorithms on smart cameras |
Towards Computational Models of the | visual | Aesthetic Appeal of Consumer Videos |
Towards Context-Aware Interaction Recognition for | visual | Relationship Detection |
Towards cross-category knowledge propagation for learning | visual | concepts |
Towards Detection of Bus Driver Fatigue Based on Robust | visual | Analysis of Eye State |
Towards Direct Localization for | visual | Teach and Repeat |
Towards Disturbance-Free | visual | Mobile Manipulation |
Towards Edge-Precise Cloud and Shadow Detection on the GaoFen-1 Dataset: A | visual | , Comprehensive Investigation |
Towards Effective | visual | Representations for Partial-Label Learning |
Towards Efficient and Effective Self-supervised Learning of | visual | Representations |
Towards Efficient Front-End | visual | Sensing for Digital Retina: A Model-Centric Paradigm |
Towards End-to-end Learning of | visual | Inertial Odometry with an EKF |
Towards Estimating Bias in Stereo | visual | Odometry |
Towards Estimating the Upper Bound of | visual | -Speech Recognition: The Visual Lip-Reading Feasibility Database |
Towards Estimating the Upper Bound of | visual | -Speech Recognition: The Visual Lip-Reading Feasibility Database |
Towards estimation of human intent in assistive robotic teleoperation using kinaesthetic and | visual | feedback |
Towards explainable deep | visual | saliency models |
Towards Fairness in | visual | Recognition: Effective Strategies for Bias Mitigation |
Towards Fine-Grained Open Zero-Shot Learning: Inferring Unseen | visual | Features from Attributes |
Towards Fully Autonomous | visual | Navigation |
Towards Generalisable Video Moment Retrieval: | visual | -Dynamic Injection to Image-Text Pre-Training |
Towards Guided Underwater Survey Using Light | visual | Odometry |
Towards Improved Observation Models for | visual | Tracking: Selective Adaptation |
Towards Intercultural Affect Recognition: Audio- | visual | Affect Recognition in the Wild Across Six Cultures |
Towards Knowledge-Aware Video Captioning via Transitive | visual | Relationship Detection |
Towards Language-Guided | visual | Recognition via Dynamic Convolutions |
Towards Learning Robotic Reaching and Pointing: An Uncalibrated | visual | Servoing Approach |
Towards local | visual | modeling for image captioning |
Towards Managing | visual | Pollution: A 3D Isovist and Voxel Approach to Advertisement Billboard Visual Impact Assessment |
Towards Managing | visual | Pollution: A 3D Isovist and Voxel Approach to Advertisement Billboard Visual Impact Assessment |
Towards Modelling of | visual | Saliency in Point Clouds for Immersive Applications |
Towards optimal distortion-based | visual | privacy filters |
Towards plug-and-play | visual | surveillance: learning tracking models |
Towards Practical | visual | Servoing in Robotics |
Towards Privacy-Preserving | visual | Recognition via Adversarial Training: A Pilot Study |
Towards real-time 3-D monocular | visual | tracking of human limbs in unconstrained environments |
Towards Real-Time | visual | Simulation of Water Surfaces |
Towards Real-World | visual | Tracking With Temporal Contexts |
Towards Robust Multi-Cue Integration for | visual | Tracking |
Towards Semantic 3D City Modeling and | visual | Explorations |
Towards semantic embedding in | visual | vocabulary |
Towards Sequence-Level Training for | visual | Tracking |
Towards stratified model-based environmental | visual | perception for humanoid robots |
Towards Training-Free Refinement for Semantic Indexing of | visual | Media |
Towards Unconstrained Pointing Problem of | visual | Question Answering: A Retrieval-based Method |
Towards unsupervised attention object extraction by integrating | visual | attention and object growing |
Towards Unsupervised Discovery of | visual | Categories |
Towards Viewpoint-Invariant | visual | Recognition via Adversarial Training |
Towards | visual | Based Navigation with Power Line Detection |
Towards | visual | Feature Translation |
Towards | visual | Saliency Computation on 3D Graphical Contents for Interactive Visualization |
Towards | visual | Saliency Explanations of Face Verification |
Towards | visual | words to words |
Towards | visual | -Inertial SLAM for Mobile Augmented Reality |
Tracker Fusion for Robustness in | visual | Feature Tracking |
Tracker-Level Fusion for Robust Bayesian | visual | Tracking |
Tracking failure detection by imitating human | visual | perception |
Tracking Gaze and | visual | Focus of Attention of People Involved in Social Interaction |
Tracking in a complex | visual | environment |
Tracking Multiple | visual | Targets via Particle-Based Belief Propagation |
Tracking Nonstationary | visual | Appearances by Data-Driven Adaptation |
Tracking Objects with Adaptive Feature Patches for PTZ Camera | visual | Surveillance |
Tracking of Human Hands and Faces through Probabilistic Fusion of Multiple | visual | Cues |
Tracking of humans and estimation of body/head orientation from top-view single camera for | visual | focus of attention analysis |
Tracking Persons using Particle Filter Fusing | visual | and Wi-Fi Localizations for Widely Distributed Camera |
Tracking Spatially Distributed Features in KLT Algorithms for RGB-D | visual | Odometry |
Tracking techniques for | visual | servoing tasks |
Tracking the Active Speaker Based on a Joint Audio- | visual | Observation Model |
Tracking the | visual | Focus of Attention for a Varying Number of Wandering People |
Tracking | visual | and infrared objects using joint Riemannian manifold appearance and affine shape modeling |
Tracking | visual | Object As An Extended Target |
Tracking-DOSeqSLAM: A dynamic sequence-based | visual | place recognition paradigm |
TracKlinic: Diagnosis of Challenge Factors in | visual | Tracking |
Trading off salience and uncertainty in sampling a | visual | scene |
Traffic Displays for | visual | Flight Indicating Track and Priority Cues |
Traffic Sign Detection Based On Biologically | visual | Mechanism |
Training Hierarchical Feed-Forward | visual | Recognition Models Using Transfer Learning from Pseudo-Tasks |
Training sequential on-line boosting classifier for | visual | tracking |
Training-Free, Lightweight Global Image Descriptor for Long-Term | visual | Place Recognition Toward Autonomous Vehicles, A |
Trajectory Guided Robust | visual | Object Tracking With Selective Remedy |
Trajectory Predictor by Using Recurrent Neural Networks in | visual | Tracking |
Trajectory Scoring Tool for Local Anomaly Detection in Maritime Traffic Using | visual | Analytics, A |
Trajectory Series Analysis based Event Rule Induction for | visual | Surveillance |
Transductive | visual | Verb Sense Disambiguation |
Transfer Function Model of Physiological Mechanisms Underlying Temporal | visual | Discomfort Experienced When Viewing Stereoscopic 3D Images |
Transfer Learning Based | visual | Tracking with Gaussian Processes Regression |
Transfer Learning via Unsupervised Task Discovery for | visual | Question Answering |
Transfer learning-based discriminative correlation filter for | visual | tracking |
Transferable Decoding with | visual | Entities for Zero-Shot Image Captioning |
Transferable | visual | Words: Exploiting the Semantics of Anatomical Patterns for Self-Supervised Learning |
Transferring Vision-Language Models for | visual | Recognition: A Classifier Perspective |
Transferring | visual | Prior for Online Object Tracking |
Transform-Retrieve-Generate: Natural Language-Centric Outside-Knowledge | visual | Question Answering |
Transformation Driven | visual | Reasoning |
Transformed ROIs for capturing | visual | transformations in videos |
Transformer Meets Tracker: Exploiting Temporal Context for Robust | visual | Tracking |
Transformer Sub-Patch Matching for High-Performance | visual | Object Tracking |
Transformer | visual | Tracker Based on Template Features Corresponding to Foreground Region |
Transformer-based Medical | visual | Question Answering Model, A |
Transformer-based | visual | object tracking via fine-coarse concatenated attention and cross concatenated MLP |
Transition of | visual | Attention Assessment in Stereoscopic Images With Evaluation of Subjective Visual Quality and Discomfort |
Transition of | visual | Attention Assessment in Stereoscopic Images With Evaluation of Subjective Visual Quality and Discomfort |
Transitional Adaptation of Pretrained Models for | visual | Storytelling |
Transitive Closure Based | visual | Words for Point Matching in Video Sequence |
Transitive Invariance for Self-Supervised | visual | Representation Learning |
Translating a | visual | LEGO Manual to a Machine-Executable Plan |
Translating Aerial Images Into Street-map-like Representations For | visual | Self-localization of Uavs |
Translating video into language by enhancing | visual | and language representations |
Translating | visual | Art Into Music |
Translingual | visual | speech synthesis |
Transparency by Design: Closing the Gap Between Performance and Interpretability in | visual | Reasoning |
Transputer-Based Automated | visual | Inspection System for Electronic Devices and PCBs, A |
TranstextNet: Transducing Text for Recognizing Unseen | visual | Relationships |
TransVG++: End-to-End | visual | Grounding With Language Conditioned Vision Transformer |
TransVG: End-to-End | visual | Grounding with Transformers |
TransVLAD: Multi-Scale Attention-Based Global Descriptors for | visual | Geo-Localization |
TRAR: Routing the Attention Spans in Transformer for | visual | Question Answering |
TravelBuddy: Interactive Travel Route Recommendation with a | visual | Scene Interface |
traverse inspection system for high precision | visual | on-loom fabric defect detection, A |
TRBACF: Learning temporal regularized correlation filters for high performance online | visual | object tracking |
Tree-Structured Model of | visual | Appearance Applied to Gaze Tracking, A |
Trends in automated | visual | inspection |
Tri-Tracking: Combining Three Independent Views for Robust | visual | Tracking |
Triangulate geometric constraint combined with | visual | -flow fusion network for accurate 6DoF pose estimation |
Trifocal Tensor-Based Adaptive | visual | Trajectory Tracking Control of Mobile Robots |
Trilinearity in | visual | Recognition by Alignment |
Trinocular | visual | odometry for divergent views with minimal overlap |
Triple attention network for sentimental | visual | question answering |
tritan Waldo would be easier to detect in the periphery than a red/green one: evidence from | visual | search, A |
TRRNET: Tiered Relation Reasoning for Compositional | visual | Question Answering |
TSGB: Target-Selective Gradient Backprop for Probing CNN | visual | Saliency |
TUMindoor: An extensive image and point cloud dataset for | visual | indoor localization and mapping |
Tutor-based learning of | visual | categories using different levels of supervision |
Tutorial on | visual | Servo Control, A |
TV program segmentation using text- | visual | analysis |
TVConv: Efficient Translation Variant Convolution for Layout-aware | visual | Processing |
Two Body Problem: Collaborative | visual | Task Completion |
Two Can Play This Game: | visual | Dialog with Discriminative Question Generation and Answering |
Two Causal Principles for Improving | visual | Dialog |
Two dimensional hashing for | visual | tracking |
Two Efficient Solutions for | visual | Odometry Using Directional Correspondence |
Two features combination with gated recurrent unit for | visual | speech recognition |
Two in One Image Secret Sharing Scheme (TiOISSS) for extended progressive | visual | cryptography using simple modular arithmetic operations |
Two novel real-time local | visual | features for omnidirectional vision |
Two-Dimensional Edge Detection Scheme for General | visual | Processing, A |
Two-Dimensional Mesh-Based | visual | -Object Representation for Interactive Synthetic/Natural Digital Video |
Two-Dimensional Optimal Velocity Model for Unidirectional Pedestrian Flow Based on Pedestrian's | visual | Hindrance Field, A |
Two-Level Adversarial | visual | -Semantic Coupling for Generalized Zero-shot Learning |
Two-Level Bimodal Association for Audio- | visual | Speech Recognition |
Two-Stage Approach for Fine-Grained | visual | Recognition via Confidence Ranking and Fusion, A |
Two-stage approach to extracting | visual | objects from paper documents |
Two-Stage Autoencoder for | visual | Anomaly Detection, A |
Two-stage aware attentional Siamese network for | visual | tracking |
Two-Stage Clustering Based 3D | visual | Saliency Model for Dynamic Scenarios, A |
Two-Stage Dynamic Model for | visual | Tracking, A |
Two-stage Multimodality Fusion for High-performance Text-based | visual | Question Answering |
two-step image stabilisation method for promoting | visual | quality in vision-enabled maritime surveillance systems, A |
Two-Stream Aural- | visual | Affect Analysis in the Wild |
Typicality-Based | visual | Search Reranking |
U-CAM: | visual | Explanation Using Uncertainty Based Class Activation Maps |
UAV anti-collision | visual | detection algorithm |
UAV Cinematography Constraints Imposed by | visual | Target Tracking |
UAV-Based Oblique Photogrammetry for Outdoor Data Acquisition and Offsite | visual | Inspection of Transmission Line |
UAV-based | visual | Remote Sensing for Automated Building Inspection |
UAVM: Towards Unifying Audio and | visual | Models |
UCT: Learning Unified Convolutional Networks for Real-Time | visual | Tracking |
UG^2: a Video Benchmark for Assessing the Impact of Image Restoration and Enhancement on Automatic | visual | Recognition |
UHDTV Image Format for Better | visual | Experience |
Ultimate SLAM? Combining Events, Images, and IMU for Robust | visual | SLAM in HDR and High Speed Scenarios |
Ultra wide band audio | visual | PHY IEEE 802.15.3c for SPIHT-compressed image transmission |
Ultra-High Temporal Resolution | visual | Reconstruction From a Fovea-Like Spike Camera via Spiking Neuron Model |
Ultrasonic | visual | Sensor for Three-Dimensional Object Recognition Using Neural Networks, An |
Unbiased | visual | Question Answering by Leveraging Instrumental Variable |
Uncalibrated and Unmodeled Image-Based | visual | Servoing of Robot Manipulators Using Zeroing Neural Networks |
uncalibrated stereo | visual | servo system, An |
Uncalibrated | visual | Servoing |
Uncalibrated | visual | Servoing for a Planar Two Link Rigid-Flexible Manipulator Without Joint-Space-Velocity Measurement |
Uncalibrated | visual | servoing from projective reconstruction of control values |
Uncalibrated | visual | Servoing in 3D Workspace |
Uncalibrated | visual | Tasks via Linear Interaction |
Uncertainty Calibrated Markov Chain Monte Carlo Sampler for | visual | Tracking Based on Multi-shape Posterior |
Uncertainty modeling for efficient | visual | odometry via inertial sensors on mobile devices |
Uncertainty Quantification of Lucas Kanade Feature Track and Application to | visual | Odometry |
Uncertainty Relation for Resolution in Space, Spatial Frequency, and Orientation Optimized by 2D | visual | Cortical Filters |
Understanding and Diagnosing | visual | Tracking Systems |
Understanding and Improving | visual | Prompting: A Label-Mapping Perspective |
Understanding and Recreating | visual | Appearance Under Natural Illumination |
Understanding and | visual | izing Deep Visual Saliency Models |
Understanding Drivers' | visual | and Comprehension Loads in Traffic Violation Hotspots Leveraging Crowd-Based Driving Simulation |
Understanding Images of Graphical User Interfaces: A New Approach to Activity Recognition for | visual | Surveillance |
Understanding Interactions and Guiding | visual | Surveillance by Tracking Attention |
Understanding Knowledge Gaps in | visual | Question Answering: Implications for Gap Identification and Testing |
Understanding Road Scenes Using | visual | Cues and GPS Information |
Understanding Scenery Quality: A | visual | Attention Measure and Its Computational Model |
Understanding the User Perception in | visual | Lifelogging: A Pilot Study in Malaysian Context |
Understanding | visual | behaviour, Special Issue Introduction |
Understanding | visual | dictionaries via Maximum Mutual Information curves |
Understanding VQA for Negative Answers Through | visual | and Linguistic Inference |
Underwater Image Enhancement via Weighted Wavelet | visual | Perception Fusion |
Underwater Video Mosaics as | visual | Navigation Maps |
Unfalsified | visual | Servoing for Simultaneous Object Recognition and Pose Tracking |
Unicode Analogies: An Anti-Objectivist | visual | Reasoning Challenge |
Unification and Integration of | visual | Modules: An Extension of the Marr Paradigm |
Unified Approach for On-Road | visual | Night-Time Vehicle Light Detection, A |
Unified Approach to Facial Affect Analysis: the MAE-Face | visual | Representation, A |
Unified Approach to Model-Based and Model-Free | visual | Servoing, An |
Unified Audio- | visual | Saliency Model for Omnidirectional Videos With Spatial Audio |
Unified Bayesian Framework for Adaptive | visual | Tracking, A |
Unified Color and Contrast Age-Dependent | visual | Content Adaptation, A |
Unified Direct | visual | Tracking of Rigid and Deformable Surfaces Under Generic Illumination Changes in Grayscale and Color Images |
Unified discriminating feature analysis for | visual | category recognition |
Unified Framework for Image Retrieval Using Keyword and | visual | Features, A |
unified framework for local | visual | descriptors evaluation, A |
Unified Framework for Salient Structure Detection by Contour-Guided | visual | Search, A |
Unified Graph-Based Multicue Feature Fusion for Robust | visual | Tracking |
unified image retrieval framework on local | visual | and semantic concept-based feature spaces, A |
Unified Modeling of Nonhomogeneous 3D Objects for Thermal and | visual | Image Synthesis |
Unified Multisensory Perception: Weakly-supervised Audio- | visual | Video Parsing |
Unified Perspective on Computational Techniques for the Measurement of | visual | Motion, A |
Unified Questioner Transformer for Descriptive Question Generation in Goal-Oriented | visual | Dialogue |
Unified Rolling Shutter and Motion Blur Model for 3D | visual | Registration, A |
Unified | visual | Information Preservation Framework for Self-supervised Pre-Training in Medical Image Analysis, A |
Unified | visual | Relationship Detection with Vision and Language Models |
Unified | visual | -Semantic Embeddings: Bridging Vision and Language With Structured Meaning Representations |
UnifiedTT: | visual | tracking with unified transformer |
Uniform query formalization in mobile | visual | search: From standards to practice |
UniFormer: Unifying Convolution and Self-Attention for | visual | Recognition |
Unifying discriminative | visual | codebook generation with classifier training for object category recognition |
Unifying Temporal Context and Multi-Feature With Update-Pacing oFramework for | visual | Tracking |
Unifying Textual and | visual | Cues for Content-Based Image Retrieval on the World Wide Web |
Unifying | visual | Attribute Learning with Object Recognition in a Multiplicative Framework |
Unifying | visual | Contrastive Learning for Object Recognition from a Graph Perspective |
Unifying | visual | Perception by Dispersible Points Learning |
unique target representation and voting mechanism for | visual | tracking, A |
UniT3D: A Unified Transformer for 3D Dense Captioning and | visual | Grounding |
Uniting Keypoints: Local | visual | Information Fusion for Large-Scale Image Search |
Universal and Adapted Vocabularies for Generic | visual | Categorization |
universal update-pacing framework for | visual | tracking, A |
Universal, Transferable Adversarial Perturbations for | visual | Object Trackers |
University of Southern California, | visual | Processing Laboratory |
University of Surrey | visual | Concept Detection System at ImageCLEF@ICPR: Working Notes, The |
UniVIP: A Unified Framework for Self-Supervised | visual | Pre-training |
Unleashing Text-to-Image Diffusion Models for | visual | Perception |
Unlocking the Emotional World of | visual | Media: An Overview of the Science, Research, and Impact of Understanding Emotion |
Unmanned Aerial Vehicle | visual | Detection and Tracking using Deep Neural Networks: A Performance Benchmark |
Unpaired Image Captioning by Image-Level Weakly-Supervised | visual | Concept Recognition |
Unscented Kalman filter for | visual | curve tracking |
Unshuffling Data for Improved Generalization in | visual | Question Answering |
Unsupervised Accuracy Estimation of Deep | visual | Models using Domain-Adaptive Adversarial Perturbation without Source Samples |
Unsupervised Adversarial | visual | Level Domain Adaptation for Learning Video Object Detectors From Images |
Unsupervised Alignment of News Video and Text Using | visual | Patterns and Textual Concepts |
Unsupervised and optimized thermal image quality enhancement and | visual | surveillance applications |
Unsupervised and Supervised | visual | Codes with Restricted Boltzmann Machines |
Unsupervised Audio- | visual | Lecture Segmentation |
Unsupervised auxiliary | visual | words discovery for large-scale image object retrieval |
Unsupervised Change Detection from Remotely Sensed Images Based on Multi-Scale | visual | Saliency Coarse-to-Fine Fusion |
Unsupervised Collaborative Learning of Keyframe Detection and | visual | Odometry Towards Monocular Deep SLAM |
Unsupervised Cross-Modal Deep-Model Adaptation for Audio- | visual | Re-identification with Wearable Cameras |
Unsupervised deep hashing for large-scale | visual | search |
Unsupervised Deep | visual | -Inertial Odometry with Online Error Correction for RGB-D Imagery |
Unsupervised discovery of | visual | object class hierarchies |
unsupervised domain adaptation approach for cross-domain | visual | classification, An |
Unsupervised Extraction of | visual | Attention Objects in Color Images |
Unsupervised Fast | visual | Localization and Mapping with Slow Features |
Unsupervised Generation of Context-Relevant Training-Sets for | visual | Object Recognition Employing Multilinguality |
Unsupervised image saliency detection with Gestalt-laws guided optimization and | visual | attention based refinement |
Unsupervised image transformation for long wave infrared and | visual | image matching using two channel convolutional autoencoder network |
Unsupervised improvement of | visual | detectors using co-training |
Unsupervised Intuitive Physics from | visual | Observations |
Unsupervised Language Learning for Discovered | visual | Concepts |
Unsupervised Learning of Discriminative Attributes and | visual | Representations |
Unsupervised Learning of Discriminative Relative | visual | Attributes |
Unsupervised Learning of Monocular Depth Estimation and | visual | Odometry with Deep Feature Reconstruction |
Unsupervised Learning of | visual | Odometry Using Direct Motion Modeling |
Unsupervised Learning of | visual | Representations by Solving Jigsaw Puzzles |
Unsupervised Learning of | visual | Representations Using Videos |
Unsupervised Learning of | visual | Structure |
Unsupervised learning of | visual | taxonomies |
Unsupervised Monocular Estimation of Depth and | visual | Odometry Using Attention and Depth-Pose Consistency Loss |
Unsupervised Monocular | visual | Odometry Based on Confidence Evaluation |
Unsupervised online learning of | visual | focus of attention |
Unsupervised Part Learning for | visual | Recognition |
Unsupervised place discovery for | visual | place classification |
Unsupervised Real-Time Unusual Behavior Detection for Biometric-Assisted | visual | Surveillance |
Unsupervised Synthetic Acoustic Image Generation for Audio- | visual | Scene Understanding |
Unsupervised Vision-Language Parsing: Seamlessly Bridging | visual | Scene Graphs with Language Structures via Dependency Relationships |
Unsupervised | visual | alignment with similarity graphs |
Unsupervised | visual | Attention and Invariance for Reinforcement Learning |
Unsupervised | visual | Changepoint Detection Using Maximum Mean Discrepancy |
Unsupervised | visual | Domain Adaptation Using Subspace Alignment |
Unsupervised | visual | Domain Adaptation: A Deep Max-Margin Gaussian Process Approach |
Unsupervised | visual | feature learning with spike-timing-dependent plasticity: How far are we from traditional feature learning approaches? |
Unsupervised | visual | hull extraction in space, time and light domains |
Unsupervised | visual | Object Categorisation via Self-Organisation |
Unsupervised | visual | Object Categorisation with BoF and Spatial Matching |
Unsupervised | visual | Odometry and Action Integration for PointGoal Navigation in Indoor Environment |
Unsupervised | visual | Relationship Inference |
Unsupervised | visual | Representation Learning by Context Prediction |
Unsupervised | visual | Representation Learning by Graph-Based Consistent Constraints |
Unsupervised | visual | Representation Learning by Online Constrained K-Means |
Unsupervised | visual | Representation Learning by Synchronous Momentum Grouping |
Unsupervised | visual | Representation Learning by Tracking Patches in Video |
Unsupervised | visual | Representation Learning via Dual-Level Progressive Similar Instance Selection |
Unsupervised | visual | Representation Learning via Multi-Dimensional Relationship Alignment |
Unsupervised | visual | -Linguistic Reference Resolution in Instructional Videos |
Up-View | visual | -Based Indoor Positioning Method via Deep Learning, An |
Urban Mobility Analysis With Mobile Network Data: A | visual | Analytics Approach |
Urban Position Estimation from One Dimensional | visual | Cues |
Urban | visual | Localization of Block-Wise Monocular Images with Google Street Views |
URIE: Universal Image Enhancement for | visual | Recognition in the Wild |
USB: Ultrashort Binary Descriptor for Fast | visual | Matching and Retrieval |
Use case | visual | Bag-of-Words techniques for camera based identity document classification |
Use of Active Deformable Models in Model-Based Robotic | visual | Servoing, The |
Use of Affective | visual | Information for Summarization of Human-Centric Videos |
use of Audio- | visual | Description Profile in 3D video content description, The |
Use of Grouping in | visual | Object Recognition, The |
Use of Remote Sensing to Quantitatively Assess the | visual | Effect of Urban Landscape-A Case Study of Zhengzhou, China, The |
use of temporal, semantic and | visual | partitioning model for efficient near-duplicate keyframe detection in large scale news corpus, The |
Use of Tencent Street View Imagery for | visual | Perception of Streets |
use of | visual | search for knowledge gathering in image decision support, The |
User Evaluation of Map-Based | visual | Analytic Tools |
User Interaction for | visual | Lifelog Retrieval in a Virtual Environment |
User Programmable | visual | Inspection |
User-Centered | visual | Analytics Approach for Interactive and Explainable Energy Demand Analysis in Prosumer Scenarios |
User-centric QoE model of | visual | perception for mobile videos |
User-Friendly Random-Grid-Based | visual | Secret Sharing |
Users' Assessment of Orthoimage Photometric Quality for | visual | Interpretation of Agricultural Fields |
Using a pictorial dictionary as a high level user interface for | visual | information retrieval |
Using a | visual | Discrimination Model for the Detection of Compression Artifacts in Virtual Pathology Images |
Using Coarse Label Constraint for Fine-Grained | visual | Classification |
Using computational models to study texture representations in the human | visual | system |
Using content-based image retrieval to automatically assess day similarity in | visual | lifelogs |
Using Dense 3D Reconstruction for | visual | Odometry Based on Structure from Motion Techniques |
Using Designed Structure of | visual | Content to Understand Content-Browsing Behavior |
Using Discrimination Graphs to Represent | visual | Interpretations that Are Hypothetical and Ambiguous |
Using Discriminative Motion Context for Online | visual | Object Tracking |
Using Domain Knowledge in Low-Level | visual | Processing to Interpret Handwritten Music: An Experiment |
Using Earth Mover's Distance in the Bag-of- | visual | -Words Model for Mathematical Symbol Retrieval |
Using Expectation-Maximisation to Learn Dynamical Models from | visual | Data |
Using Eye Tracking to Detect the Effects of Clutter on | visual | Search in Real Time |
Using Eye Tracking to Explore the Guidance and Constancy of | visual | Variables in 3D Visualization |
Using High-Level | visual | Information for Color Constancy |
Using Human | visual | System modeling for bio-inspired low level image processing |
Using Illumination Estimated from Silhouettes to Carve Surface Details on | visual | Hull |
Using Image Sequences for Long-Term | visual | Localization |
Using Intermediate Objects to Improve the Efficiency of | visual | -Search |
Using natural class hierarchies in multi-class | visual | classification |
Using Occlusions to Aid Position Estimation for | visual | Motion Capture |
Using region semantics and | visual | context for scene classification |
Using relevance feedback to learn | visual | concepts from image instances |
Using Remote Vision: The Effects of Video Image Frame Rate on | visual | Object Recognition Performance |
Using Scene Graphs for Detecting | visual | Relationships |
Using Segmentation With Multi-Scale Selective Kernel for | visual | Object Tracking |
Using Singular Displacements for Uncalibrated Monocular | visual | Systems |
Using spherical moments for | visual | servoing from a special target with unique projection center cameras |
Using Stacked Sparse Auto-Encoder and Superpixel CRF for Long-Term | visual | Scene Understanding of UGVs |
Using vision based tracking to support real-time graphical instruction for students who have | visual | impairments |
Using | visual | Context and Region Semantics for High-Level Concept Detection |
Using | visual | Dictionary to Associate Semantic Objects in Region-Based Image Retrieval |
Using | visual | Exploratory Data Analysis to Facilitate Collaboration and Hypothesis Generation in Cross-Disciplinary Research |
Using | visual | features based on MPEG-7 and deep learning for movie recommendation |
Using | visual | Features for Anti-Spam Filtering |
Using | visual | Ozone Damage Scores and Spectroscopy to Quantify Soybean Responses to Background Ozone |
Using | visual | texture analysis to classify raw coal components |
Using weighted spatial relationships in retrieval by | visual | contents |
UTC: A Unified Transformer with Inter-Task Contrastive Learning for | visual | Dialog |
Utilising | visual | Attention Cues for Vehicle Detection and Tracking |
utility of MPEG-7 systems in audio- | visual | applications with multiple streams, The |
UU-Net: Reversible Face De-Identification for | visual | Surveillance Video Footage, The |
V-Doc: | visual | questions answers with Documents |
V-FIRST 2.0: Video Event Retrieval with Flexible Textual- | visual | Intermediary for VBS 2023 |
V-SlowFast Network for Efficient | visual | Sound Separation |
V2C: | visual | Voice Cloning |
V3Det: Vast Vocabulary | visual | Detection Dataset |
Vaisl: | visual | -aware Identification of Semantic Locations in Lifelog |
VAIT: A | visual | Analytics System for Metropolitan Transportation |
VALHALLA: | visual | multimodal-conditioned generation CVPR22 |
VALID: A New Practical Audio- | visual | Database, and Comparative Results |
Validating the | visual | Saliency Model |
Validation of Pedestrian Detectors by Classification of | visual | Detection Impairing Factors |
validation study of a fixed-based, medium fidelity driving simulator for human-machine interfaces | visual | distraction testing, A |
Value of | visual | Attention for COVID-19 Classification in CT Scans, The |
VaMoRs-P An Advanced Platform for | visual | Autonomous Road Vehicle Guidance |
VANT-GAN: Adversarial Learning for Discrepancy-Based | visual | Attribution in Medical Imaging |
Variable bit-rate coding based on human | visual | system |
Variable block and multi-level extended | visual | pattern image coding |
Variable Rate ROI Image Compression Optimized for | visual | Quality |
Variance reduction techniques in particle-based | visual | contour tracking |
Variational Autoencoded Regression: High Dimensional Regression of | visual | Data on Complex Manifold |
Variational Bayesian Inference for Audio- | visual | Tracking of Multiple Speakers |
Variational Bayesian inference for forward-backward | visual | tracking in stereo sequences |
Variational Causal Inference Network for Explanatory | visual | Question Answering |
Variational Context: Exploiting | visual | and Textual Context for Grounding Referring Expressions |
Variational inference for | visual | tracking |
Varying similarity metrics in | visual | information retrieval |
VC-VQA: | visual | Calibration Mechanism For Visual Question Answering |
VC-VQA: | visual | Calibration Mechanism For Visual Question Answering |
VCPSS: A two-in-one two-decoding-options image sharing method combining | visual | cryptography (VC) and polynomial-style sharing (PSS) approaches |
VCRNet: | visual | Compensation Restoration Network for No-Reference Image Quality Assessment |
VD-GR: Boosting | visual | Dialog with Cascaded Spatial-Temporal Multi-Modal GRaphs |
VD-PCR: Improving | visual | dialog with pronoun coreference resolution |
vector quantization scheme using prequantizers of human | visual | effects, A |
VEFNet: an Event-RGB Cross Modality Fusion Network for | visual | Place Recognition |
VegFru: A Domain-Specific Dataset for Fine-Grained | visual | Categorization |
Vehicle detection fusing 2D | visual | features |
Vehicle Lane Merge | visual | Benchmark |
Vehicle tracking using a human-vision-based model of | visual | similarity |
VehicleNet: Learning Robust | visual | Representation for Vehicle Re-Identification |
Verifying and Combining Different | visual | Cues into a 3D Model |
Verifying edges for | visual | inspection purposes |
Verriest Lecture: | visual | properties of metameric blacks beyond cone vision, The |
Very low bit-rate audio- | visual | applications |
VFDP: | visual | Analysis of Flight Delay and Propagation on a Geographical Map |
Vi2CLR: Video and Image for | visual | Contrastive Learning of Representation |
VIAL: a unified process for | visual | interactive labeling |
VIASEG: | visual | Information Assisted Lightweight Point Cloud Segmentation |
VIBIKNet: | visual | Bidirectional Kernelized Network for Visual Question Answering |
VIBIKNet: | visual | Bidirectional Kernelized Network for Visual Question Answering |
Vibro: Video Browsing with Semantic and | visual | Image Embeddings |
ViCo: Word Embeddings From | visual | Co-Occurrences |
VICTOR: | visual | incompatibility detection with transformers and fashion-specific contrastive pre-training |
Vid2Seq: Large-Scale Pretraining of a | visual | Language Model for Dense Video Captioning |
Video abstraction based on the | visual | attention model and online clustering |
Video action recognition based on | visual | rhythm representation |
Video Annotation for | visual | Tracking via Selection and Refinement |
Video clip recognition using joint audio- | visual | processing model |
Video co-summarization: Video summarization by | visual | co-occurrence |
Video concept detection by audio- | visual | grouplets |
Video content annotation using | visual | analysis and a large semantic knowledgebase |
Video copy detection using multiple | visual | cues and MPEG-7 descriptors |
Video Decolorization Using | visual | Proximity Coherence Optimization |
Video driven fire spread forecasting (f) using multi-modal LWIR and | visual | flame and smoke data |
Video Fingerprint Based on | visual | Digest and Local Fingerprints, A |
Video Google: Efficient | visual | Search of Videos |
Video Hashing Algorithm With Weighted Matching Based on | visual | Saliency |
Video instance search via spatial fusion of | visual | words and object proposals |
Video Local Pattern based Image Matching for | visual | Mapping |
Video Mirroring and Iconic Gestures: Enhancing Basic Videophones to Provide | visual | Coaching and Visual Control |
Video Mirroring and Iconic Gestures: Enhancing Basic Videophones to Provide | visual | Coaching and Visual Control |
Video Object Motion Segmentation for Intelligent | visual | Surveillance |
Video Object Segmentation-based | visual | Servo Control and Object Depth Estimation on a Mobile Robot |
Video Processing for Human Perceptual | visual | Quality-Oriented Video Coding |
Video quality assessment accounting for temporal | visual | masking of local flicker |
Video quality assessment using a statistical model of human | visual | speed perception |
Video Question Answering Using Clip-Guided | visual | -Text Attention |
Video retargeting: A | visual | -friendly dynamic programming approach |
Video Rewrite: Driving | visual | Speech with Audio |
Video Sequence Interpretation for | visual | Surveillance |
Video summarisation by deep | visual | and categorical diversity |
Video summarisation with | visual | and semantic cues |
Video Summarization with Minimal | visual | Content Redundancies |
Video summary generation by | visual | shielding compressed sensing coding and double-layer affinity propagation |
Video with Ground-Truth for Validation of | visual | Registration, Tracking and Navigation Algorithms |
VideoABC: A Real-World Video Dataset for Abductive | visual | Reasoning |
VideoQ: An Automated Content Based Video Search System Using | visual | Cues |
VideoQ: An Automatic Content-Based Video Search System Using | visual | Cues |
VideoXum: Cross-Modal | visual | and Textural Summarization of Videos |
VIDO: A Robust and Consistent Monocular | visual | -Inertial-Depth Odometry |
View Based | visual | Servoing Using Epipolar Geometry |
View selection for sketch-based 3D model retrieval using | visual | part shape description |
View Synthesis using Convex and | visual | Hulls |
View-based Location and Tracking of Body Parts for | visual | Interaction |
View-Dependent, Scalable Texture Streaming in 3-D QoS With MPEG-4 | visual | Texture Coding |
View: | visual | Information Extraction Widget for improving chart images accessibility |
Viewing Behavior Supported | visual | Saliency Predictor for 360 Degree Videos |
Viewpoint Invariant Dense Matching for | visual | Geolocalization |
Viewpoint Invariant Recovery of | visual | Surfaces from Sparse Data |
Viewpoint Selection for | visual | Search Tasks |
ViewRefer: Grasp the Multi-view Knowledge for 3D | visual | Grounding |
ViLEM: | visual | -Language Error Modeling for Image-Text Retrieval |
VIMO: Simultaneous | visual | Inertial Model-based Odometry and Force Estimation |
VINS-Dimc: A | visual | -Inertial Navigation System for Dynamic Environment Integrating Multiple Constraints |
VinVL: Revisiting | visual | Representations in Vision-Language Models |
Violent Video Recognition Based on Global-Local | visual | and Audio Contrastive Learning |
Violin Timbre Navigator: Real-Time | visual | Feedback of Violin Bowing Based on Audio Analysis and Machine Learning |
ViP-CNN: | visual | Phrase Guided Convolutional Neural Network |
ViP-DeepLab: Learning | visual | Perception with Depth-aware Video Panoptic Segmentation |
ViP3D: End-to-End | visual | Trajectory Prediction via 3D Agent Queries |
ViperGPT: | visual | Inference via Python Execution for Reasoning |
ViPR: | visual | -Odometry-aided Pose Regression for 6DoF Camera Localization |
VIRSBS project: | visual | intelligent recognition for secure banking services, The |
VirTex: Learning | visual | Representations from Textual Annotations |
Virtual audio system customization using | visual | matching of ear parameters |
Virtual exertions: A user interface combining | visual | information, kinesthetics and biofeedback for virtual object manipulation |
Virtual Post-its: | visual | Label Extraction, Attachment, and Tracking for Teleconferencing |
Virtual Talk: A Model-Based Virtual Phone Using a Layered Audio- | visual | Integration |
Virtual view quality assessment based on shift compensation and | visual | masking effect |
Virtual | visual | Hulls: Example-Based 3D Shape Estimation from a Single Silhouette |
Virtual | visual | Hulls: Example-Based 3D Shape Inference from Silhouettes |
Virtual-Goal-Guided RRT for | visual | Servoing of Mobile Robots With FOV Constraint |
Virtualized Reality: Being Mobile in a | visual | Scene |
ViS-HuD: Using | visual | Saliency to Improve Human Detection with Convolutional Neural Networks |
Vis2Rec: A Large-Scale | visual | Dataset for Visit Recommendation |
Visa: An Automatic Aware And | visual | Aids Mechanism For Improving The Correct Use Of Geospatial Data |
visClust: A | visual | clustering algorithm based on orthogonal projections |
VIsCUIT: | visual | Auditor for Bias in CNN Image Classifier |
VisDA: A Synthetic-to-Real Benchmark for | visual | Domain Adaptation |
ViSE: | visual | Search Engine Using Multiple Networked Cameras |
ViSeR: | visual | Self-Regularization |
Visible and infrared image registration in man-made environments employing hybrid | visual | features |
Visible-Infrared Image Fusion Based on Early | visual | Information Processing Mechanisms |
Vision and Text Transformer for Predicting Answerability on | visual | Question Answering |
Vision and | visual | Exploration for the Stanford Mobile Robot |
Vision Permutator: A Permutable MLP-Like Architecture for | visual | Recognition |
Vision Transformers are Parameter-Efficient Audio- | visual | Learners |
Vision-based Indoor Localization Via a | visual | Slam Approach |
Vision-Based Vehicle Localization Using a | visual | Street Map with Embedded SURF Scale |
Vision-Depth Landmarks and Inertial Fusion for Navigation in Degraded | visual | Environments |
Vision: A Computational Investigation into the Human Representation and Processing of | visual | Information |
Vision: Images, Signals and Neural Networks, Models of Neural Processing in | visual | Perception |
VISIT: An Efficient Computational Model of Human | visual | Attention |
VisKE: | visual | knowledge extraction and question answering by visual verification of relation phrases |
VisKE: | visual | knowledge extraction and question answering by visual verification of relation phrases |
VISTA: achieving cumulative VIsion through energy efficient Silhouette recognition of mobile Targets through collAboration of | visual | sensor nodes |
VISTA: | visual | Interpretation System for Technical Applications - Architecture and Use |
VISTO: | visual | storyboard for web video browsing |
VistrongerDet: Stronger | visual | Information for Object Detection in VisDrone Images |
| visual | 3D Modeling from Images |
| visual | 3D Reconstruction and Dynamic Simulation of Fruit Trees for Robotic Manipulation |
| visual | Abductive Reasoning |
| visual | Abstraction of Wildlife Footage using Gaussian Mixture Models |
| visual | abstraction of wildlife footage using gaussian mixture models and the minimum description length criterion |
| visual | Access to Optimization Problems in Strategic Environmental Assessment |
| visual | Acoustic Matching |
| visual | active memory perspective on integrated recognition systems, The |
| visual | Active Search Framework for Geospatial Exploration, A |
| visual | Acuity in Day for Night |
| visual | Adaptation and the Relative Nature of Perception |
| visual | adaptation of scale and imprecision in a noisy world |
| visual | adaptive tracking for monocular omnidirectional camera |
| visual | Aerial Navigation through Adaptive Prediction and Hyper-Space Image Matching |
| visual | aesthetic quality assessment with a regression model |
| visual | aesthetic understanding: Sample-specific aesthetic classification and deep activation map visualization |
| visual | Affordance and Function Understanding: A Survey |
| visual | Algorithms |
| visual | Alignment Constraint for Continuous Sign Language Recognition |
| visual | Ambiguity of a Moving Plane, The |
| visual | Analogies: A Framework for Defining Aspect Categorization |
| visual | Analyses of Music Download History: User Studies |
| visual | Analysis and Geo-localization of Large-Scale Imagery |
| visual | Analysis And Processing of Clusters Structures In Multidimensional Datasets |
| visual | Analysis Approach for Inferring Personal Job and Housing Locations Based on Public Bicycle Data, A |
| visual | Analysis Beyond Semantics |
| visual | Analysis for Nowcasting of Multidimensional Lightning Data |
| visual | Analysis of 3D Data by Isovalue Clustering |
| visual | Analysis of a Set of Function Values |
| visual | Analysis of Behaviour: From Pixels to Semantics |
| visual | Analysis of Eye State and Head Pose for Driver Alertness Monitoring |
| visual | Analysis of High DOF Articulated Objects with Application to Hand Tracking |
| visual | Analysis of Human Movement: A Survey, The |
| visual | Analysis of Humans: Looking at People |
| visual | analysis of image collections |
| visual | Analysis of Inconsistencies In Hydraulic Simulation Data |
| visual | Analysis of Land Use Characteristics Around Urban Rail Transit Stations |
| visual | Analysis of Place Connectedness by Public Transport |
| visual | analysis of retinal changes with optical coherence tomography |
| visual | Analysis of Sketches |
| visual | Analysis of Social Media Data |
| visual | Analysis of TerraSAR-X Backscatter Imagery for Archaeological Prospection |
| visual | Analysis of Texture in the Detection and Recognition of Objects |
| visual | analysis of the relationship between word concepts and geographical locations, A |
| visual | Analysis of the Use of Mixture Covariance Matrices in Face Recognition |
| visual | analysis of urban road traffic |
| visual | Analysis of Vessel Behaviour Based on Trajectory Data: A Case Study of the Yangtze River Estuary |
| visual | Analysis On The Production Line |
| visual | analysis/synthesis feedback loop for accurate face tracking, A |
| visual | analytics and rendering for tunnel crack analysis |
| visual | Analytics approach for considering uncertainty information |
| visual | Analytics Approach for Extracting Spatio-Temporal Urban Mobility Information from Mobile Network Traffic, A |
| visual | Analytics Approach to Exploration of Large Amounts of Movement Data, A |
| visual | analytics for built-up area understanding from metric resolution Earth observation data |
| visual | Analytics Infrastructures: From Data Management to Exploration |
| visual | Analytics of Mobility and Transportation: State of the Art and Further Research Directions |
| visual | Analytics of Movement |
| visual | Analytics of Political Networks From Face-Tracking of News Video |
| visual | Analytics Support for Intelligence Analysis |
| visual | Analytics Using Graph Sampling and Summarization on Multitouch Displays |
| visual | Analytics Web Platform for Detecting High Wind Energy Potential in Urban Environments by Employing OGC Standards |
| visual | Analytics Web Platform for Detecting High Wind Energy Potential in Urban Environments by Employing OGC Standards, A |
| visual | Analytics: Seeking the Unknown |
| visual | and Cognitive Interpretation of Heterogeneous Data |
| visual | and Conceptual Hierarchy: A Paradigm for Studies of Automated Generation of Recognition Strategies |
| visual | and Contextual Modeling for the Detection of Repeated Mild Traumatic Brain Injury |
| visual | and Human-Interpretable Feedback for Assisting Physical Activity |
| visual | and multimodal analysis of human spontaneous behaviour: Introduction to the Special Issue |
| visual | and Quantitative Comparison of Real and Simulated Biomedical Image Data |
| visual | and Quantitative Evaluation of Selected Image Combination Schemes in Ultrasound Spatial Compound Scanning |
| visual | and Semantic Knowledge Transfer for Large Scale Semi-Supervised Object Detection |
| visual | and semantic similarity in ImageNet |
| visual | and Spatial Context Fusion for Implicit Human Reconstruction |
| visual | And Statistical Analysis Of Digital Elevation Models Generated Using Idw Interpolator With Varying Powers |
| visual | and Textual Deep Feature Fusion for Document Image Classification |
| visual | and textual explainability for a biometric verification system based on piecewise facial attribute analysis |
| visual | and Textual Information Fusion Method for Chart Recognition |
| visual | and Textual Prior Guided Mask Assemble for Few-Shot Segmentation and Beyond |
| visual | and Textual Sentiment Analysis of Brand-Related Social Media Pictures Using Deep Convolutional Neural Networks |
| visual | and Textual Sentiment Analysis of Daily News Social Media Images by Deep Learning |
| visual | and textual sentiment analysis using deep fusion convolutional neural networks |
| visual | animal biometrics: survey |
| visual | Anomaly and Novelty Detection |
| visual | Anomaly Detection via Partition Memory Bank Module and Error Estimation |
| visual | appearance based document classification methods: Performance evaluation and benchmarking |
| visual | appearance based document image classification |
| visual | appearance based person retrieval in unconstrained environment videos |
| visual | Appearance of Matte Surfaces |
| visual | approach for driver inattention detection, A |
| visual | approach for video geocoding using bag-of-scenes, A |
| visual | Approach to Measure Cloth-Body and Cloth-Cloth Friction, A |
| visual | Aspect: A Unified Content-Based Collaborative Filtering Model for Visual Document Recommendation |
| visual | Aspect: A Unified Content-Based Collaborative Filtering Model for Visual Document Recommendation |
| visual | Atoms: Pre-Training Vision Transformers with Sinusoidal Waves |
| visual | Attention Accelerated Vehicle Detection in Low-Altitude Airborne Video of Urban Environment |
| visual | Attention Algorithm Designed for Coupled Oscillator Acceleration, A |
| visual | attention analysis and prediction on human faces with mole |
| visual | Attention and Applications in Multimedia Technologies |
| visual | Attention and Recognition Differences Based on Expertise in a Map Reading and Memorability Study |
| visual | Attention Based Approach to Text Extraction, A |
| visual | attention based detection of signs of anthropogenic activities in satellite imagery |
| visual | Attention Based Image Quality Assessment |
| visual | attention based model for target detection in high resolution remote sensing images |
| visual | attention based on a joint perceptual space of color and brightness for improved video tracking |
| visual | attention based reference free perceptual quality metric, A |
| visual | attention based ROI maps from gaze tracking data |
| visual | attention based small object segmentation in natual images |
| visual | Attention Based Temporally Weighting Method for Video Hashing |
| visual | Attention Consistency for Human Attribute Recognition |
| visual | Attention Consistency Under Image Transforms for Multi-Label Image Classification |
| visual | Attention Control for Nuclear Power Plant Inspection |
| visual | Attention Driven by Auditory Cues |
| visual | attention estimator applied to image subject enhancement and colour and grey level compression, A |
| visual | attention focusing system using an active stereoscopic vision sensor, A |
| visual | attention for content based image retrieval |
| visual | attention for region of interest coding in JPEG 2000 |
| visual | attention guided bit allocation in video compression |
| visual | Attention Guided Multi-Scale Boundary Detection in Natural Images for Contour Grouping |
| visual | attention guided quality assessment of Tone-Mapped images using scene statistics |
| visual | Attention Guided Seed Selection for Color Image Segmentation |
| visual | attention guided video copy detection based on feature points matching with geometric-constraint measurement |
| visual | Attention in Extended Reality and Implications for Aviation Safety |
| visual | Attention in Objective Image Quality Assessment: Based on Eye-Tracking Data |
| visual | Attention in Quality Assessment |
| visual | attention inspired distant view and close-up view classification |
| visual | Attention Mechanisms |
| visual | Attention Model Based on Eye Tracking in 3D Scene Maps, A |
| visual | attention model for dynamic scenes based on motion features, A |
| visual | attention model for stereoscopic 3D images using monocular cues, A |
| visual | attention modeling based on short-term environmental adaption |
| visual | attention modeling for 3D video using neural networks |
| visual | Attention Modeling for Stereoscopic Video: A Benchmark and Computational Model |
| visual | Attention Network for Low-Dose CT |
| visual | attention on human face |
| visual | Attention on the Sphere |
| visual | attention prediction for Autism Spectrum Disorder with hierarchical semantic fusion |
| visual | attention prediction for images with leading line structure |
| visual | Attention Prediction for Stereoscopic Video by Multi-Module Fully Convolutional Network |
| visual | attention quality database for benchmarking performance evaluation metrics |
| visual | attention region determination for H.264 videos |
| visual | Attention Retargeting |
| visual | Attention Saccadic Models Learn to Emulate Gaze Patterns From Childhood to Adulthood |
| visual | Attention Using Game Theory |
| visual | Attention-Aware High Dynamic Range Quantization for HEVC Video Coding |
| visual | attention-based method to address the midas touch problem existing in gesture-based interaction, A |
| visual | Attention-Based Target Detection and Discrimination for High-Resolution SAR Images in Complex Scenes |
| visual | Attention-Driven Hyperspectral Image Classification |
| visual | Attention-Driven Spatial Pooling for Image Memorability |
| visual | Attention-Guided Approach to Monitoring of Medication Dispensing Using Multi-location Feature Saliency Patterns |
| visual | attention: Effects of blur |
| visual | Attribute Extraction Using Human Pose Estimation |
| visual | Autonomy via 2D Matching in Rendered 3D Models |
| visual | Aware Hierarchy Based Food Recognition |
| visual | Bandwidth Selection for Kernel Density Maps |
| visual | Behaviors for Docking |
| visual | Biofeedback and Game Adaptation in Relaxation Skill Transfer |
| visual | Blindspot Monitoring System for Safe Lane Changes, A |
| visual | BMI estimation from face images using a label distribution based method |
| visual | Bootstrapping for Unsupervised Symbol Grounding |
| visual | Camera Re-Localization From RGB and RGB-D Images Using DSAC |
| visual | Camera Re-Localization Using Graph Neural Networks and Relative Pose Supervision |
| visual | camera relocalization using both hand-crafted and learned features |
| visual | Capabilities in an Interactive Autonomous Robot |
| visual | capture and understanding of hand pointing actions in a 3-D environment |
| visual | Categorization of Children and Adult Walking Styles |
| visual | Categorization: How the Monkey Brain Does It |
| visual | Category Filter for Google Images, A |
| visual | category recognition using Spectral Regression and Kernel Discriminant Analysis |
| visual | Centrifuge: Model-Free Layered Video Representations, The |
| visual | change detection on tunnel linings |
| visual | Change Detection Using Multiscale Super Pixel |
| visual | Characterization of Paper Using Isomap and Local Binary Patterns |
| visual | Chirality |
| visual | Chirality Meets Freehand Sketches |
| visual | Classification by a Hierarchy of Extended Fragments |
| visual | Classification With Multikernel Shared Gaussian Process Latent Variable Model |
| visual | Classification With Multitask Joint Sparse Representation |
| visual | Cluster Grounding for Image Captioning |
| visual | cluster validity for prototype generator clustering models |
| visual | Clustering of Trademarks Using a Component-Based Matching Framework |
| visual | Clustering of Trademarks Using the Self-Organizing Map |
| visual | Code-Sentences: A New Video Representation Based on Image Descriptor Sequences |
| visual | Codebooks Survey for Video On-Line Processing |
| visual | Coin-Tracking: Tracking of Planar Double-Sided Objects |
| visual | Collision Avoidance by Segmentation |
| visual | Comfort Amelioration Technique for Stereoscopic Images: Disparity Remapping to Mitigate Global and Local Discomfort Causes |
| visual | comfort assessment for stereoscopic 3D images based on salient discomfort regions |
| visual | comfort assessment metric for stereoscopic images, A |
| visual | comfort assessment of stereoscopic images using deep visual and disparity features based on human attention |
| visual | comfort assessment of stereoscopic images using deep visual and disparity features based on human attention |
| visual | Comfort Enhancement for Stereoscopic Video Based on Binocular Fusion Characteristics |
| visual | comfort measurement for 2D/3D converted stereo video sequence |
| visual | Commonsense R-CNN |
| visual | Commonsense Representation Learning via Causal Inference |
| visual | Communication and Image Processing '91: Image Processing |
| visual | Communication at Very Low Data Rates |
| visual | Communication with UAS: Recognizing Gestures from an Airborne Platform |
| visual | Communication with UAV: Use Cases and Achievements |
| visual | communications |
| visual | Communications and Image Processing |
| visual | Communications and Image Processing '88 |
| visual | Communications and Image Processing '90 |
| visual | Communications and Image Processing '92 |
| visual | Communications and Image Processing '93 |
| visual | Communications and Image Processing '94 |
| visual | Communications and Image Processing '95 |
| visual | Communications and Image Processing '96 |
| visual | Communications and Image Processing '97 |
| visual | Communications and Image Processing '98 |
| visual | Communications and Image Processing 2001 |
| visual | Communications and Image Processing II |
| visual | Communications and Image Processing IV |
| visual | Communications and Image Processing: |
| visual | comparison based on linear regression model and linear discriminant analysis |
| visual | Comparison Based on Multi-class Classification Model |
| visual | Comparison of Images Using Multiple Kernel Learning for Ranking |
| visual | comparison of JPEG 2000 versus conventional JPEG |
| visual | Comparison of Shape Descriptors Using Multi-Dimensional Scaling, A |
| visual | Compass Based on Point and Line Features for UAV High-Altitude Orientation Estimation, A |
| visual | complexity analysis using deep intermediate-layer features |
| visual | complexity assessment of painting images |
| visual | Compliance: Task-Directed Visual Servo Control |
| visual | Compliance: Task-Directed Visual Servo Control |
| visual | Components of an Automated Inspection Task, The |
| visual | Compositional Learning for Human-object Interaction Detection |
| visual | Computation |
| visual | Computer, The |
| visual | computing education at UCSD |
| visual | Computing for Scattered Electromagnetic Fields |
| visual | computing resources distribution and balancing by multimodal cat swarm optimization |
| visual | Concept Detection and Annotation via Multiple Kernel Learning of Multiple Models |
| visual | Concept Recognition and Localization via Iterative Introspection |
| visual | Concepts for News Story Tracking: Analyzing and Exploiting the NIST TRECVID Video Annotation Experiment |
| visual | congruent ads for image search |
| visual | Conspicuity Index: Spatial Dissimilarity, Distance, and Central Bias |
| visual | content adaptation for low vision users in MPEG-21 framework |
| visual | Content Identification and Search |
| visual | content learning for visualizations memorability classification |
| visual | Content Processing and Representation |
| visual | contents adaptation for color vision deficiency |
| visual | Continual Learning |
| visual | contour tracking based on particle filters |
| visual | contour tracking based on sequential importance sampling/resampling algorithm |
| visual | Contour-based Detection Method for Orthodontic Archwire during Robotic Bending, A |
| visual | control of a multi-robot coupled system: Application to collision avoidance in human-robot interaction |
| visual | Control of Grasping and Manipulation Tasks |
| visual | Control of Wheeled Mobile Robots: Unifying Vision and Control in Generic Approaches |
| visual | Coreference Resolution in Visual Dialog Using Neural Module Networks |
| visual | Coreference Resolution in Visual Dialog Using Neural Module Networks |
| visual | Correlates of Fixation Selection: A Look at the Spatial Frequency Domain |
| visual | Correspondence Grouping via Local Consistent Neighborhood |
| visual | Correspondence Learning and Spatially Attentive Synthesis via Transformer for Exemplar-Based Anime Line Art Colorization |
| visual | correspondence using energy minimization and mutual information |
| visual | Correspondences for Unsupervised Domain Adaptation on Electron Microscopy Images |
| visual | Cortex as a General-Purpose Information-Processing Device |
| visual | Cortex Frontend: Integrating Lines, Edges, Keypoints, and Disparity |
| visual | cortex inspired features for object detection in X-ray images |
| visual | cortex on the GPU: Biologically inspired classifier and feature descriptor for rapid recognition |
| visual | creation of inhabited 3d environments: An ontology-based approach |
| visual | Cross-database Comparison of Metabolic Networks, A |
| visual | Cross-View Metric Localization with Dense Uncertainty Estimates |
| visual | Crowd Surveillance Through a Hydrodynamics Lens |
| visual | Cryptograms of Random Grids for General Access Structures |
| visual | Cryptography and Random Grids Schemes |
| visual | Cryptography and Secret Image Sharing |
| visual | Cryptography Based on Void-And-Cluster Halftoning Technique |
| visual | cryptography for color images |
| visual | cryptography for gray-level images by dithering techniques |
| visual | cryptography scheme by freeform optics based on optimal mass transport |
| visual | cryptography scheme for secret color images with color QR codes |
| visual | Cryptography Scheme with Autostereogram |
| visual | Cryptography Schemes Based in k -Linear Maps |
| visual | Cue Cluster Construction via Information Bottleneck Principle and Kernel Density Estimation |
| visual | cues-based deception detection using two-class neural network |
| visual | Curvature |
| visual | data association for real-time video tracking using genetic and estimation of distribution algorithms |
| visual | Data Compression for Multimedia Applications |
| visual | data denoising with a unified Schatten-p norm and lq norm regularized principal component pursuit |
| visual | Data Encryption for Privacy Enhancement in Surveillance Systems |
| visual | Data Exploration and Analysis III |
| visual | Data Exploration and Analysis IV |
| visual | Data Exploration and Analysis VI |
| visual | Data Exploration Framework for Complex Problem Solving Based on Extended Cognitive Fit Theory, A |
| visual | Data Fusion for Objects Localization by Active Vision |
| visual | Data Fusion: Application to Objects Localization and Exploration |
| visual | data navigators Collaboratories |
| visual | data of facial expressions for automatic pain detection |
| visual | Data Rate Gain for Wavelet Foveated Image Coding |
| visual | debugging for a pyramidal machine |
| visual | Dependency Transformers: Dependency Tree Emerges from Reversed Attention |
| visual | Deprojection: Probabilistic Recovery of Collapsed Dimensions |
| visual | Depth Guided Color Image Rain Streaks Removal Using Sparse Coding |
| visual | Depth Perception Based on Optical Blur |
| visual | description and recognition of mechanical tools with a silhouette-based approach |
| visual | Detail Augmented Mapping for Small Aerial Target Detection |
| visual | Detection and Association Tracking of Dim Small Ship Targets from Optical Image Sequences of Geostationary Satellite Using Multispectral Radiation Characteristics |
| visual | detection and species classification of orchid flowers |
| visual | detection in omnidirectional view sensors |
| visual | detection of 3D obstacles using gated images |
| visual | detection of defects in moulded plastic drippers |
| visual | Detection of Hexagonal Headed Bolts Using Method of Frames and Matching Pursuit |
| visual | Detection of Light Sources |
| visual | detection of lintel-occluded doors from a single image |
| visual | Detection of Motion |
| visual | Detection of Obstacles Assuming a Locally Planar Ground |
| visual | Dialog |
| visual | Dialog |
| visual | Dialog |
| visual | dictionary attack on Picture Passwords, A |
| visual | Dictionary Learning for Joint Object Categorization and Segmentation |
| visual | Digital Forest Model Based on a Remote Sensing Data and Forest Inventory Data |
| visual | Direction Estimation from a Monocular Image |
| visual | Discrimination of Stochastic Texture Fields |
| visual | Discrimination of Stochastic Texture Fields Based upon Their Second Order Statistics |
| visual | Discrimination of Textures with Identical Third-Order Statistics |
| visual | display methods for in computer-animated speech production models |
| visual | Distance Measures for Object Retrieval |
| visual | Distant Supervision for Scene Graph Generation |
| visual | Distortion Assessment With Emphasis on Spatially Transitional Regions |
| visual | Distortion Gauge Based on Discrimination of Noticeable Contrast Changes |
| visual | Distortions in 360° Videos |
| visual | DNA: Representing and Comparing Images Using Distributions of Neuron Activations |
| visual | documentation process of historic building refurbishment Improving energy efficiency by insulating wall cavity |
| visual | domain adaptation based on modified A-distance and sparse filtering |
| visual | Domain Adaptation for Monocular Depth Estimation on Resource-Constrained Hardware |
| visual | domain adaptation using weighted subspace alignment |
| visual | Domain Adaptation: A survey of recent advances |
| visual | Domain Bridge: A source-free domain adaptation for cross-domain few-shot learning |
| visual | Dynamic Scene Understanding Exploiting High-Level Spatio-Temporal Models |
| visual | Dynamics: Stochastic Future Generation via Layered Cross Convolutional Networks |
| visual | Echo Analysis |
| visual | echo cancellation in a projector-camera-whiteboard system |
| visual | Effects and Beyond |
| visual | Effects in Computer Games |
| visual | Effects of Turning Point and Travel Direction for Outdoor Navigation Using Head-Mounted Display |
| visual | Embedding Augmentation in Fourier Domain for Deep Metric Learning |
| visual | Embedding of Wavelet Transform Coefficients |
| visual | Encoding of Tilt from Optic Flow: Psychophysics and Computational Modelling |
| visual | enhancement of incised text |
| visual | enhancement of old documents with hyperspectral imaging |
| visual | Enhancement Using Constrained L0 Gradient Image Decomposition for Low Backlight Displays |
| visual | Entropy Gain for Wavelet Image Coding |
| visual | entropy: A new framework for quantifying visual information based on human perception |
| visual | entropy: A new framework for quantifying visual information based on human perception |
| visual | Estimation and Compression of Facial Motion Parameters: Elements of a 3D Model-Based Video Coding System |
| visual | Estimation of 3-D Line Segments from Motion: A Mobile Robot Vision System |
| visual | Estimation of Attentive Cues in HRI: The Case of Torso and Head Pose |
| visual | estimation of pointed targets for robot guidance via fusion of face pose and hand orientation |
| visual | Event Classification via Force Dynamics |
| visual | Event Detection |
| visual | Event Recognition in News Video using Kernel Methods with Multi-Level Temporal Alignment |
| visual | Event Recognition in Videos by Learning from Web Data |
| visual | Event-Based Egocentric Human Action Recognition |
| visual | Events Identification of Solids of Revolution from Perspective Views |
| visual | Evidence Accumulation in Radiograph Inspection |
| visual | Exemplar Driven Task-Prompting for Unified Perception in Autonomous Driving |
| visual | Explanation for Deep Metric Learning |
| visual | Explanation Generation Based on Lambda Attention Branch Networks |
| visual | explanation of black-box model: Similarity Difference and Uniqueness (SIDU) method |
| visual | Explanations for Exposing Potential Inconsistency of Deepfakes |
| visual | Explanations via Iterated Integrated Attributions |
| visual | exploration and functional document labeling |
| visual | Exploration of Stream Pattern Changes Using a Data-Driven Framework |
| visual | Exposes You: Pedestrian Trajectory Prediction Meets Visual Intention |
| visual | Exposes You: Pedestrian Trajectory Prediction Meets Visual Intention |
| visual | Exposure of Rock Outcrops in the Context of a Forest Disease Outbreak Simulation Based on a Canopy Height Model and Spectral Information Acquired by an Unmanned Aerial Vehicle |
| visual | Extent of an Object: Suppose We Know the Object Locations, The |
| visual | Extraction of Motion-based Information from Image Sequences |
| visual | Face Recognition Using Bag of Dense Derivative Depth Patterns |
| visual | Face Tracking: A Coarse-to-Fine Target State Estimation |
| visual | fatigue evaluation and enhancement for 2D-plus-depth video |
| visual | Fatigue Prediction for Stereoscopic Image |
| visual | Feature Attribution Using Wasserstein GANs |
| visual | feature coding for image classification integrating dictionary structure |
| visual | feature group matching for autonomous robot localization |
| visual | Feature Tracking with Automatic Motion Model Switching |
| visual | Features Extracting and Selecting for Lipreading |
| visual | Features for Image Quality Assessment with Reduced Reference |
| visual | features of intermdediate complexity and their use in classification |
| visual | features with semantic combination using Bayesian network for a more effective image retrieval |
| visual | feedback for virtual grasping |
| visual | Fidelity Criterion and Modeling |
| visual | filters for face recognition |
| visual | Fixation Patterns when Judging Image Quality: Effects of Distortion Type, Amount, and Subject Experience |
| visual | Flow and Direction of Locomotion |
| visual | Focus of Attention Estimation in 3D Scene with an Arbitrary Number of Targets |
| visual | Focus of Attention Estimation With Unsupervised Incremental Learning |
| visual | Focus of Attention in Non-calibrated Environments using Gaze Estimation |
| visual | Focus of Attention Recognition in the Ambient Kitchen |
| visual | Focusing and Defocusing: An Essential Part of Pattern Recognition Process |
| visual | Font Pairing |
| visual | Footsteps Planning System for Exoskeleton Robots Under Complex Terrain, The |
| visual | Forecasting by Imitating Dynamics in Natural Sequences |
| visual | Form: Analysis and Recognition |
| visual | Framework for the Definition and Execution of Reverse Engineering Processes, A |
| visual | framing feedback for desktop video conferencing |
| visual | Gaze Estimation by Joint Head and Eye Information |
| visual | Genome |
| visual | Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations |
| visual | Gesture Character String Recognition by Classification-Based Segmentation with Stroke Deletion |
| visual | glare limits of HDR displays in medical imaging |
| visual | Goal-Directed Meta-Imitation Learning |
| visual | Golf Club Tracking for Enhanced Swing Analysis |
| visual | graph analysis for quality assessment of manually labelled documents image database |
| visual | Graph Memory with Unsupervised Representation for Visual Navigation |
| visual | Graph Memory with Unsupervised Representation for Visual Navigation |
| visual | graph mining for graph matching |
| visual | Graphs from Motion (VGfM): Scene Understanding with Object Geometry Reasoning |
| visual | grasp affordances from appearance-based cues |
| visual | Grounding in Video for Unsupervised Word Translation |
| visual | Grounding Via Accumulated Attention |
| visual | Grounding, Grounding Expressions |
| visual | Group Binary Signature for Video Copy Detection |
| visual | grouping and object recognition |
| visual | Growth Tracking for Automated Leaf Stage Monitoring Based on Image Sequence Analysis |
| visual | Guidance for a Spatial Discrepancy Problem of in Encountered-Type Haptic Display |
| visual | Guidance of a Pig Evisceration Robot Using Neural Networks |
| visual | guided navigation for image retrieval |
| visual | Gyroscope for Accurate Orientation Estimation |
| visual | Gyroscope: Combination of Deep Learning Features and Direct Alignment for Panoramic Stabilization |
| visual | Hand Gesture Recognition For Window System Control |
| visual | hand motion capture for guiding a dexterous hand |
| visual | Hand Posture Recognition in Monocular Image Sequences |
| visual | Hand Tracking Using Nonparametric Belief Propagation |
| visual | Haze Removal by a Unified Generative Adversarial Network |
| visual | hierarchical cluster structure: A refined co-association matrix based visual assessment of cluster tendency |
| visual | hierarchical cluster structure: A refined co-association matrix based visual assessment of cluster tendency |
| visual | Homing: Surfing on the Epipoles |
| visual | Horizontal Effect for Image Quality Assessment |
| visual | hull alignment and refinement across time: a 3D reconstruction algorithm combining shape-from-silhouette with stereo |
| visual | hull Concept for Silhouette-Based Image Understanding, The |
| visual | Hull Construction in the Presence of Partial Occlusion |
| visual | Hull Construction Using Adaptive Sampling |
| visual | Hull Construction, Alignment and Refinement Across Time |
| visual | Hull Construction, Alignment and Refinement for Human Kinematic Modeling, Motion Tracking and Rendering |
| visual | Hull Embossment by Graph Cuts |
| visual | Hull from Imprecise Polyhedral Scene |
| visual | Hull of Curved Objects, The |
| visual | Hull of Piecewise Smooth Objects, The |
| visual | Hull of Smooth Curved Objects, The |
| visual | Hull Of Solids Of Revolution, The |
| visual | Hull-Based Geometric Data Compression of a 3-D Object |
| visual | Human-Computer Interactions for Intelligent Vehicles and Intelligent Transportation Systems: The State of the Art and Future Directions |
| visual | Hyperacuity: Representation and Computation of High Precision Position Information |
| visual | Hyperacuity: Spatiotemporal Interpolation in Human Vision |
| visual | identification by signature tracking |
| visual | Identification of People by Computer |
| visual | illumination compensation for face images using light mapping matrix |
| visual | image noise eliminating system |
| visual | Image Processing RAM: Memory Architecture With 2-D Data Location Search and Data Consistency Management for a Multicore Object Recognition Processor |
| visual | Image Processor, A |
| visual | Image Retrieval by Elastic Matching of User Sketches |
| visual | Image Retrieval for Applications in Art and Art History |
| visual | Imaging of Invisible Hazardous Substances Using Bacterial Inspiration |
| visual | Importance and Distortion Guided Deep Image Quality Assessment Framework |
| visual | Importance- and Discomfort Region-Selective Low-Pass Filtering for Reducing Visual Discomfort in Stereoscopic Displays |
| visual | Importance- and Discomfort Region-Selective Low-Pass Filtering for Reducing Visual Discomfort in Stereoscopic Displays |
| visual | importance-based adaptive photon tracing |
| visual | Indexing of Large Scale Train-Borne Video for Rail Condition Perceiving |
| visual | Indoor Localization in Known Environments |
| visual | Inductive Priors for Data-Efficient Deep Learning |
| visual | Informatics: Bridging Research and Practice |
| visual | Information Encryption in Frequency Domain: Risk and Enhancement |
| visual | information exploited hybrid digital-analog scheme for wireless video multicast |
| visual | Information for Firearm Identification by Digital Holography |
| visual | Information Framework for Medical Family Tree Data (Genogram) |
| visual | information from anisotropic transformations |
| visual | information fusion for object-based video image segmentation using unsupervised Bayesian online learning |
| visual | Information Management |
| visual | Information Management System for the Interactive Retrieval of Faces, A |
| visual | information measurement with quality assessment |
| visual | Information Processing |
| visual | information processing apparatus |
| visual | Information Processing II |
| visual | Information Processing III |
| visual | Information Processing IV |
| visual | Information Processing V |
| visual | Information Processing VI |
| visual | Information Processing VII |
| visual | Information Processing: Artificial Intelligence and the Sensorium of Sight |
| visual | Information Processing: The Structure and Creation of Visual Representations |
| visual | Information Processing: The Structure and Creation of Visual Representations |
| visual | Information Retrieval |
| visual | Information Retrieval from Large Distributed Online Repositories |
| visual | Information Retrieval System for Radiology Reports and the Medical Literature, A |
| visual | information retrieval system via content-based approach |
| visual | information retrieval using synthesized imagery |
| visual | Information Retrieval: Future Directions and Grand Challenges |
| visual | Information-Systems: Guest Editors Introduction |
| visual | Inpainting Method Based on the Compressed Domain, A |
| visual | Input Amplification for Inspecting Specular Surfaces |
| visual | Input for Pen-Based Computers |
| visual | Inspection Automation |
| visual | Inspection for Breakage of Micro-milling Cutter |
| visual | Inspection for Fired Ceramic Tile's Surface Defects Using Wavelet Analysis |
| visual | Inspection in the Food Industry |
| visual | Inspection Method Based on Periodic Feature for Wheel Mark Defect on Wafer Backside, A |
| visual | Inspection Method for Subway Tunnel Cracks Based on Multi-Kernel Convolution Cascade Enhancement Learning |
| visual | inspection of a combustion process in a thermoelectric plant |
| visual | Inspection of Machined Metallic High-Precision Surfaces |
| visual | Inspection of Machined Parts |
| visual | Inspection of Metal Surfaces |
| visual | inspection of multivariate volume data based on multi-class noise sampling |
| visual | Inspection of Particle Boards for Quality Assessment |
| visual | inspection of sea bottom structures by an autonomous underwater vehicle |
| visual | inspection of workpiece quality |
| visual | Inspection System Based on Trinarized Broad-Edge and Gray-Scale Hybrid Matching, A |
| visual | Inspection System Design |
| visual | Inspection System for Accurate Positioning of Railway Fastener, A |
| visual | inspection system for the classification of solder joints |
| visual | Inspection Using Linear Features |
| visual | Inspection with Federated Learning |
| visual | Integration and Detection of Discontinuities: The Key Role of Intensity Edges |
| visual | Integration from Multiple Cameras |
| visual | Intelligence for Active and Assisted Living |
| visual | Intention Detection for Wheelchair Motion |
| visual | Interaction Including Biometrics Information for a Socially Assistive Robotic Platform |
| visual | Interaction Perceptual Network for Blind Image Quality Assessment |
| visual | interaction with lifelike characters |
| visual | Interactive Systems for End-User Development: A Model-Based Design Methodology |
| visual | Interestingness Prediction: A Benchmark Framework and Literature Review |
| visual | Interface for Conducting Virtual Orchestra |
| visual | Interface from Uncalibrated Cameras for Unknown Displays |
| visual | interpenetration tradeoffs in whole-hand virtual grasping |
| visual | interpretability analysis of Deep CNNs using an Adaptive Threshold method on Diabetic Retinopathy images |
| visual | Interpretation of Hand Gestures for Human-Computer Interaction: A Review |
| visual | Interpretation of Known Objects in Constrained Scenes |
| visual | interpretation of natural pointing gestures in 3D space for human-robot interaction |
| visual | Interpretation of Surface Contours, The |
| visual | Interpretation of the Motion of Objects in Space |
| visual | islands: intuitive browsing of visual search results |
| visual | islands: intuitive browsing of visual search results |
| visual | item verification for fraud prevention in retail self-checkout |
| visual | keyboard: Real-time feet tracking for the control of musical meta-instruments, The |
| visual | keyword recognition using hidden Markov models |
| visual | keywords labeling in soccer video |
| visual | Kinship Recognition of Families in the Wild |
| visual | Knowledge Tracing |
| visual | Landmark Based 3D Road Course Estimation with Black Box Variational Inference |
| visual | landmark framework for mobile robot navigation, A |
| visual | landmark learning |
| visual | landmark recognition from Internet photo collections: A large-scale evaluation |
| visual | Landmark-Based Localization for MAVs Using Incremental Feature Updates |
| visual | landmarks detection and recognition for mobile robot navigation |
| visual | lane analysis and higher-order tasks: a concise review |
| visual | Language Framework for Plant Modeling Using L-System |
| visual | Language Identification from Facial Landmarks |
| visual | Language Pretrained Multiple Instance Zero-Shot Transfer for Histopathology Images |
| visual | Leader-Following Approach With a T-D-R Framework for Quadruped Robots, A |
| visual | learning and classification of human epithelial type 2 cell images through spontaneous activity patterns |
| visual | Learning And Recognition Of 3-D Objects From Appearance |
| visual | learning and recognition of 3D objects using two-dimensional principal component analysis: A robust and an efficient approach |
| visual | learning and recognition of a probabilistic spatio-temporal model of cyclic human locomotion |
| visual | learning and recognition of sequential data manifolds with applications to human movement analysis |
| visual | Learning by Coevolutionary Feature Synthesis |
| visual | Learning by Imitation With Motor Representations |
| visual | Learning by Integrating Descriptive and Generative Methods |
| visual | Learning for Landmark Recognition |
| visual | Learning from Multiple Views |
| visual | Learning Given Sparse Data of Unknown Complexity |
| visual | Learning in Application of Integration |
| visual | Learning Object Models from Appearance |
| visual | learning of texture descriptors for facial expression recognition in thermal imagery |
| visual | Learning of Weight from Shape Using Support Vector Machines |
| visual | Learning With Limited Labels: Zero-Shot, Few-Shot, Any-Shot, and Cross-Domain Few-Shot Learning |
| visual | line estimation from a single image of two eyes |
| visual | Link Retrieval in a Database of Paintings |
| visual | Lip Activity Detection and Speaker Detection Using Mouth Region Intensities |
| visual | literacy: an overview |
| visual | Localisation and Individual Identification of Holstein Friesian Cattle via Deep Learning |
| visual | Localization and Target Perception Based on Panoptic Segmentation |
| visual | Localization Based on Place Recognition Using Multi-feature Combination (D-lambdaLBP++HOG) |
| visual | Localization by Learning Objects-Of-Interest Dense Match Regression |
| visual | localization by linear combination of image descriptors |
| visual | Localization for Autonomous Driving: Mapping the Accurated Location in the City Maze |
| visual | Localization for Mobile Platforms |
| visual | localization for mobile surveillance |
| visual | Localization in Changing Environments using Place Recognition Techniques |
| visual | Localization of Key Positions for Visually Impaired People |
| visual | Localization of the Tianwen-1 Lander Using Orbital, Descent and Rover Images |
| visual | Localization using Imperfect 3D Models from the Internet |
| visual | Localization Using Sequence Matching Based on Multi-feature Combination |
| visual | Localization Using Sparse Semantic 3D Map |
| visual | location search using symmelets |
| visual | Looming Navigation Cue: A Unified Approach, The |
| visual | loop closing using multi-resolution SIFT grids in metric-topological SLAM |
| visual | Madlibs: Fill in the Blank Description Generation and Question Answering |
| visual | manipulation relationship recognition in object-stacking scenes |
| visual | Map Making for a Mobile Robot |
| visual | map matching and localization using a global feature map |
| visual | Map-Based Localization for Intelligent Vehicles From Multi-View Site Matching |
| visual | Mapping and Multi-modal Localisation for Anywhere AR Authoring |
| visual | Mapping by Robot Rover |
| visual | Maritime Attention Using Multiple Low-Level Features and Naive Bayes Classification |
| visual | marker for precise pose estimation based on a microlens array, A |
| visual | masking and the design of magnetic resonance image acquisition |
| visual | masking phenomena with high dynamic range content |
| visual | Material Traits: Recognizing Per-Pixel Material Context |
| visual | measurement and tracking in laser hybrid welding |
| visual | measurement of pile movements for the foundation work using a high-speed line-scan camera |
| visual | media production |
| visual | Media Retrieval Using Transform-Based Layered Query Scheme |
| visual | Memorability for Robotic Interestingness via Unsupervised Online Learning |
| visual | Memory Maps for Mobile Robots |
| visual | Memory Structure for a Mobile Robot |
| visual | Method of Locating Faults in Printed Circuit Boards |
| visual | metrology with uncalibrated radial distorted images |
| visual | Micro-Pattern Propagation |
| visual | mining of time series using a tubular visualization |
| visual | Model Approach for Parsing Colonoscopy Videos, A |
| visual | model for optimizing the design of image processing algorithms, A |
| visual | Model Weighted Cosine Transform for Image Compression and Quality Assessment, A |
| visual | modeling of knowledge for decision-making, A |
| visual | Modeling with a Hand-Held Camera |
| visual | Modelling |
| visual | Monitoring and Surveillance of Wide-Area Outdoor Scenes |
| visual | Monitoring of Driver and Passenger Control Panel Interactions |
| visual | Monocular Obstacle Avoidance for Small Unmanned Vehicles |
| visual | Motif Discovery via First-Person Vision |
| visual | motion ambiguities of a plane in 2-D FS sonar motion sequences |
| visual | Motion Ambiguity |
| visual | Motion Analysis under Interceptive Behavior |
| visual | motion based behavior learning using hierarchical discriminant regression |
| visual | Motion Capturing for Kinematic Model Estimation of a Humanoid Robot |
| visual | Motion Correspondence by Region-Based Approaches |
| visual | Motion Estimation and Prediction: A Probabilistic Network Model for Temporal Coherence |
| visual | Motion Estimation for Tumbling Satellite Capture |
| visual | motion estimation from image contour tracking |
| visual | motion estimation from point features: unified view |
| visual | Motion Estimation Via Second Order Cone Programming |
| visual | Motion of Curves and Surfaces |
| visual | Motion of Curves and Surfaces, The |
| visual | motion pattern extraction and fusion for collision detection in complex dynamic scenes |
| visual | Motion Perception |
| visual | Movie Analytics |
| visual | multiple-object tracking for unknown clutter rate |
| visual | Multiple-Secret Sharing By Circle Random Grids |
| visual | Music Transcription of Clarinet Video Recordings Trained with Audio-Based Labelled Data |
| visual | Narratives: Large-scale Hierarchical Classification of Art-historical Images |
| visual | Navigation |
| visual | Navigation Aid for the Blind in Dynamic Environments |
| visual | Navigation and Obstacle Avoidance Control for Agricultural Robots via LiDAR and Camera |
| visual | Navigation for a Mobile Robot Using Landmarks |
| visual | Navigation for Mobile Devices |
| visual | Navigation in Perceptual Databases |
| visual | Navigation of an Autonomous Vehicle Using White Line Recognition |
| visual | Navigation of Mobile Robot Using Optical Flow and Visual Potential Field |
| visual | Navigation of Mobile Robot Using Optical Flow and Visual Potential Field |
| visual | Navigation of Uncalibrated Mobile Robots from Uncalibrated Stereo Pointers |
| visual | Navigation Perspective for Category-Level Object Pose Estimation, A |
| visual | Navigation System for Automonous Land Vehicles, A |
| visual | Navigation Using a Single Camera |
| visual | Navigation Using Fast Content-Based Retrieval |
| visual | Navigation Using Projection of Spatial Right-Angle In Indoor Environment |
| visual | Navigation with Spatial Attention |
| visual | Navigation: Constructing and Utilizing Simple Maps of and Indoor Environment |
| visual | Navigation: Flies, Bees, and UGV's |
| visual | Navigation: From Biological Systems to Unmanned Ground Vehicles |
| visual | Network Analysis of Dynamic Metabolic Pathways |
| visual | Neural Classifier, A |
| visual | Neural Network that Learns Perceptual Relationships, A |
| visual | Nonverbal Behavior Analysis: The Path Forward |
| visual | Object Categorization using Distance-Based Discriminant Analysis |
| visual | object classification by robots, using on-line, self-supervised learning |
| visual | Object Clustering via Mixed-Norm Regularization |
| visual | object detection by parts-based modeling using extended histogram of gradients |
| visual | Object Detection Using Cascades of Binary and One-Class Classifiers |
| visual | Object Detection with Deformable Part Models |
| visual | Object Interface Signifier of Museum Application for Large Display |
| visual | Object Localization in Image Collections |
| visual | Object Recognition Through One-Class Learning |
| visual | Object Recognition Using Deformable Models of Vehicles |
| visual | object recognition using local binary patterns and segment-based feature |
| visual | object recognition using probabilistic kernel subspace similarity |
| visual | object retrieval via block-based visual-pattern matching |
| visual | object retrieval via block-based visual-pattern matching |
| visual | object tracking based on adaptive Siamese and motion estimation network |
| visual | Object Tracking Based on Backward Model Validation |
| visual | Object Tracking Based on Combination of Local Description and Global Representation |
| visual | Object Tracking Based on Improved Convolutional Neural Network |
| visual | Object Tracking Based on Local Steering Kernels and Color Histograms |
| visual | Object Tracking Based on Mutual Learning Between Cohort Multiscale Feature-Fusion Networks With Weighted Loss |
| visual | object tracking benchmark for cell motility in time-lapse imaging, A |
| visual | object tracking by correlation filters and online learning |
| visual | Object Tracking by Hierarchical Attention Siamese Network |
| visual | Object Tracking by Structure Complexity Coefficients |
| visual | Object Tracking by Using Ranking Loss |
| visual | Object Tracking Challenge |
| visual | Object Tracking Challenges, VOT |
| visual | Object Tracking for the Extraction of Multiple Interacting Plant Root Systems |
| visual | Object Tracking for Unmanned Aerial Vehicles Based on the Template-Driven Siamese Network |
| visual | Object Tracking in Drone Images with Deep Reinforcement Learning |
| visual | Object Tracking in First Person Vision |
| visual | Object Tracking in Spherical 360° Videos: A Bridging Approach |
| visual | Object Tracking Performance Measures Revisited |
| visual | object tracking using adaptive correlation filters |
| visual | object tracking using sparse context-aware spatio-temporal correlation filter |
| visual | object tracking using spatial Context Information and Global tracking skills |
| visual | object tracking via a manifold regularized discriminative dual dictionary model |
| visual | object tracking via coefficients constrained exclusive group LASSO |
| visual | object tracking via collaborative correlation filters |
| visual | Object Tracking via Guessing and Matching |
| visual | object tracking via iterative ant particle filtering |
| visual | Object Tracking Via Multi-Stream Deep Similarity Learning Networks |
| visual | Object Tracking via One-Class SVM |
| visual | object tracking via online sparse instance learning |
| visual | object tracking via sample-based Adaptive Sparse Representation (AdaSR) |
| visual | object tracking via the local soft cosine similarity |
| visual | Object Tracking VOT2013 Challenge Results, The |
| visual | Object Tracking VOT2014 Challenge Results, The |
| visual | Object Tracking VOT2015 Challenge Results, The |
| visual | Object Tracking VOT2016 Challenge Results, The |
| visual | Object Tracking VOT2017 Challenge Results, The |
| visual | Object Tracking With Discriminative Filters and Siamese Networks: A Survey and Outlook |
| visual | object tracking with multi-scale superpixels and color-feature guided kernelized correlation filters |
| visual | object tracking with online learning on Riemannian manifolds by one-class support vector machines |
| visual | Object Tracking with Online Sample Selection Via LASSO Regularization |
| visual | Object Tracking With Partition Loss Schemes |
| visual | Object Tracking with Pyramid, Random Subspace Features |
| visual | object tracking: A survey |
| visual | Object Tracking: The Initialisation Problem |
| visual | object trapping |
| visual | object-action recognition: Inferring object affordances from human demonstration |
| visual | Object-Oriented Learning Meets Interaction: Discovery, Representations, and Applications |
| visual | objects tracking and identification based on reduced quaternion wavelet transform |
| visual | observation and analysis of animal and insect behavior |
| visual | Observation under Uncertainty as a Discrete Event Process |
| visual | Obstacle Detection for Automatically Guided Vehicles |
| visual | Occlusion and the Interpretation of Ambiguous Pictures |
| visual | odometry |
| visual | Odometry Algorithm Based on Deep Learning |
| visual | Odometry and 3D Point Clouds Under Low-Light Conditions |
| visual | Odometry and Computer Vision Applications Based on Location Clues |
| visual | odometry and map correlation |
| visual | Odometry based on Semantic Supervision |
| visual | odometry based on the Fourier transform using a monocular ground-facing camera |
| visual | Odometry by Multi-frame Feature Integration |
| visual | Odometry Drift Reduction Using SYBA Descriptor and Feature Transformation |
| visual | odometry driven online calibration for monocular LiDAR-camera systems |
| visual | Odometry for Indoor Mobile Robot by Recognizing Local Manhattan Structures |
| visual | Odometry for Non-overlapping Views Using Second-Order Cone Programming |
| visual | Odometry for Pixel Processor Arrays |
| visual | Odometry System Using Multiple Stereo Cameras and Inertial Measurement Unit |
| visual | Odometry Using 3-Dimensional Video Input |
| visual | odometry with a single-camera stereo omnidirectional system |
| visual | Odometry, Distance Measurments from Vision, Motion |
| visual | on-line learning in distributed camera networks |
| visual | Optimization Tools in JPEG 2000 |
| visual | Organization for Figure/Ground Separation |
| visual | Organization of Illusory Surfaces |
| visual | orientation in the sewer: adaptation to the environment |
| visual | Orientation Selectivity Based Structure Description |
| visual | Oriented Encoder: Integrating Multimodal and Multi-Scale Contexts for Video Captioning |
| visual | Overlay on OpenStreetMap Data to Support Spatial Exploration of Urban Environments |
| visual | Parsing with Query-Driven Global Graph Attention (QD-GGA): Preliminary Results for Handwritten Math Formula Recognition |
| visual | Path Prediction in Complex Scenes with Crowded Moving Objects |
| visual | Pattern Analysis in Histopathology Images Using Bag of Features |
| visual | pattern discovery for architecture image classification and product image search |
| visual | Pattern Discrimination |
| visual | pattern image sequence coding |
| visual | Pattern Recognition by Moment Invariants |
| visual | pattern recognition in the years ahead |
| visual | pedestrian recognition in weak classifier space using nonlinear parametric models |
| visual | Perception and Analysis as First Steps Toward Human-Robot Chess Playing |
| visual | perception and local features for foreground-background segmentation |
| visual | Perception Approach for Accurate Segmentation of Light Profiles, A |
| visual | Perception Based Algorithm for Fast Depth Intra Coding of 3D-HEVC |
| visual | perception based Lagrangian rate distortion optimization for video coding |
| visual | perception based on biological filtering of spatial information |
| visual | Perception by a Computer |
| visual | Perception by Computer |
| visual | Perception Driven Registration of Mammograms |
| visual | Perception for Manipulation and Imitation in Humanoid Robots |
| visual | Perception for Navigation in Human Environments: The JackRabbot Human Body Pose Dataset and Benchmark |
| visual | Perception Framework to Analyse Neonatal Pain in Face Images, A |
| visual | Perception in Familiar, Complex Tasks |
| visual | Perception of Affordances and Functional Visual Primitives for Scene Analysis |
| visual | Perception of Affordances and Functional Visual Primitives for Scene Analysis |
| visual | Perception of Biological Motion and a Model for Its Analysis |
| visual | perception of computer-generated stereoscopic pictures: Toward the impact of image resolution |
| visual | perception of obstacles and vehicles for platooning |
| visual | Perception of Property Rights in 3D |
| visual | Perception of Three-Dimensional Motion |
| visual | Perception Ranking of Chess Players |
| visual | perception sensitivity for achromatic noise and chromatic noise, The |
| visual | Perception Through Video Imagery |
| visual | Perception, Theory and Practice |
| visual | perception-based motion planning using road map |
| visual | Permutation Learning |
| visual | Person Understanding Through Multi-task and Multi-dataset Learning |
| visual | Persuasion: Inferring Communicative Intents of Images |
| visual | pertinent 2D-to-3D video conversion by multi-cue fusion |
| visual | Phraselet: Refining Spatial Constraints for Large Scale Image Search |
| visual | Phrases for Exemplar Face Detection |
| visual | Place Categorization in Indoor Environments |
| visual | Place Recognition Using Landmark Distribution Descriptors |
| visual | place recognition with CNNs: From global to partial |
| visual | Place Recognition with Repetitive Structures |
| visual | place recognition: A survey from deep learning perspective |
| visual | planes-based simultaneous localization and model refinement for augmented reality |
| visual | Players Detection and Tracking in Soccer Matches |
| visual | Positioning in Indoor Environments Using RGB-D Images and Improved Vector of Local Aggregated Descriptors |
| visual | Positioning System for Vehicle or Mobile Robot Navigation, A |
| visual | Potential: One Convex Polygon, The |
| visual | Prediction of Driver Behavior in Shared Road Areas |
| visual | Preference Prediction for Enhanced Images on Ultra-High-Definition Display |
| visual | Presence: Viewing Geometry Visual Information of UHD S3D Entertainment |
| visual | Presence: Viewing Geometry Visual Information of UHD S3D Entertainment |
| visual | presentation of information derived from a 3D image system |
| visual | Print Quality Evaluation Using Computational Features |
| visual | privacy behaviour recognition for social robots based on an improved generative adversarial network |
| visual | privacy-preserving level evaluation for multilayer compressed sensing model using contrast and salient structural features |
| visual | Processing for Autonomous Driving |
| visual | Programming Environment Based on Hypergraph Representations |
| visual | Programming Interface for an Image-Processing Environment |
| visual | Programming-based Interactive Analysis of Ancient Documents: The Case of Magical Signs in Jewish Manuscripts |
| visual | Programming: Compositional visual reasoning without training |
| visual | Programming: Compositional visual reasoning without training |
| visual | Prompt Multi-Modal Tracking |
| visual | Prompt Tuning |
| visual | Prompt Tuning for Generative Transfer Learning |
| visual | prosody: facial movements accompanying speech |
| visual | Prosthesis |
| visual | Protection of HEVC Video by Selective Encryption of CABAC Binstrings |
| visual | Psychophysics for Making Face Recognition Algorithms More Explainable |
| visual | Quality Assessment for Perceptually Encrypted Light Field Images |
| visual | Quality Assessment for Projected Content |
| visual | Quality Assessment for Super-Resolved Images: Database and Method |
| visual | quality assessment for web videos |
| visual | Quality Assessment of Urban Scenes with the Contemplative Landscape Model: Evidence from a Compact City Downtown Core |
| visual | Quality Enhancement in DCT-Domain Spatial Downscaling Transcoding Using Generalized DCT Decimation |
| visual | Quality Enhancement in Optoacoustic Tomography Using Active Contour Segmentation Priors |
| visual | Quality Evaluation for Semantic Segmentation: Subjective Assessment Database and Objective Assessment Measure |
| visual | quality evaluation method for telemedicine applications, A |
| visual | Quality Evaluation of Image Object Segmentation: Subjective Assessment and Objective Measure |
| visual | quality improvement techniques of HDPhoto/JPEG-XR |
| visual | quality indices and lowquality images |
| visual | Quality Inspection System Based on a Hierarchical 3D Pose Estimation Algorithm, A |
| visual | quality measures for Characterizing Planar robot grasps |
| visual | quality metric for perceptual video coding |
| visual | quality prediction on distorted stereoscopic images |
| visual | quasi-periodicity |
| visual | Query Answering by Entity-Attribute Graph Matching and Reasoning |
| visual | query compression with locality preserving projection on Grassmann manifold |
| visual | query expansion with or without geometry: Refining local descriptors by feature aggregation |
| visual | Query Specification and Interaction with Industrial Engineering Data |
| visual | Query Systems for Databases: A Survey |
| visual | Query Tuning: Towards Effective Usage of Intermediate Representations for Parameter and Memory Efficient Transfer Learning |
| visual | Querying by Color Perceptive Regions |
| visual | Question Answering as a Meta Learning Task |
| visual | Question Answering as Reading Comprehension |
| visual | question answering from another perspective: CLEVR mental rotation tests |
| visual | question answering in the medical domain based on deep learning approaches: A comprehensive study |
| visual | question answering model based on graph neural network and contextual attention |
| visual | Question Answering Model Based on Visual Relationship Detection |
| visual | Question Answering Model Based on Visual Relationship Detection |
| visual | Question Answering Network Merging High- and Low-Level Semantic Information, A |
| visual | Question Answering on 360° Images |
| visual | Question Answering on Image Sets |
| visual | question answering with attention transfer and a cross-modal gating mechanism |
| visual | Question Answering With Dense Inter- and Intra-Modality Interactions |
| visual | question answering with gated relation-aware auxiliary |
| visual | Question Answering with Memory-Augmented Networks |
| visual | Question Answering with Textual Representations for Images |
| visual | Question Answering, Datasets, Benchmarks, Surveys |
| visual | Question Answering, Query, VQA |
| visual | question answering: A survey of methods and datasets |
| visual | Question Answering: A Tutorial |
| visual | question answering: Datasets, algorithms, and future challenges |
| visual | question answering: Which investigated applications? |
| visual | Question Generation as Dual Task of Visual Question Answering |
| visual | Question Generation as Dual Task of Visual Question Answering |
| visual | Question Generation for Class Acquisition of Unknown Objects |
| visual | Question Generation Under Multi-granularity Cross-Modal Interaction |
| visual | Question Generation: The State of the Art |
| visual | Question Reasoning on General Dependency Tree |
| visual | Rating for Given Deployments of Graphical User Interface Elements Using Shadows Algorithm |
| visual | Re-Acquisition of Geographic Locations |
| visual | re-identification across large, distributed camera networks |
| visual | Re-ranking Through Greedy Selection and Rank Fusion |
| visual | Re-ranking with Natural Language Understanding for Text Spotting |
| visual | Reaction: Learning to Play Catch With Your Drone |
| visual | realism and presence in a virtual reality game |
| visual | Reasoning using Graph Convolutional Networks for Predicting Pedestrian Crossing Intention |
| visual | Reasoning with Multi-hop Feature Modulation |
| visual | Reasoning: From State to Transformation |
| visual | Recognition and Categorization on the Basis of Similarities to Multiple Class Prototypes |
| visual | recognition and detection under bounded computational resources |
| visual | recognition by counting instances: A multi-instance cardinality potential kernel |
| visual | Recognition by Exploiting Latent Social Links in Image Collections |
| visual | recognition by learning from web data: A weakly supervised domain generalization approach |
| visual | Recognition by Request |
| visual | Recognition for Medical Image |
| visual | Recognition from Spatial Correspondence and Perceptual Organization |
| visual | Recognition in RGB Images and Videos by Learning from RGB-D Data |
| visual | Recognition Method Based on Hybrid KPCA Network |
| visual | Recognition of Activities, Gestures, Facial Expressions and Speech: An Introduction and a Perspective |
| visual | Recognition of American Sign Language Using Hidden Markov Models |
| visual | Recognition of Arabic Handwriting: Challenges and New Directions |
| visual | recognition of fastening bolts for railroad maintenance |
| visual | Recognition of Manual Tasks Using Object Motion Trajectories |
| visual | recognition of multi-agent action |
| visual | Recognition of Multi-agent Action Using Binary Temporal Relations |
| visual | recognition of paper analytical device images for detection of falsified pharmaceuticals |
| visual | recognition of pointing gestures for human-robot interaction |
| visual | Recognition of Similar Gestures |
| visual | Recognition of Types of Structural Corridor Landmarks Using Vanishing Points Detection and Hidden Markov Models |
| visual | Recognition to Access and Analyze People Density and Flow Patterns in Indoor Environments |
| visual | Recognition Using Concurrent and Layered Parameter Networks |
| visual | Recognition Using Local Appearance |
| visual | Recognition Using Local Quantized Patterns |
| visual | recognition using mappings that replicate margins |
| visual | Recognition with Humans in the Loop |
| visual | Recognition-Driven Image Restoration for Multiple Degradation with Intrinsic Semantics Recovery |
| visual | Reconstruction |
| visual | Reconstruction and Registration of Curves and Surfaces |
| visual | reconstruction with adaptive and arbitrarily oriented meshes |
| visual | Reconstruction with Discontinuities Using Variational Methods |
| visual | Registration Method for a Low Cost Robot |
| visual | Relation Grounding in Videos |
| visual | Relationship Classification With Negative-Sample Mining |
| visual | Relationship Detection Using Joint Visual-Semantic Embedding |
| visual | Relationship Detection Using Joint Visual-Semantic Embedding |
| visual | Relationship Detection Using Part-and-Sum Transformers with Composite Queries |
| visual | Relationship Detection With A Deep Convolutional Relationship Network |
| visual | Relationship Detection with Internal and External Linguistic Knowledge Distillation |
| visual | Relationship Detection with Language Priors |
| visual | Relationship Detection: A Survey |
| visual | Relationship Embedding Network for Image Paragraph Generation |
| visual | Relationship Prediction via Label Clustering and Incorporation of Depth Information |
| visual | Relationships as Functions: Enabling Few-Shot Scene Graph Prediction |
| visual | Remote Monitoring and Control System for Rod Braking on Hot Rolling Mills |
| visual | Representation and Classification by Learning Group Sparse Deep Stacking Network |
| visual | representation decoding from human brain activity using machine learning: A baseline study |
| visual | Representation in the Determination of Saliency |
| visual | Representation-Guided Framework With Global Affinity for Weakly Supervised Salient Object Detection, A |
| visual | Reranking through Weakly Supervised Multi-graph Learning |
| visual | Reranking: From Objectives to Strategies |
| visual | Retrieval Based on Combination of Histograms of AC Block Patterns and Block Neighborhood |
| visual | Rhythm Detection and Its Applications in Interactive Multimedia |
| visual | Rhythm Prediction with Feature-Aligning Network |
| visual | rhythm-based plankton detection method for ballast water quality assessment |
| visual | rhythm-based time series analysis for phenology studies |
| visual | Road Following Without 3D Reconstruction |
| visual | robot guidance for an insertion task |
| visual | Robotic Object Grasping Through Combining RGB-D Data and 3D Meshes |
| visual | Room Rearrangement |
| visual | Routine for Eye Detection Using Hybrid Genetic Architectures |
| visual | Routines |
| visual | Routines for Autonomous Driving |
| visual | Routines for Real Time Monitoring of Vehicle Behavior |
| visual | Routines: Where bottom-Up and Top-down Processing Meet |
| visual | salience and priority estimation for locomotion using a deep convolutional neural network |
| visual | salience and stack extension based ghost removal for high-dynamic-range imaging |
| visual | Salience Learning via Low Rank Matrix Recovery |
| visual | Salience-Guided Mesh Decomposition |
| visual | Saliency Analysis for Common Region of Interest Detection in Multiple Remote Sensing Images |
| visual | saliency and categorisation of abstract images |
| visual | saliency as sequential eye fixation probability |
| visual | Saliency Based Active Learning for Prostate MRI Segmentation |
| visual | Saliency Based Aerial Video Summarization by Online Scene Classification |
| visual | saliency based global-local feature representation for skin cancer classification |
| visual | Saliency Based Object Tracking |
| visual | Saliency Based on Conditional Entropy |
| visual | saliency based on extended manifold ranking and third-order optimization refinement |
| visual | saliency based on multiscale deep features |
| visual | Saliency Based on Scale-Space Analysis in the Frequency Domain |
| visual | saliency based video hashing algorithm, A |
| visual | saliency by extended quantum cuts |
| visual | Saliency by Keypoints Distribution Analysis |
| visual | Saliency by Selective Contrast |
| visual | Saliency Computation |
| visual | saliency detection based on Bayesian model |
| visual | saliency detection based on homology similarity and an experimental evaluation |
| visual | Saliency Detection Based on Multiscale Deep CNN Features |
| visual | saliency detection based on mutual information in compressed domain |
| visual | saliency detection based on region descriptors and prior knowledge |
| visual | saliency detection by spatially weighted dissimilarity |
| visual | Saliency Detection for RGB-D Images with Generative Model |
| visual | Saliency Detection guided by Neural Signals |
| visual | saliency detection using feature activity weighted decorrelation cues |
| visual | Saliency Detection Using Group Lasso Regularization in Videos of Natural Scenes |
| visual | saliency detection using information divergence |
| visual | Saliency Detection Using Spatiotemporal Decomposition |
| visual | saliency detection using video decomposition |
| visual | Saliency Detection via Kernelized Subspace Ranking With Active Learning |
| visual | saliency detection via rank-sparsity decomposition |
| visual | Saliency Detection via Sparse Residual and Outlier Detection |
| visual | Saliency Detection via Sparsity Pursuit |
| visual | Saliency Detection With Free Energy Theory |
| visual | saliency detection: From space to frequency |
| visual | saliency estimation using support value transform |
| visual | saliency guided complex image retrieval |
| visual | saliency guided mode decision in video compression based on Laplace distribution of DCT coefficients |
| visual | saliency guided textured model simplification |
| visual | saliency guided video compression algorithm |
| visual | Saliency Improves Autonomous Visual Search |
| visual | Saliency Improves Autonomous Visual Search |
| visual | Saliency Map Based on Random Sub-window Means, A |
| visual | Saliency Models Based on Spectrum Processing |
| visual | saliency object detection using sparse learning |
| visual | Saliency Oriented Vehicle Scale Estimation |
| visual | Saliency Prediction Using a Mixture of Deep Neural Networks |
| visual | Saliency Transformer |
| visual | Saliency via Embedding Hierarchical Knowledge in a Deep Neural Network |
| visual | Saliency via Selecting and Reweighting Features in Hierarchical Fusion Network |
| visual | Saliency Weighting and Cross-Domain Manifold Ranking for Sketch-Based Image Retrieval |
| visual | Saliency with Statistical Priors |
| visual | saliency's modulatory effect on just noticeable distortion profile and its application in image watermarking |
| visual | saliency-based confidentiality metric for selective crypto-compressed JPEG images |
| visual | saliency-driven extraction framework of smoothly embedded entities in 3D point clouds of open terrain, A |
| visual | saliency: A manifold way of perception |
| visual | Sampling Processes Revisited: Replicating and Extending Senders (1983) Using Modern Eye-Tracking Equipment |
| visual | Scanpath Prediction Using IOR-ROI Recurrent Mixture Density Network |
| visual | Scene Graphs for Audio Source Separation |
| visual | Search and Visual Lobe Size |
| visual | Search and Visual Lobe Size |
| visual | search for an object in a 3D environment using a mobile robot |
| visual | Search for Normal Color and Dichromatic Observers Using a Unique Distracter Color |
| visual | search guided by an efficient top-down attention approach |
| visual | search in a SMASH system |
| visual | search in natural scenes explained by local color properties |
| visual | Search in Static and Dynamic Scenes Using Fine-Grain Top-Down Visual Attention |
| visual | Search in Static and Dynamic Scenes Using Fine-Grain Top-Down Visual Attention |
| visual | search over billions of aerial and satellite images |
| visual | search reranking via adaptive particle swarm optimization |
| visual | search: psychophysical models and practical applications |
| visual | secret sharing by random grids revisited |
| visual | secret sharing for multiple secrets |
| visual | secret sharing scheme for (k, n) threshold based on QR code with multiple decryptions |
| visual | secret sharing scheme with (n,n) threshold based on WeChat Mini Program codes |
| visual | Secret Sharing Scheme: Improving the Contrast of a Recovered Image Via Different Pixel Expansions |
| visual | Security Evaluation of Perceptually Encrypted Images Based on Image Importance |
| visual | segment tree creation for MPEG-7 Description Schemes |
| visual | self-localisation using automatic topology construction |
| visual | Semantic Complex Network for Web Images |
| visual | Semantic Context Encoding for Aerial Data Introspection and Domain Prediction |
| visual | Semantic Information Pursuit: A Survey |
| visual | Semantic Planning Using Deep Successor Representations |
| visual | Semantic Reasoning for Image-Text Matching |
| visual | Semantic Relatedness Dataset for Image Captioning |
| visual | Semantic Role Labeling for Video Understanding |
| visual | Semantic Search: Retrieving Videos via Complex Textual Queries |
| visual | Semantics: Extracting Visual Information from Text Accompanying Pictures |
| visual | Semantics: Extracting Visual Information from Text Accompanying Pictures |
| visual | Sensing and its Applications: Integration of Laser Sensors to Industrial Robots |
| visual | Sensing for Navigation and Driving |
| visual | Sensing in Electronic Truck Coupling |
| visual | Sensitivities to Small Color Differences in Daylight |
| visual | sensitivity model based stereo image watermarking scheme, A |
| visual | sensitivity-based low-bit-rate image compression algorithm |
| visual | Sensor Systems: Making Them Smaller, Faster, Smarter |
| visual | Sentences for Pose Retrieval Over Low-Resolution Cross-Media Dance Collections |
| visual | Sentiment Analysis by Leveraging Local Regions and Human Faces |
| visual | Sentiment Analysis With Social Relations-Guided Multiattention Networks |
| visual | Sentiment Evaluation |
| visual | Sentiment Prediction Based on Automatic Discovery of Affective Regions |
| visual | Sentiment Prediction Using Cross-Way Few-Shot Learning Based on Knowledge Distillation |
| visual | servo control of electromagnetic actuation for a family of microrobot devices |
| visual | servo control of uncalibrated robot system with dead-zone input |
| visual | Servo Tracking Control of Quadrotor with a Cable Suspended Load |
| visual | Servoing for Automatic and Uncalibrated Needle Placement for Percutaneous Procedures |
| visual | Servoing for Micro-Manipulation |
| visual | Servoing for Online Facilities |
| visual | Servoing for Patient Alignment in Proton-Therapy |
| visual | Servoing from 2-D Image Cues |
| visual | Servoing from Lines |
| visual | Servoing in ISAC, a Decentralized Robot System for Feeding the Disabled |
| visual | Servoing in Presence of Non-Rigid Motion |
| visual | Servoing in Robotics Scheme Using a Camera/Laser-Stripe Sensor |
| visual | Servoing in the Task-Function Framework: A Contour Following Task |
| visual | Servoing Invariant to Changes in Camera Intrinsic Parameters |
| visual | Servoing of a Flexible Aerial Refueling Boom With an Eye-in-Hand Camera |
| visual | Servoing of Constrained Mobile Robots Based on Model Predictive Control |
| visual | Servoing of Flexible-Link Manipulators by Considering Vibration Suppression Without Deformation Measurements |
| visual | Servoing of Legged Robots |
| visual | Servoing of Rigid-Link Flexible-Joint Manipulators in the Presence of Unknown Camera Parameters and Boundary Output |
| visual | Servoing of Wheeled Mobile Robots Under Dynamic Environment |
| visual | Servoing of Wheeled Mobile Robots Without Desired Images |
| visual | Servoing using Correlation Filters |
| visual | Servoing Using Eigenspace Method and Dynamic Calculation of Interaction Matrices |
| visual | servoing using triangulation with an omnidirectional multi-camera system |
| visual | Servoing via Advanced Numerical Methods |
| visual | Servoing with Hand-Eye Manipulator: Optimal-Control Approach |
| visual | Shape Computation |
| visual | Shapes of Silhouette Sets |
| visual | sign information extraction and identification by deformable models for intelligent vehicles |
| visual | Sign Language Recognition |
| visual | Sign Language Recognition Based on HMMs and Auto-regressive HMMs |
| visual | Signal Analysis: Focus on Texture Similarity |
| visual | Signal Reliability for Robust Audio-Visual Speaker Identification, A |
| visual | Signal Reliability for Robust Audio-Visual Speaker Identification, A |
| visual | Signature Verification Using Affine Arc-length |
| visual | Similarity, Judgemental Certainty and Stereo Correspondence |
| visual | simulation of a capsizing ship in stormy weather condition |
| visual | simulation of fire-flakes synchronized with flame |
| visual | simulation of interactive process of stand growth, structure and thinning |
| visual | simulation of turbulent fluids using MLS interpolation profiles |
| visual | simulation technology in formatting forest management plan at unit level based on WF, The |
| visual | Skeleton and Reparative Attention for Part-of-Speech image captioning system |
| visual | SLAM and Structure from Motion in Dynamic Environments: A Survey |
| visual | SLAM for Asteroid Relative Navigation |
| visual | SLAM for Automated Driving: Exploring the Applications of Deep Learning |
| visual | SLAM for Handheld Monocular Endoscope |
| visual | SLAM for robot navigation in healthcare facility |
| visual | SLAM Robust against Dynamic Objects Based on Hybrid Semantic-Geometry Information, A |
| visual | SLAM System for a Hexapod Robot, The |
| visual | SLAM System on Mobile Robot Supporting Localization Services to Visually Impaired People, A |
| visual | SLAM with an Omnidirectional Camera |
| visual | SLAM-based approach for calibration of distributed camera networks, A |
| visual | SLAM: Simultaneous Location and Mapping or Matching |
| visual | SLAM: Why filter? |
| visual | Smoke Detection |
| visual | Soccer Analytics: Understanding the Characteristics of Collective Team Movement Based on Feature-Driven Analysis and Abstraction |
| visual | Social Relationship Recognition |
| visual | Sound Source Separation with Partial Supervision Learning |
| visual | Space Distortion |
| visual | Space Geometry Derived from Occlusion Axioms |
| visual | Space Task Specification, Planning and Control |
| visual | space-time geometry: A tool for perception and the imagination |
| visual | Space: Mathematics, Engineering, and Science |
| visual | spatial-context based wildfire smoke sensor |
| visual | speaker authentication by ensemble learning over static and dynamic lip details |
| visual | speaker authentication with random prompt texts by a dual-task CNN framework |
| visual | Speaker Identification with Spatiotemporal Directional Features |
| visual | Speech Enhancement Without A Real Visual Stream |
| visual | Speech Enhancement Without A Real Visual Stream |
| visual | speech model based on fuzzy-neuro methods, A |
| visual | Speech Recognition by Recurrent Neural Networks |
| visual | Speech Recognition Method Using Translation, Scale and Rotation Invariant Features |
| visual | Speech Recognition Using Dynamic Features And Support Vector Machines |
| visual | Speech Recognition Using Motion Features and Hidden Markov Models |
| visual | Speech Recognition Using PCA Networks and LSTMs in a Tandem GMM-HMM System |
| visual | Speech Recognition Using Weighted Dynamic Time Warping |
| visual | Speech Recognition with Loosely Synchronized Feature Streams |
| visual | Speech Synthesis by Morphing Visemes |
| visual | Speech Synthesis Using a Variable-Order Switching Shared Gaussian Process Dynamical Model |
| visual | speech, a trajectory in viseme space |
| visual | Speech: A Physiological or Behavioural Biometric? |
| visual | State Recognition for a Target-reaching Task |
| visual | Statistics Cockpits for Information Gathering in the Policy-Making Process |
| visual | stem mapping and Geometric Tense coding for Augmented Visual Vocabulary |
| visual | stem mapping and Geometric Tense coding for Augmented Visual Vocabulary |
| visual | Stereoscopic Computation |
| visual | Strategies for Mobile Robot Navigation |
| visual | stress symptoms from stereoscopic television |
| visual | Structural Assessment and Anomaly Detection for High-Velocity Data Streams |
| visual | Structural Degradation Based Reduced-Reference Image Quality Assessment |
| visual | Structure Constraint for Transductive Zero-Shot Learning in the Wild |
| visual | Style Extraction from Chart Images for Chart Restyling |
| visual | summarization of landmarks via viewpoint modeling |
| visual | SuperTree: similarity-based multi-scale visualization, The |
| visual | Surface Interpolation: A Comparison of Two Methods |
| visual | surface reconstruction and boundary detection using stochastic models |
| visual | Surface Reconstruction Using Sparse Depth Data |
| visual | Surface Segmentation from Stereo |
| visual | Surveillance and Monitoring |
| visual | Surveillance and Monitoring of Human and Vehicular Activity |
| visual | Surveillance and Monitoring System Using an Omnidirectional Video Camera |
| visual | surveillance briefing system: Event-based video retrieval and summarization |
| visual | surveillance by dynamic visual attention method |
| visual | surveillance by dynamic visual attention method |
| visual | Surveillance for Aircraft Activity Monitoring |
| visual | Surveillance for Human Fall Detection in Healthcare IoT |
| visual | Surveillance for Moving Vehicles |
| visual | Surveillance in a Dynamic and Uncertain World |
| visual | Surveillance Monitoring and Watching |
| visual | Surveillance of Activities |
| visual | Surveillance Using Less ROIs of Multiple Non-calibrated Cameras |
| visual | synonyms for landmark image retrieval |
| visual | Synset: Towards a higher-level visual representation |
| visual | Synset: Towards a higher-level visual representation |
| visual | System Based on Artificial Retina for Motion Detection |
| visual | System for Hand Gesture Recognition in Human-Computer Interaction, A |
| visual | system for real time detection of goal events during soccer matches, A |
| visual | system using ray-based image sensors and electronic holography display toward ultra-realistic communication |
| visual | Target Detection and Tracking in UAV EO/IR Videos by Moving Background Subtraction |
| visual | Target Tracking Using a Low-Cost Methodology Based on Visual Words |
| visual | Target Tracking Using a Low-Cost Methodology Based on Visual Words |
| visual | Target Tracking using Improved and Computationally Efficient Particle Filtering |
| visual | Target Tracking with Active Head Rotation |
| visual | Target TRACTOR: Tracker and Detector |
| visual | Tempo Contrastive Learning for Few-Shot Action Recognition |
| visual | terrain mapping for Mars exploration |
| visual | Terrain Matching for a Mars Rover |
| visual | Text Correction |
| visual | Text Recognition Through Contextual Processing |
| visual | Texture: Accurate Material Appearance Measurement, Representation and Modeling |
| visual | textures as realizations of multivariate log-Gaussian Cox processes |
| visual | to Sound: Generating Natural Sound for Videos in the Wild |
| visual | Tools For Crowdsourcing Data Validation Within The Globeland30 Geoportal |
| visual | Tools for ROI Montage in an Image2Video Application |
| visual | Topic Network: Building better image representations for images in social media |
| visual | topological mapping and localisation using colour histograms |
| visual | Tour Based On Panaromic Images For Indoor Places In Campus |
| visual | Tracker Using Sequential Bayesian Learning: Discriminative, Generative, and Hybrid |
| visual | Tracking |
| visual | Tracking Algorithm Using Pixel-Pair Feature |
| visual | Tracking and Control using Lie Algebras |
| visual | Tracking and Depth Estimation of Mobile Robots Without Desired Velocity Information |
| visual | tracking and dynamic learning on the Grassmann manifold with inference from a Bayesian framework and state space models |
| visual | tracking and learning using speeded up robust features |
| visual | Tracking and Motion Determination Using the IMM Algorithm |
| visual | Tracking and Recognition Using Appearance-Adaptive Models in Particle Filters |
| visual | tracking and recognition using probabilistic appearance manifolds |
| visual | tracking and segmentation using Time-of-Flight sensor |
| visual | Tracking and Servoing System for Experiment of Optogenetic Control of Brain Activity |
| visual | Tracking Based on Cooperative Model |
| visual | tracking based on Distribution Fields and online weighted multiple instance learning |
| visual | Tracking Based on Dynamic Coupled Conditional Random Field Model |
| visual | tracking based on edge field with object proposal association |
| visual | tracking based on group sparsity learning |
| visual | Tracking Based on Log-Euclidean Riemannian Sparse Representation |
| visual | tracking based on object appearance and structure preserved local patches matching |
| visual | tracking based on online sparse feature learning |
| visual | tracking based on robust appearance model |
| visual | tracking based on semantic and similarity learning |
| visual | Tracking Based on the Adaptive Color Attention Tuned Sparse Generative Object Model |
| visual | tracking based on weighted subspace reconstruction error |
| visual | Tracking by Affine Kernel Fitting Using Color and Object Boundary |
| visual | Tracking by Combining the Structure-Aware Network and Spatial-Temporal Regression |
| visual | Tracking by Continuous Density Propagation in Sequential Bayesian Filtering Framework |
| visual | tracking by dynamic matching-classification network switching |
| visual | tracking by fusing multiple cues with context-sensitive reliabilities |
| visual | Tracking by Hypothesis Testing |
| visual | Tracking by Means of Deep Reinforcement Learning and an Expert Demonstrator |
| visual | tracking by proto-objects |
| visual | Tracking by Sampling in Part Space |
| visual | Tracking by Sampling Tree-Structured Graphical Models |
| visual | tracking by separability-maximum online boosting |
| visual | Tracking by Structurally Optimizing Pre-Trained CNN |
| visual | tracking by the combination of global detector and local image patch matching |
| visual | Tracking by Tridentalign and Context Embedding |
| visual | tracking decomposition |
| visual | Tracking Extensions for Accurate Target Recovery in Low Frame Rate Videos |
| visual | Tracking for Seamless 3D Interactions in Augmented Reality |
| visual | tracking for the recovery of multiple interacting plant root systems from X-ray CT images |
| visual | Tracking Framework for Intent Recognition in Videos, A |
| visual | tracking in camera-switching outdoor sport videos: Benchmark and baselines for skiing |
| visual | tracking in complex scenes through pixel-wise tri-modeling |
| visual | Tracking in Continuous Appearance Space via Sparse Coding |
| visual | Tracking in High-Dimensional State Space by Appearance-Guided Particle Filtering |
| visual | Tracking in Occlusion Environments by Autonomous Switching of Targets |
| visual | Tracking in the 21st Century |
| visual | Tracking in the Presence of Motion Blur |
| visual | tracking in video sequences based on biologically inspired mechanisms |
| visual | Tracking of a Moving Target by a Camera Mounted on a Robot: A Combination of Control and Vision |
| visual | tracking of deepwater animals using machine learning-controlled robotic underwater vehicles |
| visual | Tracking of Hand Posture with Occlusion Handling |
| visual | tracking of hands, faces and facial features of multiple persons |
| visual | Tracking of High DoF Articulated Structures: An Application to Human Hand Tracking |
| visual | Tracking of Jellyfish in Situ |
| visual | Tracking of Known Three-Dimensional Objects |
| visual | tracking of multiple interacting objects through Rao-Blackwellized Data Association Particle Filtering |
| visual | Tracking of Multiple Objects with Automatic Motion Model Switching |
| visual | tracking of non-rigid objects with partial occlusion through elastic structure of local patches and hierarchical diffusion |
| visual | tracking of numerous targets via multi-Bernoulli filtering of image data |
| visual | tracking of object silhouettes |
| visual | tracking of partially observable targets with suboptimal filtering |
| visual | tracking of resident space objects via an RFS-based multi-Bernoulli track-before-detect method |
| visual | Tracking of Self-Occluding Articulated Objects |
| visual | Tracking of Small Animals in Cluttered Natural Environments Using a Freely Moving Camera |
| visual | Tracking of Solid Objects Based on an Active Contour Model |
| visual | Tracking on Riemannian Space Using Updated Standard Deviation Based Model |
| visual | Tracking System for Sports Video Annotation in Unconstrained Environments, A |
| visual | Tracking Technique Suitable for Control of Convoys, A |
| visual | tracking tracker via object proposals and co-trained kernelized correlation filters |
| visual | Tracking Under Motion Blur |
| visual | Tracking Using a Pixelwise Spatiotemporal Oriented Energy Representation |
| visual | tracking using active appearance models |
| visual | Tracking Using Active Search for Color |
| visual | Tracking Using Attention-Modulated Disintegration and Integration |
| visual | Tracking Using Closed-Worlds |
| visual | tracking using compensated motion model for mobile cameras |
| visual | Tracking Using Depth Data |
| visual | tracking using high-order Monte Carlo Markov chain |
| visual | Tracking Using High-Order Particle Filtering |
| visual | tracking using IPCA and sparse representation |
| visual | tracking using learned linear subspaces |
| visual | tracking using Locality-constrained Linear Coding and saliency map for visible light and infrared image sequences |
| visual | tracking using locality-constrained linear coding under a particle filtering framework |
| visual | Tracking Using Multi-stage Random Simple Features |
| visual | Tracking Using Online Semi-supervised Learning |
| visual | Tracking Using Particle Filters with Gaussian Process Regression |
| visual | Tracking Using Pertinent Patch Selection and Masking |
| visual | Tracking Using Sequential Importance Sampling with a State Partition Technique |
| visual | Tracking Using Sparsity Induced Similarity |
| visual | tracking using spatially weighted likelihood of Gaussian mixtures |
| visual | tracking using spatio-temporally nonlocally regularized correlation filter |
| visual | Tracking Using Strong Classifier and Structural Local Sparse Descriptors |
| visual | tracking using the Earth Mover's Distance between Gaussian mixtures and Kalman filtering |
| visual | tracking using the harmony search algorithm |
| visual | tracking using transformer with a combination of convolution and attention |
| visual | tracking via adaptive multi-task feature learning with calibration and identification |
| visual | Tracking via Adaptive Spatially-Regularized Correlation Filters |
| visual | Tracking Via Adaptive Structural Local Sparse Appearance Model |
| visual | Tracking via Adaptive Tracker Selection with Multiple Features |
| visual | tracking via bag of features |
| visual | tracking via Boolean map representations |
| visual | Tracking via Coarse and Fine Structural Local Sparse Appearance Models |
| visual | Tracking via Constrained Incremental Non-negative Matrix Factorization |
| visual | tracking via context-aware local sparse appearance model |
| visual | Tracking via Discriminative Sparse Similarity Map |
| visual | Tracking via Dynamic Graph Learning |
| visual | Tracking via Dynamic Memory Networks |
| visual | tracking via dynamic weighting with pyramid-redetection based Siamese networks |
| visual | Tracking via Efficient Kernel Discriminant Subspace Learning |
| visual | tracking via ensemble autoencoder |
| visual | tracking via geometric particle filtering on the affine group with optimal importance functions |
| visual | tracking via Graph Regularized Kernel Correlation Filer and Multi-Memory Voting |
| visual | tracking via guided filter |
| visual | tracking via incremental Log-Euclidean Riemannian subspace learning |
| visual | tracking via incremental self-tuning particle filtering on the affine group |
| visual | Tracking via Joint Discriminative Appearance Learning |
| visual | Tracking Via Kernel Sparse Representation With Multikernel Fusion |
| visual | Tracking via Locality Sensitive Histograms |
| visual | Tracking via Locally Structured Gaussian Process Regression |
| visual | tracking via manifold regularized local structured sparse representation model |
| visual | Tracking Via Multi-Layer Factorized Correlation Filter |
| visual | Tracking via Nonlocal Similarity Learning |
| visual | Tracking via Nonnegative Multiple Coding |
| visual | Tracking via Nonnegative Regularization Multiple Locality Coding |
| visual | Tracking via Online Nonnegative Matrix Factorization |
| visual | tracking via orthogonal sparse coding |
| visual | Tracking via Patch-Based Absorbing Markov Chain |
| visual | Tracking via Probabilistic Hypergraph Ranking |
| visual | Tracking via Probability Continuous Outlier Model |
| visual | Tracking via Random Walks on Graph Model |
| visual | Tracking via Saliency Weighted Sparse Coding Appearance Model |
| visual | Tracking Via Siamese Network With Global Similarity |
| visual | Tracking via Sparse and Local Linear Coding |
| visual | Tracking Via Sparse Representation With Reliable Structure Constraint |
| visual | tracking via sparsity pattern learning |
| visual | Tracking via Spatially Aligned Correlation Filters Network |
| visual | tracking via structural patch-based dictionary pair learning |
| visual | Tracking via Structure Constrained Grouping |
| visual | Tracking via Subspace Learning: A Discriminative Approach |
| visual | Tracking via Supervised Similarity Matching |
| visual | Tracking via Temporally Smooth Sparse Coding |
| visual | Tracking Via Temporally-Regularized Context-Aware Correlation Filters |
| visual | tracking via weakly supervised learning from multiple imperfect oracles |
| visual | tracking via weakly supervised learning from multiple imperfect oracles 1 |
| visual | Tracking via Weighted Local Cosine Similarity |
| visual | tracking with a structured local model |
| visual | Tracking With Automatic Confident Region Extraction |
| visual | Tracking with Automatic Motion Model Switching |
| visual | Tracking with Breeding Fireflies using Brightness from Background-Foreground Information |
| visual | Tracking With Convolutional Random Vector Functional Link Network |
| visual | Tracking with Deformation Models |
| visual | Tracking with Dynamic Model Update and Results Fusion |
| visual | Tracking with Fully Convolutional Networks |
| visual | tracking with genetic algorithm augmented logistic regression |
| visual | Tracking With Group Motion Approach |
| visual | tracking with histograms and articulating blocks |
| visual | tracking with multiple Hough detectors |
| visual | Tracking With Multiview Trajectory Prediction |
| visual | tracking with online Multiple Instance Learning |
| visual | tracking with randomly projected ferns |
| visual | tracking with semi-supervised online weighted multiple instance learning |
| visual | tracking with sparse correlation filters |
| visual | Tracking With Spatio-Temporal Dempster-Shafer Information Fusion |
| visual | tracking with structured patch-based model |
| visual | Tracking with Temporal Contextual Attention |
| visual | tracking with tree-structured appearance model for online learning |
| visual | Tracking With Weighted Adaptive Local Sparse Appearance Model via Spatio-Temporal Context Learning |
| visual | Tracking: An Experimental Survey |
| visual | Traffic Knowledge Graph Generation from Scene Images |
| visual | trajectory analysis via Replicated Softmax-based models |
| visual | Trajectory Tracking of Wheeled Mobile Robots With Uncalibrated Camera Extrinsic Parameters |
| visual | Transformation Aided Contrastive Learning for Video-Based Kinship Verification |
| visual | Transformers with Primal Object Queries for Multi-Label Image Classification |
| visual | Transformers: Where Do Transformers Really Belong in Vision Models? |
| visual | Translation Embedding Network for Visual Relation Detection |
| visual | Translation Embedding Network for Visual Relation Detection |
| visual | Translator: Linking Perceptions And Natural-Language Descriptions |
| visual | Tree Convolutional Neural Network in Image Classification |
| visual | Tunnel Analysis for Visibility Prediction and Camera Planning |
| visual | Turing Test for Scene Reconstruction, The |
| visual | Two-Secret Sharing Schemes by Different Superimposition Positions |
| visual | Typo Correction by Collocative Optimization: A Case Study on Merchandize Images |
| visual | UAV Trajectory Plan System Based on Network Map |
| visual | Understanding of Complex Table Structures from Document Images |
| visual | understanding of dynamic hand gestures |
| visual | Understanding of Humans in Crowd Scene and the 1st Look Into Person Challenge |
| visual | Understanding of Subjective Attributes of Data |
| visual | Understanding via Multi-Feature Shared Learning With Global Consistency |
| visual | units and confusion modelling for automatic lip-reading |
| visual | Urban Perception with Deep Semantic-Aware Network |
| visual | User Interface for Map Information Retrieval Based on Semantic Significance, A |
| visual | Vehicle Detection and Tracking Based on the Sign Pattern |
| visual | Venture: investigations with images and videos for middle school education |
| visual | Verification of Hypotheses |
| visual | versus Textual Embedding for Video Retrieval |
| visual | Vibration Tomography: Estimating Interior Material Properties from Monocular Video |
| visual | Vibrometry: Estimating Material Properties from Small Motions in Video |
| visual | video evaluation association modeling based on chaotic pseudo-random multi-layer compressed sensing for visual privacy-protected keyframe extraction |
| visual | video evaluation association modeling based on chaotic pseudo-random multi-layer compressed sensing for visual privacy-protected keyframe extraction |
| visual | Violence Rating with Pairwise Comparison |
| visual | Vocabulary for Flower Classification, A |
| visual | Vocabulary Optimization with Spatial Context for Image Annotation and Classification |
| visual | Vocabulary Signature For 3d Object Retrieval and Partial Matching |
| visual | Vocabulary with a Semantic Twist |
| visual | voice activity detection based on spatiotemporal information and bag of words |
| visual | Voice Activity Detection in the Wild |
| visual | Voice Activity Detection Using Frontal versus Profile Views |
| visual | voice activity detection with optical flow |
| visual | vs internal attention mechanisms in deep neural networks for image classification and object detection |
| visual | wafer dies counting using geometrical characteristics |
| visual | Weather Temperature Prediction |
| visual | websearching using iconic queries |
| visual | Wildlife Monitoring |
| visual | Word Aggregation |
| visual | Word Ambiguity |
| visual | Word Booster: A Spatial Layout of Words Descriptor Exploiting Contour Cues, The |
| visual | word density-based nonlinear shape normalization method for handwritten Chinese character recognition |
| visual | word disambiguation by semantic contexts |
| visual | Word Embedding for Text Classification |
| visual | Word Pairs for Similar Image Search |
| visual | word proximity and linguistics for semantic video indexing and near-duplicate retrieval |
| visual | word spatial arrangement for image retrieval and classification |
| visual | words assignment on a graph via minimal mutual information loss |
| visual | Words Assignment Via Information-Theoretic Manifold Embedding |
| visual | words dictionaries and fusion techniques for searching people through textual and visual attributes |
| visual | words dictionaries and fusion techniques for searching people through textual and visual attributes |
| visual | Words for 3D Reconstruction and Pose Computation |
| visual | words for automated visual inspection of bulk materials |
| visual | words for automated visual inspection of bulk materials |
| visual | Words on Baggage X-Ray Images |
| visual | Workflow Recognition Using a Variational Bayesian Treatment of Multistream Fused Hidden Markov Models |
| visual | -attention GAN for interior sketch colourisation |
| visual | -Attention Model Using Earth Mover's Distance-Based Saliency Measurement and Nonlinear Feature Combination, A |
| visual | -Attention-Based Background Modeling for Detecting Infrequently Moving Objects |
| visual | -Based Driver Distraction Recognition and Detection Using Random Forest, A |
| visual | -Based ID Verification by Signature Tracking |
| visual | -Based Image Retrieval by Block Reallocation Considering Object Region |
| visual | -based Integrated Navigation System Applied to A Simulation Of Lunar Module Landing |
| visual | -Based Person Detection for Search-and-Rescue with UAS: Humans vs. Machine Learning Algorithm |
| visual | -based robotic control without joint velocities |
| visual | -Based Spatiotemporal Analysis for Nighttime Vehicle Braking Event Detection |
| visual | -Concept Search Solved? |
| visual | -Context Boosting for Eye Detection |
| visual | -Depth Matching Network: Deep RGB-D Domain Adaptation With Unequal Categories |
| visual | -Feedback Distortion in a Robotic Rehabilitation Environment |
| visual | -Geometric Scene Reconstruction from Image Streams |
| visual | -hull reconstruction from uncalibrated and unsynchronized video streams |
| visual | -inertial navigation with guaranteed convergence |
| visual | -Inertial Object Detection and Mapping |
| visual | -Inertial Odometry of Smartphone under Manhattan World |
| visual | -Inertial-Semantic Scene Representation for 3D Object Detection |
| visual | -Language Prompt Tuning with Knowledge-Guided Context Optimization |
| visual | -LiDAR Odometry Aided by Reduced IMU |
| visual | -Linguistic Methods for Receipt Field Recognition |
| visual | -Manual Distraction Detection Using Driving Performance Indicators With Naturalistic Driving Data |
| visual | -Model Based Spatial Tracking in the Presence of Occlusions |
| visual | -Motion Fixation Invariant, A |
| visual | -numeric approach to clustering and anomaly detection for trajectory data, A |
| visual | -Patch-Attention-Aware Saliency Detection |
| visual | -PSNR measure of image quality |
| visual | -Quality Guided Global Backlight Dimming for Video Display on Mobile Devices |
| visual | -Quality-Driven Learning for Underwater Vision Enhancement |
| visual | -relation Conscious Image Generation from Structured-text |
| visual | -Semantic Aligned Bidirectional Network for Zero-Shot Learning |
| visual | -Semantic Alignment Across Domains Using a Semi-Supervised Approach |
| visual | -Semantic Matching by Exploring High-Order Attention and Distraction |
| visual | -SLAM for first person vision and mobile robots, A |
| visual | -Tactile Fused Graph Learning for Object Clustering |
| visual | -Tactile Sensing for In-Hand Object Reconstruction |
| visual | -tag reader: image capture by cell phone camera |
| visual | -Textual Attentive Semantic Consistency for Medical Report Generation |
| visual | -Textual Capsule Routing for Text-Based Video Segmentation |
| visual | -Textual Cross-Modal Interaction Network for Radiology Report Generation |
| visual | -Textual Hybrid Sequence Matching for Joint Reasoning |
| visual | -Textual Image Understanding and Retrieval - Joint Workshop on Content-Based Image Retrieval, Video and Image Question Answering, Texture Analysis, Classification and Retrieval |
| visual | -Textual Joint Relevance Learning for Tag-Based Social Image Search |
| visual | -Textual Sentiment Analysis in Product Reviews |
| visual | -Texual Emotion Analysis With Deep Coupled Video and Danmu Neural Networks |
| visual | -to-EEG cross-modal knowledge distillation for continuous emotion recognition |
| visual | -to-speech conversion based on maximum likelihood estimation |
| visual | /Haptic Interface to Virtual Environment (WYSIWYF Display) and Its Application |
| visual | /Inertial/GNSS Integrated Navigation System under GNSS Spoofing Attack |
| visual | 7W visual question answering |
| visual | izing Unstructured Text Sequences Using Iterative Visual Clustering |
| visual | SLAM: Long Term Visual Localization, Visual Odometry and Geometric and Learning-Based SLAM |
| visual | SLAM: Long Term Visual Localization, Visual Odometry and Geometric and Learning-Based SLAM |
| visual | Voice: Audio-Visual Speech Separation with Cross-Modal Consistency |
Vitaa: | visual | -textual Attributes Alignment in Person Search by Natural Language |
VITAL: | visual | Tracking via Adversarial Learning |
VITAMIN-E: | visual | Tracking and MappINg With Extremely Dense Feature Points |
ViTVO: Vision Transformer based | visual | Odometry with Attention Supervision |
VizAssist: an interactive user assistant for | visual | data mining |
VizWiz Grand Challenge: Answering | visual | Questions from Blind People |
VizWiz-FewShot: Locating Objects in Images Taken by People with | visual | Impairments |
VizWiz-Priv: A Dataset for Recognizing the Presence and Purpose of Private | visual | Information in Images Taken by Blind People |
VL-LTR: Learning Class-wise | visual | -Linguistic Representation for Long-Tailed Visual Recognition |
VL-LTR: Learning Class-wise | visual | -Linguistic Representation for Long-Tailed Visual Recognition |
VL-SAT: | visual | -Linguistic Semantics Assisted Training for 3D Semantic Scene Graph Prediction in Point Cloud |
VLC-BERT: | visual | Question Answering with Contextualized Commonsense Knowledge |
VLMAH: | visual | -Linguistic Modeling of Action History for Effective Action Anticipation |
VOCUS: A | visual | Attention System for Object Detection and Goal-Directed Search |
VOCVALC: | visual | Odometry and Computer Vision Applications Based on Location Clues - With a Focus on Mobile Platform Applications |
Voice-Bandwidth | visual | Communication Through Logmaps: The Telecortex |
VOLDOR: | visual | Odometry From Log-Logistic Dense Optical Flow Residuals |
VOLO: Vision Outlooker for | visual | Recognition |
VOLTER: | visual | Collaboration and Dual-Stream Fusion for Scene Text Recognition |
Voronoi-based image representation applied to binary | visual | cryptography |
Voting-Based Computational Framework for | visual | Motion Analysis and Interpretation, A |
Voting-based grouping and interpretation of | visual | motion |
VoViT: Low Latency Graph-Based Audio- | visual | Voice Separation Transformer |
Voxel Selection Framework in Multi-Voxel Pattern Analysis of fMRI Data for Prediction of Neural Response to | visual | Stimuli |
VPR-Bench: An Open-Source | visual | Place Recognition Evaluation Framework with Quantifiable Viewpoint and Appearance Change |
VQA as a factoid question answering problem: A novel approach for knowledge-aware and explainable | visual | question answering |
VQA, | visual | Question Answering, Neural Networks |
VQA-E: Explaining, Elaborating, and Enhancing Your Answers for | visual | Questions |
VQA-GNN: Reasoning with Multimodal Knowledge via Graph Neural Networks for | visual | Question Answering |
VQA-LOL: | visual | Question Answering Under the Lens of Logic |
VQA: | visual | Question Answering |
VQA: | visual | Question Answering |
VQA: | visual | Question Answering |
VQACL: A Novel | visual | Question Answering Continual Learning Setting |
VQAMix: Conditional Triplet Mixup for Medical | visual | Question Answering |
VQAPT: A New | visual | question answering model for personality traits in social media images |
VR Alpine Ski Training Augmentation using | visual | Cues of Leading Skier |
VR Interface for Browsing | visual | Spaces at VBS2021, A |
VRDFormer: End-to-End Video | visual | Relation Detection with Transformers |
VReBERT: A Simple and Flexible Transformer for | visual | Relationship Detection |
VS-Net: Voting with Segmentation for | visual | Localization |
Vs-star: A | visual | interpretation system for visual surveillance |
Vs-star: A | visual | interpretation system for visual surveillance |
VSB2-Net: | visual | -Semantic Bi-Branch Network for Zero-Shot Hashing |
VSE-fs: Fast Full-Sample | visual | Semantic Embedding |
VSI: A | visual | Saliency-Induced Index for Perceptual Image Quality Assessment |
VSNR: A Wavelet-Based | visual | Signal-to-Noise Ratio for Natural Images |
VSO: | visual | Semantic Odometry |
VSP-Fuse: Multifocus Image Fusion Model Using the Knowledge Transferred From | visual | Salience Priors |
VSR++: Improving | visual | Semantic Reasoning for Fine-Grained Image-Text Matching |
VSRN: | visual | -Semantic Relation Network for Video Visual Relation Inference |
VSRN: | visual | -Semantic Relation Network for Video Visual Relation Inference |
VSS-Net: | visual | Semantic Self-Mining Network for Video Summarization |
VTT: Long-term | visual | Tracking with Transformers |
W-Tree Indexing for Fast | visual | Word Generation |
WAEF: Weighted Aggregation with Enhancement Filter for | visual | Object Tracking |
Waffling around for Performance: | visual | Classification with Random Words and Broad Concepts |
Wasserstein approximate bayesian computation for | visual | tracking |
Watch and Act: Learning Robotic Manipulation From | visual | Demonstration |
Watch or Listen: Robust Audio- | visual | Speech Recognition with Visual Corruption Modeling and Reliability Scoring |
Watch or Listen: Robust Audio- | visual | Speech Recognition with Visual Corruption Modeling and Reliability Scoring |
Watch to Listen Clearly: | visual | Speech Enhancement Driven Multi-modality Speech Recognition |
Watermark Techniques Based on Human | visual | System, Perceptual Models |
Watermarking Algorithm Based on a Human | visual | Model |
Watermarking using multiple | visual | channels for perceptual color spaces |
Wave-ViT: Unifying Wavelet and Transformers for | visual | Representation Learning |
Wavelets and Human | visual | Perception in Image Compression |
wavelets based de-ringing technique for DCT based compressed | visual | data, A |
WavNet: | visual | saliency detection using Discrete Wavelet Convolutional Neural Network |
We are Humor Beings: Understanding and Predicting | visual | Humor |
Weak label for fast online | visual | tracking |
Weak-structure-aware | visual | object tracking with bottom-up and top-down context exploration |
Weak-supervised | visual | Geo-localization via Attention-based Knowledge Distillation |
Weakly Supervised Audio- | visual | Violence Detection |
Weakly Supervised Coupled Networks for | visual | Sentiment Analysis |
Weakly Supervised Learning of Part-Based Spatial Models for | visual | Object Recognition |
Weakly Supervised Learning of | visual | Models and Its Application to Content-Based Retrieval |
Weakly Supervised Relative Spatial Reasoning for | visual | Question Answering |
Weakly Supervised Scale-Invariant Learning of Models for | visual | Recognition |
Weakly Supervised | visual | Dictionary Learning by Harnessing Image Attributes |
Weakly Supervised | visual | Question Answer Generation |
Weakly Supervised | visual | Saliency Prediction |
Weakly Supervised | visual | Semantic Parsing |
Weakly-Supervised 3D Spatial Reasoning for Text-Based | visual | Question Answering |
Weakly-Supervised Cross-Domain Dictionary Learning for | visual | Recognition |
Weakly-Supervised Defect Segmentation Within | visual | Inspection Images of Liquid Crystal Displays in Array Process |
Weakly-Supervised Generation and Grounding of | visual | Descriptions with Conditional Generative Models |
Weakly-Supervised Learning of | visual | Relations |
Weakly-Supervised Semantic Segmentation with | visual | Words Learning and Hybrid Pooling |
Weakly-Supervised | visual | Grounding of Phrases with Linguistic Structures |
Weakly-Supervised | visual | Instrument-Playing Action Detection in Videos |
Wearable Computer Vision Systems for a Cortical | visual | Prosthesis |
Wearable Gaze Trackers: Mapping | visual | Attention in 3D |
WeatherNet: Recognising Weather and | visual | Conditions from Street-Level Images Using Deep Residual Learning |
Web and Personal Image Annotation by Mining Label Correlation With Relaxed | visual | Graph Embedding |
Web image concept annotation with better understanding of tags and | visual | features |
Web-Based | visual | and Analytical Geographical Information System for Oil and Gas Data, A |
Web-Scale Image Retrieval Using Compact Tensor Aggregation of | visual | Descriptors |
WEB-VC: | visual | Cryptography for Web Image |
Webcam-Based | visual | Gaze Estimation |
Webly Supervised Knowledge Embedding Model for | visual | Reasoning |
Webly-Supervised Fine-Grained | visual | Categorization via Deep Domain Adaptation |
Weed Mapping with UAS Imagery and a Bag of | visual | Words Based Image Classifier |
Weighted Average Precision: Adversarial Example Detection for | visual | Perception of Autonomous Vehicles |
Weighted bag of | visual | words for object recognition |
Weighted Bayesian Network for | visual | Tracking |
Weighted Component Hashing of Binary Aggregated Descriptors for Fast | visual | Search |
Weighted Dissociated Dipoles: An Extended | visual | Feature Set |
weighted full-reference image quality assessment based on | visual | saliency, A |
Weighted Local Bundle Adjustment and Application to Odometry and | visual | SLAM Fusion |
Weighted Part Context Learning for | visual | Tracking |
Weighted Pooling Based on | visual | Saliency for Image Classification |
Weighted Selection of Image Features for Resolved Rate | visual | Feedback Control |
Weighted sparse coding residual minimization for | visual | tracking |
Weighted | visual | secret sharing for general access structures based on random grids |
Weighting scheme for image retrieval based on bag-of- | visual | -words |
Weighting | visual | features with pseudo relevance feedback for CBIR |
What are the | visual | Features Underlying Human Versus Machine Vision? |
What are we looking for: Towards statistical modeling of saccadic eye movements and | visual | saliency |
What are we missing here? Brain imaging evidence for higher cognitive functions in primary | visual | cortex V1 |
What Are You Looking at?: Improving | visual | Gaze Estimation by Saliency |
What Can Projections of Flow Fields Tell Us About | visual | Motion |
What Do I See? Modeling Human | visual | Perception for Multi-person Tracking |
What does CLIP know about a red circle? | visual | prompt engineering for VLMs |
What Image Classifiers Really See: | visual | izing Bag-of-Visual Words Models |
What Is the Role of Independence for | visual | Recognition? |
What | visual | Attributes Characterize an Object Class? |
What we see is most likely to be what matters: | visual | attention and applications |
What's in a Caption? Dataset-Specific Linguistic Diversity and Its Effect on | visual | Description Models and Metrics |
What's in a Question: Using | visual | Questions as a Form of Supervision |
What/Where to Look Next? Modeling Top-Down | visual | Attention in Complex Interactive Environments |
When Correlation Filters Meet Convolutional Neural Networks for | visual | Tracking |
When Does Contrastive | visual | Representation Learning Work? |
When standard RANSAC is not enough: Cross-media | visual | matching with hypothesis relevancy |
When to use what feature? SIFT, SURF, ORB, or A-KAZE features for monocular | visual | odometry |
When | visual | Disparity Generation Meets Semantic Segmentation: A Mutual Encouragement Approach |
Where is my Wallet? Modeling Object Proposal Sets for Egocentric | visual | Query Localization |
Where to Focus: Investigating Hierarchical Attention Relationship for Fine-Grained | visual | Classification |
Where to Look: Focus Regions for | visual | Question Answering |
Where to Place: A Real-Time | visual | Saliency Based Label Placement for Augmented Reality Applications |
Which and How Many Regions to Gaze: Focus Discriminative Regions for Fine-Grained | visual | Categorization |
Which Has Better | visual | Quality: The Clear Blue Sky or a Blurry Animal? |
Which Looks Like Which: Exploring Inter-class Relationships in Fine-Grained | visual | Categorization |
Which Phoneme-to-Viseme Maps Best Improve | visual | -Only Computer Lip-Reading? |
Which semi-local | visual | masking model for wavelet based image quality metric? |
Who Let the Dogs Out? Modeling Dog Behavior from | visual | Data |
WHUVID: A Large-Scale Stereo-IMU Dataset for | visual | -Inertial Odometry and Autonomous Driving in Chinese Urban Scenarios |
Why Does a | visual | Question Have Different Answers? |
Why You Trust in | visual | Saliency |
Wideband enhancement of television images for people with | visual | impairments |
Wider or Deeper: Revisiting the ResNet Model for | visual | Recognition |
Will It Last? Learning Stable Features for Long-Term | visual | Localization |
With a Little Help from My Friends: Nearest-Neighbor Contrastive Learning of | visual | Representations |
Word Image Representation Based on | visual | Embeddings and Spatial Constraints for Keyword Spotting on Historical Documents |
Word to Sentence | visual | Semantic Similarity for Caption Generation: Lessons Learned |
Word-HOGs: Word histogram of oriented gradients for mobile | visual | search |
Workflow Based Process | visual | Analyzer (ProVisZer) for Teaching and Learning, A |
Workpiece Orientation Correction with a Robot Arm Using | visual | Information |
Workshoop on Virtual/Augmented Reality for | visual | Artificial Intelligence |
Workshop and Challenges for New Frontiers in | visual | Language Reasoning: Compositionality, Prompts and Causality |
Workshop on Modeling, Simulation and | visual | Analysis of Large Crowds |
Workshop on Multi-View Modeling & Analysis of | visual | Scenes |
Workshop on | visual | Behaviors |
Workshop on | visual | Form |
Workshop on | visual | Motion |
Workshop on | visual | Surveillance |
Write a Classifier: Predicting | visual | Classifiers from Unstructured Text |
Writer Identification and Writer Retrieval Using the Fisher Vector on | visual | Vocabularies |
WSCNet: Weakly Supervised Coupled Networks for | visual | Sentiment Classification and Detection |
X Vision: Combining Image Warping and Geometric Constraints for Fast | visual | Tracking |
X-Learner: Learning Cross Sources and Tasks for Universal | visual | Representation |
X-ray Categorization and Retrieval on the Organ and Pathology Level, Using Patch-Based | visual | Words |
X-Ray Image Classification and Retrieval Using Ensemble Combination of | visual | Descriptors |
XDNet: A Few-Shot Meta-Learning Approach for Cross-Domain | visual | Inspection |
XOR-Based | visual | Cryptographic Schemes With Monotonously Increasing and Flawless Reconstruction Properties |
XOR-based | visual | cryptography scheme with essential shadows |
XVO: Generalized | visual | Odometry via Cross-Modal Self-Training |
Yes, we CANN: Constrained Approximate Nearest Neighbors for local feature-based | visual | localization |
Yin and Yang: Balancing and Answering Binary | visual | Questions |
YORO - Lightweight End to End | visual | Grounding |
YouTube Movie Reviews: Sentiment Analysis in an Audio- | visual | Context |
ZebraRecognizer: Pedestrian crossing recognition for people with | visual | impairment or blindness |
Zero Experience Required: Plug & Play Modular Transfer Learning for Semantic | visual | Navigation |
Zero shot learning based on class | visual | prototypes and semantic consistency |
Zero-Shot Keyword Spotting for | visual | Speech Recognition In-the-wild |
Zero-Shot Learning Using Synthesised Unseen | visual | Data with Diffusion Regularisation |
Zero-Shot Learning via Category-Specific | visual | -Semantic Mapping and Label Refinement |
Zero-Shot Learning via | visual | Abstraction |
Zero-Shot Point Cloud Segmentation by Semantic- | visual | Aware Synthesis |
Zero-Shot Recognition Using Dual | visual | -Semantic Mapping Paths |
Zero-shot semantic segmentation via spatial and multi-scale aware | visual | class embedding |
Zero-Shot | visual | Imitation |
Zero-Shot | visual | Recognition Using Semantics-Preserving Adversarial Embedding Networks |
Zero-Shot | visual | Recognition via Bidirectional Latent Embedding |
zero-watermark algorithm for multiple images based on | visual | cryptography and image fusion, A |
ZeroCap: Zero-Shot Image-to-Text Generation for | visual | -Semantic Arithmetic |
Zoom-and-Reasoning: Joint Foreground Zoom and | visual | -Semantic Reasoning Detection Network for Aerial Images |
Zoom-Net: Mining Deep Feature Interactions for | visual | Relationship Recognition |
9498 for visual