| _ | task | _ |
| 12-in-1: Multi- | task | Vision and Language Representation Learning |
| 3D descriptor to detect | task | -oriented grasping points in clothing, A |
| 3D Instance Segmentation via Multi- | task | Metric Learning |
| 3D Multi-Attention Guided Multi- | task | Learning Network for Automatic Gastric Tumor Segmentation and Lymph Node Classification |
| 3D Prior is All You Need: Cross- | task | Few-shot 2D Gaze Estimation |
| 3D referencing for remote | task | assistance in augmented reality |
| 4-DoF Tracking for Robot Fine Manipulation | task | s |
| 5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition | task | s |
| AAGAN: Accuracy-Aware Generative Adversarial Network for Supervised | task | s |
| ABAW: Learning from Synthetic Data & Multi- | task | Learning Challenges |
| ABAW: Valence-Arousal Estimation, Expression Recognition, Action Unit Detection & Multi- | task | Learning Challenges |
| ABSNet: Aesthetics-Based Saliency Network Using Multi- | task | Convolutional Network |
| Abstraction for Correspondence Search Using | task | -Based Controls, An |
| Acceleration of multi- | task | cascaded convolutional networks |
| AccelIR: | task | -aware Image Compression for Accelerating Neural Restoration |
| Accuracy of MLP Based Data Visualization Used in Oil Prices Forecasting | task | |
| accurate and efficient multi- | task | brain tumour detection with segmented MRI images using auto-metric adolescent neural network, An |
| Accurate Segmentation of CT Male Pelvic Organs via Regression-Based Deformable Models and Multi- | task | Random Forests |
| Achievement-based Training Progress Balancing for Multi- | task | Learning |
| Acquiring Manipulation | task | s from Observation |
| Acronym Model Based Vision in the Intelligent | task | Automation Project |
| Action utility prediction and role | task | allocation in robot soccer system |
| Action-Affect-Gender Classification Using Multi- | task | Representation Learning |
| Actionet: An Interactive End-To-End Platform For | task | -Based Data Collection And Augmentation In 3D Environment |
| Active Learning with | task | Consistency and Diversity in Multi-Task Networks |
| Active Learning with | task | Consistency and Diversity in Multi-Task Networks |
| Active Sensor Planning for Multiview Vision | task | s |
| Active | task | Cognition Method for Home Service Robot Using Multi-Graph Attention Fusion Mechanism, An |
| Active Transfer Learning with Zero-Shot Priors: Reusing Past Datasets for Future | task | s |
| Active vision-based control schemes for autonomous navigation | task | s |
| Activity Recognition of Assembly | task | s Using Body-Worn Microphones and Accelerometers |
| ADA-AT/DT: An Adversarial Approach for Cross-Domain and Cross- | task | Knowledge Transfer |
| AdaMT-Net: An Adaptive Weight Learning Based Multi- | task | Learning Model For Scene Understanding |
| AdaMTL: Adaptive Input-dependent Inference for Efficient Multi- | task | Learning |
| AdaMV-MoE: Adaptive Multi- | task | Vision Mixture-of-Experts |
| Adapting JPEG XS gains and priorities to | task | s and contents |
| Adaptive Construction of the Virtual Debris Flow Disaster Environments Driven by Multilevel Visualization | task | |
| Adaptive Feature Aggregation in Deep Multi- | task | Convolutional Neural Networks |
| Adaptive Multi- | task | Learning for Few-shot Object Detection |
| Adaptive multi- | task | learning for fine-grained categorization |
| Adaptive Sampling of Motion Trajectories for Discrete | task | -Based Analysis and Synthesis of Gesture |
| Adaptive | task | Sampling for Meta-learning |
| Adaptive | task | -Aware Refining Network for Few-Shot Fine-Grained Image Classification |
| Adaptive | task | -Wise Message Passing for Multi-Task Learning: A Spatial Interaction Perspective |
| Adaptive | task | -Wise Message Passing for Multi-Task Learning: A Spatial Interaction Perspective |
| Adaptive Trajectory Planning Method of Autonomous Vehicles Integrating Multiple | task | s, An |
| Adaptive Weight Generator for Multi- | task | Image Recognition by Task Grouping Prompt |
| Adaptive Weight Generator for Multi- | task | Image Recognition by Task Grouping Prompt |
| Adding New | task | s to a Single Network with Weight Transformations Using Binary Masks |
| Addressing Glaucoma Structure-Function Relationship: A Multi- | task | Learning Framework With Multi-Modal and Unpaired Data |
| Adjoint Rigid Transform Network: | task | -conditioned Alignment of 3D Shapes |
| Advances on Multimodal Remote Sensing Foundation Models for Earth Observation Downstream | task | s: A Survey |
| Advancing Video Anomaly Detection: A Bi-Directional Hybrid Framework for Enhanced Single- and Multi- | task | Approaches |
| Adversarial Attacks of Vision | task | s in the Past 10 Years: A Survey |
| Adversarial Deep Multi- | task | Learning Using Semantically Orthogonal Spaces and Application to Facial Attributes Prediction |
| Adversarial Encoder-Multi- | task | -Decoder for Multi-Stage Processes |
| Adversarial Learning Guided | task | Relatedness Refinement for Multi-Task Deep Learning |
| Adversarial Learning Guided | task | Relatedness Refinement for Multi-Task Deep Learning |
| Adversarial Structure Matching for Structured Prediction | task | s |
| ADVICE: Decision Support for Complex Geospatial Decision Making | task | s |
| Aerial path planning for 3D urban scene reconstruction with dual- | task | reconstructability learning and adaptive viewpoints selection |
| Affect Estimation in 3D Space Using Multi- | task | Active Learning for Regression |
| Affective Behavior Analysis Using Action Unit Relation Graph and Multi- | task | Cross Attention |
| Affective Expression Analysis in-the-wild using Multi- | task | Temporal Statistical Deep Learning Model |
| affective facial recognition | task | : The influence of cognitive styles and exposure times, The |
| Affine Coordinate Based Algorithm for Reprojecting the Human Face for Identification | task | s, An |
| AGIL: Learning Attention from Human for Visuomotor | task | s |
| AI-Generated Image Quality Assessment Based on | task | -Specific Prompt and Multi-Granularity Similarity |
| Aircraft-LBDet: Multi- | task | Aircraft Detection with Landmark and Bounding Box Detection |
| ALFRED: A Benchmark for Interpreting Grounded Instructions for Everyday | task | s |
| Algorithm-Dependent Generalization Bounds for Multi- | task | Learning |
| Aligned Image-Word Representations Improve Inductive Transfer Across Vision-Language | task | s |
| Aligning and Updating Cadaster Maps with Aerial Images by Multi- | task | , Multi-resolution Deep Learning |
| Aligning computational and human perceptions of image complexity: A dual- | task | framework for prediction and localization |
| All in Tokens: Unifying Output Space of Visual | task | s via Soft Token |
| All-in-One: Emotion, Sentiment and Intensity Prediction Using a Multi- | task | Ensemble Framework |
| AMFMER: A multimodal full transformer for unifying aesthetic assessment | task | s |
| AMMF: Attention-Based Multi-Phase Multi- | task | Fusion for Small Contour Object 3D Detection |
| Amodal Segmentation through Out-of- | task | and Out-of-Distribution Generalization with a Bayesian Model |
| Amyloid-beta Deposition Prediction With Large Language Model Driven and | task | -Oriented Learning of Brain Functional Networks |
| Analogical Augmentation and Significance Analysis for Online | task | -Free Continual Learning |
| Analysing inter-observer saliency variations in | task | -free viewing of natural images |
| Analysing the performance of Viola-Jones and multi- | task | convolution neural networks face detection algorithms using real-time video sequences |
| Analysis and Performance of Two Middle-Level Vision | task | s on a Fine-Grained SIMD Tree Machine, The |
| Analysis of lung scan imaging using deep multi- | task | learning structure for Covid-19 disease |
| Analysis of Observer Performance in Known-Location | task | s for Tomographic Image Reconstruction |
| Analysis of observer performance in unknown-location | task | s for tomographic image reconstruction |
| Analysis of Skill Improvement Process Based on Movement of Gaze and Hand in Assembly | task | |
| Analysis of | task | and Data Characteristic and the Collaborative Processing Method in Real-Time Visualization Pipeline of Urban 3DGIS, The |
| Analysis of | task | s and Features for Neuro-degenerative Disease Assessment by Handwriting, An |
| Analysis of Thermal Infrared and Visual Images for Industrial Inspection | task | s |
| Analysis on scalability and energy efficiency of HEVC decoding using | task | -based programming model |
| Analyzing Zero-shot Cross-lingual Transfer in Supervised NLP | task | s |
| ANAS: Asymptotic NAS for large-scale proxyless search and multi- | task | transfer learning |
| Anatomically Guided PET Image Reconstruction Using Conditional Weakly-Supervised Multi- | task | Learning Integrating Self-Attention |
| Anatomy-Aware Deep Unrolling for | task | -Oriented Acceleration of Multi-Contrast MRI |
| Anomaly Detection in Video via Self-Supervised and Multi- | task | Learning |
| Anomaly Detection via Learnable Pretext | task | |
| Ant colony optimisation for coloured travelling salesman problem by multi- | task | learning |
| Ante-Hoc Generation of | task | -Agnostic Interpretation Maps |
| ANTHROPOS-V: Benchmarking the Novel | task | of Crowd Volume Estimation |
| Apathy Classification by Exploiting | task | Relatedness |
| Application of an Information Gain Model in a Motor Learning Laparoscopic Surgery | task | |
| Application of Machine Learning for Disease Detection | task | s in Olive Trees Using Hyperspectral Data |
| Application of Panoramic Annular Lens for Motion Analysis | task | s: Surveillance and Smoke Detection |
| Application of | task | -based measures of image quality to optimization and evaluation of three-dimensional reconstruction-based compensation methods in myocardial perfusion SPECT |
| Application of | task | -specific metrics in JPEG2000 ROI compression |
| Application of the Fuzzy Artmap Neural-Network Model to Medical Pattern-Classification | task | s |
| Application of Three-Class ROC Analysis to | task | -Based Image Quality Assessment of Simultaneous Dual-Isotope Myocardial Perfusion SPECT (MPS) |
| Applications of non-metric vision to some visual guided | task | s |
| Applications of Non-Metric Vision to Some Visually-Guided Robotics | task | s |
| Approach to Investigate an Influence of Visual Angle Size on Emotional Activation During a Decision-making | task | , An |
| Approach to the Vision | task | s Involved in an Autonomous Crop Protection Vehicle, An |
| Approximating the Ideal Observer and Hotelling Observer for Binary Signal Detection | task | s by Use of Supervised Learning Methods |
| Approximating the Ideal Observer for Joint Signal Detection and Localization | task | s by use of Supervised Learning Methods |
| AR-PCA-HMM Approach for Sensorimotor | task | Classification in EEG-based Brain-Computer Interfaces |
| Arch-Graph: Acyclic Architecture Relation Predictor for | task | -Transferable Neural Architecture Search |
| Architecture and Automatic | task | Planning for Sensor-Based Robots |
| Are deep learning models robust to partial object occlusion in visual recognition | task | s? |
| Are metrics measuring what they should? An evaluation of Image Captioning | task | metrics |
| ARNOLD: A Benchmark for Language-Grounded | task | Learning With Continuous States in Realistic 3D Scenes |
| AS-Net: An attention-aware downsampling network for point clouds oriented to classification | task | s |
| ASHiTA: Automatic Scene-Grounded HIerarchical | task | Analysis |
| Assessing Similarities and Differences between Males and Females in Visual Behaviors in Spatial Orientation | task | s |
| Assessing the effects of dynamic luminance contrast noise masking on a color discrimination | task | |
| Assessing the Impact of Deep Neural Network-Based Image Denoising on Binary Signal Detection | task | s |
| Assessing the Role of Spatial Relations for the Object Recognition | task | |
| Assessment of NavVis VLX and BLK2GO SLAM Scanner Accuracy for Outdoor and Indoor Surveying | task | s |
| AssistGUI: | task | -Oriented PC Graphical User Interface Automation |
| Assisting Multimodal Named Entity Recognition by cross-modal auxiliary | task | s |
| AssistQ: Affordance-Centric Question-Driven | task | Completion for Egocentric Assistant |
| Associating Multi-Modal Brain Imaging Phenotypes and Genetic Risk Factors via a Dirty Multi- | task | Learning Method |
| Asymmetric Convolution: An Efficient and Generalized Method to Fuse Feature Maps in Multiple Vision | task | s |
| Asynchronous Deep Reinforcement Learning for Collaborative | task | Computing and On-Demand Resource Allocation in Vehicular Edge Computing |
| Asynchronous Shuffled Frog-Leaping With Feasible Jaya Algorithm for Uncertain | task | Rescheduling Problem in UAV Emergency Networks, An |
| Attacking Attention of Foundation Models Disrupts Downstream | task | s |
| Attending Generalizability in Course of Deep Fake Detection by Exploring Multi- | task | Learning |
| Attention based multi- | task | interpretable graph convolutional network for Alzheimer's disease analysis |
| Attention-Aware Multi- | task | Convolutional Neural Networks |
| Attention-Based Multi- | task | Learning for Fine-Grained Image Classification |
| attention-enhanced cross- | task | network to analyse lung nodule attributes in CT images, An |
| Attentive Single- | task | ing of Multiple Tasks |
| Attentive | task | Interaction Network for Multi-Task Learning |
| Attentive | task | Interaction Network for Multi-Task Learning |
| Attentive texture similarity as a categorization | task | : Comparing texture synthesis models |
| Auto-context and its application to high-level vision | task | s |
| Auto-Context and Its Application to High-Level Vision | task | s and 3D Brain Image Segmentation |
| AutoLoss-Zero: Searching Loss Functions from Scratch for Generic | task | s |
| Automated camera layout to satisfy | task | -specific and floor plan-specific coverage requirements |
| Automated Cognitive Health Assessment Using Smart Home Monitoring of Complex | task | s |
| Automated measurement of children's facial expressions during problem solving | task | s |
| Automated Temporal Analysis of Gaze Following in a Visual Tracking | task | , The |
| Automatic 3D Bi-Ventricular Segmentation of Cardiac Images by a Shape-Refined Multi- | task | Deep Learning Approach |
| Automatic Detection of Amyotrophic Lateral Sclerosis (ALS) from Video-Based Analysis of Facial Movements: Speech and Non-Speech | task | s |
| Automatic Facial Attractiveness Prediction by Deep Multi- | task | Learning |
| Automatic Scoring of Multiple Semantic Attributes With Multi- | task | Feature Leverage: A Study on Pulmonary Nodules in CT Images |
| Automatic Sensor Placement from Vision | task | Requirements |
| Automatic Sensor Search and Positioning for Geometric | task | s |
| Automatically Generating Specification Properties From | task | Models for the Formal Verification of Human-Automation Interaction |
| Automatically Layer-Wise Searching Strategy for Channel Pruning Based on | task | -Driven Sparsity Optimization, An |
| AutoMF: Spatio-temporal Architecture Search for The Meteorological Forecasting | task | |
| Autonomous Generation of Service Strategy for Household | task | s: A Progressive Learning Method With A Priori Knowledge and Reinforcement Learning |
| Autonomous Planning Algorithm for Satellite Laser Ranging | task | s Based on Rolling Horizon Optimization Framework |
| AutoSegEdge: Searching for the edge device real-time semantic segmentation based on multi- | task | learning |
| Auxiliary | task | -Guided CycleGAN for Black-Box Model Domain Adaptation |
| Auxiliary | task | s and Exploration Enable ObjectGoal Navigation |
| Auxiliary | task | s Benefit 3D Skeleton-based Human Motion Prediction |
| Auxiliary | task | s for Efficient Learning of Point-Goal Navigation |
| Axial Sphere Loss: Encouraging Open-Space Risk Minimization in Face Identification | task | s |
| Backpack Full of Skills: Egocentric Video Understanding with Diverse | task | Perspectives, A |
| Balancing Shared and | task | -Specific Representations: A Hybrid Approach to Depth-Aware Video Panoptic Segmentation |
| Bargaining Theoretic Approach to Quality-Fair System Resource Allocation for Multiple Decoding | task | s, A |
| Batch mode Adaptive Multiple Instance Learning for computer vision | task | s |
| Batch Model Consolidation: A Multi- | task | Model Consolidation Framework |
| Bayesian Classification of | task | -Oriented Actions Based on Stochastic Context-Free Grammar |
| Bayesian evaluation framework for subjectively annotated visual recognition | task | s, A |
| Behavior Recognition in Human Object Interactions with a | task | Model |
| Belief-Based | task | Offloading Algorithm in Vehicular Edge Computing, A |
| Benchmark and Evaluation of Surveillance | task | |
| BEST: Benchmark and Evaluation of Surveillance | task | |
| Beyond Appearance: A Semantic Controllable Self-Supervised Learning Framework for Human-Centric Visual | task | s |
| Beyond bottom-up: Incorporating | task | -dependent influences into a computational model of spatial attention |
| Beyond Counting: Comparisons of Density Maps for Crowd Analysis | task | s: Counting, Detection, and Tracking |
| Beyond Dataset Bias: Multi- | task | Unaligned Shared Knowledge Transfer |
| Beyond Image Super-Resolution for Image Recognition with | task | -Driven Perceptual Loss |
| Beyond Self-Attention: External Attention Using Two Linear Layers for Visual | task | s |
| Beyond Vision: A Multimodal Recurrent Attention Convolutional Neural Network for Unified Image Aesthetic Prediction | task | s |
| BGTracker: Cross- | task | Bidirectional Guidance Strategy for Multiple Object Tracking |
| Bi-level deep mutual learning assisted multi- | task | network for occluded person re-identification |
| Bi-level Learning of | task | -Specific Decoders for Joint Registration and One-Shot Medical Image Segmentation |
| Bi-spectral higher order statistics and time-frequency domain features for arithmetic | task | classification from EEG signals |
| BigSmall: Efficient Multi- | task | Learning for Disparate Spatial and Temporal Physiological Measurements |
| Billion-Scale Pretraining with Vision Transformers for Multi- | task | Visual Representations |
| Biologically inspired | task | oriented gist model for scene classification |
| Biologically Motivated and Computationally Tractable Model of Low and Mid-Level Vision | task | s, A |
| Bitat: Neural Network Binarization with | task | -dependent Aggregated Transformation |
| Blockchain and Learning-Based Secure and Intelligent | task | Offloading for Vehicular Fog Computing |
| Boosted multi- | task | learning for face verification with applications to web image and video search |
| Boosted Multi- | task | Model for Pedestrian Detection With Occlusion Handling, A |
| Boosting Camouflaged Object Detection with Dual- | task | Interactive Transformer |
| Boundary Error Analysis and Categorization in the TRECVID News Story Segmentation | task | |
| Box-Based Refinement for Weakly Supervised and Unsupervised Localization | task | s |
| Brain Image Segmentation for Ultrascale Neuron Reconstruction via an Adaptive Dual- | task | Learning Network |
| Brain-Computer Interface for Mental Arithmetic | task | from Single-Trial Near-Infrared Spectroscopy Brain Signals, A |
| BridgeNet: Comprehensive and Effective Feature Interactions via Bridge Feature for Multi- | task | Dense Predictions |
| Bridging Cross- | task | Protocol Inconsistency for Distillation in Dense Object Detection |
| Bridging Morphology and Molecular Signatures: Multi- | task | Deep Learning for Multi-Omics Prediction from Histopathology |
| Brush, lasso, or magic wand? Picking the right tool for large-scale multiple object selection | task | s |
| BTG-Net++: Enhanced Bi-Directional | task | -Guided Network for Few-Shot Fine-Grained Image Classification |
| Building Reliable and Reusable Test Collections for Image Retrieval: The Wikipedia | task | at ImageCLEF |
| Built-Up Area Change Detection Using Multi- | task | Network with Object-Level Refinement |
| CafeBoost: Causal Feature Boost to Eliminate | task | -Induced Bias for Class Incremental Learning |
| CALTracker: Cross- | task | Association Learning for Multiple Object Tracking |
| CAMEL: Confidence-Aware Multi- | task | Ensemble Learning with Spatial Information for Retina OCT Image Classification and Segmentation |
| Can Categories and Attributes Be Learned in a Multi- | task | Way? |
| Can data placement be effective for Neural Networks classification | task | s? Introducing the Orthogonal Loss |
| Can Selfless Learning improve accuracy of a single classification | task | ? |
| Can We Characterize | task | s Without Labels or Features? |
| Capability-Oriented Decision-Making in Multi-UAV Deployment and | task | Allocation: A Hierarchical Game-Based Framework |
| Cascade framework for | task | -space synchronization of networked robots with uncertain kinematics and dynamics |
| Cascade of | task | s for facial expression analysis |
| case study in identifying acceptable bitrates for human face recognition | task | s, A |
| Category Contrast for Unsupervised Domain Adaptation in Visual | task | s |
| Causal Attention for Vision-Language | task | s |
| Causal Intervention Method for Domain Generalization with a Self-Supervised Auxiliary | task | , A |
| CDTFusion: Crossing Domain and | task | for Infrared and Visible Image Fusion |
| Cell tracking using deep neural networks with multi- | task | learning |
| Center-Focusing Multi- | task | CNN with Injected Features for Classification of Glioma Nuclear Images |
| CGMNet: Semantic Change Detection via a Change-Aware Guided Multi- | task | Network |
| ChangeMask: Deep multi- | task | encoder-transformer-decoder architecture for semantic change detection |
| Channel-Robust RF Fingerprint Identification Using Multi- | task | Learning and Receiver Collaboration |
| Channelized-ideal observer using Laguerre-Gauss channels in detection | task | s involving non-Gaussian distributed lumpy backgrounds and a Gaussian signal |
| Chimera: A Multi- | task | Recurrent Convolutional Neural Network for Forest Classification and Structural Estimation |
| CIE XYZ Net: Unprocessing Images for Low-Level Computer Vision | task | s |
| CKD: Cross- | task | Knowledge Distillation for Text-to-Image Synthesis |
| Class Incremental Learning for Image Classification With Out-of-Distribution | task | Identification |
| Class Incremental Learning With | task | -Selection |
| Classification images for simple detection and discrimination | task | s in correlated noise |
| Classification of fNIRS based brain hemodynamic response to mental arithmetic | task | s |
| Classification of Hyperspectral Image Based on | task | -Specific Learning Network |
| Classification | task | Assisted Segmentation Network for Breast Tumor Segmentation in Ultrasound Images |
| Classification-based Multi- | task | Learning for Efficient Pose Estimation Network |
| Classifier-specific intermediate representation for multimedia | task | s |
| CLEF 2005 Automatic Medical Image Annotation | task | , The |
| CLEval: Character-Level Evaluation for Text Detection and Recognition | task | s |
| Cloud Transformers: A Universal Approach To Point Cloud Processing | task | s |
| Cluster-based multi- | task | Sparse Representation for efficient face recognition |
| Clustered Multi- | task | Linear Discriminant Analysis for View Invariant Color-Depth Action Recognition |
| Clustered | task | -Aware Meta-Learning by Learning from Learning Paths |
| Clustering Quality Measures for Point Cloud Segmentation | task | s |
| CMT-CO: Contrastive Learning with Character Movement | task | for Handwritten Text Recognition |
| CNN-Based cascaded multi- | task | learning of high-level prior and density estimation for crowd counting |
| Co-attentive multi- | task | convolutional neural network for facial expression recognition |
| CO-Net++: A Cohesive Network for Multiple Point Cloud | task | s at Once With Two-Stage Feature Rectification |
| CO-Net: Learning Multiple Point Cloud | task | s at Once with A Cohesive Network |
| Co-segmentation inspired attention module for video-based computer vision | task | s |
| Co-Training Vision-Language Models for Remote Sensing Multi- | task | Learning |
| Coarse to Fine: Progressive and Multi- | task | Learning for Salient Object Detection |
| Coarse-to-fine pseudo supervision guided meta- | task | optimization for few-shot object classification |
| Coarse-to-Fine | task | -Driven Inpainting for Geoscience Images |
| CoDriver ETA: Combine Driver Information in Estimated Time of Arrival by Driving Style Learning Auxiliary | task | |
| Coevolutionary learning of neural network ensemble for complex classification | task | s |
| Cognition-Enabled Autonomous Robot Control for the Realization of Home Chore | task | Intelligence |
| Cognitive | task | Virtualization for Alzheimer's Diagnosis Using Realistic VR Simulation |
| Cognitive | task | s modelization and description in VR environment for Alzheimer's disease state identification |
| Cognitive Workload Impacts of Simulated Visibility Changes During Search and Surveillance | task | s Quantified by Functional Near Infrared Spectroscopy |
| Collective Sports: A multi- | task | dataset for collective activity recognition |
| Collimator Optimization for Detection and Quantitation | task | s: Application to Gallium-67 Imaging |
| Colorization as a Proxy | task | for Visual Understanding |
| Combined use of partial least squares regression and neural network for diagnosis | task | s |
| Combining Linear Dimensionality Reduction and Locality Preserving Projections with Feature Selection for Recognition | task | s |
| Combining Multiple Shape Matching Techniques with Application to Place Recognition | task | |
| Combining | task | Predictors via Enhancing Joint Predictability |
| Combining Vision-Language Models and Weak Supervision for Nuanced Vision Classification | task | s |
| Combining visual and acoustic features for audio classification | task | s |
| Comic MTL: optimized multi- | task | learning for comic book image analysis |
| Comic Speaker Prediction Based on Visual Relations and Natural Language Processing | task | s |
| Commentary Paper on An Object- and | task | -Oriented Architecture for Automated Video Surveillance in Distributed Sensor Networks |
| Comparative Performance Evaluation of Gray-Scale and Color Information for Face Recognition | task | s |
| Comparing indirect and direct touch in a stereoscopic interaction | task | |
| Comparing Objective and Subjective Metrics Between Physical and Virtual | task | s |
| Comparing state-of-the-art visual features on invariant object recognition | task | s |
| Comparison and Evaluation of Sonification Strategies for Guidance | task | s |
| Comparison of a two-handed interface to a wand interface and a mouse interface for fundamental 3D | task | s |
| Comparison of Channel Methods and Observer Models for the | task | -Based Assessment of Multi-Projection Imaging in the Presence of Structured Anatomical Noise |
| Comparison of Classifier Fusion Methods for Classification in Pattern Recognition | task | s |
| Comparison of Edge Detection Algorithms Using a Structure from Motion | task | |
| Comparison of Edge Detector Performance through Use in an Object Recognition | task | |
| Comparison of Edge Detectors Using an Object Recognition | task | |
| Comparison of Facial Alignment Techniques: With Test Results on Gender Classification | task | |
| Comparison of Feature Detectors with Passive and | task | -Based Visual Saliency, A |
| Comparison of PMD-Cameras and Stereo-Vision for the | task | of Surface Reconstruction using Patchlets, A |
| Complementary computing for visual | task | s: Meshing computer vision with human visual processing |
| Complexity Experts are | task | -Discriminative Learners for Any Image Restoration |
| Complexity of Perceptual Search | task | s, The |
| composite approach to evaluate two interaction techniques for a 3D pointing | task | , A |
| Composite Description Based on Salient Contours and Color Information for CBIR | task | s |
| Composite | task | ing: Understanding Images by Spatial Composition of Tasks |
| Compound Expression Recognition In-the-wild with AU-assisted Meta Multi- | task | Learning |
| comprehensive survey: Image deraining and stereo-matching | task | -driven performance analysis, A |
| ComPtr: Toward Diverse Bi-Source Dense Prediction | task | s via a Simple Yet General Complementary Transformer |
| Computational Methods for | task | -Directed Sensor Data Fusion and Planning |
| Computational Model of Focused Attention Meditation and Its Transfer to a Sustained Attention | task | , A |
| Computational Modeling of Age-Differences in a Visually Demanding Driving | task | : Vehicle Detection |
| Computing Swept Volumes for Sensor Planning | task | s |
| ConCon-Chi: Concept-Context Chimera Benchmark for Personalized Vision-Language | task | s |
| Conditional Channel Gated Networks for | task | -Aware Continual Learning |
| Conditional Multi- | task | learning for Plant Disease Identification |
| Conditional Mutual Information Based Feature Selection for Classification | task | |
| Conditional random fields versus template-matching in MT phrasing | task | s involving sparse training data |
| Confidence Intervals for Error Rates in 1:1 Matching | task | s: Critical Statistical Analysis and Recommendations |
| Conflicts between Likelihood and Knowledge Distillation in | task | Incremental Learning for 3D Object Detection |
| Connecting Image Denoising and High-Level Vision | task | s via Deep Learning |
| Constrained Probabilistic Mask Learning for | task | -specific Undersampled MRI Reconstruction |
| Constructing | task | visibility intervals for a surveillance system |
| Content Based Image Retrieval system using Wavelet Transformation and multiple input multiple | task | Deep Autoencoder |
| Content-Aware Image Color Editing with Auxiliary Color Restoration | task | s |
| Content-Dependency Reduction With Multi- | task | Learning In Blind Stitched Panoramic Image Quality Assessment |
| Context multi- | task | visual object tracking via guided filter |
| Context-based automatic reconstruction and texturing of 3D urban terrain for quick-response | task | s |
| Contextual Learning in the Selective Attention for Identification model (CL-SAIM): Modeling contextual cueing in visual search | task | s |
| Contextualising Implicit Representations for Semantic | task | s |
| Continual Action Assessment via | task | -Consistent Score-Discriminative Feature Distribution Modeling |
| Continual Learning Based on OOD Detection and | task | Masking |
| Continual Learning Survey: Defying Forgetting in Classification | task | s, A |
| Continual Object Detection via Prototypical | task | Correlation Guided Gating Mechanism |
| Contrasting Instructional Strategies Suited to a Detection | task | : Examining Differences in Subjective Workload |
| Controllable Dynamic Multi- | task | Architectures |
| Convex Formulation for Learning a Shared Predictive Structure from Multiple | task | s, A |
| Convolution-Enhanced Bi-Branch Adaptive Transformer With Cross- | task | Interaction for Food Category and Ingredient Recognition |
| Convolutional Masked Image Modeling for Dense Prediction | task | s on Pathology Images |
| Convolutional Oriented Boundaries: From Image Segmentation to High-Level | task | s |
| Cooperative Dual- | task | Path Planning for Persistent Surveillance and Emergency Handling by Multiple Unmanned Ground Vehicles |
| Cooperative multi- | task | learning and reliability assessment for glioma segmentation and IDH genotyping |
| Cooperative | task | allocation method for multi-unmanned aerial vehicles based on the modified genetic algorithm |
| Cops-Ref: A New Dataset and | task | on Compositional Referring Expression Comprehension |
| Cordial Sync: Going Beyond Marginal Policies for Multi-Agent Embodied | task | s, A |
| CoTDet: Affordance Knowledge Prompting for | task | Driven Object Detection |
| Counting Challenging Crowds Robustly Using a Multi-Column Multi- | task | Convolutional Neural Network |
| Coupled Shift-Invariant Tensorial Spatial ICA Applied to Multi-Group Complex-Valued | task | -Related and Resting-State fMRI Data |
| Coupled-dynamic learning for vision and language: Exploring Interaction between different | task | s |
| Covid-MANet: Multi- | task | attention network for explainable diagnosis and severity assessment of COVID-19 from CXR images |
| CP-mtML: Coupled Projection Multi- | task | Metric Learning for Large Scale Face Retrieval |
| CPVF: Vectorization of Agricultural Cultivation Field Parcels via a Boundary-Parcel Multi- | task | Learning Network in Ultra-High-Resolution Remote Sensing Images |
| CrabNet: Fully | task | -Specific Feature Learning for One-Stage Object Detection |
| Crafting a multi- | task | CNN for viewpoint estimation |
| Creating Experts From the Crowd: Techniques for Finding Workers for Difficult | task | s |
| CRML-Net: Cross-Modal Reasoning and Multi- | task | Learning Network for tooth image segmentation |
| CRNet: A Detail-Preserving Network for Unified Image Restoration and Enhancement | task | |
| Crop/Weed Field Image Dataset for the Evaluation of Computer Vision Based Precision Agriculture | task | s, A |
| Cross Dense Feature Learning With | task | Guidance for Few-Shot Classification |
| Cross Domain Lifelong Learning Based on | task | Similarity |
| Cross-Connected Networks for Multi- | task | Learning of Detection and Segmentation |
| Cross-Corpus Acoustic Emotion Recognition with Multi- | task | Learning: Seeking Common Ground While Preserving Differences |
| Cross-domain Few-shot Learning with | task | -specific Adapters |
| Cross-Domain Multi- | task | Learning for Object Detection and Saliency Estimation |
| Cross-Domain Self-Supervised Multi- | task | Feature Learning Using Synthetic Imagery |
| Cross-layer features in convolutional neural networks for generic classification | task | s |
| Cross-Lingual Text Image Recognition via Multi- | task | Sequence to Sequence Learning |
| Cross-Modal Data Augmentation for | task | s of Different Modalities |
| Cross-Modal Knowledge Transfer Without | task | -Relevant Source Data |
| Cross-Modal Pedestrian Behavior Prediction: A Dual- | task | Approach with Progressive Denoising Attention and CVAE |
| Cross-modality person re-identification via multi- | task | learning |
| Cross-Stitch Networks for Multi- | task | Learning |
| Cross- | task | Affinity Learning for Multitask Dense Scene Predictions |
| Cross- | task | and cross-domain SAR target recognition: A meta-transfer learning approach |
| Cross- | task | and Cross-Participant Classification of Cognitive Load in an Emergency Simulation Game |
| Cross- | task | and time-aware adversarial attack framework for perception of autonomous driving |
| Cross- | task | Attention Mechanism for Dense Multi-task Learning |
| Cross- | task | Attention Mechanism for Dense Multi-task Learning |
| Cross- | task | Crash Severity Analysis With Cost-Sensitive Transfer Graph Convolutional Network |
| Cross- | task | Inconsistency Based Active Learning (CTIAL) for Emotion Recognition |
| Cross- | task | Multimodal Reinforcement for Long Tail Next POI Recommendation |
| Cross- | task | Relation-Aware Consistency for Weakly Supervised Temporal Action Detection |
| Cross- | task | Transfer for Geotagged Audiovisual Aerial Scene Recognition |
| Cross- | task | Weakly Supervised Learning From Instructional Videos |
| CrossInfoNet: Multi- | task | Information Sharing Based Hand Pose Estimation |
| CTOD: Cross-Attentive | task | -Alignment for One-Stage Object Detection |
| CubiCasa5K: A Dataset and an Improved Multi- | task | Model for Floorplan Image Analysis |
| Curriculum Learning for Multi- | task | Classification of Visual Attributes |
| Curriculum learning of multiple | task | s |
| Curriculum learning of visual attribute clusters for multi- | task | classification |
| Curriculum-Based Asymmetric Multi- | task | Reinforcement Learning |
| Cutting through the clutter: | task | -relevant features for image matching |
| C^2MT: A Credible and Class-Aware Multi- | task | Transformer for SR-IQA |
| Data-Adaptive Weight-Ensembling for Multi- | task | Model Fusion |
| Data-driven bayesian-guided activation functions for multi- | task | pattern recognition |
| Data-Efficient and Robust | task | Selection for Meta-Learning |
| DATA: Domain-Aware and | task | -Aware Self-supervised Learning |
| Database Indexing, Specific | task | s |
| DATNet: Dense Auxiliary | task | s for Object Detection |
| DAWN: Domain-Adaptive Weakly Supervised Nuclei Segmentation via Cross- | task | Interactions |
| DBA-PCGC: Dual-Domain Boundary Aware for | task | -Friendly Point Cloud Geometry Compression |
| Dealing with Cross- | task | Class Discrimination in Online Continual Learning |
| Dealing with Missing Modalities in the Visual Question Answer-Difference Prediction | task | through Knowledge Distillation |
| Decentralized Motion Planning for Multiagent Collaboration Under Coupled LTL | task | Specifications |
| Deciding the Path: Leveraging Multi-Agent Systems for Solving Complex | task | s |
| Deconfounding the Effects of Resting State Activity on | task | Activation Detection in fMRI |
| Decouple-and-Sample: Protecting Sensitive Information in | task | Agnostic Data Release |
| Decouple-Then-Merge: Finetune Diffusion Models as Multi- | task | Learning |
| Decoupled Distillation to Erase: A General Unlearning Method for Any Class-centric | task | s |
| Decoupled Multi- | task | Learning with Cyclical Self-Regulation for Face Parsing |
| Decoupled Multi- | task | Network for Shadow Removal, A |
| Decoupled PROB: Decoupled Query Initialization | task | s and Objectness-Class Learning for Open World Object Detection |
| Decoupled self-supervised deep multi- | task | learning framework for subscriber portrait in smart meter |
| DecoupleNet: Domain-specific | task | decoupling network for low-light image enhancement |
| Decoupling Learning and Remembering: a Bilevel Memory Framework with Knowledge Projection for | task | -Incremental Learning |
| Decoupling multi- | task | causality for improved skin lesion segmentation and classification |
| Deep CNN-based classification of motor imagery | task | s from EEG signals using 2D wavelet transformed images of adaptively reconstructed signals from MVMD decomposed modes |
| Deep Co-Training with | task | Decomposition for Semi-Supervised Domain Adaptation |
| Deep Collaborative Multi- | task | Network: A Human Decision Process Inspired Model for Hierarchical Image Classification |
| Deep contrastive representation learning for supervised | task | s |
| Deep Convolutional-Shepard Interpolation Neural Networks for Image Classification | task | s |
| Deep Crisp Boundaries: From Boundaries to Higher-Level | task | s |
| Deep Elastic Networks With Model Selection for Multi- | task | Learning |
| Deep Execution Monitor for Robot Assistive | task | s |
| Deep Floor Plan Recognition Using a Multi- | task | Network With Room-Boundary-Guided Attention |
| Deep Learning for Multi- | task | Plant Phenotyping |
| Deep Learning Triplet Ordinal Relation Preserving Binary Code for Remote Sensing Image Retrieval | task | |
| Deep learning-driven diagnosis: A multi- | task | approach for segmenting stroke and Bell's palsy |
| Deep Low Light Image Enhancement via Multi- | task | Learning of Few Shot Exposure Imaging |
| Deep MANTA: A Coarse-to-Fine Many- | task | Network for Joint 2D and 3D Vehicle Analysis from Monocular Image |
| Deep Multi- | task | Attribute-Driven Ranking for Fine-Grained Sketch-Based Image Retrieval |
| Deep Multi- | task | Learning Based Fast Intra-Mode Decision for Versatile Video Coding |
| Deep multi- | task | learning for a geographically-regularized semantic segmentation of aerial images |
| Deep Multi- | task | Learning for Facial Expression Recognition and Synthesis Based on Selective Feature Sharing |
| Deep multi- | task | learning for gait-based biometrics |
| Deep Multi- | task | Learning for Joint Localization, Perception, and Prediction |
| Deep Multi- | task | Learning for Joint Prediction of Heterogeneous Face Attributes |
| Deep Multi- | task | Learning in Computer Vision |
| Deep Multi- | task | Learning to Recognise Subtle Facial Expressions of Mental States |
| Deep multi- | task | learning with relational attention for business success prediction |
| Deep Multi- | task | Multi-Label CNN for Effective Facial Attribute Classification |
| Deep multimodal learning for cross-modal retrieval: One model for all | task | s |
| Deep reinforcement learning and ant colony optimization supporting multi-UGV path planning and | task | assignment in 3D environments |
| Deep Reinforcement Learning-Based | task | Offloading for Vehicular Edge Computing With Flexible RSU-RSU Cooperation |
| Deep Reinforcement Learning-Based | task | Offloading With Collaborative Inference in UAV-Assisted Mobile Edge Computing Networks |
| Deep Reinforcement Learning-Based | task | Scheduling and Resource Allocation for Vehicular Edge Computing: A Survey |
| Deep tree-structured face: A unified representation for multi- | task | facial biometrics |
| Deep Virtual Networks for Memory Efficient Inference of Multiple | task | s |
| Deep visual unsupervised domain adaptation for classification | task | s: A survey |
| Deep-Learning Based GNSS Scene Recognition Method for Detailed Urban Static Positioning | task | via Low-Cost Receivers, A |
| DeepDNet: Deep Dense Network for Depth Completion | task | |
| DeepSaliency: Multi- | task | Deep Neural Network Model for Salient Object Detection |
| DeepShoe: An improved Multi- | task | View-invariant CNN for street-to-shop shoe retrieval |
| DeepSkeleton: Learning Multi- | task | Scale-Associated Deep Side Outputs for Object Skeleton Extraction in Natural Images |
| Defocus Image Deblurring Network With Defocus Map Estimation as Auxiliary | task | |
| Dense computing | task | analysis of multi-view matching method and GPU implementation |
| Dense Relational Image Captioning via Multi- | task | Triple-Stream Networks |
| DenseFormer-MoE: A Dense Transformer Foundation Model With Mixture of Experts for Multi- | task | Brain Image Analysis |
| Densely connected multidilated convolutional networks for dense prediction | task | s |
| Density-aware and background-aware network for crowd counting via multi- | task | learning |
| Density-Aware Multi- | task | Learning for Crowd Counting |
| Depict: Diffusion-enabled Permutation Importance for Image Classification | task | s |
| Descriptive and Prescriptive Languages for Mobility | task | s: Are They Different? |
| Design and Evaluation of a Haptic Computer-Assistant for Telemanipulation | task | s |
| Design Eye-Tracking Augmented Reality Headset to Reduce Cognitive Load in Repetitive Parcel Scanning | task | |
| Designing Extremely Memory-Efficient CNNs for On-device Vision and Audio | task | s |
| Designing Extremely Memory-efficient CNNs for On-device Vision | task | s |
| Designing stereo heads using | task | domain constraints |
| DETA: A Point-Based Tracker With Deformable Transformer and | task | -Aligned Learning |
| DETA: Denoised | task | Adaptation for Few-Shot Learning |
| Detecting Absence of Bone Wall in Jugular Bulb by Image Transformation Surrogate | task | s |
| Detecting Anatomical Landmarks From Limited Medical Imaging Data Using Two-Stage | task | -Oriented Deep Neural Networks |
| Detecting Drivers' Mirror: Checking Actions and Its Application to Maneuver and Secondary | task | Recognition |
| Detecting global ocean subsurface density change with high-resolution via dual- | task | densely-former |
| Detecting Stress During Real-World Driving | task | s Using Physiological Sensors |
| Detection of Atypical Elements by Transforming | task | to Supervised Form |
| Detection of Cognitive Binding During Ambiguous Figure | task | s by Wavelet Coherence Analysis of EEG Signals |
| Detection of Metadata Tampering Through Discrepancy Between Image Content and Metadata Using Multi- | task | Deep Learning |
| Detection of static objects for the | task | of video surveillance |
| Determination of Motion Breakpoints in a | task | Sequence from Human Hand Motion |
| Determining the trustworthiness of DNNs in classification | task | s using generalized feature-based confidence metric |
| DeTTO: Dependency-Aware Trustworthy | task | Offloading in Vehicular IoT |
| Developer-friendly segmentation using OpenVL, a high-level | task | -based abstraction |
| Developing | task | -Specific RBF Hand Gesture Recognition |
| Development and evaluation of a specialized | task | taxonomy for spatial planning: A map literacy experiment with topographic maps |
| Devil is in the | task | : Exploiting Reciprocal Appearance-Localization Features for Monocular 3D Object Detection, The |
| DeViouS: a distributed environment for vision | task | s |
| DF-Net: Unsupervised Joint Learning of Depth and Flow Using Cross- | task | Consistency |
| Diagnostic Captioning by Cooperative | task | Interactions and Sample-Graph Consistency |
| Diff-Plugin: Revitalizing Details for Diffusion-Based Low-Level | task | s |
| Difficulty Estimation with Action Scores for Computer Vision | task | s |
| Diffusion-based Visual Anagram as Multi- | task | Learning |
| DiffusionMTL: Learning Multi- | task | Denoising Diffusion Model from Partially Annotated Data |
| Digital Twin-Based | task | -Driven Resource Management in Intelligent UAV Swarms |
| Digital Twin-Driven Vehicular | task | Offloading and IRS Configuration in the Internet of Vehicles |
| Discover-Then-Name: | task | -Agnostic Concept Bottlenecks via Automated Concept Discovery |
| Discriminative multi- | task | objects tracking with active feature selection and drift correction |
| Discriminative Multi- | task | Sparse Learning for Robust Visual Tracking Using Conditional Random Field |
| Discriminative Neural Variational Model for Unbalanced Classification | task | s in Knowledge Graph |
| Discriminative Training of Deep Fully Connected Continuous CRFs With | task | -Specific Loss |
| Disentangling | task | -Oriented Representations for Unsupervised Domain Adaptation |
| Disjoint Multi- | task | Learning Between Heterogeneous Human-Centric Tasks |
| Disjoint Multi- | task | Learning Between Heterogeneous Human-Centric Tasks |
| Disposable Transfer Learning for Selective Source | task | Unlearning |
| Distance-Based Descriptors and Their Application in the | task | of Object Detection |
| Distilling Cross- | task | Knowledge via Relationship Matching |
| Distilling Facial Knowledge with Teacher- | task | s: Semantic-Segmentation-Features For Pose-Invariant Face-Recognition |
| Distilling from Similar | task | s for Transfer Learning on a Budget |
| Distilling Image Dehazing With Heterogeneous | task | Imitation |
| Distortion-Aware Multi- | task | Learning Framework for Fractional Interpolation in Video Coding, A |
| Distractor suppression Siamese network with | task | -aware attention for visual tracking |
| Distributed Algorithm for | task | Offloading in Vehicular Networks With Hybrid Fog/Cloud Computing, A |
| Distributed architectures and logical- | task | decomposition in multimedia surveillance systems |
| Distributed Collaborative Computing for | task | Completion Rate Maximization in Vehicular Edge Computing |
| Distributed Deadlock-Free | task | Offloading Algorithm for Integrated Communication-Sensing-Computing Satellites with Data-Dependent Constraints, A |
| Distributed Infrastructure Enabling Effective Integration of Earth Observation Information Resources for Collective Solution of Archiving, Searching, Processing and Eo Data Analyzing | task | s |
| Distributed Semantic Segmentation with Efficient Joint Source and | task | Decoding |
| Distributed | task | Assignment for Multiple Robots Under Limited Communication Range |
| Di | task | : Multi-Task Fine-Tuning with Diffeomorphic Transformations |
| DIVE: Inverting Conditional Diffusion Models for Discriminative | task | s |
| Diversified Dynamic Routing for Vision | task | s |
| Diversified Fisher kernel: encoding discrimination in Fisher features to compete deep neural models for visual classification | task | |
| Diversified | task | Augmentation with Redundancy Reduction for Cross-Domain Few-Shot Learning |
| diversified-equal loss for image translation | task | s, The |
| divide-and-conquer strategy for facial landmark detection using dual- | task | CNN architecture, A |
| Divide-and-conquer towards optimal adaptation of pre-trained model to medical | task | s |
| DJUHNet: A deep representation learning-based scheme for the | task | of joint image upsampling and hashing |
| DMITS: Dependency and Mobility-Aware Intelligent | task | Scheduling in Socially-Enabled VFC Based on Federated DRL Approach |
| Do Different Map Types Support Map Reading Equally? Comparing Choropleth, Graduated Symbols, and Isoline Maps for Map Use | task | s |
| Doc2graph: A | task | Agnostic Document Understanding Framework Based on Graph Neural Networks |
| DocRes: A Generalist Model Toward Unifying Document Image Restoration | task | s |
| Does gender make a difference to performing in-vehicle | task | s? |
| Does Robustness on ImageNet Transfer to Downstream | task | s? |
| Does the Brain Rest?: An Independent Component Analysis of Temporally Coherent Brain Networks at Rest and During a Cognitive | task | |
| Doing and Feeling: Relationships Between Moods, Productivity and | task | -Switching |
| Doing Versus Observing: Virtual Reality and 360-Degree Video for Training Manufacturing | task | s |
| Domain Adaptation Through | task | Distillation |
| Domain Aware Multi- | task | Pretraining of 3d Swin Transformer for T1-Weighted Brain MRI |
| Domain Generalization for Object Recognition with Multi- | task | Autoencoders |
| domain-of-influence based pricing strategy for | task | assignment in crowdsourcing package delivery, A |
| Don't just listen, use your imagination: Leveraging visual common sense for non-visual | task | s |
| DONNAv2: Lightweight Neural Architecture Search for Vision | task | s |
| Double- | task | Deep Q-Learning with Multiple Views |
| DR(eye)VE: A Dataset for Attention-Based | task | s with Applications to Autonomous and Assisted Driving |
| Driven to discussion: engaging drivers in conversation with a digital assistant as a countermeasure to passive | task | -related fatigue |
| Driver Gaze Area Prediction During IVIS Secondary | task | s Based on Multivariate Features of Spatial-Temporal Distribution |
| Driver Multi- | task | Emotion Recognition Network Based on Multi-Modal Facial Video Analysis |
| DropMAE: Masked Autoencoders with Spatial-Attention Dropout for Tracking | task | s |
| DT Assisted | task | Offloading for C-V2X Networks With Imperfect DT Prediction Conditions |
| Dual Domain Multi- | task | Model for Vehicle Re-Identification |
| Dual | task | Learning by Leveraging Both Dense Correspondence and Mis-Correspondence for Robust Change Detection With Imperfect Matches |
| Dual Transfer Learning for Event-based End- | task | Prediction via Pluggable Event to Image Translation |
| Dual-Channel Multi- | task | CNN for No-Reference Screen Content Image Quality Assessment |
| Dual-Domain Multi- | task | Learning-Based Domain Adaptation for Hyperspectral Image Classification |
| Dual- | task | ConvLSTM-UNet for Instance Segmentation of Weakly Annotated Microscopy Videos |
| Dual- | task | Integrated Network for Fast Pedestrian Detection in Crowded Scenes |
| Dual- | task | Mutual Learning With QPHFM Watermarking for Deepfake Detection |
| Dual- | task | Mutual Reinforcing Embedded Joint Video Paragraph Retrieval and Grounding |
| Dual- | task | Network for Terrace and Ridge Extraction: Automatic Terrace Extraction via Multi-Task Learning |
| Dual- | task | Network for Terrace and Ridge Extraction: Automatic Terrace Extraction via Multi-Task Learning |
| dual- | task | region-boundary aware neural network for accurate pulmonary nodule segmentation, A |
| Dual- | task | Semantic Change Detection for Remote Sensing Images Using the Generative Change Field Module |
| Dual- | task | Supervised Network for SAR and Road Vector Image Matching |
| Dual- | task | Synergy-Driven Generalization Framework for Pancreatic Cancer Segmentation in CT Scans, A |
| Duality Diagram Similarity: A Generic Framework for Initialization Selection in | task | Transfer Learning |
| DXA-Net: Dual- | task | Cross-Lingual Alignment Network for Zero-Shot Cross-Lingual Spoken Language Understanding |
| Dynamic Context Removal: A General Training Strategy for Robust Models on Video Action Predictive | task | s |
| Dynamic Cross- | task | Representation Adaptation for Clinical Targets Co-Segmentation in CT Image-Guided Post-Prostatectomy Radiotherapy |
| Dynamic Delay-Sensitive Observation-Data-Processing | task | Offloading for Satellite Edge Computing: A Fully-Decentralized Approach |
| Dynamic Feature Interaction Framework for Multi- | task | Visual Perception, A |
| Dynamic Integration of | task | -Specific Adapters for Class Incremental Learning |
| Dynamic Neural Network for Multi- | task | Learning Searching across Diverse Network Topologies |
| Dynamic Resource Allocation for Cloud-Edge Collaboration Offloading in VEC Networks With Diverse | task | s |
| Dynamic | task | Assignment and Path Optimization for Multi-AUVs System |
| Dynamic | task | decomposition for decentralized object tracking in complex scenes |
| Dynamic | task | Decomposition for Probabilistic Tracking in Complex Scenes |
| Dynamic | task | Planning Method for Multi-Source Remote Sensing Satellite Cooperative Observation in Complex Scenarios |
| Dynamic | task | Prioritization for Multitask Learning |
| Dynamic topology and relevance learning SOM-based algorithm for image clustering | task | s |
| Dynamic-Net: Tuning the Objective Without Re-Training for Synthesis | task | s |
| DynaShare: | task | and Instance Conditioned Parameter Sharing for Multi-Task Learning |
| DynaShare: | task | and Instance Conditioned Parameter Sharing for Multi-Task Learning |
| DynRefer: Delving into Region-level Multimodal | task | s via Dynamic Resolution |
| DYSON: Dynamic Feature Space Self-Organization for Online | task | -Free Class Incremental Learning |
| E-CNNet: Time-reassigned Multisynchrosqueezing transform-based deep learning framework for MI-BCI | task | classification |
| e-ViL: A Dataset and Benchmark for Natural Language Explanations in Vision-Language | task | s |
| Each Performs Its Functions: | task | Decomposition and Feature Assignment for Audio-Visual Segmentation |
| Earth Observation | task | Representation Model Supporting Dynamic Demand for Flood Disaster Monitoring and Management, An |
| EarthNet2021: A large-scale dataset and challenge for Earth surface forecasting as a guided video prediction | task | |
| EarthVQANet: Multi- | task | visual question answering for remote sensing image understanding |
| EASUM: Enhancing Affective State Understanding through Joint Sentiment and Emotion Modeling for Multimodal | task | s |
| EdgeStereo: An Effective Multi- | task | Learning Network for Stereo Matching and Edge Detection |
| Editorial special issue on multiple- | task | learning for big data |
| EEG & Eye Tracking User Experiments for Spatial Memory | task | on Maps |
| EEG Correlates of Difficulty Levels in Dynamical Transitions of Simulated Flying and Mapping | task | s |
| EEG Method to Identify Image Preference With an Explicit/Implicit | task | Brain-Computer Interface, An |
| Effcient Multiple Loop Adjustment for Computer Vision | task | s |
| Effect of Color-Contrasting Shadows on a Dynamic 3-D Laparoscopic Surgical | task | , The |
| Effective extraction of ventricles and myocardium objects from cardiac magnetic resonance images with a multi- | task | learning U-Net |
| Effective Presentation Attack Detection Driven by Face Related | task | |
| Effective | task | Sampling Strategy Based on Category Generation for Fine-Grained Few-Shot Object Recognition, An |
| effectiveness of T5, GPT-2, and BERT on text-to-image generation | task | , The |
| Effectiveness of | task | -Level Parallelism for High-Level Vision, The |
| Effects of Algorithmic Transparency on User Experience and Physiological Responses in Affect-Aware | task | Adaptation |
| Effects of Augmented Reality on the Performance of Teleoperated Industrial Assembly | task | s in a Robotic Embodiment |
| Effects of Cast Shadows and Stereopsis on Performing Computer-Generated Spatial | task | s, The |
| Effects of Controlled Element Dynamics on Human Feedforward Behavior in Ramp-Tracking | task | s |
| Effects of Lead Time of Take-Over Request and Nondriving | task | s on Taking-Over Control of Automated Vehicles, The |
| Effects of Non-Driving Related | task | s During Self-Driving Mode |
| Effects of Preview on Human Control Behavior in Tracking | task | s With Various Controlled Elements |
| Effects of Preview Time in Manual Tracking | task | s |
| Effects of sound on visual realism perception and | task | performance |
| Effects of visual conflicts on 3D selection | task | performance in stereoscopic display environments |
| Efficient Action Detection in Untrimmed Videos via Multi- | task | Learning |
| Efficient adaptive density estimation per image pixel for the | task | of background subtraction |
| Efficient and Effective Transformer Decoder-based Framework for Multi- | task | Visual Grounding, An |
| Efficient and Effective Weight-Ensembling Mixture of Experts for Multi- | task | Model Merging |
| Efficient Anomaly Detection Using Self-Supervised Multi-Cue | task | s |
| Efficient Computation Sharing for Multi- | task | Visual Scene Understanding |
| Efficient Controllable Multi- | task | Architectures |
| efficient deep multi- | task | learning structure for COVID-19 disease, An |
| Efficient embedding of interprocessor communications in parallel implementations of intermediate level vision | task | s |
| Efficient Expansion and Gradient Based | task | Inference for Replay Free Incremental Learning |
| Efficient Feature Compression for the Object Tracking | task | |
| Efficient Graph-Based Spatio-Temporal Indexing Method for | task | -Oriented Multi-Modal Scene Data Organization, An |
| efficient multi-agent computationnal model for massively distribution of independent and heterogeneous | task | s, An |
| Efficient multi- | task | progressive learning for semantic segmentation and disparity estimation |
| Efficient Multilevel Eigensolvers with Applications to Data Analysis | task | s |
| Efficient Package Delivery | task | Assignment for Truck and High Capacity Drone |
| Efficient Partial | task | Offloading and Resource Allocation Scheme for Vehicular Edge Computing in a Dynamic Environment, An |
| Efficient Prediction of Model Transferability in Semantic Segmentation | task | s |
| Efficient Proposals: Scale Estimation for Object Proposals in Pedestrian Detection | task | s |
| Efficient Stitchable | task | Adaptation |
| efficient system for combining complementary kernels in complex visual categorization | task | s, An |
| Efficient | task | assignment in spatial crowdsourcing with worker and task privacy protection |
| Efficient | task | assignment in spatial crowdsourcing with worker and task privacy protection |
| Efficient | task | Grouping Through Sample-Wise Optimisation Landscape Analysis |
| Efficient | task | Implementation Modeling Framework with Multi-Stage Feature Selection and AutoML: A Case Study in Forest Fire Risk Prediction, An |
| Efficient | task | -Specific Conditional Diffusion Policies: Shortcut Model Acceleration and SO(3) Optimization |
| Efficient | task | -Specific Feature Re-Fusion for More Accurate Object Detection and Instance Segmentation |
| Efficient Transfer From Image-Based Large Multimodal Models to Video | task | s |
| Efficient Transfer Learning for Visual | task | s via Continuous Optimization of Prompts |
| Efficient use of parallelism in intermediate level vision | task | s |
| Efficiently utilizing complex-valued PolSAR image data via a multi- | task | deep learning framework |
| EGO-CH: Dataset and fundamental | task | s for visitors behavioral understanding using egocentric vision |
| Egocentric Video | task | Translation |
| EgoTV: Egocentric | task | Verification from Natural Language Task Descriptions |
| EgoTV: Egocentric | task | Verification from Natural Language Task Descriptions |
| Elaborate multi- | task | subspace learning with discrete group constraint |
| ELBA: Learning by Asking for Embodied Visual Navigation and | task | Completion |
| Electromyography for Teleoperated | task | s in Weightlessness |
| Embedding | task | Structure for Action Detection |
| Embedding | task | s Into the Latent Space: Cross-Space Consistency for Multi-Dimensional Analysis in Echocardiography |
| Emerging Relation Network and | task | Embedding for Multi-Task Regression Problems |
| Emerging Relation Network and | task | Embedding for Multi-Task Regression Problems |
| EmoComicNet: A multi- | task | model for comic emotion recognition |
| Emotion Recognition With Sequential Multi- | task | Learning Technique |
| Empathy Detection From Text, Audiovisual, Audio or Physiological Signals: A Systematic Review of | task | Formulations and Machine Learning Methods |
| empirical comparison of graph-based dimensionality reduction algorithms on facial expression recognition | task | s, An |
| Empirical Investigations on Benchmark | task | s for Automatic Image Annotation |
| EMT-NAS: Transferring architectural knowledge between | task | s from different datasets |
| Emu Edit: Precise Image Editing via Recognition and Generation | task | s |
| EMVP: An Edge-Assisted Multi- | task | Visual Perception System for Multi-Vehicle Scenarios |
| Encoding sparse and competitive structures among | task | s in multi-task learning |
| Encoding sparse and competitive structures among | task | s in multi-task learning |
| End-to-end interactive joint model: Clause-phrase multi- | task | learning for suicidal ideation cause extraction (SICE) in Chinese Weibo text |
| End-to-End Learned Scalable Multilayer Feature Compression for Machine Vision | task | s |
| End-to-end Multi-Modal Multi- | task | Vehicle Control for Self-Driving Cars with Visual Perceptions |
| End-to-End Multi- | task | Learning for Lung Nodule Segmentation and Diagnosis |
| End-to-End Multi- | task | Learning Model for Drivable Road Detection via Edge Refinement and Geometric Deformation, An |
| End-to-end Multi- | task | Learning of Missing Value Imputation and Forecasting in Time-Series Data |
| End-To-End Multi- | task | Learning With Attention |
| End-to-End Real-Time Obstacle Detection Network for Safe Self-Driving via Multi- | task | Learning |
| End-to-End | task | -Guided Refinement of Synthetic Images for Data Efficient Cerebral Microbleed Detection |
| EndoNet: A Deep Architecture for Recognition | task | s on Laparoscopic Videos |
| Energy Efficient | task | -Offloading for DT-Powered IRS-Aided Vehicular Communication Network Underlaying UAV |
| Energy-aware and robust | task | (re)assignment in embedded smart camera networks |
| Energy-efficient adaptive dependent | task | scheduling in cooperative vehicle-infrastructure system |
| Energy-Efficient Cooperative | task | Offloading in NOMA-Enabled Vehicular Fog Computing |
| Energy-Efficient Timely Truck Transportation for Geographically-Dispersed | task | s |
| Energy-transfer features and their application in the | task | of face detection |
| Engagement Detection with Multi- | task | Training in E-Learning Environments |
| EnGraf-Net: Multiple Granularity Branch Network with Fine-Coarse Graft Grained for Classification | task | |
| Enhance to read better: A Multi- | task | Adversarial Network for Handwritten Document Image Enhancement |
| Enhanced Multi- | task | Learning Architecture for Detecting Pedestrian at Far Distance |
| Enhanced representation and multi- | task | learning for image annotation |
| Enhanced | task | attention with adversarial learning for dynamic multi-task CNN |
| Enhanced | task | attention with adversarial learning for dynamic multi-task CNN |
| Enhancing abusive language detection: A domain-adapted approach leveraging BERT pre-training | task | s |
| Enhancing Cross- | task | Black-Box Transferability of Adversarial Examples With Dispersion Reduction |
| Enhancing Infrared Small Target Detection: A Saliency-Guided Multi- | task | Learning Approach |
| Enhancing Monocular Depth Estimation with Multi-Source Auxiliary | task | s |
| Enhancing Multi- | task | Learning with Attention Mechanisms |
| Enhancing | task | Offloading in IoV With a Two-Stage Algorithm Under Information Asymmetry |
| Enhancing Video Anomaly Understanding via Multi- | task | Instruction Tuning |
| Enriching visual feature representations for vision-language | task | s using spectral transforms |
| Enroll-to-Verify Approach for Cross- | task | Unseen Emotion Class Recognition, An |
| Ensemble methods for biclustering | task | s |
| Ensemble of Multi- | task | Learning Networks for Facial Expression Recognition In-the-wild with Learning from Synthetic Data |
| Ensemble Prototype Networks for Unsupervised Cross-Modal Hashing With Cross- | task | Consistency |
| Entropic Score to Rank Annotators for Crowdsourced Labeling | task | s, An |
| Equiangular Basis Vectors: A Novel Paradigm for Classification | task | s |
| Equivalence of Some Common Linear Feature Extraction Techniques for Appearance-Based Object Recognition | task | s |
| Error Detection in Egocentric Procedural | task | Videos |
| Error regulation strategies for Model Based visual servoing | task | s: Application to autonomous object grasping with Nao robot |
| Escape from Meadwyn 4: A cross-platform environment for collaborative navigation | task | s |
| Estimation receiver operating characteristic curve and ideal observers for combined detection/estimation | task | s |
| Evaluating Multi- | task | Learning for Multi-view Head-Pose Classification in Interactive Environments |
| Evaluating stereo vision and user tracking in mixed reality | task | s! |
| Evaluating synthetic pre-Training for handwriting processing | task | s |
| Evaluating Transferability in Retrieval | task | s: An Approach Using MMD and Kernel Methods |
| Evaluating Variable Autonomy for a Teleoperated Navigation | task | in a Habituated Environment |
| Evaluation of 3D virtual cursor offset techniques for navigation | task | s in a multi-display virtual environment |
| Evaluation of BERT and ALBERT Sentence Embedding Performance on Downstream NLP | task | s |
| Evaluation of Edge Detection Algorithms Using a Structure from Motion | task | |
| Evaluation of Haptic and Visual Cues for Repulsive or Attractive Guidance in Nonholonomic Steering | task | s |
| Evaluation of the tactile detection response | task | in a laboratory test using a surrogate driving set-up |
| Evaluation of Time Series Distance Functions in the | task | of Detecting Remote Phenology Patterns |
| Evaluation of Visual Content Descriptors for Supporting Ad-Hoc Video Search | task | s at the Video Browser Showdown |
| Evaluation of Visual Speech Features for the | task | s of Speech and Speaker Recognition, An |
| EvDistill: Asynchronous Events to End- | task | Learning via Bidirectional Reconstruction-guided Cross-modal Knowledge Distillation |
| Event-Based Visible and Infrared Fusion via Multi- | task | Collaboration |
| Evolution of Transferable and Self-Organized Communication Modules for Solving Multiple Swarm Robotics | task | s |
| ExaM: Unsupervised Concept-Based Representation Learning to Better Explain Models in Vision | task | s |
| Examination of Effectiveness of a Performed Procedural | task | Using Low-Cost Peripheral Devices in VR |
| Expanding Scope of the Stability Gap: Unveiling its Presence in Joint Incremental Learning of Homogeneous | task | s, The |
| Experiencing the Spirit of Place as a Design | task | : The Street of Hamra in the Heart of Beirut |
| Experimental Comparison of Non-Parametric Classifiers for Time-Constrained Classification | task | s, An |
| Experimental Results of Underwater Sound Speed Profile Inversion by Few-Shot Multi- | task | Learning |
| Expert Level Control of Ramp Metering Based on Multi- | task | Deep Reinforcement Learning |
| Expert Level Control of Ramp Metering Based on Multi- | task | Deep Reinforcement Learning |
| Expert Regularizers for | task | Specific Processing |
| Exploiting Proximity-Aware | task | s for Embodied Social Navigation |
| Exploiting Related and Unrelated | task | s for Hierarchical Metric Learning and Image Classification |
| Exploiting | task | and data parallelism for advanced video coding on hybrid CPU + GPU platforms |
| Exploring complementary information of self-supervised pretext | task | s for unsupervised video pre-training |
| Exploring Dual- | task | Correlation for Pose Guided Person Image Generation |
| Exploring Patch-wise Semantic Relation for Contrastive Learning in Image-to-Image Translation | task | s |
| Exploring Relational Context for Multi- | task | Dense Prediction |
| Exploring | task | Structure for Brain Tumor Segmentation From Multi-Modality MR Images |
| Exploring the diversity and invariance in yourself for visual pre-training | task | |
| Exploring Vision-Based Interfaces: How to Use Your Head in Dual Pointing | task | s |
| Exploring visual language models for driver gaze estimation: A | task | -based approach to debugging AI |
| Extracting discriminative features using | task | -oriented gaze maps measured from observers for personal attribute classification |
| Eye landmarks detection via two-level cascaded CNNs with multi- | task | learning |
| Eye Movement Analysis Algorithm for a Multielement Target Tracking | task | : Maximum Transition-Based Agglomerative Hierarchical Clustering, An |
| Eye-In-Hand Robotic | task | s in Uncalibrated Environments |
| EyeNet: A Multi- | task | Deep Network for Off-Axis Eye Gaze Estimation |
| Fabric Retrieval Based on Multi- | task | Learning |
| FaceHunter: A Multi- | task | Convolutional Neural Network Based Face Detector |
| Facial Action Unit Event Detection by Cascade of | task | s |
| Facial Action Unit Recognition in the Wild with Multi- | task | CNN Self-Training for the EmotioNet Challenge |
| Facial age estimation using Clustered Multi- | task | Support Vector Regression Machine |
| Facial Attributes Classification Using Multi- | task | Representation Learning |
| Facial Emotion Recognition with Noisy Multi- | task | Annotations |
| Facial event classification with | task | oriented dynamic Bayesian network |
| Facial expression recognition based on a multi- | task | global-local network |
| Facial Landmark Detection by Deep Multi- | task | Learning |
| Facial point detection using convolutional neural network transferred from a heterogeneous | task | |
| Facilitating Autonomous Driving | task | s With Large Language Models |
| Factors of Influence for Transfer Learning Across Diverse Appearance Domains and | task | Types |
| FADE: A | task | -Agnostic Upsampling Operator for Encoder-Decoder Architectures |
| FADE: Fusing the Assets of Decoder and Encoder for | task | -Agnostic Upsampling |
| Fair Federated Learning for Multi- | task | 6G NWDAF Network Anomaly Detection |
| Fair Representation: Guaranteeing Approximate Multiple Group Fairness for Unknown | task | s |
| FAME-ViL: Multi- | task | ing Vision-Language Model for Heterogeneous Fashion Tasks |
| FAR-AMTN: Attention Multi- | task | Network for Face Attribute Recognition |
| FArMARe: a Furniture-Aware Multi- | task | methodology for Recommending Apartments based on the user interests |
| Fast approximate kernel-based similarity search for image retrieval | task | |
| Fast Computation of Region Homogeneity with Application in a Surveillance | task | |
| Fast GraspNeXt: A Fast Self-Attention Neural Network Architecture for Multi- | task | Learning in Computer Vision Tasks for Robotic Grasping on the Edge |
| Fast GraspNeXt: A Fast Self-Attention Neural Network Architecture for Multi- | task | Learning in Computer Vision Tasks for Robotic Grasping on the Edge |
| Fast Selection of Small and Precise Candidate Sets from Dictionaries for Text Correction | task | s |
| Fault Signal Perception of Nanofiber Sensor for 3D Human Motion Detection Using Multi- | task | Deep Learning |
| FCL-ViT: | task | -aware attention tuning for Continual Learning |
| Feature Extraction from Wavelet Coefficients for Pattern Recognition | task | s |
| Feature Learning for the Image Retrieval | task | |
| Feature Planning for Robust Execution of General Robot | task | s using Visual Servoing |
| Feature Selection for Multimedia Analysis by Sharing Information Among Multiple | task | s |
| Feature Selection in Regression | task | s Using Conditional Mutual Information |
| Feature-Guided Instance Mining and | task | -Aligned Focal Loss for Weakly Supervised Object Detection in Remote Sensing Images |
| Federated Learning Enabled Credit Priority | task | Processing for Transportation Big Data |
| Fedhca2: Towards Hetero-Client Federated Multi- | task | Learning |
| FedServ: Federated | task | Service in Fog-Enabled Internet of Vehicles |
| Few-Shot Class-Incremental Learning by Sampling Multi-Phase | task | s |
| Few-Shot Class-Incremental SAR Target Recognition Based on Dynamic | task | -Adaptive Classifier |
| Few-shot classification with | task | -adaptive semantic feature learning |
| Few-Shot High-Resolution Range Profile Ship Target Recognition Based on | task | -Specific Meta-Learning with Mixed Training and Meta Embedding |
| Few-Shot Image Classification Benchmarks are Too Far From Reality: Build Back Better with Semantic | task | Sampling |
| Few-Shot Learning of Compact Models via | task | -Specific Meta Distillation |
| Filter Distribution Templates in Convolutional Networks for Image Classification | task | s |
| Finding Non-uniform Quantization Schemes Using Multi- | task | Gaussian Processes |
| Finding Significant Points for a Handwritten Classification | task | |
| Finding | task | -Relevant Features for Few-Shot Learning by Category Traversal |
| Finding the Right Moment: Human-Assisted Trailer Creation via | task | Composition |
| Finding Visual | task | Vectors |
| Findings from shared | task | s on hate speech detection: Performance patterns for low-resource languages |
| Fine-Grain Batching-Based | task | Allocation Algorithm for Spatial Crowdsourcing, A |
| Fine-Grained Recognition in the Wild: A Multi- | task | Domain Adaptation Approach |
| FineRehab: A Multi-modality and Multi- | task | Dataset for Rehabilitation Analysis |
| First Attempt to Define Level of Details Based on Decision-making | task | s: Application to Underground Utility Network, A |
| Fish-Inspired | task | Allocation Algorithm for Multiple Unmanned Aerial Vehicles in Search and Rescue Missions |
| Fisher information and surrogate figures of merit for the | task | -based assessment of image quality |
| FishNet: Fish visual recognition with one stage multi- | task | learning |
| Flexible Clustered Multi- | task | Learning by Learning Representative Tasks |
| Flexible Clustered Multi- | task | Learning by Learning Representative Tasks |
| flexible ensemble-SVM for computer vision | task | s, A |
| FlexVLN: Flexible Adaptation for Diverse Vision-and-Language Navigation | task | s |
| FloorLevel-Net: Recognizing Floor-Level Lines With Height-Attention-Guided Multi- | task | Learning |
| Florence-2: Advancing a Unified Representation for a Variety of Vision | task | s |
| FMGNet: An efficient feature-multiplex group network for real-time vision | task | |
| Focus More on What? Guiding Multi- | task | Training for End-to-End Person Search |
| FocusFace: Multi- | task | Contrastive Learning for Masked Face Recognition |
| Forgetting to Remember: A Scalable Incremental Learning Framework for Cross- | task | Blind Image Quality Assessment |
| Formal Machine-Learning Approach to Generating Human-Machine Interfaces From | task | Models, A |
| Formulation of Parallel Image Processing | task | s |
| Forward Diffusion Guided Reconstruction as a Multi-Modal Multi- | task | Learning Scheme |
| FPGA Implementation of 3-Bit Quantized Multi- | task | CNN for Contour Detection and Disparity Estimation |
| Fragment-Based Knowledge Transfer for Multi- | task | Capacitated Vehicle Routing |
| Frame-Based System for Modelling and Executing Visual | task | s, A |
| framework and baseline results for the CLEF medical automatic annotation | task | , A |
| Framework for Implementing Multi-Sensor Robotic | task | s, A |
| Frank-Wolfe-based Multi- | task | Learning for Historical Document Restoration |
| Frequency domain | task | -adaptive network for restoring images with combined degradations |
| Frequency-Aware Feature Aggregation Network with Dual- | task | Consistency for RGB-T Salient Object Detection |
| From Easy to Difficult: A Self-Paced Multi- | task | Joint Sparse Representation Method |
| From Emotions to Action Units with Hidden and Semi-Hidden- | task | Learning |
| From Evaluation to Verification: Towards | task | -oriented Relevance Metrics for Pedestrian Detection in Safety-critical Domains |
| From Translation to Generative LLMs: Classification of Code-Mixed Affective | task | s |
| FSMT: Few-Shot Object Detection via Multi- | task | Decoupled |
| FULLER: Unified Multi-modality Multi- | task | 3D Perception via Multi-level Gradient Calibration |
| Fully Automated Multimodal MRI-Based Multi- | task | Learning for Glioma Segmentation and IDH Genotyping, A |
| Fully Automated Pipeline For Classification | task | s With An Application To Remote Sensing, A |
| Fully synthetic training for image restoration | task | s |
| Fully-Adaptive Feature Sharing in Multi- | task | Networks with Applications in Person Attribute Classification |
| Fusion Encoder with Multi- | task | Guidance for Cross-Modal Text-Image Retrieval in Remote Sensing, A |
| Fuzzy Color-Based Approach for Understanding Animated Movies Content in the Indexing | task | , A |
| g3D-LF: Generalizable 3D-Language Feature Fields for Embodied | task | s |
| GAFNet: A Global Fourier Self Attention Based Novel Network for multi-modal downstream | task | s |
| Ganzzle: Reframing Jigsaw Puzzle Solving as a Retrieval | task | using a Generative Mental Image |
| Gaze estimation with semi-supervised eye landmark detection as an auxiliary | task | |
| General and | task | -oriented Video Segmentation |
| General End-to-End Method for Characterizing Neuropsychiatric Disorders using Free-Viewing Visual Scanning | task | s, A |
| Generalist YOLO: Towards Real-Time End-to-End Multi- | task | Visual Language Models |
| Generalized Face Anti-Spoofing via Multi- | task | Learning and One-Side Meta Triplet Loss |
| Generalized Fake Image Detection Method Based on Gated Hierarchical Multi- | task | Learning |
| generalized multi- | task | learning approach to stereo DSM filtering in urban areas, A |
| Generalized | task | -Driven Medical Image Quality Enhancement With Gradient Promotion |
| Generating an Interpretation Tree from a CAD Model for 3D-Object Recognition in Bin-Picking | task | s |
| Generating Erroneous Human Behavior From Strategic Knowledge in | task | Models and Evaluating Its Impact on System Safety With Model Checking |
| Generating Private Data Surrogates for Vision Related | task | s |
| Generating | task | -Oriented Interactions of Service Robots |
| Generating Visual Sensing Strategies in Assembly | task | s |
| Generative Adversarial Multi- | task | Learning for Face Sketch Synthesis and Recognition |
| Generative Adversarial Network With Robust Discriminator Through Multi- | task | Learning for Low-Dose CT Denoising |
| Generative Causality-Driven Network for Graph Multi- | task | Learning |
| Generic Object Crowd Tracking by Multi- | task | Learning |
| Genetic | task | s Planning in Image Processing: Towards a Minimization of Prior Information |
| GenSumm: A Joint Framework for Multi- | task | Tweet Classification and Summarization Using Sentiment Analysis and Generative Modelling |
| Geodesic-Aligned Gradient Projection for Continual | task | Learning |
| Geographic Named Entity Matching and Evaluation Recommendation Using Multi-Objective | task | s: A Study Integrating a Large Language Model (LLM) and Retrieval-Augmented Generation (RAG) |
| Geometric backtracking for combined | task | and motion planning in robotic systems |
| GKEAL: Gaussian Kernel Embedded Analytic Learning for Few-Shot Class Incremental | task | |
| Glacier Extraction from Cloudy Satellite Images Using a Multi- | task | Generative Adversarial Network Leveraging Transformer-Based Backbones |
| GLeaD: Improving GANs with A Generator-Leading | task | |
| global probabilistic framework for the foreground, background and shadow classification | task | , A |
| GMC: A general framework of multi-stage context learning and utilization for visual detection | task | s |
| GMSS: Graph-Based Multi- | task | Self-Supervised Learning for EEG Emotion Recognition |
| Goal-Driven Human Motion Synthesis in Diverse | task | s |
| Goals, | task | s, and Bonds: Toward the Computational Assessment of Therapist Versus Client Perception of Working Alliance |
| Going Beyond Multi- | task | Dense Prediction with Synergy Embedding Models |
| Good Data Augmentation Policy is not All You Need: A Multi- | task | Learning Perspective, A |
| Gradient-based class weighting for unsupervised domain adaptation in dense prediction visual | task | s |
| GradMix: Multi-source Transfer across Domains and | task | s |
| GraftNet: Towards Domain Generalized Stereo Matching with a Broad-Spectrum and | task | -Oriented Feature |
| Granularity-Aware Adaptation for Image Retrieval Over Multiple | task | s |
| Graph Convolutional Reinforcement Learning-Guided Joint Trajectory Optimization and | task | Offloading for Aerial Edge Computing |
| Graph-based neural network models with multiple self-supervised auxiliary | task | s |
| GridDehazeNet+: An Enhanced Multi-Scale Network with Intra- | task | Knowledge Transfer for Single Image Dehazing |
| Grouped Multi- | task | CNN for Facial Attribute Recognition |
| GROUSE: A | task | and Model Agnostic Wavelet- Driven Framework for Medical Imaging |
| GRS: Generating Robotic Simulation | task | s from Real-World Images |
| Guest Editorial: Learning from limited annotations for computer vision | task | s |
| Guiding a robot by visual feedback in assembling | task | s |
| HA-Net: Hierarchical Attention Network Based on Multi- | task | Learning for Ciliary Muscle Segmentation in AS-OCT |
| HAIC-NET: Semi-supervised OCTA vessel segmentation with self-supervised pretext | task | and dual consistency training |
| HammerDrive: A | task | -Aware Driving Visual Attention Model |
| Hand Image Understanding via Deep Multi- | task | Learning |
| Hand-Dorsa Vein Recognition Based on | task | -Specific Cross-Convolutional-Layer Pooling |
| HANDEY: A Robot | task | Planner |
| Handling Complex Events in Surveillance | task | s |
| Handling of | task | Hierarchies on the Nepomuk Social Semantic Desktop |
| Haptic Solution to Assist Visually Impaired in Mobility | task | s, A |
| HD-MTL: Hierarchical Deep Multi- | task | Learning for Large-Scale Visual Recognition |
| Height aware understanding of remote sensing images based on cross- | task | interaction |
| Helpful or Harmful: Inter- | task | Association in Continual Learning |
| Henet: Hybrid Encoding for End-to-end Multi- | task | 3d Perception from Multi-view Cameras |
| Heterogeneous Face Attribute Estimation: A Deep Multi- | task | Learning Approach |
| Heterogeneous face detection based on multi- | task | cascaded convolutional neural network |
| Heterogeneous Multi- | task | Learning for Human Pose Estimation with Deep Convolutional Neural Network |
| Heuristic Distributed | task | Allocation Method for Multivehicle Multitask Problems and Its Application to Search and Rescue Scenario, A |
| HF-UNet: Learning Hierarchically Inter- | task | Relevance in Multi-Task U-Net for Accurate Prostate Segmentation in CT Images |
| HF-UNet: Learning Hierarchically Inter- | task | Relevance in Multi-Task U-Net for Accurate Prostate Segmentation in CT Images |
| Hier-EgoPack: Hierarchical Egocentric Video Understanding With Diverse | task | Perspectives |
| Hierarchical Approach for Multi- | task | Logistic Regression, A |
| Hierarchical Clustering Multi- | task | Learning for Joint Human Action Grouping and Recognition |
| Hierarchical Conditional Semi-Paired Image-to-Image Translation for Multi- | task | Image Defect Correction on Shopping Websites |
| Hierarchical Diffusion Policy for Kinematics-Aware Multi- | task | Robotic Manipulation |
| Hierarchical Gaussian Processes model for multi- | task | learning |
| Hierarchical Knowledge Prompt Tuning for Multi- | task | Test-Time Adaptation |
| Hierarchical learning of multi- | task | sparse metrics for large-scale image classification |
| Hierarchical Modeling for | task | Recognition and Action Segmentation in Weakly-Labeled Instructional Videos |
| Hierarchical Multi- | task | Approach to Gastrointestinal Image Analysis, A |
| Hierarchical Multi- | task | Learning via Task Affinity Groupings |
| Hierarchical Multi- | task | Learning via Task Affinity Groupings |
| Hierarchical Multi- | task | Network For Race, Gender and Facial Attractiveness Recognition |
| Hierarchical Multi- | task | Restoration Network for Old Photo Enhancement |
| Hierarchical Prompt Learning for Multi- | task | Learning |
| Hierarchical Representation Network With Auxiliary | task | s for Video Captioning and Video Question Answering |
| Hierarchical Shared Encoder With | task | -Specific Transformer Layer Selection for Emotion-Cause Pair Extraction |
| Hierarchical | task | and Motion Planning in the Now |
| Hierarchical Vision Architecture for Robotic Manipulation | task | s, A |
| Hierarchical-Learning-Based | task | Assignment for Heterogeneous Multi-AUV-UG Collaborative System to Collect Data From Underwater Sensors |
| HierarQ: | task | -Aware Hierarchical Q-Former for Enhanced Video Understanding |
| Hiface: Hybrid | task | Learning for Face Reconstruction from Single Image |
| High Cognitive Load Assessment in Drivers Through Wireless Electroencephalography and the Validation of a Modified N-Back | task | |
| High-low level | task | combination for object detection in foggy weather conditions |
| Hinge-Wasserstein: Estimating Multimodal Aleatoric Uncertainty in Regression | task | s |
| HitFusion: Infrared and Visible Image Fusion for High-Level Vision | task | s Using Transformer |
| hmOS: An Extensible Platform for | task | -Oriented Human-Machine Computing |
| Holistic Evaluation of | task | View Format for Training a Simulated Robot-Assisted EOD Task, A |
| Holistic Evaluation of | task | View Format for Training a Simulated Robot-Assisted EOD Task, A |
| Hot-started NAS for | task | -specific Embedded Applications |
| How do image complexity, | task | demands and looking biases influence human gaze behavior? |
| How Far Can a 1-pixel Camera Go? Solving Vision | task | s Using Photoreceptors and Computationally Designed Visual Morphology |
| How interaction methods affect image segmentation: User experience in the | task | |
| How Much More Data Do I Need? Estimating Requirements for Downstream | task | s |
| How Optimal Depth Cue Integration Depends on the | task | |
| How Resource Demands of Nondriving-Related | task | s and Engagement Time Affect Drivers' Physiological Response and Takeover Performance in Conditional Automated Driving |
| How to involve structural modeling for cartographic object recognition | task | s in high-resolution satellite images? |
| How Trustworthy are Performance Evaluations for Basic Vision | task | s? |
| How Useful Is Self-Supervised Pretraining for Visual | task | s? |
| HSP-MFL: A High-level Semantic Property driven Multi- | task | Feature Learning Network for unsupervised person Re-ID |
| Htad: A Home- | task | s Activities Dataset with Wrist-accelerometer and Audio Features |
| HTD: Heterogeneous | task | Decoupling for Two-Stage Object Detection |
| Hulk: A Universal Knowledge Translator for Human-Centric | task | s |
| Human Comfort Index Estimation in Industrial Human-Robot Collaboration | task | |
| Human Visual System vs Convolution Neural Networks in food recognition | task | : An empirical comparison |
| Human-Robot Interaction During Virtual Reality Mediated Teleoperation: How Environment Information Affects Spatial | task | Performance and Operator Situation Awareness |
| Human-Robot Interaction Video Sequencing | task | (HRIVST) for Robot's Behavior Legibility |
| Hybrid Approach for Approximating the Ideal Observer for Joint Signal Detection and Estimation | task | s by Use of Supervised Learning and Markov-Chain Monte Carlo Methods, A |
| Hybrid Mapping for the Assistance of Teleoperated Grasping | task | s |
| Hybrid Neural Network System for Pattern Classification | task | s with Missing Features, A |
| Hybrid Single Input and Multiple Output Method For Compressing Features Towards Machine Vision | task | s |
| Hybrid | task | Cascade for Instance Segmentation |
| Hybrid | task | Cascade-Based Building Extraction Method in Remote Sensing Imagery |
| HyperCoil-Recon: A Hypernetwork-based Adaptive Coil Configuration | task | Switching Network for MRI Reconstruction |
| HyperCon: Image-To-Video Model Transfer for Video-To-Video Translation | task | s |
| HyperFace: A Deep Multi- | task | Learning Framework for Face Detection, Landmark Localization, Pose Estimation, and Gender Recognition |
| HyperGCN: Interpreting Hyperscanning EEG Signals for Common Multi- | task | Classification Using Graph Convolutional Networks |
| Hypergraph+: An Improved Hypergraph-Based | task | -Scheduling Algorithm for Massive Spatial Data Processing on Master-Slave Platforms |
| Hyperspectral image destriping and denoising from a | task | decomposition view |
| Hyperspectral Image Super-Resolution with RGB Image Super-Resolution as an Auxiliary | task | |
| Hyperspectral Target Detection via Adaptive Joint Sparse Representation and Multi- | task | Learning with Locality Information |
| HyperSTAR: | task | -Aware Hyperparameter Recommendation for Training and Compression |
| HyperSTAR: | task | -Aware Hyperparameters for Deep Networks |
| HyperTaFOR: | task | -Adaptive Few-Shot Open-Set Recognition with Spatial-Spectral Selective Transformer for Hyperspectral Imagery |
| Hypervolume Under the ROC Hypersurface of Near-Guessing and Near-Perfect Observers in N-Class Classification | task | s, The |
| I can't believe there's no images!: Learning Visual | task | s Using Only Language Supervision |
| Iconic Classification Scheme for Video-Based Traffic Sensor | task | s, An |
| Ideal observer analysis for | task | normalization of pattern classifier performance applied to EEG and fMRI data |
| Identification of Driver State for Lane-Keeping | task | s |
| Identifying and Mitigating Spurious Correlation in Multi- | task | Learning |
| Identifying Auxiliary or Adversarial | task | s Using Necessary Condition Analysis for Adversarial Multi-task Video Understanding |
| Identifying Auxiliary or Adversarial | task | s Using Necessary Condition Analysis for Adversarial Multi-task Video Understanding |
| Identifying Patterns for Convolutional Neural Networks in Regression | task | s to Make Specific Predictions via Genetic Algorithms |
| Identifying Stable EEG Patterns in Manipulation | task | for Negative Emotion Recognition |
| Identifying Successful Motor | task | Completion via Motion-Based Performance Metrics |
| Identifying the best data-driven feature selection method for boosting reproducibility in classification | task | s |
| IEA Wind | task | 32: Wind Lidar Identifying and Mitigating Barriers to the Adoption of Wind Lidar |
| IIMT-net: Poly-1 weights balanced multi- | task | network for semantic segmentation and depth estimation using interactive information |
| Image as a Foreign Language: BEIT Pretraining for Vision and Vision-Language | task | s |
| Image Change Captioning by Learning from an Auxiliary | task | |
| Image compression based on | task | -specific information |
| Image Cosegmentation via Multi- | task | Learning |
| Image Pattern Similarity Index and Its Application to | task | -Specific Transfer Learning |
| Image Quality Feedback-based Adaptive Video Definition Improvement For The Space Manipulation | task | |
| Image Reflection Removal via Contextual Feature Fusion Pyramid and | task | -Driven Regularization |
| Image segmentation fusion using weakly supervised trace-norm multi- | task | learning method |
| image similarity descriptor for classification | task | s, An |
| Image Splicing Localization using a Multi- | task | Fully Convolutional Network (MFCN) |
| Image-Based Robot | task | Planning and Control Using a Compact Visual Representation |
| Image-to-Images Translation for Multi- | task | Organ Segmentation and Bone Suppression in Chest X-Ray Radiography |
| ImageCLEF Medical Retrieval | task | at ICPR 2010: Information Fusion to Combine Visual and Textual Information, The |
| ImageCLEF Medical Retrieval | task | at ICPR 2010: Information Fusion, The |
| ImageCLEF@ICPR Contest: Challenges, Methodologies and Results of the Photo Annotation | task | |
| Imitation of Assembly | task | s for Realizing Dexterous Manipulation |
| IMOFC: Identity-Level Metric Optimized Feature Compression for Identification | task | s |
| Impact of annotation dimensionality under variable | task | complexity in remote guidance |
| Impact of Foveated Rendering on Procedural | task | Training |
| Impact of Video Compression Artifacts on Fisheye Camera Visual Perception | task | s |
| Impact of Viewing Distance on | task | Performance and Its Properties |
| Impact on Reader Performance for Lesion-Detection/ Localization | task | s of Anatomical Priors in SPECT Reconstruction |
| Implement contour following | task | of objects with unknown geometric models by using combination of two visual servoing techniques |
| Improve object detection via a multi-feature and multi- | task | CNN model |
| Improved ASD classification using dynamic functional connectivity and multi- | task | feature selection |
| Improved Gaussian Mixture Model for the | task | of Object Tracking |
| Improved high dynamic range imaging using multi-scale feature flows balanced between | task | -orientedness and accuracy |
| Improved Method for Human Activity Detection with High-Resolution Images by Fusing Pooling Enhancement and Multi- | task | Learning, An |
| Improved Model for Segmentation and Recognition of Fine-Grained Activities with Application to Surgical Training | task | s, An |
| Improved Noise and Attack Robustness for Semantic Segmentation by Using Multi- | task | Training with Self-Supervised Depth Estimation |
| Improving Beam Alignment Accuracy in mmWave Communication Systems With Auxiliary | task | s |
| Improving Bird's Eye View Semantic Segmentation by | task | Decomposition |
| Improving End-to-End Text Image Translation From the Auxiliary Text Translation | task | |
| Improving Few-Shot Learning Through Multi- | task | Representation Learning Theory |
| Improving Few-Shot Learning using Composite Rotation based Auxiliary | task | |
| Improving Fusion with Margin-Derived Confidence in Biometric Authentication | task | s |
| Improving Generalization Ability of Deep Neural Networks for Visual Recognition | task | s |
| Improving Model Accuracy for Imbalanced Image Classification | task | s by Adding a Final Batch Normalization Layer: An Empirical Study |
| Improving Multiple Machine Vision | task | s in the Compressed Domain |
| Improving Multiview Face Detection with Multi- | task | Deep Convolutional Neural Networks |
| Improving Performances of MSER Features in Matching and Retrieval | task | s |
| Improving radial lens distortion correction with multi- | task | learning |
| Improving Robotic Grasping on Monocular Images Via Multi- | task | Learning and Positional Loss |
| Improving Self-Supervised Learning for Out-Of-Distribution | task | via Auxiliary Classifier |
| Improving Single-Image Defocus Deblurring: How Dual-Pixel Images Help Through Multi- | task | Learning |
| Improving texture description in remote sensing image multi-scale classification | task | s by using visual words |
| In Defense of the Learning Without Forgetting for | task | Incremental Learning |
| In-Camera Data Stream Processing System for Defect Detection in Web Inspection | task | s, An |
| Inconsistency-Based Multi- | task | Cooperative Learning for Emotion Recognition |
| Incorporating Human Contrast Sensitivity in Model Observers for Detection | task | s |
| Incorporating Lane Estimation as Context Source in Pedestrian Recognition | task | |
| Incorporating Self-attention Mechanism and Multi- | task | Learning into Scene Text Detection |
| Incorporating | task | Progress Knowledge for Subgoal Generation in Robotic Manipulation through Image Edits |
| Incremental Learning of | task | s From User Demonstrations, Past Experiences, and Vocal Comments |
| Incremental | task | Learning with Incremental Rank Updates |
| Independent Component Alignment for Multi- | task | Learning |
| Indoor Scene Recognition using | task | and Saliency-driven Feature Pooling |
| Inertial BSN-Based Characterization and Automatic UPDRS Evaluation of the Gait | task | of Parkinsonians |
| Inferring | task | s and Fluents in Videos by Learning Causal Relations |
| Influence of | task | and Scene Content on Subjective Video Quality |
| influence of the visualization | task | on the simulator sickness symptoms: A comparative SSQ study on 3DTV and 3D immersive glasses, The |
| Influencing Human Escape Maneuvers With Perceptual Cues in the Presence of a Visual | task | |
| Information Fusion in Visual- | task | Inference |
| Information-Theoretic Approach to Transferability in | task | Transfer Learning, An |
| Information-Theoretic Method to Automatic Shortcut Avoidance and Domain Generalization for Dense Prediction | task | s, An |
| Informationally Decentralized System Resource Management for Multiple Multimedia | task | s |
| InfraParis: A multi-modal and multi- | task | autonomous driving dataset |
| Infrared and Visible Image Fusion: From Data Compatibility to | task | Adaption |
| Infrared Cephalic-vein to Assist Blood Extraction | task | s: Automatic Projection And Recognition |
| Infrared Small Target Detection Based on Saliency Guided Multi- | task | Learning |
| Instance-Aware Multi- | task | Learning for Nuclei Segmentation |
| Instance-Aware Semantic Segmentation via Multi- | task | Network Cascades |
| Instance-Based Video Search via Multi- | task | Retrieval and Re-Ranking |
| Instruct-ReID: A Multi-Purpose Person Re-Identification | task | with Instructions |
| InstructDiffusion: A Generalist Modeling Interface for Vision | task | s |
| Integrating multi-camera tracking into a dynamic | task | allocation system for smart cameras |
| Integrating Sensing, | task | Planning, and Execution for Robotic Assembly |
| Integrating Vision and Touch for Object Recognition | task | s |
| Integration of Multi-Source Landslide Disaster Data Based on Flink Framework and APSO Load Balancing | task | Scheduling |
| Integration of Physical and Cognitive Human Models to Simulate Driving With a Secondary In-Vehicle | task | |
| Intelligent | task | Offloading for Heterogeneous V2X Communications |
| intelligent visual | task | system for lateral skull X-ray images, An |
| Inter- | task | Association Critic for Cross-Resolution Person Re-Identification |
| Interactions Between Threat and Executive Control in a Virtual Reality Stroop | task | |
| Interactive intelligent agents with creative minds: Experiments with mobile robots in cooperating | task | s by using machine learning |
| Intermediate-level vision | task | s on a memory array architecture |
| Intern VL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic | task | s |
| Interpretable Multi- | task | Prediction Neural Network for Autonomous Vehicles |
| Interpretable | task | -inspired Adaptive Filter Pruning for Neural Networks Under Multiple Constraints |
| Interpretation of Spatial Language in a Map Navigation | task | |
| Interpreting a Dynamic and Uncertain World: | task | -Based Control |
| Interpreting mechanisms of prediction for skin cancer diagnosis using multi- | task | learning |
| Intra- | task | Mutual Attention-Based Vision Transformer for Few-Shot Learning |
| Inversed Pyramid Network with Spatial-adapted and | task | -oriented Tuning for few-shot learning |
| Inverted Pyramid Multi- | task | Transformer for Dense Scene Understanding |
| Investigating Customization Strategies and Convergence Behaviors of | task | -Specific ADMM |
| Investigating EEG Microstate Analysis in Cognitive Software Engineering | task | s: A Systematic Mapping Study and Taxonomy |
| Investigating one-eyed and stereo cursors for 3D pointing | task | s |
| Investigating | task | -Driven Latent Feasibility for Nonconvex Image Modeling |
| Investigating the Best Performing | task | Conditions of a Multi-Tasking Learning Model in Healthcare Using Convolutional Neural Networks: Evidence from a Parkinson'S Disease Database |
| Investigating Uncertainty Weighting for Multi- | task | Learning: Insights and Analytical Alternative |
| Investigation of Three Potential Stress Inducement | task | s During On-Road Driving |
| InvPT++: Inverted Pyramid Multi- | task | Transformer for Visual Scene Understanding |
| IOU-enhanced Attention for End-to-end | task | Specific Object Detection |
| iPCa-Former: A Multi- | task | Transformer Framework for Perceiving Incidental Prostate Cancer |
| Is Arabic text categorization a solved | task | ? |
| Is image super-resolution helpful for other vision | task | s? |
| Is Meta-Learning Always Necessary?: A Practical ML Framework Solving Novel | task | s at Large-scale Car Sharing Platform |
| Is overfeat useful for image-based surface defect classification | task | s? |
| ISDM at ImageCLEF 2010 Fusion | task | |
| Isogeometric finite-elements methods and variational reconstruction | task | s in vision: A perfect match |
| It's all about habits: Exploiting multi- | task | clustering for activities of daily living analysis |
| iTAML: An Incremental | task | -Agnostic Meta-learning Approach |
| Iterative Learning Distributed Model Predictive Control for Autonomous Vehicle Platoons With Applications to Repetitive | task | s |
| IUR-Net: A Multi-Stage Framework for Label Refinement | task | s in Noisy Remote Sensing Samples |
| Jack of All | task | s, Master of Many: Designing General-purpose Coarse-to-Fine Vision-Language Model |
| JARVIS-1: Open-World Multi- | task | Agents With Memory-Augmented Multimodal Language Models |
| Joint airport runway segmentation and line detection via multi- | task | learning for intelligent visual navigation |
| Joint Classification and Trajectory Regression of Online Handwriting using a Multi- | task | Learning Approach |
| Joint Head Pose Estimation with Multi- | task | Cascaded Convolutional Networks for Face Alignment |
| Joint Learning of Image Deblurring and Depth Estimation Through Adversarial Multi- | task | Network |
| Joint Multi-Patch and Multi- | task | CNNs for Robust Face Recognition |
| Joint Multi- | task | CNN for Cross-Age Face Recognition, A |
| Joint Optimization of | task | Offloading and Resource Allocation for UAV-Assisted Edge Computing: A Stackelberg Bilayer Game Approach |
| Joint Partial Offloading and Resource Allocation for Vehicular Federated Learning | task | s |
| Joint Prediction of Sea Clutter Amplitude Distribution Based on a One-Dimensional Convolutional Neural Network with Multi- | task | Learning |
| Joint Scheduling of Causal Prompts and | task | s for Multi-Task Learning |
| Joint Scheduling of Causal Prompts and | task | s for Multi-Task Learning |
| Joint Service Placement and | task | Offloading in Vehicle-Edge-Cloud Collaborative Networks |
| Joint Sparse and Low-Rank Multi- | task | Learning with Extended Multi-Attribute Profile for Hyperspectral Target Detection |
| Joint Sparse Optical Flow Estimation and Keypoint Detection via Dual- | task | Imperative Learning |
| Joint Sparse Representation of Brain Activity Patterns in Multi- | task | fMRI Data |
| Joint spatial and temporal structure learning for | task | based control |
| Joint Spectrum Sharing and V2V/V2I | task | Offloading for Vehicular Edge Computing Networks Based on Coalition Formation Game |
| Joint | task | Offloading and Resource Allocation for Fog-Based Intelligent Transportation Systems: A UAV-Enabled Multi-Hop Collaboration Paradigm |
| Joint | task | Offloading and Resource Allocation for Vehicular Edge Computing Based on V2I and V2V Modes |
| Joint | task | Offloading, Resource Allocation, and Security Assurance for Mobile Edge Computing-Enabled UAV-Assisted VANETs |
| Joint | task | -Recursive Learning for RGB-D Scene Understanding |
| Joint | task | -Recursive Learning for Semantic Segmentation and Depth Estimation |
| Joint Video Summarization and Moment Localization by Cross- | task | Sample Transfer |
| Joint-Confidence-Guided Multi- | task | Learning for 3D Reconstruction and Understanding From Monocular Camera |
| Joint- | task | Regularization for Partially Labeled Multi-Task Learning |
| Joint- | task | Regularization for Partially Labeled Multi-Task Learning |
| Jointly Recognizing Object Fluents and | task | s in Egocentric Videos |
| JSIS3D: Joint Semantic-Instance Segmentation of 3D Point Clouds With Multi- | task | Pointwise Networks and Multi-Value Conditional Random Fields |
| JSPNet: Learning joint semantic and instance segmentation of point clouds via feature self-similarity and cross- | task | probability |
| Knowing Where I Am: Exploiting Multi- | task | Learning for Multi-view Indoor Image-based Localization |
| Knowledge Base Driven | task | -Oriented Image Semantic Communication Scheme, A |
| Knowledge-Guided Multi- | task | Network for Remote Sensing Imagery |
| KPTFusion: Knowledge Prior-based | task | -Driven Multimodal Image Fusion |
| KrishnaCam: Using a longitudinal, single-person, egocentric dataset for scene understanding | task | s |
| KSM: Fast Multiple | task | Adaption via Kernel-wise Soft Mask Learning |
| L2,1-l1 regularized nonlinear multi- | task | representation learning based cognitive performance prediction of Alzheimer's disease |
| Language Adaptive Weight Generation for Multi- | task | Visual Grounding |
| Language Features Matter: Effective Language Representations for Vision-Language | task | s |
| Large Margin Multi-Modal Multi- | task | Feature Extraction for Image Classification |
| Large Vision-Language Models are Generalist Solvers For Pathology | task | s |
| Large-scale Annotated Mechanical Components Benchmark for Classification and Retrieval | task | s with Deep Neural Networks, A |
| Large-Scale Gaussian Process Inference with Generalized Histogram Intersection Kernels for Visual Recognition | task | s |
| Large-Scale Multiobjective Vehicle | task | Offloading Optimization Based on Cloud-Edge-End Collaboration for 6G Enabled Transport Systems |
| Large-Scale Rice Mapping Using Multi- | task | Spatiotemporal Deep Learning and Sentinel-1 SAR Time Series |
| Large-small model collaboration for medical visual question answering with | task | aware mixture of experts and relation knowledge distillation |
| Latent | task | Adaptation with Large-Scale Hierarchies |
| Latent-Space Scalability for Multi- | task | Collaborative Intelligence |
| LCASAFormer: Cross-attention enhanced backbone network for 3D point cloud | task | s |
| LD-Pruner: Efficient Pruning of Latent Diffusion Models using | task | -Agnostic Insights |
| Learn2Reg: Comprehensive Multi- | task | Medical Image Registration Challenge, Dataset and Evaluation in the Era of Deep Learning |
| Learned Hybrid Video Coding for Human Perception and Multiple Machine Vision | task | s |
| Learning a | task | -Specific Descriptor for Robust Matching of 3D Point Clouds |
| Learning Across | task | s and Domains |
| Learning Across | task | s for Zero-Shot Domain Adaptation From a Single Source Domain |
| Learning an Evolved Mixture Model for | task | -Free Continual Learning |
| Learning by Correction: Efficient Tuning | task | for Zero-Shot Generative Vision-Language Reasoning |
| Learning by Watching: Extracting Reusable | task | Knowledge from Visual Observation of Human Performance |
| Learning cross- | task | relations for panoptic driving perception |
| Learning Deep Features for Multiple Object Tracking by Using a Multi- | task | Learning Strategy |
| Learning deep features for | task | -independent EEG-based biometric verification |
| Learning Everyday Manipulation | task | s from Observation |
| Learning Frequency-Specific Quantization Scaling in VVC for Standard-Compliant | task | -Driven Image Coding |
| Learning from History: | task | -agnostic Model Contrastive Learning for Image Restoration |
| Learning From Visual Demonstrations via Replayed | task | -Contrastive Model-Agnostic Meta-Learning |
| Learning Geographically Distributed Data for Multiple | task | s Using Generative Adversarial Networks |
| Learning Good Features to Transfer Across | task | s and Domains |
| Learning Human Priors for | task | -Constrained Grasping |
| Learning Instance and | task | -Aware Dynamic Kernels for Few-Shot Learning |
| Learning Light Field Synthesis with Multi-Plane Images: Scene Encoding as a Recurrent Segmentation | task | |
| Learning Local Features by Jointly Semantic-Guided and | task | Rewards |
| Learning Multi- | task | Correlation Particle Filters for Visual Tracking |
| Learning Multi- | task | Target-Specific Correlation Filters for Robust Tracking |
| Learning Multifunctional Binary Codes for Both Category and Attribute Oriented Retrieval | task | s |
| Learning Multiple Dense Prediction | task | s from Partially Annotated Data |
| Learning Multiple Pixelwise | task | s Based on Loss Scale Balancing |
| Learning multiple visual | task | s while discovering their structure |
| Learning On-Road Visual Control for Self-Driving Vehicles With Auxiliary | task | s |
| Learning open loop control of complex motor | task | s |
| Learning Optimized Low-Light Image Enhancement for Edge Vision | task | s |
| Learning robot | task | s with loops from experiences to enhance robot adaptability |
| Learning Scene Structure Guidance via Cross- | task | Knowledge Transfer for Single Depth Super-Resolution |
| Learning subsequential transducers for pattern recognition interpretation | task | s |
| Learning | task | -Optimal Registration Cost Functions for Localizing Cytoarchitecture and Function in the Cerebral Cortex |
| Learning | task | -Preferred Inference Routes for Gradient De-Conflict in Multi-Output DNNs |
| Learning | task | -Specific Generalized Convolutions in the Permutohedral Lattice |
| Learning | task | -Specific Object Recognition and Scene Understanding |
| Learning temporal structure for | task | based control |
| Learning to Forget for Meta-Learning via | task | -and-Layer-Wise Attenuation |
| Learning to Learn and Remember Super Long Multi-Domain | task | Sequence |
| Learning to Learn | task | -Adaptive Hyperparameters for Few-Shot Learning |
| Learning to Look Around: Intelligently Exploring Unseen Environments for Unknown | task | s |
| Learning to Perform Visual | task | s from Human Demonstrations |
| Learning to Quantize Deep Networks by Optimizing Quantization Intervals With | task | Loss |
| Learning to Resize Images for Computer Vision | task | s |
| Learning to Scale Multilingual Representations for Vision-Language | task | s |
| Learning to segment clustered amoeboid cells from brightfield microscopy via multi- | task | learning with adaptive weight selection |
| Learning to Share Latent | task | s for Action Recognition |
| Learning to Transfer: Transferring Latent | task | Structures and Its Application to Person-Specific Facial Action Unit Detection |
| Learning to Translate Between Real World and Simulated 3D Sensors While Transferring | task | Models |
| Learning Trajectory-Word Alignments for Video-Language | task | s |
| Learning Two-Branch Neural Networks for Image-Text Matching | task | s |
| Learning with Privileged | task | s |
| Learning With Style: Continual Semantic Segmentation Across | task | s and Domains |
| Learning-Based Intent-Aware | task | Offloading for Air-Ground Integrated Vehicular Edge Computing |
| Lemma: A Multi-view Dataset for Learning Multi-agent Multi- | task | Activities |
| LeSkill: Structured Skill Learning for Long-Horizon Robotic Manipulation | task | s |
| Less is More: Efficient Model Merging with Binary | task | Switch |
| Less is More: Reducing | task | and Model Complexity for 3D Point Cloud Semantic Segmentation |
| Let me show you how it's done: Cross-modal knowledge distillation as pretext | task | for semantic segmentation |
| Let Them Choose What They Want: A Multi- | task | CNN Architecture Leveraging Mid-Level Deep Representations for Face Attribute Classification |
| Leveraging Auxiliary | task | s with Affinity Learning for Weakly Supervised Semantic Segmentation |
| Leveraging Crowdsourced Data for Creating Temporal Segmentation Ground Truths of Subjective | task | s |
| Leveraging Deep Reinforcement Learning for Reaching Robotic | task | s |
| Leveraging Heterogeneous Auxiliary | task | s to Assist Crowd Counting |
| Leveraging multiple | task | s to regularize fine-grained classification |
| Leveraging Pre-trained Multi- | task | Deep Models for Trustworthy Facial Analysis in Affective Behaviour Analysis in-the-Wild |
| Leveraging Road Area Semantic Segmentation with Auxiliary Steering | task | |
| Leveraging | task | -Specific Pre-Training to Reason across Images and Videos |
| Leveraging Vision Language Models for Specialized Agricultural | task | s |
| LiDAR-BEVMTN: Real-Time LiDAR Bird's-Eye View Multi- | task | Perception Network for Autonomous Driving |
| Lifelong CycleGAN for continual multi- | task | image restoration |
| Lifelong Learning for Text Steganalysis Based on Chronological | task | Sequence |
| Lifelong Learning of | task | -Parameter Relationships for Knowledge Transfer |
| Light Dual- | task | Neural Network for Haze Removal, A |
| Line Smoothing Method of Hand-Drawn Strokes Using Adaptive Moving Average for Illustration Tracing | task | s, A |
| Linear Multi- | task | Learning for Predicting Soil Properties Using Field Spectroscopy |
| Linguistically-aware attention for reducing the semantic gap in vision-language | task | s |
| LiSD: An Efficient Multi- | task | Learning Framework for Lidar Segmentation and Detection |
| Load Balancing Requirement In Parallel Implementations Of Image Feature Extraction | task | s |
| Local Knowledge and Professional Background Have a Minimal Impact on Volunteer Citizen Science Performance in a Land-Cover Classification | task | |
| Locality-Constrained Multi- | task | Joint Sparse Representation for Image Classification |
| Localize Me Anywhere, Anytime: A Multi- | task | Point-Retrieval Approach |
| Location-Aware Reliable | task | Cooperative-Computation Scheme Under Fog Computing-Based IoVs |
| Log-polar mapping template design: From | task | -level requirements to geometry parameters |
| LoME: LoRA-Driven Multimodal Extractor for RGB-X Vision | task | s |
| Low-Lightgan: Low-Light Enhancement Via Advanced Generative Adversarial Network With | task | -Driven Training |
| lower bound on the sample size needed to perform a significant frequent pattern mining | task | , A |
| LTA-PCS: Learnable | task | -Agnostic Point Cloud Sampling |
| m&m's: A Benchmark to Evaluate Tool-use for multi-step multi-modal | task | s |
| MA-MNN: Multi-flow attentive memristive neural network for multi- | task | image restoration |
| Macaron Attention: The Local Squeezing Global Attention Mechanism in Tracking | task | s |
| machine learning approach for image retrieval | task | s, A |
| Machine Vision for Industry: | task | s, Tools, and Techniques |
| MADRL-Based URLLC-Aware | task | Offloading for Air-Ground Vehicular Cooperative Computing Network |
| MAGIC of E-Health: A Gesture-Based Approach to Estimate Understanding and Performance in Remote Ultrasound | task | s, The |
| Managing Full Waveform LIDAR Data: A Challenging | task | for the Forthcoming Years |
| Managing Time-Sensitive IoT Applications via Dynamic Application | task | Distribution and Adaptation |
| Mancs: A Multi- | task | Attentional Network with Curriculum Sampling for Person Re-Identification |
| Manigaussian: Dynamic Gaussian Splatting for Multi- | task | Robotic Manipulation |
| Manipulative Hand Gesture Recognition Using | task | Knowledge for Human Computer Interaction |
| Many Hands Make Light Work: Transferring Knowledge from Auxiliary | task | s for Video-Text Retrieval |
| Many | task | Learning With Task Routing |
| Many | task | Learning With Task Routing |
| Many- | task | Federated Learning: A New Problem Setting and A Simple Baseline |
| Mapping Computer-Vision-Related | task | s onto Reconfigurable Parallel-Processing Systems |
| MAS: Towards Resource-Efficient Federated Multiple- | task | Learning |
| Masked AutoDecoder is Effective Multi- | task | Vision Generalist |
| Matching the | task | to an Image Processing Architecture |
| MATIP: A dynamic hardware | task | integration platform for Multiprocessing Reconfigurable System on Chip |
| MATTE: Multi- | task | multi-scale attention |
| MAXGNR: A Dynamic Weight Strategy via Maximizing Gradient-to-noise Ratio for Multi- | task | Learning |
| Maximum Likelihood Method for Estimating Performance in a Rapid Serial Visual Presentation Target-Detection | task | , A |
| Mbnet: A Multi- | task | Deep Neural Network for Semantic Segmentation and Lumbar Vertebra Inspection on X-ray Images |
| MCLA | task | Offloading Framework for 5G-NR-V2X-Based Heterogeneous VECNs |
| MCMT-GAN: Multi- | task | Coherent Modality Transferable GAN for 3D Brain Image Synthesis |
| MDA: Multimodal Data Augmentation Framework for Boosting Performance on Sentiment/Emotion Classification | task | s |
| Mean Box Pooling: A Rich Image Representation and Output Embedding for the Visual Madlibs | task | |
| Measuring the Effectiveness of | task | -Level Parallelism for High-Level Vision |
| Measuring the Quality of Annotations for a Subjective Crowdsourcing | task | |
| Mecformer: Multi- | task | Whole Slide Image Classification with Expert Consultation Network |
| Medrat: Unpaired Medical Report Generation via Auxiliary | task | s |
| MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation | task | using Discrete Visual Representations |
| memory-augmented multi- | task | collaborative framework for unsupervised traffic anomaly detection in driving videos, A |
| Meta-Analysis of Vibrotactile and Visual Information Displays for Improving | task | Performance, A |
| Meta-Learning with Less Forgetting on Large-Scale Non-Stationary | task | Distributions |
| Meta-Learning with | task | -Adaptive Loss Function for Few-Shot Learning |
| Meta-Learning With | task | -Adaptive Selection |
| Meta-sampler: Almost-Universal yet | task | -Oriented Sampling for Point Clouds |
| Meta-Transfer Learning Through Hard | task | s |
| Metaat: Active Testing for Label-efficient Evaluation of Dense Recognition | task | s |
| MetAdapt: Meta-learned | task | -adaptive architecture for few-shot classification |
| Method of Automatic Sensor Placement for Robot Vision in Inspection | task | s, A |
| Method to Analyze Correlations between Multiple Brain Imaging | task | s to Characterize Schizophrenia, A |
| Methodology for Evaluation of | task | -Performance in Robotic Systems: A Case-Study in Vision-Based Localization, A |
| Metric Learning for Multi-Output | task | s |
| Metric-Based Regularization and Temporal Ensemble for Multi- | task | Learning using Heterogeneous Unsupervised Tasks |
| Metric-Based Regularization and Temporal Ensemble for Multi- | task | Learning using Heterogeneous Unsupervised Tasks |
| Micrant: Towards Regression | task | Oriented Annotation Tool for Microscopic Images |
| MILA: Multi- | task | Learning from Videos via Efficient Inter-Frame Attention |
| Mimic In-Context Learning for Multimodal | task | s |
| Minimizing dataset bias: Discriminative multi- | task | sparse coding through shared subspace learning for image classification |
| Mining Novice User Activity with TRECVID Interactive Retrieval | task | s |
| mining scene understanding framework with limited labeled samples jointly driven by object-level spatial relationships and multi- | task | network, A |
| Mirror U-Net: Marrying Multimodal Fission with Multi- | task | Learning for Semantic Segmentation in Medical Imaging |
| Mitigating Bias in Gender, Age and Ethnicity Classification: A Multi- | task | Convolution Neural Network Approach |
| Mitigating Knowledge Discrepancies among Multiple Datasets for | task | -agnostic Unified Face Alignment |
| Mitigating Search Interference With | task | -Aware Nested Search |
| Mitigating | task | Interference in Multi-Task Learning via Explicit Task Routing with Non-Learnable Primitives |
| Mitigating | task | Interference in Multi-Task Learning via Explicit Task Routing with Non-Learnable Primitives |
| Mitigating | task | Interference in Multi-Task Learning via Explicit Task Routing with Non-Learnable Primitives |
| Mitigating | task | randomness in graph few-shot learning |
| Mixed Reality Training of Military | task | s: Comparison of Two Approaches Through Reactions from Subject Matter Experts |
| MLGNet: Multi- | task | Learning Network with Attention-Guided Mechanism for Segmenting Agricultural Fields |
| MLR-SNet: Transferable LR Schedules for Heterogeneous | task | s |
| MLVU: Benchmarking Multi- | task | Long Video Understanding |
| MMEARTH: Exploring Multi-modal Pretext | task | s for Geospatial Representation Learning |
| MMMNet: An End-to-End Multi- | task | Deep Convolution Neural Network With Multi-Scale and Multi-Hierarchy Fusion for Blind Image Quality Assessment |
| MMTL-UniAD: A Unified Framework for Multimodal and Multi- | task | Learning in Assistive Driving Perception |
| MO-EMT-NAS: Multi-objective Continuous Transfer of Architectural Knowledge Between | task | s from Different Datasets |
| MobileDeRainGAN: An Efficient Semi-Supervised Approach to Single Image Rain Removal for | task | -Driven Applications |
| Mobility and Deadline-Aware | task | Scheduling Mechanism for Vehicular Edge Computing |
| Mobility-Aware Multi-Hop | task | Offloading for Autonomous Driving in Vehicular Edge Computing and Networks |
| MOCAS: A Multimodal Dataset for Objective Cognitive Workload Assessment on Simultaneous | task | s |
| Mod-Squad: Designing Mixtures of Experts As Modular Multi- | task | Learners |
| Model Breadcrumbs: Scaling Multi- | task | Model Merging with Sparse Masks |
| model generation method for object recognition | task | by pictorial examples, A |
| Model-Based Analysis and Classification of Driver Distraction Under Secondary | task | s |
| Model-Protected Multi- | task | Learning |
| Modeling Inner- and Cross- | task | Contrastive Relations for Continual Image Classification |
| Modeling Multiple Normal Action Representations for Error Detection in Procedural | task | s |
| Modeling Prosodic Phrasing With Multi- | task | Learning in Tacotron-Based TTS |
| Modeling Sensor Confidence for Sensor Integration | task | s |
| Modeling Shared Control System Between Human Pilot and Autopilot for a Carrier-Based Aircraft Landing | task | |
| Modeling | task | fMRI Data Via Deep Convolutional Autoencoder |
| Modulation Module for Multi- | task | Learning with Applications in Image Retrieval, A |
| MOE-DIFFIR: | task | -customized Diffusion Priors for Universal Compressed Image Restoration |
| Monitoring An Assembly | task | By Perception Requests |
| Motion Understanding: | task | -Directed Attention and Representations that Link Perception with Action |
| Move and remove: Multi- | task | learning for building simplification in vector maps with a graph convolutional neural network |
| Movement-flow-based visual servoing and force control fusion for Manipulation | task | s in unstructured environments |
| MPM-Net: Multi- | task | interactive network with progressive multi-granularity learning for herbal medicine recognition |
| MQANet: Multi- | task | Quadruple Attention Network of Multi-Object Semantic Segmentation from Remote Sensing Images |
| MRLReID: Unconstrained Cross-Resolution Person Re-Identification With Multi- | task | Resolution Learning |
| MSNet: Multi- | task | self-supervised network for time series classification |
| MT-CooL: Multi- | task | Cooperative Learning via Flat Minima Searching |
| MT-emotieffnet for Multi- | task | Human Affective Behavior Analysis and Learning from Synthetic Data |
| MT-GN: Multi- | task | -Learning-Based Graph Residual Network for Tropical Cyclone Intensity Estimation |
| MT-ORL: Multi- | task | Occlusion Relationship Learning |
| MT-STNet: A Novel Multi- | task | Spatiotemporal Network for Highway Traffic Flow Prediction |
| MT-UNET: A Novel U-Net Based Multi- | task | Architecture For Visual Scene Understanding |
| MTADA: A Multi- | task | Adversarial Domain Adaptation Network for EEG-Based Cross-Subject Emotion Recognition |
| MTANet: Multi- | task | Attention Network for Automatic Medical Image Segmentation and Classification |
| MtArtGPT: A Multi- | task | Art Generation System With Pre-Trained Transformer |
| MTCNet: Multi- | task | collaboration network for rotation-invariance face detection |
| MTD-Net: A robust multi- | task | discriminative network for choroidal neovascularization segmentation |
| MTD-YOLO: A Multi-Scale Perception Framework with | task | Decoupling and Dynamic Alignment for UAV Small Object Detection |
| Mtevent: a Multi- | task | Event Camera Dataset for 6D Pose Estimation and Moving Object Detection |
| MTFormer: Multi- | task | Learning via Transformer and Cross-Task Reasoning |
| MTFormer: Multi- | task | Learning via Transformer and Cross-Task Reasoning |
| MTGLS: Multi- | task | Gaze Estimation with Limited Supervision |
| Mti-net: Multi-scale | task | Interaction Networks for Multi-task Learning |
| Mti-net: Multi-scale | task | Interaction Networks for Multi-task Learning |
| MTJND: Multi- | task | Deep Learning Framework for Improved JND Prediction |
| MTL-NAS: | task | -Agnostic Neural Architecture Search Towards General-Purpose Multi-Task Learning |
| MTL-NAS: | task | -Agnostic Neural Architecture Search Towards General-Purpose Multi-Task Learning |
| MTLMetro: A Deep Multi- | task | Learning Model for Metro Passenger Demands Prediction |
| MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi- | task | Learning |
| MTLSegFormer: Multi- | task | Learning with Transformers for Semantic Segmentation in Precision Agriculture |
| MTMamba++: Enhancing Multi- | task | Dense Scene Understanding via Mamba-Based Decoders |
| MTMAMBA: Enhancing Multi- | task | Dense Scene Understanding by Mamba-based Decoders |
| MTMSN: Multi- | task | and Multi-Modal Sequence Network for Facial Action Unit and Expression Recognition |
| MTMVC: Semi-supervised 3D hand pose estimation using multi- | task | and multi-view consistency |
| Mtnas: Search Multi- | task | Networks for Autonomous Driving |
| MtpNet: Multi- | task | Panoptic Driving Perception Network |
| MTW-DETR: A multi- | task | collaborative optimization model for adverse weather object detection |
| Multi | task | -Based Facial Expression Synthesis with Supervision Learning and Feature Disentanglement of Image Style |
| multi-agent curiosity reward model for | task | -oriented dialogue systems, A |
| Multi-Agent Federated DRL Model for Vehicular | task | Offloading in WPT-Aided eROAD Environment, A |
| Multi-camera positioning to optimize | task | observability |
| Multi-Category Image Super-Resolution with Convolutional Neural Network and Multi- | task | Learning |
| Multi-Centroid | task | Descriptor for Dynamic Class Incremental Inference |
| Multi-dataset fusion for multi- | task | learning on face attribute recognition |
| Multi-Dataset, Multi | task | Learning of Egocentric Vision Tasks |
| Multi-Dimensional and Multi- | task | Facial Expression Recognition for Academic Outcomes Prediction |
| Multi-Domain and Multi- | task | Learning for Human Action Recognition |
| Multi-Expert Adaptive Selection: | task | -Balancing for All-in-One Image Restoration |
| Multi-Focus Image Fusion Algorithm Based on Multi- | task | Learning and PS-ViT |
| Multi-Instance Multi- | task | Learning for Joint Clinical Outcome and Genomic Profile Predictions From the Histopathological Images |
| Multi-Label Multi- | task | Deep Learning for Behavioral Coding |
| Multi-Label Nonlinear Matrix Completion With Transductive Multi- | task | Feature Selection for Joint MGMT and IDH1 Status Prediction of Patient With High-Grade Gliomas |
| Multi-Layer | task | Offloading Scheme in Fog Computing-Based VANETs With Optimized Completion Delay |
| Multi-level network Lasso for multi- | task | personalized learning |
| Multi-Local- | task | Learning with Global Regularization for Object Tracking |
| Multi-Mission Oriented Joint Optimization of | task | Assignment and Flight Path Planning for Heterogeneous UAV Cluster |
| Multi-Modal Interface for Road Planning | task | s Using Vision, Haptics and Sound, A |
| Multi-Modal Meta Multi- | task | Learning for Social Media Rumor Detection |
| Multi-Modality Multi- | task | Recurrent Neural Network for Online Action Detection |
| Multi-Objective Dependent | task | Scheduling, Resource Allocation, and Service Caching in Aerial-Ground Integrated MEC |
| Multi-Objective Modeling Method of Multi-Satellite Imaging | task | Planning for Large Regional Mapping, A |
| Multi-Objective Multi-Picking-Robot | task | Allocation: Mathematical Model and Discrete Artificial Bee Colony Algorithm |
| Multi-Organ Foundation Model for Universal Ultrasound Image Segmentation With | task | Prompt and Anatomical Prior |
| Multi-Satellite Imaging | task | Planning for Large Regional Coverage: A Heuristic Algorithm Based on Triple Grids Method |
| Multi-satellite Scheduling Approach For Dynamic Areal | task | s Triggered By Emergent Disasters |
| Multi-Scale Adaptive | task | Attention Network for Few-Shot Learning |
| Multi-Scale and Multi- | task | Deep Learning Framework for Automatic Road Extraction |
| Multi-scale feature fusion with | task | -specific data synthesis for pneumonia pathogen classification |
| Multi-scale Multi- | task | Distillation for Incremental 3d Medical Image Segmentation |
| Multi-scale | task | -aware structure graph modeling for few-shot image recognition |
| Multi-skill aware | task | assignment in real-time spatial crowdsourcing |
| Multi-Subset Selection for Keyword Extraction and Other Prototype Search | task | s Using Feature Selection Algorithms |
| Multi- | task | Adapters for On-Device Audio Inference |
| Multi- | task | Adversarial Network for Disentangled Feature Learning |
| Multi- | task | Affinity Propagation Based Natural Image Matting |
| Multi- | task | and multi-kernel Gaussian process dynamical systems |
| Multi- | task | B-Cos Networks for Near Real-Time Interpretable Person ReID |
| Multi- | task | based object tracking via a collaborative model |
| Multi- | task | Bayesian Compressive Sensing Exploiting Intra-Task Dependency |
| Multi- | task | Bayesian Compressive Sensing Exploiting Intra-Task Dependency |
| Multi- | task | Bayesian Deep Neural Net for Detecting Life-Threatening Infant Incidents From Head Images, A |
| Multi- | task | Cascaded Network for Prediction of Affect, Personality, Mood and Social Context Using EEG Signals, A |
| Multi- | task | Center-of-Pressure Metrics Estimation With Graph Convolutional Network |
| Multi- | task | cGAN for Simultaneous Spaceborne DSM Refinement and Roof-Type Classification |
| Multi- | task | Classification of Sewer Pipe Defects and Properties using a Cross-Task Graph Neural Network Decoder |
| Multi- | task | Classification of Sewer Pipe Defects and Properties using a Cross-Task Graph Neural Network Decoder |
| Multi- | task | classification with sequential instances and tasks |
| Multi- | task | classification with sequential instances and tasks |
| Multi- | task | Clustering of Human Actions by Sharing Information |
| Multi- | task | clustering via domain adaptation |
| Multi- | task | CNN for Maritime Target Detection, A |
| Multi- | task | CNN for restoring corrupted fingerprint images |
| Multi- | task | CNN Model for Attribute Prediction |
| Multi- | task | co-clustering via nonnegative matrix factorization |
| Multi- | task | Collaboration Deep Learning Framework for Infrared Precipitation Estimation |
| Multi- | task | Collaborative Network for Joint Referring Expression Comprehension and Segmentation |
| Multi- | task | Collaborative Network for Light Field Salient Object Detection, A |
| Multi- | task | Comparator Framework for Kinship Verification, A |
| Multi- | task | Compositional Network for Visual Relationship Detection |
| Multi- | task | Consistency Enhancement Network for Semantic Change Detection in HR Remote Sensing Images and Application of Non-Agriculturalization, A |
| Multi- | task | Consistency for Active Learning |
| Multi- | task | Consistency-Preserving Adversarial Hashing for Cross-Modal Retrieval |
| Multi- | task | Contextual Atrous Residual Network for Brain Tumor Detection Segmentation, A |
| Multi- | task | contrastive learning for automatic CT and X-ray diagnosis of COVID-19 |
| Multi- | task | convolution neural network-based lifting scheme for image compression |
| Multi- | task | Convolution Operators with Object Detection for Visual Tracking |
| Multi- | task | Convolutional Neural Network for Blind Stereoscopic Image Quality Assessment Using Naturalness Analysis, A |
| Multi- | task | Convolutional Neural Network for Joint Iris Detection and Presentation Attack Detection, A |
| Multi- | task | Convolutional Neural Network for Patient Detection and Skin Segmentation in Continuous Non-Contact Vital Sign Monitoring |
| Multi- | task | Convolutional Neural Network for Pose-Invariant Face Recognition |
| Multi- | task | Convolutional Neural Network for Renal Tumor Segmentation and Classification Using Multi-Phasic CT Images, A |
| Multi- | task | Convolutional Neural Network Relative Radiometric Calibration Based on Temporal Information, A |
| multi- | task | convolutional neural network with spatial transform for parking space detection, A |
| Multi- | task | Correlation Particle Filter for Robust Object Tracking |
| Multi- | task | Curriculum Framework for Open-set Semi-supervised Learning |
| Multi- | task | Curriculum Transfer Deep Learning of Clothing Attributes |
| Multi- | task | Deep Dual Correlation Filters for Visual Tracking |
| Multi- | task | Deep Learning for Cerebrovascular Disease Classification and MRI-to-PET Translation |
| Multi- | task | Deep Learning for Image Segmentation Using Recursive Approximation Tasks |
| Multi- | task | Deep Learning for Image Segmentation Using Recursive Approximation Tasks |
| Multi- | task | Deep Learning for No-reference Screen Content Image Quality Assessment |
| Multi- | task | Deep Learning for Real-Time 3D Human Pose Estimation and Action Recognition |
| Multi- | task | Deep Learning Methods for Determining Railway Major Technical Standards |
| Multi- | task | deep learning with optical flow features for self-driving cars |
| Multi- | task | Deep Model With Margin Ranking Loss for Lung Nodule Analysis |
| Multi- | task | Deep Neural Network for Multi-Label Learning |
| Multi- | task | Deep Relative Attribute Learning for Visual Urban Perception |
| Multi- | task | deep visual-semantic embedding for video thumbnail selection |
| Multi- | task | Dense Prediction via Mixture of Low-Rank Experts |
| Multi- | task | Diffusion With Masked Measurements |
| Multi- | task | disagreement-reducing multimodal sentiment fusion network |
| Multi- | task | Distillation: Towards Mitigating the Negative Transfer in Multi-Task Learning |
| Multi- | task | Distillation: Towards Mitigating the Negative Transfer in Multi-Task Learning |
| Multi- | task | Distributed Learning Using Vision Transformer With Random Patch Permutation |
| Multi- | task | Domain Adaptation for Language Grounding with 3d Objects |
| Multi- | task | driven explainable diagnosis of COVID-19 using chest X-ray images |
| Multi- | task | dynamic graph learning for brain disorder identification with functional MRI |
| Multi- | task | face analyses through adversarial learning |
| Multi- | task | Facial Activity Patterns Learning for micro-expression recognition using Joint Temporal Local Cube Binary Pattern |
| Multi- | task | feature learning-based improved supervised descent method for facial landmark detection |
| Multi- | task | few-shot learning with composed data augmentation for image classification |
| Multi- | task | Forest for Human Pose Estimation in Depth Images |
| Multi- | task | framework based on feature separation and reconstruction for cross-modal retrieval |
| Multi- | task | Framework for Car Detection From High-Resolution UAV Imagery Focusing on Road Regions, A |
| Multi- | task | fully convolutional network for tree species mapping in dense forests using small training hyperspectral data |
| multi- | task | fully deep convolutional neural network for contactless fingerprint minutiae extraction, A |
| Multi- | task | Fusion Deep Learning Model for Short-Term Intersection Operation Performance Forecasting |
| Multi- | task | Fusion for Improving Mammography Screening Data Classification |
| Multi- | task | Gaussian Process Regression-based Image Super Resolution |
| Multi- | task | Generative Adversarial Network for Detecting Small Objects in the Wild |
| Multi- | task | GLOH feature selection for human age estimation |
| Multi- | task | Guided No-Reference Omnidirectional Image Quality Assessment With Feature Interaction |
| Multi- | task | Hardwired Accelerator for Face Detection and Alignment, A |
| Multi- | task | Head Pose Estimation in-the-Wild |
| Multi- | task | hierarchical convolutional network for visual-semantic cross-modal retrieval |
| Multi- | task | Hypergraphs for Semi-supervised Learning using Earth Observations |
| Multi- | task | image classification via collaborative, hierarchical spike-and-slab priors |
| Multi- | task | image restoration network based on spatial aggregation attention and multi-feature fusion |
| Multi- | task | image set classification via joint representation with class-level sparsity and intra-task low-rankness |
| Multi- | task | image set classification via joint representation with class-level sparsity and intra-task low-rankness |
| Multi- | task | Interaction Learning for Spatiospectral Image Super-Resolution |
| Multi- | task | Joint Sparse and Low-Rank Representation for the Scene Classification of High-Resolution Remote Sensing Image |
| Multi- | task | Knowledge Distillation for Eye Disease Prediction |
| Multi- | task | Layout Analysis of Handwritten Musical Scores |
| Multi- | task | Learning Approach for Human Activity Segmentation and Ergonomics Risk Assessment, A |
| Multi- | task | Learning Based Traditional Mongolian Words Recognition |
| Multi- | task | Learning based Video Anomaly Detection with Attention |
| Multi- | task | Learning By A Top-Down Control Network |
| Multi- | task | Learning by Maximizing Statistical Dependence |
| Multi- | task | Learning for 2D Phase Unwrapping in Fringe Projection, A |
| Multi- | task | Learning for Acoustic Event Detection Using Event and Frame Position Information |
| Multi- | task | Learning for Blind Source Separation |
| Multi- | task | Learning for Calorie Prediction on a Novel Large-Scale Recipe Dataset Enriched with Nutritional Information |
| Multi- | task | learning for captioning images with novel words |
| Multi- | task | Learning for Dense Prediction Tasks: A Survey |
| Multi- | task | Learning for Dense Prediction Tasks: A Survey |
| Multi- | task | learning for gait-based identity recognition and emotion recognition using attention enhanced temporal graph convolutional network |
| Multi- | task | Learning for Hierarchical Professional Gesture Recognition: State-Space Modeling for Task Temporal Dependencies |
| Multi- | task | Learning for Hierarchical Professional Gesture Recognition: State-Space Modeling for Task Temporal Dependencies |
| Multi- | task | Learning for Human Affect Prediction with Auditory-Visual Synchronized Representation |
| Multi- | task | Learning for Motion Analysis and Segmentation in 3D Echocardiography |
| Multi- | task | learning for natural language processing in the 2020s: Where are we going? |
| Multi- | task | learning for object keypoints detection and classification |
| Multi- | task | Learning for Ocean-Front Detection and Evolutionary Trend Recognition |
| Multi- | task | learning for one-class SVM with additional new features |
| Multi- | task | Learning for Predicting Parkinson's Disease Based on Medical Imaging Information |
| Multi- | task | Learning for Segmentation of Building Footprints with Deep Neural Networks |
| Multi- | task | Learning for Ship Trajectory Prediction and Motion Planning via Node Relationship Modeling |
| Multi- | task | learning for simultaneous script identification and keyword spotting in document images |
| Multi- | task | Learning for Simultaneous Video Generation and Remote Photoplethysmography Estimation |
| Multi- | task | Learning for Supervised and Unsupervised Classification of Grocery Images |
| Multi- | task | Learning for UAV Aerial Object Detection in Foggy Weather Condition |
| Multi- | task | learning for video anomaly detection |
| Multi- | task | Learning for Video Surveillance with Limited Data |
| multi- | task | learning framework for dual-polarization SAR imagery despeckling in temporal change detection scenarios, A |
| Multi- | task | Learning Framework for Emotion Recognition In-the-wild |
| Multi- | task | Learning Framework for Emotion Recognition Using 2D Continuous Space, A |
| Multi- | task | Learning Framework for Head Pose Estimation under Target Motion, A |
| Multi- | task | Learning Framework for Joint Disease Risk Prediction and Comorbidity Discovery, A |
| Multi- | task | Learning Framework for Motion Estimation and Dynamic Scene Deblurring |
| Multi- | task | Learning Framework with Enhanced Cross-Level Semantic Consistency for Multi-Level Land Cover Classification, A |
| multi- | task | learning method for extraction of newly constructed areas based on bi-temporal hyperspectral images, A |
| Multi- | task | Learning Model for V-PCC Geometry Compression Artifact Removal |
| Multi- | task | Learning Network for Medical Image Analysis Guided by Lesion Regions and Spatial Relationships of Tissues |
| Multi- | task | Learning Network With a Collision-Aware Graph Transformer for Traffic-Agents Trajectory Prediction, A |
| Multi- | task | Learning of Cascaded CNN for Facial Attribute Classification |
| Multi- | task | Learning of Classification and Generation for Set-structured Data |
| Multi- | task | Learning of Depth from Tele and Wide Stereo Image Pairs |
| Multi- | task | Learning of Emotion Recognition and Facial Action Unit Detection with Adaptively Weights Sharing Network |
| Multi- | task | Learning of Facial Landmarks and Expression |
| Multi- | task | Learning of Hierarchical Vision-Language Representation |
| Multi- | task | Learning of Object States and State-Modifying Actions From Web Videos |
| Multi- | task | Learning of Relative Height Estimation and Semantic Segmentation from Single Airborne RGB Images |
| multi- | task | learning strategy for unsupervised clustering via explicitly separating the commonality, A |
| Multi- | task | learning to rank for web search |
| Multi- | task | learning using GNet features and SVM classifier for signature identification |
| Multi- | task | Learning Using Multi-modal Encoder-Decoder Networks with Shared Skip Connections |
| Multi- | task | Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics |
| Multi- | task | learning using variational auto-encoder for sentiment classification |
| Multi- | task | Learning via Non-sparse Multiple Kernel Learning |
| Multi- | task | Learning via Scale Aware Feature Pyramid Networks and Effective Joint Head |
| Multi- | task | Learning with Attention for End-to-end Autonomous Driving |
| Multi- | task | Learning With Coarse Priors for Robust Part-Aware Person Re-Identification |
| Multi- | task | Learning with Compressible Features for Collaborative Intelligence |
| Multi- | task | learning with deformable convolution |
| Multi- | task | Learning with Future States for Vision-based Autonomous Driving |
| Multi- | task | Learning with Knowledge Distillation for Dense Prediction |
| Multi- | task | Learning with Low Rank Attribute Embedding for Multi-Camera Person Re-Identification |
| Multi- | task | Learning with Low Rank Attribute Embedding for Person Re-Identification |
| Multi- | task | Learning with Multi-Query Transformer for Dense Prediction |
| Multi- | task | learning with over-sampled time-series representation of a trajectory for traffic motion pattern recognition |
| Multi- | task | Learning, Multiple Tasks, Transfer Learning, Domain Adaption |
| Multi- | task | Learning, Multiple Tasks, Transfer Learning, Domain Adaption |
| Multi- | task | Learning-Enabled Automatic Vessel Draft Reading for Intelligent Maritime Surveillance |
| Multi- | task | linear discriminant analysis for multi-view action recognition |
| Multi- | task | low-rank affinity pursuit for image segmentation |
| Multi- | task | low-rank and sparse matrix recovery for human motion segmentation |
| Multi- | task | Lycium barbarum Recognition and Pedicel Localization Network Based on Improved YOLOv8 |
| Multi- | task | Matrix Factorized Graph Neural Network for Co-Prediction of Zone-Based and OD-Based Ride-Hailing Demand, A |
| Multi- | task | Mean Teacher for Semi-supervised Facial Affective Behavior Analysis, A |
| Multi- | task | Mean Teacher for Semi-Supervised Shadow Detection, A |
| Multi- | task | meta label correction for time series prediction |
| Multi- | task | Micro-expression Recognition Combining Deep and Handcrafted Features |
| Multi- | task | mid-level feature learning for micro-expression recognition |
| Multi- | task | Mixture-of-Experts Model for Underwater Target Localization and Recognition |
| Multi- | task | Model Based on Vision Task Level for Saliency Object Detection in Foggy Conditions |
| Multi- | task | Model Based on Vision Task Level for Saliency Object Detection in Foggy Conditions |
| Multi- | task | Model for Comic Book Image Analysis |
| Multi- | task | Momentum Distillation for Multimodal Sentiment Analysis |
| Multi- | task | Multi-Domain Learning for Digital Staining and Classification of Leukocytes |
| Multi- | task | Multi-Modal Self-Supervised Learning for Facial Expression Recognition |
| Multi- | task | Multi-modal Semantic Hashing for Web Image Retrieval with Limited Supervision |
| Multi- | task | Multi-Sample Learning |
| Multi- | task | Multi-Sensor Fusion for 3D Object Detection |
| Multi- | task | Multi-Stage Transitional Training Framework for Neural Chat Translation, A |
| Multi- | task | Multi-view based Multi-objective Clustering Algorithm, A |
| Multi- | task | multimodal feature refinement for emotional speech animation |
| Multi- | task | multiple kernel machines for personalized pain recognition from functional near-infrared spectroscopy brain signals |
| Multi- | task | Network for Joint Specular Highlight Detection and Removal, A |
| multi- | task | network for speaker and command recognition in industrial environments, A |
| Multi- | task | Network Guided by Ultrasound Features for Predicting BRAFV600E Mutation Status in Thyroid Ultrasound Images |
| Multi- | task | Network with Distance-Mask-Boundary Consistency Constraints for Building Extraction from Aerial Images, A |
| Multi- | task | Network With Dynamic Segmentation for Sea Ice Classification in Arctic Shipping Route Optimization, A |
| Multi- | task | network with inter-task consistency learning for face parsing and facial expression recognition at real-time speed |
| Multi- | task | network with inter-task consistency learning for face parsing and facial expression recognition at real-time speed |
| Multi- | task | Neural Approach for Emotion Attribution, Classification, and Summarization, A |
| Multi- | task | Neural Network for Action Recognition with 3D Key-Points, A |
| Multi- | task | Nonnegative Matrix Factorization |
| Multi- | task | Occlusion Learning for Real-Time Visual Object Tracking |
| Multi- | task | OCTA image segmentation with innovative dimension compression |
| Multi- | task | Paired Masking With Alignment Modeling for Medical Vision-Language Pre-Training |
| Multi- | task | Pose-Invariant Face Recognition |
| Multi- | task | Probabilistic Regression with Overlap Maximization for Visual Tracking |
| Multi- | task | proximal support vector machine |
| Multi- | task | Rank Learning for Image Quality Assessment |
| Multi- | task | Rank Learning for Visual Saliency Estimation |
| Multi- | task | Recurrent Neural Network for Immediacy Prediction |
| Multi- | task | Relative Attribute Prediction by Incorporating Local Context and Global Style Information |
| Multi- | task | Scheduling Decision-Making Approach for SAEV Fleets: Integrating Stochastic Demand and Long-Term Benefit, A |
| Multi- | task | SE-Network for Image Splicing Localization |
| Multi- | task | Self-Supervised Object Detection via Recycling of Bounding Box Annotations |
| Multi- | task | Self-Supervised Visual Learning |
| Multi- | task | Self-Training for Learning General Representations |
| Multi- | task | Semantic Segmentation Network for Threat Detection in X-Ray Security Images, A |
| Multi- | task | Semi-Supervised Adversarial Autoencoding for Speech Emotion Recognition |
| Multi- | task | semi-supervised crowd counting via global to local self-correction |
| Multi- | task | Siamese Network for Retinal Artery/Vein Separation via Deep Convolution Along Vessel |
| Multi- | task | signal recovery by higher level hyper-parameter sharing |
| Multi- | task | Sparse Learning with Beta Process Prior for Action Recognition |
| Multi- | task | Spatiotemporal Neural Networks for Structured Surface Reconstruction |
| Multi- | task | Structure-Aware Context Modeling for Robust Keypoint-Based Object Tracking |
| Multi- | task | Supervised Compression Model for Split Computing, A |
| Multi- | task | Transfer Methods to Improve One-Shot Learning for Multimedia Event Detection |
| Multi- | task | Travel Route Planning With a Flexible Deep Learning Framework |
| Multi- | task | Vehicle Detection With Region-of-Interest Voting |
| Multi- | task | View Synthesis with Neural Radiance Fields |
| Multi- | task | visual food recognition by integrating an ontology supported with LLM |
| Multi- | task | Visual Perception for Object Detection and Semantic Segmentation in Intelligent Driving |
| Multi- | task | warped Gaussian process for personalized age estimation |
| Multi- | task | WaveRNN With an Integrated Architecture for Cross-Lingual Voice Conversion |
| Multi- | task | Y-Shaped Graph Neural Network for Point Cloud Learning in Autonomous Driving |
| Multi- | task | Zero-Shot Action Recognition with Prioritised Data Augmentation |
| Multi-view | task | -driven recognition in visual sensor networks |
| Multi-view visual speech recognition based on multi | task | learning |
| MultiFire20K: A semi-supervised enhanced large-scale UAV-based benchmark for advancing multi- | task | learning in fire monitoring |
| MULTIFLOW: Shifting Towards | task | -Agnostic Vision-Language Pruning |
| Multilevel Vision Based Spatial Reasoning for Robotic | task | s |
| MultiMAE Meets Earth Observation: Pre-Training Multi-Modal Multi- | task | Masked Autoencoders for Earth Observation Tasks |
| MultiMAE Meets Earth Observation: Pre-Training Multi-Modal Multi- | task | Masked Autoencoders for Earth Observation Tasks |
| MultiMAE: Multi-modal Multi- | task | Masked Autoencoders |
| Multimodal and Multi- | task | Audio-Visual Vehicle Detection and Classification |
| Multimodal Assessment of Expertise in AR-Guided Psychomotor | task | s |
| Multimodal Deep Sparse Subspace Clustering for Multiple Stimuli-based Cognitive | task | |
| Multimodal Localization Distillation Method for 3D Single Object Tracking | task | in Intelligent Transportation Systems |
| Multimodal Personality Recognition in Collaborative Goal-Oriented | task | s |
| Multimodal | task | -Driven Dictionary Learning for Image Classification |
| MultiNet: Multi-Modal Multi- | task | Learning for Autonomous Driving |
| MultiPanoWise: holistic deep architecture for multi- | task | dense prediction from a single panoramic image |
| Multiple metric learning with query adaptive weights and multi- | task | re-weighting for person re-identification |
| Multiple object tracking based on multi- | task | learning with strip attention |
| Multiple Object Tracking of Drone Videos by a Temporal-Association Network with Separated- | task | s Structure |
| Multiple-Solution Optimization Strategy for Multirobot | task | Allocation |
| Multiscale | task | -Decoupled Oriented SAR Ship Detection Network Based on Size-Aware Balanced Strategy |
| Multisensor Interface to Improve the Learning Experience in Arc Welding Training | task | s, A |
| Multisubject | task | -Related fMRI Data Processing via a Two-Stage Generalized Canonical Correlation Analysis |
| Multivariate Analysis of Gaze Behavior and | task | Performance Within Interface Design Evaluation |
| Multiview Laplacian semisupervised feature selection by leveraging shared knowledge among multiple | task | s |
| Mutual Dual- | task | Generator With Adaptive Attention Fusion for Image Inpainting |
| Mutual Support of Data Modalities in the | task | of Sign Language Recognition |
| Mutually Guided Dual- | task | Network for Scene Text Detection |
| MVANet: Multi- | task | Guided Multi-View Attention Network for Chinese Food Recognition |
| MVP Sensor Planning System for Robotic Vision | task | s, The |
| Name Your Colour For the | task | : Artificially Discover Colour Naming via Colour Quantisation Transformer |
| NASOA: Towards Faster | task | -Oriented Online Fine-Tuning with a Zoo of Models |
| National-scale greenhouse mapping for high spatial resolution remote sensing imagery using a dense object dual- | task | deep learning framework: A case study of China |
| NDDR-CNN: Layerwise Feature Fusing in Multi- | task | CNNs by Neural Discriminative Dimensionality Reduction |
| NDDR-LCS: A Multi- | task | Learning Method for Classification of Carotid Plaques |
| Network Collaborative Pruning Method for Hyperspectral Image Classification Based on Evolutionary Multi- | task | Optimization |
| Network Generalization Prediction for Safety Critical | task | s in Novel Operating Domains |
| Neural Architecture Search for Dense Prediction | task | s in Computer Vision |
| Neural | task | Graphs: Generalizing to Unseen Tasks From a Single Video Demonstration |
| Neural | task | Graphs: Generalizing to Unseen Tasks From a Single Video Demonstration |
| Neural | task | Planning With AND-OR Graph Representations |
| Neural Weight Search for Scalable | task | Incremental Learning |
| NeuroGrasp: Real-Time EEG Classification of High-Level Motor Imagery | task | s Using a Dual-Stage Deep Learning Framework |
| Neuromorphic Architecture for Cortical Multilayer Integration of Early Visual | task | s, A |
| New Airborne Sensors and Platforms for Solving Specific | task | s In Remote Sensing |
| New and Fast Algorithm for Estimating the Perimeter of Objects for Industrial Vision | task | s, A |
| New approaches for colour histogram adaptation in face tracking | task | s |
| New Cloud-edge-terminal Resources Collaborative Scheduling Framework For Multi-level Visualization | task | s of Large-scale Spatio-temporal Data, A |
| new data complexity measure for multi-class imbalanced classification | task | s, A |
| New discretization of total variation functional for image processing | task | s |
| new framework with multiple | task | s for detecting and locating pain events in video, A |
| New Insights on Relieving | task | -Recency Bias for Online Class Incremental Learning |
| New Metrics for Evaluating Performance in Document Analysis | task | s: Application to the Table Case |
| New Results and Measurements Related to Some | task | s in Object-Oriented Dynamic Image-Coding Using GNN Universal Chips |
| New Sport Teams Logo Dataset for Detection | task | s, A |
| NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language | task | s |
| No Matter Where You Are: Flexible Graph-Guided Multi- | task | Learning for Multi-view Head Pose Classification under Target Motion |
| No-reference stereoscopic image quality assessment using a multi- | task | CNN and registered distortion representation |
| No-reference | task | performance prediction on distorted LWIR images |
| Noise Pattern Recognition of Airplanes Taking Off: | task | for a Monitoring System |
| Noise Self-Regression: A New Learning Paradigm to Enhance Low-Light Images Without | task | -Related Data |
| Non-Archimedean normalized fields in texture analysis | task | s |
| Non-generative Generalized Zero-shot Learning via | task | -correlated Disentanglement and Controllable Samples Synthesis |
| Novel Active Vision-Based Visual Threat Cue for Autonomous Navigation | task | s |
| Novel Contract Theory-Based Incentive Mechanism for Cooperative | task | -Offloading in Electrical Vehicular Networks, A |
| novel decoder based on Bayesian rules for | task | -driven object segmentation, A |
| Novel Formulations and Improved Differential Evolution Algorithm for Optimal Lane Reservation With | task | Merging |
| Novel k-Means Clustering Based | task | Decomposition Method for Distributed Vector-Based CA Models, A |
| novel learning approach to multiple | task | s based on boosting methodology, A |
| Novel Multi- | task | Learning for Motion Magnification |
| NSGA-II Algorithm for | task | Scheduling in UAV-Enabled MEC System, A |
| NurtureNet: A Multi- | task | Video-based Approach for Newborn Anthropometry |
| OakInk2: A Dataset of Bimanual Hands-Object Manipulation in Complex | task | Completion |
| Object Recognition for a Grasping | task | by a Mobile Manipulator |
| Object sequences: encoding categorical and spatial information for a yes/no visual question answering | task | |
| Object Tracking via Multi- | task | Gaussian-Laplacian Regression |
| Object- and | task | -Oriented Architecture for Automated Video Surveillance in Distributed Sensor Networks, An |
| Object-oriented Map Exploration and Construction Based on Auxiliary | task | Aided DRL |
| Objective Assessment of Sonographic Quality I: | task | Information |
| Objective Comparison Methodology of Edge Detection Algorithms Using a Structure from Motion | task | , An |
| Objective Comparison Methodology of Edge Detection Algorithms Using a Structure from Motion | task | , An |
| Objective Function to Evaluate Performance of Human-Robot Collaboration in Target Recognition | task | s, An |
| Observation | task | Chain Representation Model for Disaster Process-Oriented Remote Sensing Satellite Sensor Planning: A Flood Water Monitoring Application, An |
| Observer Efficiency in Discrimination | task | s Simulating Malignant and Benign Breast Lesions Imaged With Ultrasound |
| Oculus Rift Versus HTC Vive: Usability Assessment from a Teleportation | task | |
| ODVISTA: An Omnidirectional Video Dataset for Super-Resolution and Quality Enhancement | task | s |
| OGMN: Occlusion-guided multi- | task | network for object detection in UAV images |
| Olympus: A Universal | task | Router for Computer Vision Tasks |
| Olympus: A Universal | task | Router for Computer Vision Tasks |
| Omni-ID: Holistic Identity Representation Designed for Generative | task | s |
| Omnidata: A Scalable Pipeline for Making Multi- | task | Mid-Level Vision Datasets from 3D Scans |
| On multi- | task | learning for facial action unit detection |
| On realistic human motion simulation for virtual manipulation | task | s |
| On the Application of Massively Parallel SIMD Tree Machines to Certain Intermediate-Level Vision | task | s |
| On the correspondence between objects and events for the diagnosis of situations in visual surveillance | task | s |
| On the equivalence of local-mode finding, robust estimation and mean-shift analysis as used in early vision | task | s |
| On the Importance of Accurate Geometry Data for Dense 3D Vision | task | s |
| On the Robustness of Language Guidance for Low-Level Vision | task | s: Findings from Depth Estimation |
| On the suitability of different probability distributions for the | task | of image segmentation |
| On the Suitability of Reinforcement Fine-Tuning to Visual | task | s |
| On the Use of Different Classification Rules in an Editing | task | |
| On the use of independent | task | s for face recognition |
| On the Use of Pre-trained Neural Networks for Different Face Recognition | task | s |
| On the Use of Topological Constraints Within Object Recognition | task | s |
| One framework to rule them all: Unifying multimodal | task | s with LLM neural-tuning |
| One Metric to Measure Them All: Localisation Recall Precision (LRP) for Evaluating Visual Detection | task | s |
| One Model for ALL: Low-Level | task | Interaction Is a Key to Task-Agnostic Image Fusion |
| One Model for ALL: Low-Level | task | Interaction Is a Key to Task-Agnostic Image Fusion |
| One-Pass Multi- | task | Networks With Cross-Task Guided Attention for Brain Tumor Segmentation |
| One-Pass Multi- | task | Networks With Cross-Task Guided Attention for Brain Tumor Segmentation |
| One-Shot Multiple Object Tracking in UAV Videos Using | task | -Specific Fine-Grained Features |
| One-Stage Anchor-Free Online Multiple Target Tracking With Deformable Local Attention and | task | -Aware Prediction |
| One-stage Multi- | task | Detector for 3D Cardiac MR Imaging |
| Online Anchor-Based Training for Image Classification | task | s |
| Online Class Incremental Learning on Stochastic Blurry | task | Boundary via Mask and Visual Prompt Tuning |
| Online Continual Learning on a Contaminated Data Stream with Blurry | task | Boundaries |
| Online Knowledge Distillation for Multi- | task | Learning |
| Online learning of | task | -driven object-based visual attention control |
| Online multi-modal | task | -driven dictionary learning and robust joint sparse representation for visual tracking |
| Online Multi- | task | Clustering for Human Motion Segmentation |
| Online multi- | task | learning for semantic concept detection in video |
| Online Multiple Object Tracking with Cross- | task | Synergy |
| Online Performance Prediction of Perception DNNs by Multi- | task | Learning With Depth Estimation |
| Online | task | -Free Continual Generative and Discriminative Learning via Dynamic Cluster Memory |
| Online | task | -Free Continual Learning via Dynamic Expansionable Memory Distribution |
| Online | task | -free continual learning via Expansible Vision Transformer |
| Online | task | -free Continual Learning with Dynamic Sparse Distributed Memory |
| Online-LoRA: | task | -Free Online Continual Learning via Low Rank Adaptation |
| OO-dMVMT: A Deep Multi-view Multi- | task | Classification Framework for Real-time 3D Hand Gesture Classification and Segmentation |
| Open Source Software Platform for Visualizing and Teaching Conservation | task | s in Architectural Heritage Environments, An |
| Open-World Multi- | task | Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction |
| OpenVL: A | task | -based abstraction for developer-friendly computer vision |
| OpenVL: Abstracting Vision | task | s Using a Segment-Based Language Model |
| Operational and Biomechanical Evaluation of a Wrist Exoskeleton Prototype for Assisting Meat-Cutting | task | s |
| Operator Choice Modeling for Collaborative UAV Visual Search | task | s |
| Optimal Arousal Identification and Classification for Affective Computing Using Physiological Signals: Virtual Reality Stroop | task | |
| Optimal Location Privacy Preserving and Service Quality Guaranteed | task | Allocation in Vehicle-Based Crowdsensing Networks |
| Optimal Pricing for Offloaded Hard- and Soft-Deadline | task | s in Edge Computing |
| Optimal | task | and Motion Planning and Execution for Multiagent Systems in Dynamic Environments |
| Optimal Transport of Diverse Unsupervised | task | s for Robust Learning from Noisy Few-shot Data |
| Optimizated CIELAB Colour Model For All-Analog Photoelectronic High Speed Vision- | task | Chip (ACCEL) by Creative Computing Approach, The |
| Optimization Algorithm of UAVs | task | Assignment and Path Planning Based on Dynamic Cluster Particle Swarm Optimization |
| Optimization and regularization of complex | task | decomposition for blind removal of multi-factor degradation |
| Optimization of Graph Convolutional Networks with Variational Graph Autoencoder Architecture for 3D Face Reconstruction | task | |
| Optimization of Industrial, Vision-Based, Intuitively Generated Robot Point-Allocating | task | s Using Genetic Algorithms |
| Optimization of Restricted ROC Surfaces in Three-Class Classification | task | s |
| optimization on pictogram identification for the road-sign recognition | task | using SVMs, An |
| Optimized Spatiotemporal Data Scheduling Based on Maximum Flow for Multilevel Visualization | task | s |
| Optimizing Dense Visual Predictions Through Multi- | task | Coherence and Prioritization |
| Optimizing Lidars for Wind Turbine Control Applications: Results from the IEA Wind | task | 32 Workshop |
| Optimizing Multi- | task | Network with Learned Prototypes for Weakly Supervised Semantic Segmentation |
| Optimizing Optics and Imaging for Pattern Recognition Based Screening | task | s |
| Optimizing | task | Offloading in VEC: A PDQKM Scheme Combining Deep Reinforcement Learning and Kuhn-Munkres Matching |
| Ordinal Multi- | task | Part Segmentation With Recurrent Prior Generation |
| Orthogonal channel attention-based multi- | task | learning for multi-view facial expression recognition |
| OSCAR: Object-Semantics Aligned Pre-Training for Vision-Language | task | s |
| OTCE: A Transferability Metric for Cross-Domain Cross- | task | Representations |
| Out-of-Distribution Detection: A | task | -Oriented Survey of Recent Advances |
| overhead-free region-based JPEG framework for | task | -driven image compression, An |
| Overt visual attention for free-viewing and quality assessment | task | s: Impact of the regions of interest on a video quality metric |
| Overview of the Photo Annotation | task | in ImageCLEF@ICPR |
| P-RoPE: A polar-based rotary position embedding for polar transformed images in rotation-invariant | task | s |
| PAC-Bayes Meta-Learning With Implicit | task | -Specific Posteriors |
| PackNet: Adding Multiple | task | s to a Single Network by Iterative Pruning |
| PACTran: PAC-Bayesian Metrics for Estimating the Transferability of Pretrained Models to Classification | task | s |
| PAD-Net: Multi- | task | s Guided Prediction-and-Distillation Network for Simultaneous Depth Estimation and Scene Parsing |
| Padding Investigations for CNNs in Scene Parsing | task | s |
| PAMTRI: Pose-Aware Multi- | task | Learning for Vehicle Re-Identification Using Highly Randomized Synthetic Data |
| Pano-SfMLearner: Self-Supervised Multi- | task | Learning of Depth and Semantics in Panoramic Videos |
| PAR Contest 2023: Pedestrian Attributes Recognition with Multi- | task | Learning |
| Parallel Implementations of Perceptual Grouping | task | s on Distributed Memory Machines |
| Parallel | task | -Prompts ICM: A Versatile Feature Codec for Machine Vision |
| PARFormer: Transformer-Based Multi- | task | Network for Pedestrian Attribute Recognition |
| Paris-CARLA-3D: A Real and Synthetic Outdoor Point Cloud Dataset for Challenging | task | s in 3D Mapping |
| Parking Space Status Inference Upon a Deep CNN and Multi- | task | Contrastive Network With Spatial Transform |
| PARSEL: A Multimodal Dataset for Modeling Decision-Making Processes Involved in Selecting Partners for Joint | task | s |
| Partially Shared Multi- | task | Convolutional Neural Network with Local Constraint for Face Attribute Learning |
| Partially Supervised Multi- | task | Network for Single-View Dietary Assessment |
| Parts-based multi- | task | sparse learning for visual tracking |
| Partstad: 2d-to-3d Part Segmentation | task | Adaptation |
| Patch-based Privacy Preserving Neural Network for Vision | task | s |
| Path Planning for Indoor Contact Inspection | task | s with Uavs |
| Pattern-Structure Diffusion for Multi- | task | Learning |
| Pay Attention! - Robustifying a Deep Visuomotor Policy Through | task | -Focused Visual Attention |
| PDFactor: Learning Tri-Perspective View Policy Diffusion Field for Multi- | task | Robotic Manipulation |
| PeaceGAN: A GAN-Based Multi- | task | Learning Method for SAR Target Image Generation with a Pose Estimator and an Auxiliary Classifier |
| PECTP: Parameter-Efficient Cross- | task | Prompts for Incremental Vision Transformer |
| Pedestrian and part position detection using a regression-based multiple | task | deep convolutional neural network |
| Pedestrian Crossing Intention Prediction Based on Cross-Modal Transformer and Uncertainty-Aware Multi- | task | Learning for Autonomous Driving |
| Pedestrian Detection Aided by Deep Learning Semantic | task | s |
| Perception planning for an exploration | task | of a 3D environment |
| Perception-Based Learning for Motion in Contact in | task | Planning |
| Perceptual Artifacts Localization for Image Synthesis | task | s |
| Perceptual Decoupling With Heterogeneous Auxiliary | task | s for Joint Low-Light Image Enhancement and Deblurring |
| Performance of visual search | task | s from various types of contour information |
| Person Re-Identification Over Camera Networks Using Multi- | task | Distance Metric Learning |
| Person Search by a Bi-Directional | task | -Consistent Learning Model |
| Personality-Assisted Multi- | task | Learning for Generic and Personalized Image Aesthetics Assessment |
| Personalized Model-Driven Adaptive | task | Facilitates Visuomotor Skill Learning Mediated by Promoting Flow Experience |
| Perspectives and Prospects on Transformer Architecture for Cross-Modal | task | s with Language and Vision |
| PETAH: Parameter Efficient | task | Adaptation for Hybrid Transformers |
| Petri-Net-Based Modeling of Human Operator's Planning for the Evaluation of | task | Performance Using the Example of Air Traffic Control |
| PHGC: Procedural Heterogeneous Graph Completion for Natural Language | task | Verification in Egocentric Videos |
| PhysMLE: Generalizable and Priors-Inclusive Multi- | task | Remote Physiological Measurement |
| Piggyback: Adapting a Single Network to Multiple | task | s by Learning to Mask Weights |
| Pixel shuffling is all you need: spatially aware convmixer for dense prediction | task | s |
| Place Theory as an Alternative Solution in Automatic Speech Recognition | task | s, The |
| PODD: A Dual- | task | Detection for Greenhouse Extraction Based on Deep Learning |
| Podnet: Pooled Outputs Distillation for Small- | task | s Incremental Learning |
| Point Proposal Based Instance Segmentation with Rectangular Masks for Robot Picking | task | |
| Poly-PC: A Polyhedral Network for Multiple Point Cloud | task | s at Once |
| Pose-Independent Facial Action Unit Intensity Regression Based on Multi- | task | Deep Transfer Learning |
| Pose-Robust Face Verification by Exploiting Competing | task | s |
| Pose-to-Pose: A New | task | and Benchmark for Human Pose Transition in Yoga |
| PoseBias: On Dataset Bias and | task | Difficulty - Is there an Optimal Camera Position for Facial Image Analysis? |
| Potential Game Based | task | Offloading in the High-Speed Railway With Reinforcement Learning |
| PPO2: Location Privacy-Oriented | task | Offloading to Edge Computing Using Reinforcement Learning for Intelligent Autonomous Transport Systems |
| Precompiling a Geometric Model into an Interpretation Tree for Object Recognition in Bin-Picking | task | s |
| Predicting Chemical Properties using Self-Attention Multi- | task | Learning based on SMILES Representation |
| Predicting Driver's Transition Time to a Secondary | task | Given an in-Vehicle Alert |
| Predicting Gaze in Egocentric Video by Learning | task | -Dependent Attention Transition |
| Predicting Human Postures for Manual Material Handling | task | s Using a Conditional Diffusion Model |
| Predicting Multiple Attributes via Relative Multi- | task | Learning |
| Predicting | task | -Driven Attention via Integrating Bottom-Up Stimulus and Top-Down Guidance |
| Predicting Taxi Demand Based on 3D Convolutional Neural Network and Multi- | task | Learning |
| Predicting the Performance in Decision-Making | task | s: From Individual Cues to Group Interaction |
| Predicting the Subjective Responses' Emotion in Dialogues with Multi- | task | Learning |
| Predicting Visual Focus of Attention From Intention in Remote Collaborative | task | s |
| Predicting Visual Political Bias Using Webly Supervised Data and an Auxiliary | task | |
| Predicting When and What to Explain From Multimodal Eye Tracking and | task | Signals |
| Prediction-Based and Locality-Aware | task | Scheduling for Parallelizing Video Transcoding Over Heterogeneous MapReduce Cluster |
| Predictive Model for Use of an Assistive Robotic Manipulator: Human Factors Versus Performance in Pick-and-Place/Retrieval | task | s, A |
| Preliminary Prosodic and Gestural Characteristics of Instructing Acts in Polish | task | -Oriented Dialogues |
| Principled Design of Image Representation: Towards Forensic | task | s, A |
| Principles Emerging from the Design of Visual Search Algorithms for Practical Inspection | task | s |
| Prior Aided Streaming Network for Multi- | task | Affective Analysis |
| Privacy-Aware Multiagent Deep Reinforcement Learning for | task | Offloading in VANET |
| Privileged multi- | task | learning for attribute-aware aesthetic assessment |
| Pro-Tuning: Unified Prompt Tuning for Vision | task | s |
| Proactive robot | task | sequencing through real-time hand motion prediction in human-robot collaboration |
| Probabilistic learning of | task | -specific visual attention |
| Probabilistic Multi- | task | Learning for Visual Saliency Estimation in Video |
| probabilistic representation for efficient large scale visual recognition | task | s, A |
| Probabilistic Vehicle Reconstruction Using a Multi- | task | CNN |
| Probing Neural Representations of Scene Perception in a Hippocampally Dependent | task | Using Artificial Neural Networks |
| Profit Maximization of Independent | task | Offloading in MEC-Enabled 5G Internet of Vehicles |
| Prognostic Framework for Robotic Manipulators Operating Under Dynamic | task | Severities |
| Programmable Motion Generation for Open-Set Motion Control | task | s |
| Programming intermediate level vision | task | s on parallel machines |
| Progress-Aware Online Action Segmentation for Egocentric Procedural | task | Videos |
| Progressive Multi- | task | Anti-Noise Learning and Distilling Frameworks for Fine-Grained Vehicle Recognition |
| Progressive Pretext | task | Learning for Human Trajectory Prediction |
| Projected Augmented Reality to Guide Manual Precision | task | s: An Alternative to Head Mounted Displays |
| Prompt Guided Transformer for Multi- | task | Dense Prediction |
| Prompt Prototype Learning Based on Ranking Instruction For Few-Shot Visual | task | s |
| PromptonomyViT: Multi- | task | Prompt Learning Improves Video Transformers using Synthetic Scene Data |
| Propagating Uncertainty Across Cascaded Medical Imaging | task | s for Improved Deep Learning Inference |
| Prototype-Based Methodology for the Statistical Analysis of Local Features in Stereotypical Handwriting | task | s |
| PSO based multi-robot | task | allocation, A |
| PT4AL: Using Self-supervised Pretext | task | s for Active Learning |
| Public Bus-Assisted | task | Offloading for UAVs |
| Public Life in Public Space (PLPS): A multi- | task | , multi-group video dataset for public life research |
| Puck localization and multi- | task | event recognition in broadcast hockey videos |
| PYRA: Parallel Yielding Re-activation for Training-inference Efficient | task | Adaptation |
| Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA | task | s? A: Self-Train on Unlabeled Images! |
| QFabric: Multi- | task | Change Detection Dataset |
| QoE-Based | task | Offloading With Deep Reinforcement Learning in Edge-Enabled Internet of Vehicles |
| QuadroNet: Multi- | task | Learning for Real-Time Semantic Depth Aware Instance Segmentation |
| Quality of Experience Comparison Between Binocular and Monocular Augmented Reality Display Under Various Occlusion Conditions for Manipulation | task | s with Virtual Instructions |
| Quality-Oriented | task | Allocation and Scheduling in Transcoding Servers With Heterogeneous Processors |
| Quantifying | task | Priority for Multi-Task Optimization |
| Quantifying | task | Priority for Multi-Task Optimization |
| Queueing Model Based Intelligent Human-Machine | task | Allocator, A |
| Queuing Network Modeling of Driver Lateral Control With or Without a Cognitive Distraction | task | |
| Rainbow UDA: Combining Domain Adaptive Models for Semantic Segmentation | task | s |
| Raw High-Definition Radar for Multi- | task | Learning |
| Reactive Model Correction: Mitigating Harm to | task | -Relevant Features via Conditional Bias Suppression |
| Real-Time Active Vision and Computer Interfaces Exploiting Human Actions and Object Context for Recognition | task | s |
| Real-Time Human Pose Estimation via Cascaded Neural Networks Embedded with Multi- | task | Learning |
| Real-Time Multi- | task | Environmental Perception System for Traffic Safety Empowered by Edge Artificial Intelligence |
| Real-Time Multi- | task | Single Shot Face Detector, A |
| Real-Time Segmentation and Recognition of Surgical | task | s in Cataract Surgery Videos |
| Real-Time | task | Recognition in Cataract Surgery Videos Using Adaptive Spatiotemporal Polynomials |
| Real-Time Visual Sensing for | task | Planning in a Field Navigation Vehicle |
| Real-Valued Negative Selection (RNS) for Classification | task | |
| Recent Trends in | task | and Motion Planning for Robotics: A Survey |
| RECISTSurv: Hybrid Multi- | task | Transformer for Hepatocellular Carcinoma Response and Survival Evaluation |
| Recognizing Assembly | task | s using Face-Contact Relations |
| Recognizing Daily Activities from First-Person Videos with Multi- | task | Clustering |
| Reconfigurable computing: design methodology and hardware | task | s scheduling for real-time image processing |
| Recovery Effect of Different Virtual Natural Environments on Stress in Short-term Isolation | task | s |
| Rectification-Based Knowledge Retention for | task | Incremental Learning |
| Recurrent Assistance: Cross-Dataset Training of LSTMs on Kitchen | task | s |
| Recurrent Variational Network: A Deep Learning Inverse Problem Solver applied to the | task | of Accelerated MRI Reconstruction |
| Redirected touching: The effect of warping space on | task | performance |
| Reducing Power Consumption and Latency of Autonomous Vehicles With Efficient | task | and Path Assignment in the V2X-MEC Based on Nash Equilibrium |
| Reducing the Minimal Fleet Size by Delaying Individual | task | s |
| Reference Tracking Optimization With Obstacle Avoidance via | task | Prioritization for Automated Driving |
| Refinement Method for Single-Stage Object Detection Based on Progressive Decoupled | task | Alignment, A |
| Reflective Field for Pixel-Level | task | s |
| Region Attentive Action Unit Intensity Estimation With Uncertainty Weighted Multi- | task | Learning |
| Region-aware Distribution Contrast: A Novel Approach to Multi- | task | Partially Supervised Learning |
| Regularized uncertainty-based multi- | task | learning model for food analysis |
| Reinforced Feature Points: Optimizing Feature Detection and Description for a High-Level | task | |
| Reinforcement Learning Based Edge-End Collaboration for Multi- | task | Scheduling in 6G Enabled Intelligent Autonomous Transport Systems |
| Reinforcement Learning via Auxiliary | task | Distillation |
| Reinforcement-Learning-Based Energy-Efficient Framework for Multi- | task | Video Analytics Pipeline, A |
| Rejection strategies and confidence measures for a k-nn classifier in an ocr | task | |
| Relating Deep Neural Network Representations to EEG-fMRI Spatiotemporal Dynamics in a Perceptual Decision-Making | task | |
| relation of susceptibility levels of hypnosis and different mental | task | s, The |
| Relational Experience Replay: Continual Learning by Adaptively Tuning | task | -Wise Relationship |
| Relational Future Captioning Model for Explaining Likely Collisions in Daily | task | s |
| Relational Temporal Graph Reasoning for Dual- | task | Dialogue Language Understanding |
| Relevance of a Feed-Forward Model of Visual Attention for Goal-Oriented and Free-Viewing | task | s |
| Relevance-aware Question Generation in Non- | task | -oriented Dialogue Systems |
| Reliable | task | -Constrained Band Selection Method for Hyperspectral Anomaly Detection |
| Reparameterizing Convolutions for Incremental Multi- | task | Learning Without Task Interference |
| Reparameterizing Convolutions for Incremental Multi- | task | Learning Without Task Interference |
| Representation learning with deep sparse auto-encoder for multi- | task | learning |
| Representation Similarity Analysis for Efficient | task | Taxonomy and Transfer Learning |
| REPVF: A Unified Vector Fields Representation for Multi- | task | 3d Perception |
| Research on Distributed Collaborative | task | Planning and Countermeasure Strategies for Satellites Based on Game Theory Driven Approach |
| Research on Multi- | task | Semantic Segmentation Based on Attention and Feature Fusion Method |
| Research on Semantic Communication Based on Balancing of | task | Distortion |
| Research on | task | -Driven Dual-Light Image Fusion and Enhancement Method under Low Illumination |
| Residual multi- | task | learning for facial landmark localization and expression recognition |
| Residual-based Language Models are Free Boosters for Biomedical Imaging | task | s |
| Resource Allocation Strategy in Internet of Vehicles Based on Multi- | task | Federated Learning and Incentive Mechanism, A |
| Resource prediction and quality control for parallel execution of heterogeneous medical imaging | task | s |
| Resource-Aware Coverage and | task | Assignment in Visual Sensor Networks |
| Resource-aware sensor selection and | task | assignment |
| Rethinking Low-Rank Adaptation in Vision: Exploring Head-Level Responsiveness across Diverse | task | s |
| Rethinking | task | and Metrics of Instance Segmentation on 3D Point Clouds |
| Rethinking | task | -Incremental Learning Baselines |
| Rethinking Token Reduction with Parameter-Efficient Fine-Tuning in ViT for Pixel-Level | task | s |
| Retinal Image Analysis Oriented to the Clinical | task | |
| Retinal vessel segmentation based on | task | -driven generative adversarial network |
| Reusing the | task | -specific Classifier as a Discriminator: Discriminator-free Adversarial Domain Adaptation |
| revenge of BiSeNet: Efficient Multi- | task | Image Segmentation, The |
| Reverse Image Segmentation: A High-Level Solution to a Low-Level | task | |
| Revisiting Sequence-to-Sequence Video Object Segmentation with Multi- | task | Loss and Skip-Memory |
| Revisiting Unsupervised Meta-Learning via the Characteristics of Few-Shot | task | s |
| Revitalizing Regression | task | s Through Modern Training Procedures: Applications in Medical Image Analysis for Covid-19 Infection Percentage Estimation |
| Riesz Feature Representation: Scale Equivariant Scattering Network for Classification | task | s |
| RL-RC-DoT: A Block-level RL agent for | task | -Aware Video Compression |
| RM2Occ: Re-Projection Multi- | task | Multi-Sensor Fusion for Autonomous Driving 3D Object Detection and Occupancy Perception |
| Robot System That Observes and Replicates Grasping | task | s, A |
| Robot | task | Programming by Human Demonstration |
| robot that keeps it simple: Hello robot wants to reinvent how autonomous machines perform | task | s at home, A |
| Robots in Retirement Homes: Person Search and | task | Planning for a Group of Residents by a Team of Assistive Robots |
| Robust Learning Through Cross- | task | Consistency |
| Robust multi-feature visual tracking via multi- | task | kernel-based sparse learning |
| Robust Multi-Focus Image Fusion Using Multi- | task | Sparse Representation and Spatial Context |
| Robust Multi- | task | Learning With Flexible Manifold Constraint |
| Robust Multispectral Reconstruction Network from RGB Images Trained by Diverse Satellite Data and Application in Classification and Detection | task | s, A |
| Robust Object Detection for UAVs in Foggy Environments with Spatial-Edge Fusion and Dynamic | task | Alignment |
| Robust object tracking via multi- | task | based collaborative model |
| Robust object tracking via multi- | task | dynamic sparse model |
| Robust Semi-Supervised 3D Medical Image Segmentation With Diverse Joint- | task | Learning and Decoupled Inter-Student Learning |
| Robust Variant-Parameter Double Integral Multi- Layer Neural Dynamics for Tracking | task | s of Quadrotors in Unbounded Noisy Environments |
| Robust Visual Servoing in 3-D Reaching | task | s |
| Robust Visual Tracking Via Multi- | task | Sparse Learning |
| Robust Visual Tracking via Structured Multi- | task | Sparse Learning |
| Robustly tracking objects via multi- | task | kernel dynamic sparse model |
| Robustness of functional connectivity metrics for EEG-based personal identification over | task | -induced intra-class and inter-class variations |
| Role of Preprocessing for Word Representation Learning in Affective | task | s, The |
| Rope3D: The Roadside Perception Dataset for Autonomous Driving and Monocular 3D Object Detection | task | |
| Rotating your face using multi- | task | deep neural network |
| RSNet: The Search for Remote Sensing Deep Neural Networks in Recognition | task | s |
| RTMDet-MGG: A Multi- | task | Model With Global Guidance |
| Rule-Based Multi- | task | Deep Learning for Highly Efficient Rice Lodging Segmentation |
| Safe-Construct: Redefining Construction Safety Violation Recognition as 3D Multi-View Engagement | task | |
| SAMCT: Segment Any CT Allowing Labor-Free | task | -Indicator Prompts |
| SAR Image Recognition with Monogenic Scale Selection-Based Weighted Multi- | task | Joint Sparse Representation |
| SAREval: A Multi-Dimensional and Multi- | task | Benchmark for Evaluating Visual Language Models on SAR Image Understanding |
| SATHUR: Self Augmenting | task | Hallucinal Unified Representation for Generalized Class Incremental Learning |
| Sayette Group Formation | task | (GFT) Spontaneous Facial Expression Database |
| Scalable classifiers for Internet vision | task | s |
| Scaling Properties of Diffusion Models For Perceptual | task | s |
| Scaling up Image Segmentation across Data and | task | s |
| Scaling Up Personalized Image Aesthetic Assessment via | task | Vector Customization |
| ScatSimCLR: Self-Supervised Contrastive Learning with Pretext | task | Regularization for Small-Scale Datasets |
| Scenario-Free Autonomous Driving With Multi- | task | Offline-to-Online Reinforcement Learning |
| Scene Memory Transformer for Embodied Agents in Long-Horizon | task | s |
| Scene-Dependent Intention Recognition for | task | Communication with Reduced Human-Robot Interaction |
| Scenes-Objects-Actions: A Multi- | task | , Multi-label Video Dataset |
| SciOL and MuLMS-Img: Introducing A Large-Scale Multimodal Scientific Dataset and Models for Image-Text | task | s in the Scientific Domain |
| SDC - Stacked Dilated Convolution: A Unified Descriptor Network for Dense Matching | task | s |
| SDCINet: A novel cross- | task | integration network for segmentation and detection of damaged/changed building targets with optical remote sensing imagery |
| SDMT: Spatial Dependence Multi- | task | Transformer Network for 3D Knee MRI Segmentation and Landmark Localization |
| Seabed-Net: A multi- | task | network for joint bathymetry estimation and seabed classification from remote sensing imagery in shallow waters |
| Searching Architecture and Precision for U-net based Image Restoration | task | s |
| Searching attentive | task | s with document analysis evidences and Dempster-Shafer theory |
| Searching for Robustness: Loss Learning for Noisy Classification | task | s |
| SEG-ESRGAN: A Multi- | task | Network for Super-Resolution and Semantic Segmentation of Remote Sensing Images |
| SegAnyPath: A Foundation Model for Multi- Resolution Stain-Variant and Multi- | task | Pathology Image Segmentation |
| SegIns: A simple extension to instance discrimination | task | for better localization learning |
| Segmentation of Multiple Structures in Chest Radiographs Using Multi- | task | Fully Convolutional Networks |
| Selecting Useful Knowledge from Previous | task | s for Future Learning in a Single Network |
| Selection of Features and Evaluation of Visual Measurements During Robotic Visual Servoing | task | s |
| Selection of Relevant Electrodes Based on Temporal Similarity for Classification of Motor Imagery | task | s |
| Self-Evolved Dynamic Expansion Model for | task | -Free Continual Learning |
| Self-organizing Maps for Motor | task | s Recognition from Electrical Brain Signals |
| Self-Paced Hard | task | -Example Mining for Few-Shot Classification |
| Self-Supervise Reinforcement Learning Method for Vacant Parking Space Detection Based on | task | Consistency and Corrupted Rewards |
| Self-supervised assisted multi- | task | learning network for one-shot defect segmentation with fake defect generation |
| Self-Supervised Knowledge Transfer via Loosely Supervised Auxiliary | task | s |
| Self-Supervised Material and Texture Representation Learning for Remote Sensing | task | s |
| Self-supervised multi- | task | learning for medical image analysis |
| Self-Supervised Multi- | task | Pretraining Improves Image Aesthetic Assessment |
| Self-supervised Multi- | task | Procedure Learning from Instructional Videos |
| Self-Supervised Solution for the Switch-Toggling Visual | task | , A |
| Self-training and multi- | task | learning for limited data: Evaluation study on object detection |
| Semantic Bottleneck for Computer Vision | task | s |
| Semantic Fisher Scores for | task | Transfer: Using Objects to Classify Scenes |
| Semantic Image Segmentation with | task | -Specific Edge Detection Using CNNs and a Discriminatively Trained Domain Transform |
| Semantic Path Planning for Indoor Navigation | task | s Using Multi-View Context and Prior Knowledge |
| Semantic Segmentation and Change Detection By Multi- | task | U-Net |
| Semantic segmentation from remote sensor data and the exploitation of latent learning for classification of auxiliary | task | s |
| Semantic Segmentation of Airborne Images and Corresponding Digital Surface Models - Additional Input Data Or Additional | task | ? |
| Semantic Segmentation via Multi- | task | , Multi-domain Learning |
| Semantical video coding: Instill static-dynamic clues into structured bitstream for AI | task | s |
| Semantics-guided multi- | task | genetic programming for multi-output regression |
| SemanticSugarBeets: A Multi- | task | Framework and Dataset for Inspecting Harvest and Storage Characteristics of Sugar Beets |
| Semi-automatic Pipeline for Large-scale Dataset Annotation | task | : A DMD Application |
| Semi-Reference Sonar Image Quality Assessment Based on | task | and Visual Perception |
| Semi-Supervised Crowd Counting via Multi- | task | Pseudo-Label Self-Correction Strategy |
| Semi-Supervised Crowd Counting via Self-training on Surrogate | task | s |
| Semi-Supervised Depth Estimation by Multi- | task | Learning |
| Semi-supervised extensions of multi- | task | tree ensembles |
| Semi-supervised feature selection with exploiting shared information among multiple | task | s |
| Semi-Supervised Image Classification With Self-Paced Cross- | task | Networks |
| Semi-supervised Multi- | task | Learning for Semantics and Depth |
| Semi-Supervised Unpaired Medical Image Segmentation Through | task | -Affinity Consistency |
| Semi-Weakly-Supervised Learning of Complex Actions from Instructional | task | Videos |
| Semisupervised Hyperspectral Classification Using | task | -Driven Dictionary Learning With Laplacian Regularization |
| Sensor Planning for 3D Visual Search with | task | Constraints |
| Sentiment Similarity-oriented Attention Model with Multi- | task | Learning for Text-based Emotion Recognition, A |
| Sequential Cross Attention Based Multi- | task | Learning |
| Sequential Mastery of Multiple Visual | task | s: Networks Naturally Learn to Learn and Forget to Forget |
| Set-Supervised Action Learning in Procedural | task | Videos via Pairwise Order Consistency |
| Severity grading of psoriatic plaques using deep CNN based multi- | task | learning |
| SF-Net: A Multi- | task | Model for Brain Tumor Segmentation in Multimodal MRI via Image Fusion |
| SG-PGM: Partial Graph Matching Network with Semantic Geometric Fusion for 3D Scene Graph Alignment and its Downstream | task | s |
| SGW-Based Multi- | task | Learning in Vision Tasks |
| SGW-Based Multi- | task | Learning in Vision Tasks |
| Shannon information for joint estimation/detection | task | s and complex imaging systems |
| Share-Aware Joint Model Deployment and | task | Offloading for Multi-Task Inference |
| Share-Aware Joint Model Deployment and | task | Offloading for Multi-Task Inference |
| Sharing Decoders: Network Fission for Multi- | task | Pixel Prediction |
| SHIFT: A Synthetic Driving Dataset for Continuous Multi- | task | Domain Adaptation |
| Ship Classification Based on MSHOG Feature and | task | -Driven Dictionary Learning with Structured Incoherent Constraints in SAR Images |
| Ship Classification in SAR Imagery by Shallow CNN Pre-Trained on | task | -Specific Dataset with Feature Refinement |
| Ship Detection for PolSAR Images via | task | -Driven Discriminative Dictionary Learning |
| Ship Identification and Characterization in Sentinel-1 SAR Images with Multi- | task | Deep Learning |
| Short-Term Prediction of Passenger Demand in Multi-Zone Level: Temporal Convolutional Neural Network With Multi- | task | Learning |
| ShuffleCount: | task | -Specific Knowledge Distillation for Crowd Counting |
| SIMBAR: Single Image-Based Scene Relighting For Effective Data Augmentation For Automated Driving Vision | task | s |
| Simple Solvers for Large Quadratic Programming | task | s |
| Simple Yet Robust Nonlinear Function for Low-Light Image Enhancement | task | , A |
| Simulating | task | -Free Continual Learning Streams From Existing Datasets |
| Simultaneous Deep Transfer Across Domains and | task | s |
| Simultaneous Detection of Multiple Facial Action Units via Hierarchical | task | Structure Learning |
| Simultaneous Estimation of Dish Locations and Calories with Multi- | task | Learning |
| Simultaneous estimation of food categories and calories with multi- | task | CNN |
| Simultaneous estimation of image quality and distortion via multi- | task | convolutional neural networks |
| SINC: Self-Supervised In-Context Learning for Vision-Language | task | s |
| Single Architecture and Multiple | task | deep Neural Network for Altered Fingerprint Analysis |
| Single Image to Semantic BIM: Domain-Adapted 3D Reconstruction and Annotations via Multi- | task | Deep Learning |
| Single Satellite Imagery Simultaneous Super-Resolution and Colorization Using Multi- | task | Deep Neural Networks |
| Single-Image HDR Reconstruction with | task | -specific Network based on Channel Adaptive RDN |
| Single-Loss Multi- | task | Learning For Improving Semantic Segmentation Using Super-Resolution |
| Skeletonization and 3D graph approach for thin objects recognition in pick and place | task | s |
| Skeletons and Asynchronous RPC for Embedded Data and | task | Parallel Image Processing |
| Skelevision: Towards Adversarial Resiliency of Person Tracking with Multi- | task | Learning |
| SketchTransfer: A Challenging New | task | for Exploring Detail-Invariance and the Abstractions Learned by Deep Networks |
| SkillDiffuser: Interpretable Hierarchical Planning via Skill Abstractions in Diffusion-Based | task | Execution |
| SkyEyeGPT: Unifying remote sensing vision-language | task | s via instruction tuning with large language model |
| SMNet: Symmetric Multi- | task | Network for Semantic Change Detection in Remote Sensing Images Based on CNN and Transformer |
| Smoothness of Surgical Tool Tip Motion Correlates to Skill in Endovascular | task | s |
| SNP-S3: Shared Network Pre-Training and Significant Semantic Strengthening for Various Video-Text | task | s |
| SOD-MTGAN: Small Object Detection via Multi- | task | Generative Adversarial Network |
| Soft-Landing Strategy for Alleviating the | task | Discrepancy Problem in Temporal Action Localization Tasks |
| Soft-Landing Strategy for Alleviating the | task | Discrepancy Problem in Temporal Action Localization Tasks |
| Software Designs of Image Processing | task | s With Incremental Refinement of Computation |
| Solving Motion Planning | task | s with a Scalable Generative Model |
| Solving the same-different | task | with convolutional neural networks |
| SOTVerse: A User-Defined | task | Space of Single Object Tracking |
| Sound and Visual Representation Learning with Multiple Pretraining | task | s |
| Space-Time Periodic | task | Model for Recommendation of Remote Sensing Images, A |
| Sparse Bayesian multi- | task | learning for predicting cognitive outcomes from neuroimaging measures in Alzheimer's disease |
| Sparse multi- | task | regression and feature selection to identify brain imaging predictors for memory performance |
| Sparse shared structure based multi- | task | learning for MRI based cognitive performance prediction of Alzheimer's disease |
| Sparse U-PDP: A Unified Multi- | task | Framework for Panoptic Driving Perception |
| Spatial and temporal beliefs for mistake detection in assembly | task | s |
| Spatial Augmented Reality user interface techniques for room size modeling | task | s |
| Spatial Downscaling of Satellite Sea Surface Wind with Soft-Sharing Multi- | task | Learning |
| Spatial Focus Attention for Fine-Grained Skeleton-Based Action | task | s |
| Spatial-temporal multi- | task | learning for salient region detection |
| SpatialFlow: Bridging All | task | s for Panoptic Segmentation |
| Spatially-Correlative Loss for Various Image Translation | task | s, The |
| Spatio-Temporal EV | task | Offloading, Energy, and Traffic Management for 6G Communication-Power-Transportation Coupling Network |
| Spatio-temporal representation for face authentication by using multi- | task | learning with human attributes |
| Spatiotemporal Typhoon Damage Assessment: A Multi- | task | Learning Method for Location Extraction and Damage Identification from Social Media Texts |
| Speaker-aware Multi- | task | Learning for automatic speech recognition |
| Special Issue Retraction: Jointly network image processing: multi- | task | image semantic segmentation of indoor scene based on CNN |
| Spectral Reconstruction and Disparity from Spatio-Spectrally Coded Light Fields via Multi- | task | Deep Learning |
| Split n merge net: A dynamic masking network for multi- | task | attention |
| Split to Learn: Gradient Split for Multi- | task | Human Image Analysis |
| Spot the Difference: A Novel | task | for Embodied Agents in Changing Environments |
| SPOT: An efficient training-free | task | similarity quantification method for continual learning |
| SpotNet: Self-Attention Multi- | task | Network for Object Detection |
| SRKTDN: Applying Super Resolution Method to Dehazing | task | |
| SSL++: Improving Self-Supervised Learning by Mitigating the Proxy | task | -Specificity Problem |
| SSMTL++: Revisiting self-supervised multi- | task | learning for video anomaly detection |
| Stackelberg Game-Based Multi-Agent Algorithm for Resource Allocation and | task | Offloading in MEC-Enabled C-ITS |
| STiL: Semi-supervised Tabular-Image Learning for Comprehensive | task | -Relevant Information Exploration in Multimodal Classification |
| Stochastic Filter Groups for Multi- | task | CNNs: Learning Specialist and Generalist Convolution Kernels |
| Stochastic Guided Search Model for Search Asymmetries in Visual Search | task | s |
| Stochastic Quality Metric for Optimal Control of Active Camera Network Configurations for 3D Computer Vision | task | s, A |
| Stochastic | task | Scheduling in UAV-Based Intelligent On-Demand Meal Delivery System |
| Stratified Multi- | task | Learning for Robust Spotting of Scene Texts |
| Strong-Help-Weak: An Online Multi- | task | Inference Learning Approach for Robust Advanced Driver Assistance Systems |
| Structure Synchronized Dynamic Event-Triggered Control for Marine Ranching AMVs via the Multi- | task | Switching Guidance |
| Structure-Encoding Auxiliary | task | s for Improved Visual Representation in Vision-and-Language Navigation |
| Structured and weighted multi- | task | low rank tracker |
| Study of Improving the Accuracy of Convolutional Neural Networks in Face Recognition | task | s, The |
| Study of Multi- | task | and Region-Wise Deep Learning for Food Ingredient Recognition, A |
| study on a target detection model for autonomous driving | task | s, A |
| Superquadric Indoor Scene Representation for Orientation and Navigation | task | s |
| SuperTickets: Drawing | task | -Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning |
| Supervised Learning of Detection and Classification | task | s with Uncertain Training Data |
| Supervision of Perception | task | s for Autonomous Systems: The OCAPI Approach |
| Supporting Operators in Process Control | task | s: Benefits of Interactive 3-D Visualization |
| Supporting Virtual Collaboration in Spatial Design | task | s: Are Surrogate or Natural Gestures More Effective? |
| Surgical motion | task | performance in a hand eye colocated digital stereo microcsope |
| Survey of Computation Offloading With | task | Types, A |
| Survey of Computer Vision Techniques for Forest Characterization and Carbon Monitoring | task | s, A |
| Survey on Graph Neural Networks and Graph Transformers in Computer Vision: A | task | -Oriented Perspective, A |
| SwinFace: A Multi- | task | Transformer for Face Recognition, Expression Recognition, Age Estimation and Attribute Estimation |
| Switch Diffusion Transformer: Synergizing Denoising | task | s with Sparse Mixture-of-experts |
| Switcher-HNet: A switchable hierarchical network for tree species classification from forest stand to individual tree | task | s |
| SYENet: A Simple Yet Effective Network for Multiple Low-Level Vision | task | s with Real-time Performance on Mobile Device |
| Symbolic Representation for Any-to-Any Generative | task | s |
| Synergistic Semantic Segmentation and Height Estimation for Monocular Remote Sensing Images via Cross- | task | Interaction |
| Syn | task | Net: A synergistic multi-task network for joint segmentation and classification of small anatomical structures in ultrasound imaging |
| syntax-directed program that performs a three-dimensional perceptual | task | , A |
| Synthesis of complex-valued InSAR data with a multi- | task | convolutional neural network |
| Synthesize High-Quality Multi-Contrast Magnetic Resonance Imaging From Multi-Echo Acquisition Using Multi- | task | Deep Generative Model |
| System for Various Visual Classification | task | s Based on Neural Networks, A |
| Systematic Literature Review on Multi-Robot | task | Allocation, A |
| T2Net: Synthetic-to-Realistic Translation for Solving Single-Image Depth Estimation | task | s |
| Ta-Adapter: Enhancing few-shot CLIP with | task | -aware encoders |
| TAAN: | task | -Aware Attention Network for Few-shot Classification |
| TAB: Text-Align Anomaly Backbone Model for Industrial Inspection | task | s |
| TACLE: | task | and Class-Aware Exemplar-Free Semi-Supervised Class Incremental Learning |
| TaCOS: | task | -Specific Camera Optimization with Simulation |
| TAD-Graph: Enhancing Whole Slide Image Analysis via | task | -Aware Subgraph Disentanglement |
| TADFormer: | task | -Adaptive Dynamic TransFormer for Efficient Multi-Task Learning |
| TADFormer: | task | -Adaptive Dynamic TransFormer for Efficient Multi-Task Learning |
| TAE-Net: | task | -Adaptive Embedding Network for Few-Shot Remote Sensing Scene Classification |
| TAE: | task | -aware Expandable Representation for Long Tail Class Incremental Learning |
| TAFE-Net: | task | -Aware Feature Embeddings for Low Shot Learning |
| TAFL: | task | -Agnostic Feature Learner for Efficient Adaptation to Unseen Clinical Tasks Based on Whole-Slide Histopathological Images |
| TAFL: | task | -Agnostic Feature Learner for Efficient Adaptation to Unseen Clinical Tasks Based on Whole-Slide Histopathological Images |
| Tafssl: | task | -adaptive Feature Sub-space Learning for Few-shot Classification |
| Take a prior from other | task | s for severe blur removal |
| TAL EmotioNet Challenge 2020 Rethinking the Model Chosen Problem in Multi- | task | Learning |
| TAME: | task | Agnostic Continual Learning using Multiple Experts |
| TAML-Adapter: Enhancing Adapter Tuning Through | task | -Agnostic Meta-Learning for Low-Resource Automatic Speech Recognition |
| TAMM: A | task | -Adaptive Multi-Modal Fusion Network for Facial-Related Health Assessments on 3D Facial Images |
| TANGO: Training-free Embodied AI Agents for Open-world | task | s |
| TAO-Net: | task | -Adaptive Operation Network for Image Restoration and Enhancement |
| TAPE: | task | -Agnostic Prior Embedding for Image Restoration |
| TapFace: A | task | -oriented facial privacy protection framework |
| TapTell: Interactive visual search for mobile | task | recommendation |
| task | Adaptive Network for Image Restoration with Combined Degradation Factors |
| task | Adaptive Parameter Sharing for Multi-Task Learning |
| task | Adaptive Parameter Sharing for Multi-Task Learning |
| task | adaptive siamese neural networks for open-set recognition of encrypted network traffic with bidirectional dropout |
| task | Agnostic and Post-hoc Unseen Distribution Detection |
| task | Agnostic Meta-Learning for Few-Shot Learning |
| task | Agnostic Restoration of Natural Video Dynamics |
| task | Agnostic Robust Learning on Corrupt Outputs by Correlation-Guided Mixture Density Networks |
| task | Allocation and Scalability Evaluation for Real-Time Multimedia Processing in a Cluster Environment |
| task | and Environment-Sensitive Tracking |
| task | Assignment Algorithm for Multiple Aerial Vehicles to Attack Targets With Dynamic Values, A |
| task | Augmentation-Based Meta-Learning Segmentation Method for Retinopathy |
| task | Bias in Contrastive Vision-Language Models |
| task | Category Space for User-Centric Comparative Multimedia Search Evaluations, A |
| task | complexity analysis and QoS management for mapping dynamic video-processing tasks on a multi-core platform |
| task | complexity analysis and QoS management for mapping dynamic video-processing tasks on a multi-core platform |
| task | Configuration Impacts Annotation Quality and Model Training Performance in Crowdsourced Image Segmentation |
| task | Decomposing and Cell Comparing Method for Cervical Lesion Cell Detection, A |
| task | decomposition and modular single-hidden-layer perceptron classifiers for multi-class learning problems |
| task | Decomposition and Synchronization for Semantic Biomedical Image Segmentation |
| task | Decoupled Framework for Reference-based Super-Resolution |
| task | dependent deep LDA pruning of neural networks |
| task | differentiation: Constructing robust branches for precise object detection |
| task | Difficulty Aware Parameter Allocation and Regularization for Lifelong Learning |
| task | Discrepancy Maximization for Fine-grained Few-Shot Classification |
| task | Driven 3D Object Recognition System Using Bayesian Networks, A |
| task | Driven Perceptual Organization for Extraction of Rooftop Polygons |
| task | Encoding With Distribution Calibration for Few-Shot Learning |
| task | Frames in Visuo-Motor Coordination |
| task | Frames: Primitives for Sensory-Motor Coordination |
| task | Grouping for Multilingual Text Recognition |
| task | interleaving and orientation estimation for high-precision oriented object detection in aerial images |
| task | Is Worth One Word: Learning with Task Prompts for High-quality Versatile Image Inpainting, A |
| task | Is Worth One Word: Learning with Task Prompts for High-quality Versatile Image Inpainting, A |
| task | Load Estimation from Multimodal Head-Worn Sensors Using Event Sequence Features |
| task | model of lower body motion for a humanoid robot to imitate human dances |
| task | Navigator: Decomposing Complex Tasks for Multimodal Large Language Models |
| task | Navigator: Decomposing Complex Tasks for Multimodal Large Language Models |
| task | Offloading and Resource Allocation in Vehicular Cooperative Perception With Integrated Sensing, Communication, and Computation |
| task | Offloading Based on the Fusion of Model- and Data-Driven Intelligence for Vehicular Edge Computing Networks |
| task | oriented facial behavior recognition with selective sensing |
| task | Oriented Vision |
| task | Planner for Sensor-Based Inspection and Manipulation Robots, A |
| task | Planning and Action Coordination in Integrated Sensor-Based Robots |
| task | Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment |
| task | Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment |
| task | Programming: Learning Data Efficient Behavior Representations |
| task | Recognition and Person Identification in Cyclic Dance Sequences with Multi Factor Tensor Analysis |
| task | Relation Networks |
| task | Relevant Relaxation Network for Visuo-Motor Systems |
| task | Residual for Tuning Vision-Language Models |
| task | Scheduling in Large Camera Networks |
| task | Scheduling of Real-Time Traffic Information Processing Based on Digital Twins |
| task | selection in spatial crowdsourcing from worker's perspective |
| task | Singular Vectors: Reducing Task Interference in Model Merging |
| task | Singular Vectors: Reducing Task Interference in Model Merging |
| task | Specific Factors for Video Characterization |
| task | Specific Local Region Matching |
| task | Specific Networks for Identity and Face Variation |
| task | Specific Visual Saliency Prediction with Memory Augmented Conditional Generative Adversarial Networks |
| task | Switching Network for Multi-task Learning |
| task | Switching Network for Multi-task Learning |
| task | weighting based on particle filter in deep multi-task learning with a view to uncertainty and performance |
| task | weighting based on particle filter in deep multi-task learning with a view to uncertainty and performance |
| task | -Adapted Learnable Embedded Quantization for Scalable Human-Machine Image Compression |
| task | -Adaptive Attention for Image Captioning |
| task | -Adaptive Embedding Learning with Dynamic Kernel Fusion for Few-Shot Remote Sensing Scene Classification |
| task | -Adaptive Feature Disentanglement and Hallucination for Few-Shot Classification |
| task | -Adaptive Feature Matching Loss for Image Deblurring |
| task | -Adaptive Feature Reweighting for Few Shot Classification |
| task | -adaptive Few-shot Learning on Sphere Manifold |
| task | -Adaptive Negative Envision for Few-Shot Open-Set Recognition |
| task | -Adaptive Saliency Guidance for Exemplar-Free Class Incremental Learning |
| task | -Agnostic Attacks Against Vision Foundation Models |
| task | -Agnostic Continual Learning Using Base-Child Classifiers |
| task | -Agnostic Guided Feature Expansion for Class-Incremental Learning |
| task | -Agnostic Open-Set Prototype for Few-Shot Open-Set Recognition |
| task | -Agnostic Vision Transformer for Distributed Learning of Image Processing |
| task | -Aligned Part-Aware Panoptic Segmentation Through Joint Object-Part Representations |
| task | -Assisted Domain Adaptation with Anchor Tasks |
| task | -Assisted Domain Adaptation with Anchor Tasks |
| task | -Aware Adaptive Learning for Cross-Domain Few-Shot Learning |
| task | -Aware Attention Model for Clothing Attribute Prediction |
| task | -Aware Attentional Dynamic Alignment for Few-Shot Compressed Video Classification |
| task | -Aware Clustering for Prompting Vision-Language Models |
| task | -aware Cross-modal Feature Refinement Transformer with Large Language Models for Visual Grounding |
| task | -Aware Dual-Representation Network for Few-Shot Action Recognition |
| task | -Aware Encoder Control for Deep Video Compression |
| task | -Aware Few-Shot Visual Classification with Improved Self-Supervised Metric Learning |
| task | -Aware Graph Convolutional Network for Active Learning |
| task | -Aware Image Downscaling |
| task | -aware Part Mining Network for Few-Shot Learning |
| task | -aware Quantization Network for JPEG Image Compression |
| task | -Aware Variational Adversarial Active Learning |
| task | -Aware Weakly Supervised Object Localization With Transformer |
| task | -balanced distillation for object detection |
| task | -Based Approach to Adaptive and Multimodality Imaging, A |
| task | -based control of articulated human pose detection for OpenVL |
| task | -based evaluation of skin detection for communication and perceptual interfaces |
| task | -based Focal Loss for Adversarially Robust Meta-Learning |
| task | -Based Modeling of a 5k Ultra-High-Resolution Medical Imaging System for Digital Breast Tomosynthesis |
| task | -based parameter isolation for foreground segmentation without catastrophic forgetting using multi-scale region and edges fusion network |
| task | -Based User Evaluation of Content-Based Image Database Browsing Systems |
| task | -Conditioned Adaptation of Visual Features in Multi-Task Policy Learning |
| task | -Conditioned Adaptation of Visual Features in Multi-Task Policy Learning |
| task | -conditioned Domain Adaptation for Pedestrian Detection in Thermal Imagery |
| task | -Conditioned Ensemble of Expert Models for Continuous Learning |
| task | -Container Matching Game for Computation Offloading in Vehicular Edge Computing and Networks |
| task | -Customized Mixture of Adapters for General Image Fusion |
| task | -dependent multi-task multiple kernel learning for facial action unit detection |
| task | -dependent multi-task multiple kernel learning for facial action unit detection |
| task | -dependent saliency estimation from trajectories of agents in video sequences |
| task | -Dependent Visual-Codebook Compression |
| task | -Directed Computation of Quantitative Decisions from Sensor Data |
| task | -directed evaluation of image segmentation methods |
| task | -Directed Sensor Fusion and Planning: A Computational Approach |
| task | -Discriminative Domain Alignment for Unsupervised Domain Adaptation |
| task | -Distributionally Robust Data-Free Meta-Learning |
| task | -Driven Biometric Authentication of Users in Virtual Reality (VR) Environments |
| task | -driven camera operations for robotic exploration |
| task | -Driven Controllable Scenario Generation Framework Based on AOG |
| task | -Driven Dictionary Learning |
| task | -Driven Dictionary Learning for Hyperspectral Image Classification With Structured Sparsity Constraints |
| task | -Driven Disaster Data Link Approach, A |
| task | -Driven Dynamic Fusion: Reducing Ambiguity in Video Description |
| task | -Driven Exploration: Decoupling and Inter-Task Feedback for Joint Moment Retrieval and Highlight Detection |
| task | -Driven Exploration: Decoupling and Inter-Task Feedback for Joint Moment Retrieval and Highlight Detection |
| task | -Driven Feature Pooling for Image Classification |
| task | -driven Image Fusion with Learnable Fusion Loss |
| task | -Driven Image Retrieval Using Geographic Information |
| task | -driven intelligent workspace system to provide guidance feedback, A |
| task | -Driven Invertible Projection Matrix Learning Algorithm for Hyperspectral Compressed Sensing, A |
| task | -Driven Learning of Spatial Combinations of Visual Features |
| task | -Driven Modular Networks for Zero-Shot Compositional Learning |
| task | -Driven Optimization of Fluence Field and Regularization for Model-Based Iterative Reconstruction in Computed Tomography |
| task | -Driven Progressive Part Localization for Fine-Grained Object Recognition |
| task | -driven progressive part localization for fine-grained recognition |
| task | -Driven Saliency Detection on Music Video |
| task | -Driven Self-Supervised BI-Channel Networks Learning for Diagnosis of Breast Cancers with Mammography |
| task | -Driven Semantic Coding via Reinforcement Learning |
| task | -driven Uncertainty Quantification in Inverse Problems via Conformal Prediction |
| task | -Driven Underwater Image Enhancement via Hierarchical Semantic Refinement |
| task | -Driven Video Compression for Humans and Machines: Framework Design and Optimization |
| task | -Driven Wavelets Using Constrained Empirical Risk Minimization |
| task | -Driven Webpage Saliency |
| task | -Feature Collaborative Learning with Application to Personalized Attribute Prediction |
| task | -Free Continual Learning |
| task | -Generic Hierarchical Human Motion Prior using VAEs |
| task | -Guided, Implicitly-Searched and Meta-Initialized Deep Model for Image Fusion, A |
| task | -Independent Robotic Uncalibrated Hand-Eye Coordination Based on the Extended State Observer |
| task | -Informed Meta-Learning for Remote Sensing |
| task | -Level Contrastiveness for Cross-Domain Few-Shot Learning |
| task | -Level Planning of Pick-and-Place Robot Motions |
| task | -Oriented Approach for Cost-Sensitive Recognition, A |
| task | -oriented camera assignment in a video network |
| task | -Oriented Channel Attention for Fine-Grained Few-Shot Classification |
| task | -Oriented Compact Representation of 3D Point Clouds via A Matrix Optimization-Driven Network |
| task | -Oriented Convex Bilevel Optimization With Latent Feasibility |
| task | -Oriented Evaluation of Super-Resolution Techniques |
| task | -Oriented Feature Hallucination for Few-Shot Image Classification |
| task | -Oriented Generation of Visual Sensing Strategies |
| task | -Oriented Generation of Visual Sensing Strategies in Assembly Tasks |
| task | -Oriented Generation of Visual Sensing Strategies in Assembly Tasks |
| task | -Oriented Hand Motion Retargeting for Dexterous Manipulation Imitation |
| task | -Oriented Human-Object Interactions Generation with Implicit Neural Representations |
| task | -Oriented Knowledge Base for Geospatial Problem-Solving, A |
| task | -Oriented Multi-Modal Mutual Learning for Vision-Language Models |
| task | -Oriented Multi-Modal Question Answering For Collaborative Applications |
| task | -Oriented Network Design for Visual Tracking and Motion Filtering of Needle Tip Under 2D Ultrasound |
| task | -Oriented Network for Image Dehazing |
| task | -Oriented Object Tracking in Large Distributed Camera Networks |
| task | -Oriented Spatial Graph Structure Learning Method for Traffic Forecasting, A |
| task | -Oriented Tool Manipulation With Robotic Dexterous Hands: A Knowledge Graph Approach From Fingers to Functionality |
| task | -Oriented Vision with Multiple Bayes Nets |
| task | -Oriented Visualization Approaches for Landscape and Urban Change Analysis |
| task | -related population characteristics in handwriting analysis |
| task | -relevant object detection and tracking |
| task | -Risk Consistency Object Detection Framework Based on Deep Reinforcement Learning, A |
| task | -Sensitive Efficient Feature Extraction Network for Oriented Object Detection in Remote Sensing Images |
| task | -Specific Contour Tracker for Ultrasound, A |
| task | -specific contrastive learning for few-shot remote sensing image scene classification |
| task | -specific dependency-based word embedding methods |
| task | -Specific Electroencephlogram Analysis: A Novel ICA and Dynamic Multi-Stage Clustering Approach for Neural Signal Processing |
| task | -specific evaluation of three-dimensional image interpolation techniques, A |
| task | -Specific Fine-Tuning via Variational Information Bottleneck for Weakly-Supervised Pathology Whole Slide Image Classification |
| task | -Specific Gesture Analysis in Real-Time Using Interpolated Views |
| task | -Specific Gradient Adaptation for Few-Shot One-Class Classification |
| task | -Specific Image Partitioning |
| task | -Specific Importance-Awareness Matters: On Targeted Attacks Against Object Detection |
| task | -specific Inconsistency Alignment for Domain Adaptive Object Detection |
| task | -specific information for imaging system analysis |
| task | -Specific Loss for Robust Instance Segmentation With Noisy Class Labels |
| task | -Specific Normalization for Continual Learning of Blind Image Quality Models |
| task | -specific Novel Object Characterization |
| task | -Specific Performance Evaluation of UGVs: Case Studies at the IVFC |
| task | -Specific Spatiotemporal Context-Aware Decoupling for Occluded Video Object Detection |
| task | -Specific Utility in a General Bayes net Vision System |
| task | -Switchable Pre-Processor for Image Compression for Multiple Machine Vision Tasks |
| task | -Switchable Pre-Processor for Image Compression for Multiple Machine Vision Tasks |
| task | -to-Instance Prompt Learning for Vision-Language Models at Test Time |
| task | 2Box: Box Embeddings for Modeling Asymmetric Task Relationships |
| task | 2Vec: Task Embedding for Meta-Learning |
| task | Expert: Dynamically Assembling Multi-Task Representations with Memorial Mixture-of-Experts |
| task | ing Teams: Supervisory Control and Task Management of Autonomous Unmanned Systems |
| task | ology: Utilizing Task Relations at Scale |
| task | onomy: Disentangling Task Transfer Learning |
| task | s Integrated Networks: Joint Detection and Retrieval for Image Search |
| task | s of the Crowd: A Typology of Tasks in Geographic Information Crowdsourcing and a Case Study in Humanitarian Mapping, The |
| task | s of the Crowd: A Typology of Tasks in Geographic Information Crowdsourcing and a Case Study in Humanitarian Mapping, The |
| TASTE-Rob: Advancing Video Generation of | task | -Oriented Hand-Object Interaction for Generalizable Robotic Manipulation |
| Taxi Demand Prediction Using Parallel Multi- | task | Learning Model |
| TDViT: Temporal Dilated Video Transformer for Dense Video | task | s |
| Teaching a Robot to Perform | task | through Imitation and On-line Feedback |
| Teaching Machines to Understand Baseball Games: Large-Scale Baseball Video Database for Multiple Video Understanding | task | s |
| Temporal Action Proposal Generation via Multi- | task | Feature Learning |
| Temporal Interpolation as an Unsupervised Pretraining | task | for Optical Flow Estimation |
| Temporal Segmentation of | task | s from Human Hand Motion |
| Tensor Multi- | task | Learning for Person Re-Identification |
| Ternary Feature Masks: zero-forgetting for | task | -incremental learning |
| Test-Time Fine-Tuning of Image Compression Models for Multi- | task | Adaptability |
| Testing Some Improvements of the Fukunaga and Narendra's Fast Nearest Neighbour Search Algorithm in a Spelling | task | |
| TextContourNet: A Flexible and Effective Framework for Improving Scene Text Detection Architecture With a Multi- | task | Cascade |
| TFUT: | task | fusion upward transformer model for multi-task learning on dense prediction |
| TFUT: | task | fusion upward transformer model for multi-task learning on dense prediction |
| Three Birds with One Stone: Multi- | task | Temporal Action Detection via Recycling Temporal Annotations |
| Three Things We Need to Know About Transferring Stable Diffusion to Visual Dense Prediction | task | s |
| Three-Dimensional Magnetotelluric Forward Modeling Using Multi- | task | Deep Learning with Branch Point Selection |
| TIAR-SAR: An Oriented SAR Ship Detector Combining a | task | Interaction Head Architecture with Composite Angle Regression |
| Tibetan Thangka data set and relative | task | s, A |
| TimberVision: A Multi- | task | Dataset and Framework for Log-Component Segmentation and Tracking in Autonomous Forestry Operations |
| Time-aware and | task | -transferable adversarial attack for perception of autonomous vehicles |
| TL;DW? Summarizing Instructional Videos with | task | Relevance and Cross-Modal Saliency |
| TLCO: Topological Link-Aware | task | Co-Offloading Method for Joint V2V and V2I System |
| To Complete or to Estimate, That is the Question: A Multi- | task | Approach to Depth Completion and Monocular Depth Estimation |
| Token Cropr: Faster ViTs for Quite a Few | task | s |
| TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through | task | Tokenization |
| TOOD: | task | -aligned One-stage Object Detection |
| TOODIB: | task | -aligned one-stage object detection with interactions between branches |
| Top-Down Visual Attention for Efficient Rendering of | task | Related Scenes |
| TOPLight: Lightweight Neural Networks with | task | -Oriented Pretraining for Visible-Infrared Recognition |
| Torch-Points3D: A Modular Multi- | task | Framework for Reproducible Deep Learning on 3D Point Clouds |
| Toward Automatic Robot Instruction from Perception: Temporal Segmentation of | task | s from Human Hand Motion |
| Toward correlating and solving abstract | task | s using convolutional neural networks |
| Toward Edge-Efficient Dense Predictions with Synergistic Multi- | task | Neural Architecture Search |
| Toward Multi- | task | Generalization in Autonomous Navigation: A Human-in-the-Loop Adversarial Reinforcement Learning With Diffusion Policy |
| Toward Robust Visual Object Tracking With Independent Target-Agnostic Detection and Effective Siamese Cross- | task | Interaction |
| Toward | task | -Independent Optimal Adaptive Control of a Hip Exoskeleton for Locomotion Assistance in Neurorehabilitation |
| Towards a Real-time Framework for Visual Monitoring | task | s |
| Towards Addressing the Misalignment of Object Proposal Evaluation for Vision-Language | task | s via Semantic Grounding |
| Towards an Assembly Plan from Observation, Part I: | task | Recognition with Polyhedral Objects |
| Towards an Assembly Plan from Observation: | task | Recognition with Polyhedral Objects |
| Towards Consistent Multi- | task | Learning: Unlocking the Potential of Task-Specific Parameters |
| Towards Consistent Multi- | task | Learning: Unlocking the Potential of Task-Specific Parameters |
| Towards General Purpose Vision Systems: An End-to-End | task | -Agnostic Vision-Language Architecture |
| Towards Lightweight Transformer Via Group-Wise Transformation for Vision-and-Language | task | s |
| Towards scheduling hard real-time image processing | task | s on a single GPU |
| Towards | task | Sampler Learning for Meta-Learning |
| Towards | task | -Generic Image Compression: A Study of Semantics-Oriented Metrics |
| Towards Ultra Low Latency Spiking Neural Networks for Vision and Sequential | task | s Using Temporal Pruning |
| Towards Universal Dataset Distillation via | task | -Driven Diffusion |
| Towards Weakly-Supervised Text Spotting using a Multi- | task | Transformer |
| Tracking Dynamic Flow: Decoding Flow Fluctuations Through Performance in a Fine Motor Control | task | |
| Tracking techniques for visual servoing | task | s |
| Tracking via Robust Multi- | task | Multi-View Joint Sparse Representation |
| Traffic Prediction With Missing Data: A Multi- | task | Learning Approach |
| Traffic Sign Recognition Using a Multi- | task | Convolutional Neural Network |
| Traffic Sign Recognition via Multi-Modal Tree-Structure Embedded Multi- | task | Learning |
| Training a | task | -Specific Image Reconstruction Loss |
| Training for | task | Specific Keypoint Detection |
| Training Hierarchical Feed-Forward Visual Recognition Models Using Transfer Learning from Pseudo- | task | s |
| Training Neural Networks on RAW and HDR Images for Restoration | task | s |
| Training of Multiple and Mixed | task | s with a Single Network Using Feature Modulation |
| Trajectory representation learning for Multi- | task | NMRDP planning |
| Transfer Learning from Other | task | s, Other Classes |
| Transfer learning in computer vision | task | s: Remember where you come from |
| Transfer Learning via Unsupervised | task | Discovery for Visual Question Answering |
| Transferability and Hardness of Supervised Classification | task | s |
| Transferring and Adapting Source Knowledge ( | task | ) in Computer Vision (CV) |
| Transformer with | task | Selection for Continual Learning |
| Transition in Different Critical Situations: How Non-Driving Related | task | s Affect Drivers' Physiological Response and Takeover Behavior After Partial Automation Silent Failures |
| TransNAS-Bench-101: Improving transferability and Generalizability of Cross- | task | Neural Architecture Search |
| TripleS: Mitigating multi- | task | learning conflicts for semantic change detection in high-resolution remote sensing imagery |
| TRNR: | task | -Driven Image Rain and Noise Removal With a Few Images Based on Patch Analysis |
| TSP-Transformer: | task | -Specific Prompts Boosted Transformer for Holistic Scene Understanding |
| TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization | task | s |
| TWM: A framework for creating highly compressible videos targeted to computer vision | task | s |
| Two Body Problem: Collaborative Visual | task | Completion |
| Two by Two: Learning Multi- | task | Pairwise Objects Assembly for Generalizable Robot Manipulation |
| Two-aspect Information Interaction Model for ABAW4 Multi- | task | Challenge |
| Two-dimensional cellular automata of radius one for density classification | task | |
| Two-Level Attention with Multi- | task | Learning for Facial Emotion Estimation |
| Two-level attention with two-stage multi- | task | learning for facial emotion recognition |
| Two-Level Multi- | task | Metric Learning with Application to Multi-Classification |
| Two-Stream Multi- | task | Network for Fashion Recognition |
| Two-stream person re-identification with multi- | task | deep neural networks |
| U-Tell: Unsupervised | task | Expert Lifelong Learning |
| UlcerMTL: Multi- | task | Learning for Classification and Segmentation of Diabetic Foot Ulcers |
| UM-Adapt: Unsupervised Multi- | task | Adaptation Using Adversarial Cross-Task Distillation |
| UM-Adapt: Unsupervised Multi- | task | Adaptation Using Adversarial Cross-Task Distillation |
| UMD-Net: A Unified Multi- | task | Assistive Driving Network Based on Multimodal Fusion |
| Uncalibrated Visual | task | s via Linear Interaction |
| Uncertain Facial Expression Recognition via Multi- | task | Assisted Correction |
| Uncovering Communities of Pipelines in the | task | -FMRI Analytical Space |
| Uncrewed Aerial Vehicle (UAV)-Based High-Throughput Phenotyping of Maize Silage Yield and Nutritive Values Using Multi-Sensory Feature Fusion and Multi- | task | Learning with Attention Mechanism |
| Understanding and Predicting Temporal Visual Attention Influenced by Dynamic Highlights in Monitoring | task | |
| Understanding Multi- | task | Activities from Single-Task Videos |
| Understanding Multi- | task | Activities from Single-Task Videos |
| Understanding tools: | task | -oriented object modeling, learning and recognition |
| UNET-Based Multi- | task | Architecture for Brain Lesion Segmentation |
| Uni-NLX: Unifying Textual Explanations for Vision and Vision-Language | task | s |
| Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language | task | s |
| Uni-Perceiver: Pre-training Unified Architecture for Generic Perception for Zero-shot and Few-shot | task | s |
| UniAV: Unified Audio-Visual Perception for Multi- | task | Video Event Localization |
| UniDCP: Unifying Multiple Medical Vision-Language | task | s via Dynamic Cross-Modal Learnable Prompts |
| Unified Image Compression Method for Human Perception and Multiple Vision | task | s, A |
| Unified Multi- | task | Learning Architecture for Fast and Accurate Pedestrian Detection, A |
| UniFormaly: Towards | task | -agnostic unified framework for visual anomaly detection |
| Unifying Top-Down Views by | task | -Specific Domain Adaptation |
| Unimodal Multi- | task | Fusion for Emotional Mimicry Intensity Prediction |
| UniNet: A Unified Scene Understanding Network and Exploring Multi- | task | Relationships through the Lens of Adversarial Attacks |
| UniRestore: Unified Perceptual and | task | -Oriented Image Restoration Model Using Diffusion Prior |
| Universal Representations: A Unified Look at Multiple | task | and Domain Learning |
| Unleash the Black Magic in Age: A Multi- | task | Deep Neural Network Approach for Cross-Age Face Verification |
| UnLoc: A Unified Framework for Video Localization | task | s |
| Unsupervised Domain Adaptation From Axial to Short-Axis Multi-Slice Cardiac MR Images by Incorporating Pretrained | task | Networks |
| Unsupervised domain adaptation using eigenanalysis in kernel space for categorisation | task | s |
| Unsupervised Encoder-decoder Model for Anomaly Prediction | task | |
| Unsupervised Image Restoration With Quality- | task | -Perception Loss |
| Unsupervised Image Style Embeddings for Retrieval and Recognition | task | s |
| Unsupervised Instance Segmentation in Microscopy Images via Panoptic Domain Adaptation and | task | Re-Weighting |
| Unsupervised learning of multi- | task | deep variational model |
| Unsupervised manifold learning through reciprocal kNN graph and Connected Components for image retrieval | task | s |
| Unsupervised manifold learning using Reciprocal kNN Graphs in image re-ranking and rank aggregation | task | s |
| Unsupervised Multi- | task | Domain Adaptation |
| Unsupervised Multi- | task | Feature Learning on Point Clouds |
| Unsupervised Multi- | task | Learning for 3D Subtomogram Image Alignment, Clustering and Segmentation |
| Unsupervised Multi- | task | Learning with Hierarchical Data Structure |
| Unsupervised Pre-training for Temporal Action Localization | task | s |
| Unsupervised Reinforcement Learning for Multi- | task | Autonomous Driving: Expanding Skills and Cultivating Curiosity |
| Unsupervised Variational Translator for Bridging Image Restoration and High-level Vision | task | s |
| Unveiling Groups of Related | task | s in Multi-Task Learning |
| Unveiling Groups of Related | task | s in Multi-Task Learning |
| Upper-Limb Posture Definition During Grasping with | task | and Environment Constraints |
| Use of BCI Systems in the Analysis of EEG Signals for Motor and Speech Imagery | task | ?: A SLR |
| Use of Sub-Ensembles and Multi-Template Observers to Evaluate Detection | task | Performance for Data That are Not Multivariate Normal |
| Using a Binary Diffractive Optical Element to Increase the Imaging System Depth of Field in UAV Remote Sensing | task | s |
| Using a semantic edge-aware multi- | task | neural network to delineate agricultural parcels from remote sensing images |
| Using difference features effectively: A multi- | task | network for exploring change areas and change moments in time series remote sensing images |
| Using Fourier/Mellin-based correlators and their fractional versions in navigational | task | s |
| Using Virtual Active Vision Tools to Improve Autonomous Driving | task | s |
| Using Wiimote for 2D and 3D Pointing | task | s: Gesture Performance Evaluation |
| Using Wireless EEG Signals to Assess Memory Workload in the n-Back | task | |
| UTC: A Unified Transformer with Inter- | task | Contrastive Learning for Visual Dialog |
| V2PNet: A Voxel-to-Point Network Framework for | task | -Oriented Point Cloud Sampling |
| Vacant Parking Space Detection based on | task | Consistency and Reinforcement Learning |
| Validation of Portable Mobile Mapping System for Inspection | task | s in Thermal and Fluid-Mechanical Facilities |
| Validity of Three-Class Hotelling Trace (3-HT) in Describing Three-Class | task | Performance: Comparison of Three-Class Volume Under ROC Surface (VUS) and 3-HT, The |
| Variable Fractional Network Evolutionary Game for Distributed Resilient | task | Allocation in Heterogeneous Multi-Robot Systems |
| Variable multi-scale attention fusion network and adaptive correcting gradient optimization for multi- | task | learning |
| VC-Dimension Analysis of Object Recognition | task | s |
| Vector Quantization Enhancement for Computer Vision | task | s |
| Vehicle Detection in UAV Images via Background Suppression Pyramid Network and Multi-Scale | task | Adaptive Decoupled Head |
| Vehicular | task | Offloading and Job Scheduling Method Based on Cloud-Edge Computing |
| Veritatem Dies Aperit - Temporally Consistent Depth Prediction Enabled by a Multi- | task | Geometric and Semantic Scene Understanding Approach |
| Versatilegaussian: Real-time Neural Rendering for Versatile | task | s Using Gaussian Splatting |
| Vertebroplasty Performance on Simulator for 19 Surgeons Using Hierarchical | task | Analysis |
| VHS to HDTV Video Translation Using Multi- | task | Adversarial Learning |
| Vibrotactile Reaction Time | task | to Measure Cognitive Performance in Virtual and Real Environments, A |
| Video Anomaly Detection via self-supervised and spatio-temporal proxy | task | s learning |
| Video Anomaly Detection via Sequentially Learning Multiple Pretext | task | s |
| Video Completion in Digital Stabilization | task | Using Pseudo-panoramic Technique |
| Video Enhancement with | task | -Oriented Flow |
| Video | task | Decathlon: Unifying Image and Video Tasks in Autonomous Driving |
| Video | task | Decathlon: Unifying Image and Video Tasks in Autonomous Driving |
| Video Visualization and Visual Analytics: A | task | -Based and Application- Driven Investigation |
| Video-Analytics | task | -Aware Quad-Tree Partitioning and Quantization for HEVC |
| Viewpoint Selection for Visual Search | task | s |
| Viewport-Based CNN: A Multi- | task | Approach for Assessing 360° Video Quality |
| Vifa: An Efficient Visible and Infrared Image Fusion Architecture for Multi- | task | Applications via Continual Learning |
| Virtual Reality is Better Than Desktop for Training a Spatial Knowledge | task | , but Not for Everyone |
| Virtual | task | s but Real Gains: Improving Multi-Task Learning |
| Virtual | task | s but Real Gains: Improving Multi-Task Learning |
| Vision-Based Fuzzy Controllers for Navigation | task | s |
| Vision-Based Target Objects Recognition and Segmentation for Unmanned Systems | task | Allocation |
| Vision-Language Models for Vision | task | s: A Survey |
| Vision-Language Models Performing Zero-Shot | task | s Exhibit Disparities Between Gender Groups |
| Vision-Language Navigation With Self-Supervised Auxiliary Reasoning | task | s |
| Vision-to-Language | task | s Based on Attributes and Attention Mechanism |
| VisionHub: Learning | task | -Plugins for Efficient Universal Vision Model |
| VisionlLaMA: A Unified LLaMA Backbone for Vision | task | s |
| VisionScores - A System-Segmented Image Score Dataset for Deep Learning | task | s |
| Visual Compliance: | task | -Directed Visual Servo Control |
| Visual Components of an Automated Inspection | task | , The |
| Visual Control of Grasping and Manipulation | task | s |
| Visual Exemplar Driven | task | -Prompting for Unified Perception in Autonomous Driving |
| Visual lane analysis and higher-order | task | s: a concise review |
| Visual Objectification in Films: Towards a New AI | task | for Video Interpretation |
| Visual Perception in Familiar, Complex | task | s |
| Visual Person Understanding Through Multi- | task | and Multi-dataset Learning |
| Visual Question Answering as a Meta Learning | task | |
| Visual Question Generation as Dual | task | of Visual Question Answering |
| Visual Recognition of Manual | task | s Using Object Motion Trajectories |
| Visual robot guidance for an insertion | task | |
| Visual Servoing in the | task | -Function Framework: A Contour Following Task |
| Visual Servoing in the | task | -Function Framework: A Contour Following Task |
| Visual Space | task | Specification, Planning and Control |
| Visual speaker authentication with random prompt texts by a dual- | task | CNN framework |
| Visual State Recognition for a Target-reaching | task | |
| Visual tracking via adaptive multi- | task | feature learning with calibration and identification |
| Visual-Language Multi- | task | Blind Image Quality Assessment With Local Quality Weighting |
| visualization method for data domain changes in CNN networks and the optimization method for selecting thresholds in classification | task | s, A |
| Visually multimodal depression assessment based on key questions with weighted multi- | task | learning |
| Visually-Guided Audio Spatialization in Video with Geometry-Aware Multi- | task | Learning |
| VL-ADAPTER: Parameter-Efficient Transfer Learning for Vision-and-Language | task | s |
| VL2Lite: | task | -Specific Knowledge Distillation from Large Vision-Language Models to Lightweight Networks |
| VLTP: Vision-Language Guided Token Pruning for | task | -Oriented Segmentation |
| Vulnerability-Aware | task | Scheduling for Edge Intelligence Empowered Trajectory Analysis in Intelligent Transportation Systems |
| Watching it in Dark: A Target-aware Representation Learning Framework for High-level Vision | task | s in Low Illumination |
| Water Flow Detection in a Handwashing | task | |
| Water level prediction from social media images with a multi- | task | ranking approach |
| WaterScenes: A Multi- | task | 4D Radar-Camera Fusion Dataset and Benchmarks for Autonomous Driving on Water Surfaces |
| Weakly Supervised Actor-Action Segmentation via Robust Multi- | task | Ranking |
| Weakly Supervised Multi- | task | Ranking Framework for Actor-Action Semantic Segmentation, A |
| Weakly-Supervised Action Learning in Procedural | task | Videos via Process Knowledge Decomposition |
| WeakMCN: Multi- | task | Collaborative Network for Weakly Supervised Referring Expression Comprehension and Segmentation |
| wearable augmented reality system with haptic feedback and its performance in virtual assembly | task | s, A |
| Weather-degraded image semantic segmentation with multi- | task | knowledge distillation |
| Web User Interact | task | Recognition Based on Conditional Random Fields |
| What contextual and demographic factors predict drivers' decision to engage in secondary | task | s? |
| What Matters For Meta-Learning Vision Regression | task | s? |
| What Object Should I Use? - | task | Driven Object Detection |
| What Sketch Explainability Really Means for Downstream | task | s? |
| What | task | s can be Performed with an Uncalibrated Stereo Vision System? |
| When Age-Invariant Face Recognition Meets Face Age Synthesis: A Multi- | task | Learning Framework |
| When Age-Invariant Face Recognition Meets Face Age Synthesis: A Multi- | task | Learning Framework and a New Benchmark |
| When Deep Learning Meets Multi- | task | Learning in SAR ATR: Simultaneous Target Recognition and Segmentation |
| When Domain Generalization meets Generalized Category Discovery: An Adaptive | task | -Arithmetic Driven Approach |
| When Will We Arrive? A Novel Multi- | task | Spatio-Temporal Attention Network Based on Individual Preference for Estimating Travel Time |
| Where and Why are They Looking? Jointly Inferring Human Attention and Intentions in Complex | task | s |
| Why Is It Important to Consider Dust Aerosol in the Sevastopol and Black Sea Region during Remote Sensing | task | s? A Case Study |
| Wide Evaluation of ChatGPT on Affective Computing | task | s, A |
| WoodScape: A Multi- | task | , Multi-Camera Fisheye Dataset for Autonomous Driving |
| X-DETR: A Versatile Architecture for Instance-wise Vision-Language | task | s |
| X-Learner: Learning Cross Sources and | task | s for Universal Visual Representation |
| X2-VLM: All-in-One Pre-Trained Model for Vision-Language | task | s |
| X3KD: Knowledge Distillation Across Modalities, | task | s and Stages for Multi-Camera 3D Object Detection |
| YOLOPX: Anchor-free multi- | task | learning network for panoptic driving perception |
| You Only Learn One Query: Learning Unified Human Query for Single-stage Multi-person Multi- | task | Human-centric Perception |
| You-Do, I-Learn: Discovering | task | Relevant Objects and their Modes of Interaction from Multi-User Egocentric Video |
| Zero-Shot | task | Transfer |
| ZiCo-BC: A Bias Corrected Zero-Shot NAS for Vision | task | s |
| ZooKT: | task | -adaptive knowledge transfer of Model Zoo for few-shot learning |
2648 for task