Update Dates 2211

2211 * *BMVC
* *ICIP
* 2.5D visual relationship detection
* 2D Amodal Instance Segmentation Guided by 3D Shape Prior
* 2D GANs Meet Unsupervised Single-View 3D Reconstruction
* 2DPASS: 2D Priors Assisted Semantic Segmentation on LiDAR Point Clouds
* 2HDED:Net for Joint Depth Estimation and Image Deblurring from a Single Out-of-Focus Image
* 3D Centroidnet: Nuclei Centroid Detection with Vector Flow Voting
* 3D Clothed Human Reconstruction in the Wild
* 3D Clues Guided Convolution for Depth Completion
* 3D CoMPaT: Composition of Materials on Parts of 3D Things
* 3D Compositional Zero-Shot Learning with DeCompositional Consensus
* 3D Crowd Counting via Geometric Attention-Guided Multi-view Fusion
* 3D End-to-End Boundary-Aware Networks for Pancreas Segmentation
* 3D Equivariant Graph Implicit Functions
* 3D Face Reconstruction with Dense Landmarks
* 3D Geometry Design via End-To-End Optimization for Land Seismic Acquisition
* 3D Head Pose Estimation Based on Graph Convolutional Network from A Single RGB Image
* 3D Human Motion Generation from the Text Via Gesture Action Classification and the Autoregressive Model
* 3D Human Pose Estimation Using Möbius Graph Convolutional Networks
* 3D Instances as 1D Kernels
* 3D Interacting Hand Pose Estimation by Hand De-occlusion and Removal
* 3D Object Detection with a Self-supervised Lidar Scene Flow Backbone
* 3D Objects Reconstruction Using Frontal Images. An Example With Guitars
* 3d Particle Picking in Cryo-Electron Tomograms Using Instance Segmentation
* 3D Random Occlusion and Multi-layer Projection for Deep Multi-camera Pedestrian Localization
* 3D Residual Interpolation for Spike Camera Demosaicing
* 3D Room Layout Estimation from a Cubemap of Panorama Image via Deep Manhattan Hough Transform
* 3D Scene Inference from Transient Histograms
* 3D semantic segmentation based on spatial-aware convolution and shape completion for augmented reality applications
* 3D Shape Sequence of Human Comparison and Classification Using Current and Varifolds
* 3D Siamese Transformer Network for Single Object Tracking on Point Clouds
* 3D US-Based Evaluation and Optimization of Tumor Coverage for US-Guided Percutaneous Liver Thermal Ablation
* 3D-Aware Indoor Scene Synthesis with Depth Priors
* 3D-Aware Semantic-Guided Generative Model for Human Synthesis
* 3D-FM GAN: Towards 3D-Controllable Face Manipulation
* 3D-PL: Domain Adaptive Depth Estimation with 3D-Aware Pseudo-Labeling
* 3D-Selfcutmix: Self-Supervised Learning for 3D Point Cloud Analysis
* 3D-VDNet: Exploiting the vertical distribution characteristics of point clouds for 3D object detection and augmentation
* 3DCNN-Based Palpation Localization with Temporal Attention Module
* 3DCT Reconstruction from a Single X-Ray Projection Using Convolutional Neural Network
* 3DG-STFM: 3D Geometric Guided Student-Teacher Feature Matching
* 4DContrast: Contrastive Learning with Dynamic Correspondences for 3D Scene Understanding
* 6d Rotation Representation For Unconstrained Head Pose Estimation
* A-OKVQA: A Benchmark for Visual Question Answering Using World Knowledge
* Abduction of Sherlock Holmes: A Dataset for Visual Abductive Reasoning, The
* Ablation-CAM++: Grouped Recursive Visual Explanations for Deep Convolutional Networks
* Abstracting Sketches Through Simple Primitives
* Accelerated Level-Set Method for Inverse Scattering Problems, An
* Accelerating a Morphology-Preserving Adsorption Model by Deep Learning
* Accelerating Score-Based Generative Models with Preconditioned Diffusion Sampling
* Accurate and Robust Image Correspondence for Structure-From-Motion and its Application to Multi-View Stereo
* Accurate Detection of Proteins in Cryo-Electron Tomograms from Sparse Labels
* Accurate Estimation of Chlorophyll-a Concentration in the Coastal Areas of the Ebro Delta (NW Mediterranean) Using Sentinel-2 and Its Application in the Selection of Areas for Mussel Aquaculture
* Accurate Head Pose Estimation Based on Multi-Stage Regression
* Acknowledging the Unknown for Multi-Label Learning with Single Positive Labels
* AcroFOD: An Adaptive Method for Cross-Domain Few-Shot Object Detection
* ACT-NET: Asymmetric Co-Teacher Network for Semi-Supervised Memory-Efficient Medical Image Segmentation
* Action Quality Assessment with Temporal Parsing Transformer
* Action-Based Contrastive Learning for Trajectory Prediction
* ActionFormer: Localizing Moments of Actions with Transformers
* Active Audio-Visual Separation of Dynamic Sound Sources
* Active Label Correction Using Robust Parameter Update and Entropy Propagation
* Active Learning for Hyperspectral Image Classification via Hypergraph Neural Network
* Active Learning Strategies for Weakly-Supervised Object Detection
* Active Pointly-Supervised Instance Segmentation
* ActiveMatch: End-To-End Semi-Supervised Active Representation Learning
* ActiveNeRF: Learning Where to See with Uncertainty Estimation
* Actor-Centered Representations for Action Localization in Streaming Videos
* AdaAfford: Learning to Adapt Manipulation Affordance for 3D Articulated Objects via Few-Shot Interactions
* AdaBest: Minimizing Client Drift in Federated Learning via Adaptive Bias Estimation
* AdaBin: Improving Binary Neural Networks with Adaptive Binary Sets
* AdaFocusV3: On Unified Spatial-Temporal Dynamic Video Recognition
* AdaNeRF: Adaptive Sampling for Real-Time Rendering of Neural Radiance Fields
* Adaptive Agent Transformer for Few-Shot Segmentation
* Adaptive Co-teaching for Unsupervised Monocular Depth Estimation
* Adaptive Compressive Sampling for Mid-Infrared Spectroscopic Imaging
* Adaptive Cross-Domain Learning for Generalizable Person Re-identification
* Adaptive Detail Injection-Based Feature Pyramid Network for Pan-Sharpening
* Adaptive Detection in Partially Homogeneous Environment with Limited Samples Based on Geometric Barycenters
* Adaptive Face Forgery Detection in Cross Domain
* Adaptive Feature Interpolation for Low-Shot Image Generation
* Adaptive Fine-Grained Sketch-Based Image Retrieval
* Adaptive Image Transformations for Transfer-Based Adversarial Attack
* Adaptive Local Implicit Image Function for Arbitrary-Scale Super-Resolution
* Adaptive Loop Filter with a CNN-Based Classification
* Adaptive Multi-Scale Progressive Probability Model for Lossless Image Compression
* Adaptive Patch Exiting for Scalable Single Image Super-Resolution
* Adaptive Proxy Anchor Loss for Deep Metric Learning
* Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation
* Adaptive Robust Radar Target Detector Based on Gradient Test
* Adaptive Sensor Fault Accommodation for Vehicle Active Suspensions via Partial Measurement Information
* Adaptive Spatial-BCE Loss for Weakly Supervised Semantic Segmentation
* Adaptive Token Sampling for Efficient Vision Transformers
* Adaptive Transformers for Robust Few-shot Cross-domain Face Anti-spoofing
* Adaptive Warping Network for Transferable Adversarial Attacks
* Adaptive Weighted Losses With Distribution Approximation for Efficient Consistency-Based Semi-Supervised Learning
* Adding Non-Linear Context to Deep Networks
* Addressing Heterogeneity in Federated Learning via Distributional Transformation
* Advanced Motion Vector Difference Coding Beyond AV1
* AdvDO: Realistic Adversarial Attacks for Trajectory Prediction
* Adversarial Contrastive Learning via Asymmetric InfoNCE
* Adversarial Erasing Framework via Triplet with Gated Pyramid Pooling Layer for Weakly Supervised Semantic Segmentation
* Adversarial Evolving Neural Network for Longitudinal Knee Osteoarthritis Prediction
* Adversarial Examples for Good: Adversarial Examples Guided Imbalanced Learning
* Adversarial Feature Augmentation for Cross-domain Few-Shot Classification
* Adversarial Label Poisoning Attack on Graph Neural Networks via Label Propagation
* Adversarial Label-Poisoning Attacks and Defense for General Multi-Class Models Based on Synthetic Reduced Nearest Neighbor
* Adversarial Pairwise Reverse Attention for Camera Performance Imbalance in Person Re-Identification: New Dataset And Metrics
* Adversarial Partial Domain Adaptation by Cycle Inconsistency
* Adversarial Training of Anti-Distilled Neural Network with Semantic Regulation of Class Confidence
* Adversarial Training with Channel Attention Regularization
* Adversarially-Aware Robust Object Detector
* AEBSR: Active-Sampling and Energy-Based Single Image Super-Resolution
* Aesthetics and Cartography: Post-Critical Reflections on Deviance in and of Representations
* AF-SRNet: Quantitative Precipitation Forecasting Model Based on Attention Fusion Mechanism and Residual Spatiotemporal Feature Extraction
* Affine Correspondences Between Multi-Camera Systems for 6DOF Relative Pose Estimation
* Affine Transformation-Based Color Compression For Dynamic 3D Point Clouds
* AgeTransGAN for Facial Age Transformation with Rectified Performance Metrics
* Aggregated Context Network For Semantic Segmentation Of Aerial Images
* AgSAT: A Smart Irrigation Application for Field-Scale Daily Crop ET and Water Requirements Using Satellite Imagery
* AI-Based Compression: A New Unintended Counter Attack on JPEG-Related Image Forensic Detectors?
* AI4EO Hyperview: A Spectralnet3d and RNNplus Approach for Sustainable Soil Parameter Estimation on Hyperspectral Image Data
* AiATrack: Attention in Attention for Transformer Visual Tracking
* AirDet: Few-Shot Detection Without Fine-Tuning for Autonomous Exploration
* AIVC: Artificial Intelligence Based Video Codec
* Algebraic Optimization Approach To Image Registration, An
* AlignSDF: Pose-Aligned Signed Distance Fields for Hand-Object Reconstruction
* All You Need Is RAW: Defending Against Adversarial Attacks with Camera Image Pipelines
* Almost-Orthogonal Layers for Efficient General-Purpose Lipschitz Networks
* AlphaVC: High-Performance and Efficient Learned Video Compression
* AMixer: Adaptive Weight Mixing for Self-Attention Free Vision Transformers
* Analyses of the Impact of Soil Conditions and Soil Degradation on Vegetation Vitality and Crop Productivity Based on Airborne Hyperspectral VNIR-SWIR-TIR Data in a Semi-Arid Rainfed Agricultural Area (Camarena, Central Spain)
* Analysis and Experimentation on the ManTraNet Image Forgery Detector
* Analysis Method for Metric-Level Switching in Beat Tracking, An
* Analysis of Drought Characteristics Projections for the Tibetan Plateau Based on the GFDL-ESM2M Climate Model
* Analysis of Internal Angle Error of UAV LiDAR Based on Rotating Mirror Scanning
* Analysis of Long-Term Aerosol Optical Properties Combining AERONET Sunphotometer and Satellite-Based Observations in Hong Kong
* Analysis of Spatio-Temporal Dynamics of Chinese Inland Water Clarity at Multiple Spatial Scales between 1984 and 2018
* Analysis of the Evolution of the Relationship between the Urban Pattern and Economic Development in Guangdong Province Based on Coupled Multisource Data
* Analysis of the Spatial and Temporal Evolution Patterns of Grassland Health and Its Driving Factors in Xilingol
* Analysis of Video Quality Induced Spatio-Temporal Saliency Shifts
* Analytical Algorithm for Tensor Tomography From Projections Acquired About Three Axes, An
* Anatomy of Video Editing: A Dataset and Benchmark Suite for AI-Assisted Video Editing, The
* ANFIS-EKF-Based Single-Beacon Localization Algorithm for AUV
* Animation from Blur: Multi-modal Blur Decomposition with Motion Guidance
* AnimeCeleb: Large-Scale Animation CelebHeads Dataset for Head Reenactment
* Anisotropic Edge Detection in Catadioptric Images
* Anomalib: A Deep Learning Library for Anomaly Detection
* Anomaly Detection of Metro Station Tracks Based on Sequential Updatable Anomaly Detection Framework
* Anomaly Matters: An Anomaly-Oriented Model for Medical Visual Question Answering
* Anti-Interference From Noisy Labels: Mean-Teacher-Assisted Confident Learning for Medical Image Segmentation
* Anti-Neuron Watermarking: Protecting Personal Data Against Unauthorized Neural Networks
* Anti-retroactive Interference for Lifelong Learning
* Any-Resolution Training for High-Resolution Image Synthesis
* Applicability Assessment of Coherent Doppler Wind LiDAR for Monitoring during Dusty Weather at the Northern Edge of the Tibetan Plateau
* Application and Evaluation of Deep Neural Networks for Airborne Hyperspectral Remote Sensing Mineral Mapping: A Case Study of the Baiyanghe Uranium Deposit in Northwestern Xinjiang, China
* Application of Remote Sensing Techniques to Identification of Underwater Airplane Wreck in Shallow Water Environment: Case Study of the Baltic Sea, Poland
* Application of Social Network Analysis in the Economic Connection of Urban Agglomerations Based on Nighttime Lights Remote Sensing: A Case Study in the New Western Land-Sea Corridor, China
* Approximate Differentiable Rendering with Algebraic Surfaces
* Approximate Discrete Optimal Transport Plan with Auxiliary Measure Method
* Approximating Relu Networks by Single-Spike Computation
* ARAH: Animatable Volume Rendering of Articulated Human SDFs
* Are Vision Transformers Robust to Patch Perturbations?
* ARF: Artistic Radiance Fields
* Arg-Cnn: An Attention-Based Network for Plant Identification
* ARM: Any-Time Super-Resolution Method
* ART-SS: An Adaptive Rejection Technique for Semi-Supervised Restoration for Adverse Weather-Affected Images
* Artificial Intelligence-Based Learning Approaches for Remote Sensing
* ASpanFormer: Detector-Free Image Matching with Adaptive Span Transformer
* Assessing Nitrogen Variability at Early Stages of Maize Using Mobile Fluorescence Sensing
* Assessing the Accessibility of Swimming Pools in Nanjing by Walking and Cycling Using Baidu Maps
* Assessment of Fire Regimes and Post-Fire Evolution of Burned Areas with the Dynamic Time Warping Method on Time Series of Satellite Images: Setting the Methodological Framework in the Peloponnese, Greece
* Assessment of Image Manipulation Using Natural Language Description: Quantification of Manipulation Direction
* ASSISTER: Assistive Navigation via Conditional Instruction Generation
* AssistQ: Affordance-Centric Question-Driven Task Completion for Egocentric Assistant
* Asymmetric Relation Consistency Reasoning for Video Relation Grounding
* ATCA: An ARC Trajectory Based Model with Curvature Attention for Video Frame Interpolation
* Attaining Class-Level Forgetting in Pretrained Model Using Few Samples
* Attention Diversification for Domain Generalization
* Attention-Aware Learning for Hyperparameter Prediction in Image Processing Pipelines
* Attention-Based Neural Network For Ill-Exposed Image Correction
* Attention-Driven Graph Neural Network for Deep Face Super-Resolution
* Attribute Conditioned Fashion Image Captioning
* AU-Aware 3D Face Reconstruction through Personalized AU-Specific Blendshape Learning
* Audio-Driven Stylized Gesture Generation with Flow-Based Model
* Audio-Visual Mismatch-Aware Video Retrieval via Association and Adjustment
* Audio-Visual Segmentation
* AudioScopeV2: Audio-Visual Attention Architectures for Calibrated Open-Domain On-Screen Sound Separation
* Augmentation of rPPG Benchmark Datasets: Learning to Remove and Embed rPPG Signals via Double Cycle Consistent Learning from Unpaired Facial Videos
* Augmented Equivariant Attention Networks for Microscopy Image Transformation
* Augmenting Deep Classifiers with Polynomial Neural Networks
* Auth-Persons: A Dataset for Detecting Humans in Crowds from Aerial Views
* Authentication Of Copy Detection Patterns Under Machine Learning Attacks: A Supervised Approach
* Auto-FedRL: Federated Hyperparameter Optimization for Multi-institutional Medical Image Segmentation
* Auto-regressive Image Synthesis with Integrated Quantization
* AutoAvatar: Autoregressive Neural Fields for Dynamic Avatar Modeling
* Autolv: Automatic Lecture Video Generator
* Automatic Check-Out via Prototype-Based Classifier Learning from Single-Product Exemplars
* Automatic Dataset Generation for Specific Object Detection
* Automatic Defect Segmentation by Unsupervised Anomaly Learning
* Automatic Dense Annotation of Large-Vocabulary Sign Language Videos
* Automatic Detection of Sentimentality from Facial Expressions
* Automatic Filtering and Classification of Low-Density Airborne Laser Scanner Clouds in Shrubland Environments
* Automatic Fuzzy Graph Construction For Interpretable Image Classification
* Automatic Illumination of Flat-Colored Drawings by 3D Augmentation of 2D Silhouettes
* Automatic Inspection of Cultural Monuments Using Deep and Tensor-Based Learning on Hyperspectral Imagery
* Automatic Laboratory Martian Rock and Mineral Classification Using Highly-Discriminative Representation Derived from Spectral Signatures
* Automatic Moving Pose Grading for Golf Swing in Sports
* Automating Detection of Papilledema in Pediatric Fundus Images with Explainable Machine Learning
* AutoMix: Unveiling the Power of Mixup for Stronger Classifiers
* Autonomous Generation of Service Strategy for Household Tasks: A Progressive Learning Method With A Priori Knowledge and Reinforcement Learning
* Autonomous Tracking and State Estimation With Generalized Group Lasso
* Autoregressive 3D Shape Generation via Canonical Mapping
* Autoregressive Uncertainty Modeling for 3D Bounding Box Prediction
* AutoTransition: Learning to Recommend Video Transition Effects
* AV-GAZE: A Study on the Effectiveness of Audio Guided Visual Attention Estimation for Non-profilic Faces
* AvatarCap: Animatable Avatar Conditioned Monocular Human Volumetric Capture
* AvatarPoser: Articulated Full-Body Pose Tracking from Sparse Motion Sensing
* AVT: Au-Assisted Visual Transformer for Facial Expression Recognition
* Aware of the History: Trajectory Forecasting with the Local Behavior Data
* Azimuth-Elevation Direction Finding With a Pair of Acoustic Vector Sensors in the Presence of a Reflecting Boundary
* BA-Net: Bridge Attention for Deep Convolutional Neural Networks
* Back to Old Constraints to Jointly Supervise Learning Depth, Camera Motion and Optical Flow in a Monocular Video
* Backbone is All Your Need: A Simplified Architecture for Visual Object Tracking
* Background-Insensitive Scene Text Recognition with Text Semantic Segmentation
* Background-Tolerant Object Classification With Embedded Segmentation Mask For Infrared and Color Imagery
* Bag-Of-Features-Based Knowledge Distillation For Lightweight Convolutional Neural Networks
* Bagging Regional Classification Activation Maps for Weakly Supervised Object Localization
* Balanced Affinity Loss for Highly Imbalanced Baggage Threat Contour-Driven Instance Segmentation
* Balancing Between Forgetting and Acquisition in Incremental Subpopulation Learning
* Balancing Stability and Plasticity Through Advanced Null Space in Continual Learning
* Banana Mapping in Heterogenous Smallholder Farming Systems Using High-Resolution Remote Sensing Imagery and Machine Learning Models with Implications for Banana Bunchy Top Disease Surveillance
* Banding vs. Quality: perceptual impact and objective assessment
* Bandwidth-Aware Adaptive Codec for DNN Inference Offloading in IoT
* BANet: A Blur-Aware Attention Network for Dynamic Scene Deblurring
* Barycentric Defense
* BASQ: Branch-wise Activation-clipping Search Quantization for Sub-4-bit Neural Networks
* Batch Size Reconstruction-Distribution Trade-Off In Kernel Based Generative Autoencoders
* Batch-Efficient EigenDecomposition for Small and Medium Matrices
* BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring Space for Video Object Segmentation
* BayesCap: Bayesian Identity Cap for Calibrated Uncertainty in Frozen Neural Networks
* Bayesian Optimization with Clustering and Rollback for CNN Auto Pruning
* Bayesian Tracking of Video Graphs Using Joint Kalman Smoothing and Registration
* BEAT: A Large-Scale Semantic and Emotional Multi-modal Dataset for Conversational Gestures Synthesis
* Benchmarking 3D Face De-Identification with Preserving Facial Attributes
* Benchmarking Different SfM-MVS Photogrammetric and iOS LiDAR Acquisition Methods for the Digital Preservation of a Short-Lived Excavation: A Case Study from an Area of Sinkhole Related Subsidence
* Benchmarking of Deep Architectures for Segmentation of Medical Images
* Benchmarking Omni-Vision Representation Through the Lens of Visual Realms
* Benchmarking Performance of Object Detection Under Image Distortions in an Uncontrolled Environment
* BEVFormer: Learning Bird's-Eye-View Representation from Multi-camera Images via Spatiotemporal Transformers
* Beyond Bjøntegaard: Limits of Video Compression Performance Comparisons
* Beyond Periodicity: Towards a Unifying Framework for Activations in Coordinate-MLPs
* BGSNet: Bidirectional-Guided Semi-3D Network for Prediction of Hematoma Expansion
* Bi-directional Contrastive Learning for Domain Adaptive Semantic Segmentation
* Bi-Directional Inter-Prediction For Geometry-Based Point Cloud Compression
* Bi-Directional Pseudo-Three-Dimensional Network for Video Frame Interpolation
* Bi-level Feature Alignment for Versatile Image Translation and Manipulation
* Bi-Modal Compositional Network for Feature Disentanglement
* Bi-PointFlowNet: Bidirectional Learning for Point Cloud Based Scene Flow Estimation
* Bi-Polar Mask for Joint Cell and Nuclei Instance Segmentation
* Biases Analysis and Calibration of ICESat-2/ATLAS Data Based on Crossover Adjustment Method
* BigColor: Colorization Using a Generative Color Prior for Natural Images
* Bilateral Normal Integration
* Bilevel Training Schemes in Imaging for Total Variation: Type Functionals with Convex Integrands
* BiN-Flow: Bidirectional Normalizing Flow for Robust Image Dehazing
* Bina-Rep Event Frames: A Simple and Effective Representation for Event-Based Cameras
* Binary Morphological Neural Network
* Biologically Plausible Illusionary Contrast Perception with Spiking Neural Networks
* BIPS: Bi-modal Indoor Panorama Synthesis via Residual Depth-Aided Adversarial Learning
* Bitwidth-Adaptive Quantization-Aware Neural Network Training: A Meta-Learning Approach
* Black Ice Detection Method Based on 1-Dimensional CNN Using mmWave Sensor Backscattering, A
* Black-Box Dissector: Towards Erasing-Based Hard-Label Model Stealing Attack
* Black-Box Few-Shot Knowledge Distillation
* Blind Deconvolution Using the Sure-Blur Criterion and Linear PSF Expansions
* Blind Image Decomposition
* Blind Image Quality Assessment for Authentic Distortions by Intermediary Enhancement and Iterative Training
* Blind Quality of a 3D Reconstructed MESH
* Blind Video Quality Assessment via Space-Time Slice Statistics
* BlobGAN: Spatially Disentangled Scene Representations
* BLT: Bidirectional Layout Transformer for Controllable Layout Generation
* BMD: A General Class-Balanced Multicentric Dynamic Prototype Strategy for Source-Free Domain Adaptation
* BodySLAM: Joint Camera Localisation, Mapping, and Human Motion Tracking
* Boosting Event Stream Super-Resolution with a Recurrent Neural Network
* Boosting Supervised Dehazing Methods via Bi-level Patch Reweighting
* Boosting Supervised Learning in Small Data Regimes with Conditional GAN Augmentation
* Boosting the Performance of Weakly-Supervised 3d Human Pose Estimators With Pose Prior Regularizers
* Boosting Transferability of Targeted Adversarial Examples via Hierarchical Generative Networks
* Bootstrapped Masked Autoencoders for Vision BERT Pretraining
* Bottom Up Top Down Detection Transformers for Language Grounding in Images and Point Clouds
* Boundary Corrected Multi-Scale Fusion Network for Real-Time Semantic Segmentation
* Boundary-Area Enhanced Module for Instance Segmentation
* BoundaryFace: A Mining Framework with Noise Label Self-correction for Face Recognition
* Bounding Box Disparity: 3D Metrics for Object Detection with Full Degree of Freedom
* Box-Supervised Instance Segmentation with Level Set Evolution
* Box2Mask: Weakly Supervised 3D Semantic Instance Segmentation using Bounding Boxes
* BRACE: The Breakdancing Competition Dataset for Dance Motion Synthesis
* Breadcrumbs: Adversarial Class-Balanced Sampling for Long-Tailed Recognition
* Break and Make: Interactive Structural Understanding Using LEGO Bricks
* Breaking down Polyblur: Fast Blind Correction of Small Anisotropic Blurs
* Breakpoint Dependent Scalable Coding of Optical Flow Volume
* Bregman Majorization-Minimization Framework for Pet Image Reconstruction, A
* Bregman Plug-And-Play Priors
* Bridging Images and Videos: A Simple Learning Framework for Large Vocabulary Video Object Detection
* Bridging the Domain Gap in Real World Super-Resolution
* Bridging the Domain Gap Towards Generalization in Automatic Colorization
* Bridging the Gap Between Image Coding for Machines and Humans
* Bridging the Visual Semantic Gap in VLN via Semantically Richer Instructions
* Bringing Rolling Shutter Images Alive with Dual Reversed Distortion
* BRIO-TA Dataset: Understanding Anomalous Assembly Process in Manufacturing, The
* BRNet: Exploring Comprehensive Features for Monocular Depth Estimation
* Broad Study of Pre-training for Domain Generalization and Adaptation, A
* Building Extraction and Floor Area Estimation at the Village Level in Rural China Via a Comprehensive Method Integrating UAV Photogrammetry and the Novel EDSANet
* Building Inspection Toolkit: Unified Evaluation And Strong Baselines For Bridge Damage Recognition
* BungeeNeRF: Progressive Neural Radiance Field for Extreme Multi-scale Scene Rendering
* Bureya Landslide Recent Evolution According to Spaceborne SAR Interferometry Data, The
* Burn After Reading: Online Adaptation for Cross-domain Streaming Data
* ByteTrack: Multi-object Tracking by Associating Every Detection Box
* BézierPalm: A Free Lunch for Palmprint Recognition
* C2FNet: A Coarse-to-Fine Network for Multi-View 3D Point Cloud Generation
* C3P: Cross-Domain Pose Prior Propagation for Weakly Supervised 3D Human Pose Estimation
* CA-SSL: Class-Agnostic Semi-Supervised Learning for Detection and Segmentation
* CACLA-Based Local Path Planner for Drones Navigating Unknown Indoor Corridors
* CADyQ: Content-Aware Dynamic Quantization for Image Super-Resolution
* Calculation Model for Ground Surface Temperature in High-Altitude Regions of the Qinghai-Tibet Plateau, China, A
* Calibration-Free Multi-view Crowd Counting
* Caltech Fish Counting Dataset: A Benchmark for Multiple-Object Tracking and Counting, The
* Camera Auto-calibration from the Steiner Conic of the Fundamental Matrix
* Camera Pose Auto-encoders for Improving Pose Regression
* Camera Pose Estimation and Localization with Active Audio Sensing
* Camera Self-Calibration: Deep Learning from Driving Scenes
* Can Data Assimilation Improve Short-Term Prediction of Land Surface Variables?
* Can Shuffling Video Benefit Temporal Bias Problem: A Novel Training Framework for Temporal Grounding
* CANF-VC: Conditional Augmented Normalizing Flows for Video Compression
* Capturing, Reconstructing, and Simulating: The UrbanScene3D Dataset
* CAR: Class-Aware Regularizations for Semantic Segmentation
* Cartoon Explanations of Image Classifiers
* Case Study of a Calibration Problem in Acquired Hyperspectral Images
* Category-Level 6D Object Pose and Size Estimation Using Self-supervised Deep Prior Deformation Networks
* CATRE: Iterative Point Clouds Alignment for Category-Level Object Pose Refinement
* CAViT: Contextual Alignment Vision Transformer for Video Object Re-identification
* CBPT: A New Backbone for Enhancing Information Transmission of Vision Transformers
* CCL: Class-Wise Curriculum Learning for Class Imbalance Problems
* CCPL: Contrastive Coherence Preserving Loss for Versatile Style Transfer
* CDANet: Channel Split Dual Attention Based CNN for Brain Tumor Classification In Mr Images
* CelebV-HQ: A Large-Scale Video Facial Attributes Dataset
* CenterFormer: Center-Based Transformer for 3D Object Detection
* Centrality and Consistency: Two-Stage Clean Samples Identification for Learning with Instance-Dependent Noisy Labels
* Cervix Detection Driven Deep Learning Approach for Cow Heat Analysis from Endoscopic Images, A
* Chairs Can Be Stood On: Overcoming Object Bias in Human-Object Interaction Detection
* Challenges of Continuous Self-Supervised Learning, The
* Channel-Position Self-Attention with Query Refinement Skeleton Graph Neural Network in Human Pose Estimation
* Channel-Wise Bit Allocation for Deep Visual Feature Quantization
* Characteristics of Raindrop Size Distributions in Different Climatological Regions in South Korea, The
* Check and Link: Pairwise Lesion Correspondence Guides Mammogram Mass Detection
* Chinese Mandarin Lipreading using Cascaded Transformers with Multiple Intermediate Representations
* CHORE: Contact, Human and Object Reconstruction from a Single RGB Image
* ChunkyGAN: Real Image Inversion via Segments
* CIR-Net: Cross-Modality Interaction and Refinement for RGB-D Salient Object Detection
* CIRCLE: Convolutional Implicit Reconstruction and Completion for Large-Scale Indoor Scene
* Class Activation Map Refinement via Semantic Affinity Exploration for Weakly Supervised Object Detection
* Class Is Invariant to Context and Vice Versa: On Learning Invariance for Out-Of-Distribution Generalization
* Class-Agnostic Object Counting Robust to Intraclass Diversity
* Class-Agnostic Object Detection with Multi-modal Transformer
* Class-Incremental Learning with Cross-Space Clustering and Controlled Transfer
* Class-Incremental Novel Class Discovery
* Class-Wise FM-NMS for Knowledge Distillation of Object Detection
* Classification of Terrestrial Laser Scanner Point Clouds: A Comparison of Methods for Landslide Monitoring from Mathematical Surface Approximation
* Classification-Regression for Chart Comprehension
* CLAST: Contrastive Learning for Arbitrary Style Transfer
* CLASTER: Clustering with Reinforcement Learning for Zero-Shot Action Recognition
* ClearPose: Large-scale Transparent Object Dataset and Benchmark
* CLID: A Chunk-Level Intent Detection Framework for Multiple Intent Spoken Language Understanding
* CLIFF: Carrying Location Information in Full Frames into Human Pose and Shape Estimation
* CLIP-Actor: Text-Driven Recommendation and Stylization for Animating Human Meshes
* CLOSE: Curriculum Learning on the Sharing Extent Towards Better One-Shot NAS
* Closer Look at Invariances in Self-supervised Pre-training for 3D Vision, A
* Cloud 3D Dataset and Application-Specific Learned Image Compression in Cloud 3D, A
* Cluster-Based 3D Keypoint Detection for Category-Agnostic 6D Pose Tracking
* Clustering by Directly Disentangling Latent Space
* Clustering-Based Psychometric No-Reference Quality Model for Point Cloud Video
* CMA-CLIP: Cross-Modality Attention Clip for Text-Image Classification
* CMD: Self-supervised 3D Action Representation Learning with Cross-Modal Mutual Distillation
* CMT: Context-Matching-Guided Transformer for 3D Tracking in Point Clouds
* CNN-Based Fast CU Partitioning Algorithm for VVC Intra Coding
* CNN-Based Local Tone Mapping in the Perceptual Quantization Domain
* CNN-Based Post-Processor for Perceptually-Optimized Immersive Media Compression, A
* Co-Clustering on Bipartite Graphs for Robust Model Fitting
* Co-Correcting: Combat Noisy Labels in Space Debris Detection
* Coarse-to-Fine Contrastive Self-Supervised Feature Learning for Land-Cover Classification in SAR Images With Limited Labeled Data
* Coarse-To-Fine Incremental Few-Shot Learning
* Coarse-to-Fine Morphological Approach with Knowledge-Based Rules and Self-Adapting Correction for Lung Nodules Segmentation, A
* Coarse-to-Fine Sparse Transformer for Hyperspectral Image Reconstruction
* CODA: A Real-World Road Corner Case Dataset for Object Detection in Autonomous Driving
* Codec Information Assisted Framework for Efficient Compressed Video Super-Resolution, A
* CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval
* COFENet: Co-Feature Neural Network Model for Fine-Grained Image Classification
* Cognitive Perspective on Subjective and Objective Diagnostic Image Quality Models, A
* CoGS: Controllable Generation and Search from Sketch and Style
* Collaborating Domain-Shared and Target-Specific Feature Clustering for Cross-domain 3D Action Recognition
* Collaborative Consistent Knowledge Distillation Framework for Remote Sensing Image Scene Classification Network
* Collaborative learning network for head pose estimation
* Color Alignment for Relative Color Constancy via Non-Standard References
* Color Constancy Beyond Standard Illuminants
* Color Image Restoration in the Low Photon-Count Regime Using Expectation Propagation
* ColorFormer: Image Colorization via Color Memory Assisted Hybrid-Attention Transformer
* Colorization for in situ Marine Plankton Images
* Combating Label Distribution Shift for Active Domain Adaptation
* Combining Internal and External Constraints for Unrolling Shutter in Videos
* Combining Non-Data-Adaptive Transforms for OCT Image Denoising by Iterative Basis Pursuit
* CoMER: Modeling Coverage for Transformer-Based Handwritten Mathematical Expression Recognition
* Comparative Study of Graph Matching Algorithms in Computer Vision, A
* Comparative Study of Various Deep Learning Approaches to Shape Encoding of Planar Geospatial Objects, A
* Comparing Sentinel-2 and Landsat 8 for Burn Severity Mapping in Western North America
* Comparing Vector Fields Across Surfaces: Interest for Characterizing the Orientations of Cortical Folds
* Comparison of Accelerated Versions of the Iterative Gradient Method to Ameliorate the Spatial Resolution of Microwave Radiometer Products
* Comparison of Different Atmospheric Turbulence Simulation Methods for Image Restoration, A
* Comparison of Phase-based Sub-Pixel Motion Estimation Methods
* Comparison of Physical-Based Models to Measure Forest Resilience to Fire as a Function of Burn Severity
* Comparison of Regularization Methods for Near-Light-Source Perspective Shape-from-Shading, A
* Comparison of Three Convolution Neural Network Schemes to Retrieve Temperature and Humidity Profiles from the FY4A GIIRS Observations
* Compiler-Aware Neural Architecture Search for On-Mobile Real-time Super-Resolution
* Complementing Brightness Constancy with Deep Networks for Optical Flow Prediction
* Completely Self-supervised Crowd Counting via Distribution Matching
* CompNVS: Novel View Synthesis with Scene Completion
* Component-Based Transformation for Person Image Generation
* COMPOSER: Compositional Reasoning of Group Activity in Videos with Keypoint-Only Modality
* Compositional Human-Scene Interaction Synthesis with Semantic Control
* Compositional Visual Generation with Composable Diffusion Models
* Compound Prototype Matching for Few-Shot Action Recognition
* Compression of User Generated Content Using Denoised References
* Computationally-Efficient Vision Transformer for Medical Image Semantic Segmentation Via Dual Pseudo-Label Supervision
* Computing and Assistive Technology Solutions for the Visually Impaired
* Computing Curvature, Mean Curvature and Weighted Mean Curvature
* ConCL: Concept Contrastive Learning for Dense Prediction Pre-training in Pathology Images
* Concurrent Subsidiary Supervision for Unsupervised Source-Free Domain Adaptation
* Conditional Reconstruction for Open-Set Semantic Segmentation
* Conditional RGB-T Fusion for Effective Crowd Counting
* Conditional Stroke Recovery for Fine-Grained Sketch-Based Image Retrieval
* Conditional-Flow NeRF: Accurate 3D Modelling with Reliable Uncertainty Quantification
* ConMatch: Semi-supervised Learning with Confidence-Guided Consistency Regularization
* Conmw Transformer: A General Vision Transformer Backbone With Merged-Window Attention
* Connecting Compression Spaces with Transformer for Approximate Nearest Neighbor Search
* Constrained Mean Shift Using Distant yet Related Neighbors for Representation Learning
* Constructing Balance from Imbalance for Long-Tailed Image Recognition
* Content Adaptive Latents and Decoder for Neural Image Compression
* Content-Adaptive Neural Network Post-Processing Filter with NNR-Coded Weight-Updates
* Content-Oriented Learned Image Compression
* Context Relation Fusion Model for Visual Question Answering
* Context-Aware Hierarchical Transformer for Fine-Grained Video-Text Retrieval
* Context-Aware Streaming Perception in Dynamic Environments
* Context-Consistent Semantic Image Editing with Style-Preserved Modulation
* Context-Enhanced Stereo Transformer
* Contextformer: A Transformer with Spatio-Channel Attention for Context Modeling in Learned Image Compression
* Contextual and Cross-Modal Interaction for Multi-Modal Speech Emotion Recognition
* Contextual Text Block Detection Towards Scene Text Understanding
* Continual 3D Convolutional Neural Networks for Real-time Processing of Videos
* Continual Contrastive Learning for Cross-Dataset Scene Classification
* Continual Learning in Vision Transformer
* Continual Referring Expression Comprehension via Dual Modular Memorization
* Continual Semantic Segmentation via Structure Preserving and Projected Feature Alignment
* Continual Variational Autoencoder Learning via Online Cooperative Memorization
* Continuous PDR and GNSS Fusing Algorithm for Smartphone Positioning, A
* Contrast-Phys: Unsupervised Video-Based Remote Physiological Measurement via Spatiotemporal Contrast
* Contrasting Quadratic Assignments for Set-Based Representation Learning
* Contrastive and Selective Hidden Embeddings for Medical Image Segmentation
* Contrastive Deep Supervision
* Contrastive Learning for Diverse Disentangled Foreground Generation
* Contrastive Learning for Online Semi-Supervised General Continual Learning
* Contrastive Monotonic Pixel-Level Modulation
* Contrastive Objective for Learning Disentangled Representations, A
* Contrastive Positive Mining for Unsupervised 3D Action Representation Learning
* Contrastive Prototypical Network with Wasserstein Confidence Penalty
* Contrastive Vicinal Space for Unsupervised Domain Adaptation
* Contrastive Vision-Language Pre-training with Limited Resources
* Contributions of Shape, Texture, and Color in Visual Recognition
* Controllable and Guided Face Synthesis for Unconstrained Face Recognition
* controllable face forgery framework to enrich face-privacy-protection datasets, A
* Controllable Shadow Generation Using Pixel Height Maps
* Controllable Video Generation Through Global and Local Motion Dynamics
* Convex Quadratic Programming for Slimming Convolutional Networks
* Convolutional Embedding Makes Hierarchical Vision Transformer Stronger
* Convolutional neural networks for obstacle detection on the road and driving assistance
* Convolutional Neural Networks for Omnidirectional Image Quality Assessment: A Benchmark
* Convolutional Neural Tree for Video-Based Facial Expression Recognition Embedding Emotion Wheel as Inductive Bias
* Convolutional Sparse Coding with Weighted L1 Norm for Phase Retrieval: Algorithm and Its Deep Unfolded Network
* COO: Comic Onomatopoeia Dataset for Recognizing Arbitrary or Truncated Texts
* Cornerformer: Purifying Instances for Corner-Based Detectors
* Coronary Artery Centerline Tracking with the Morphological Skeleton Loss
* Correspondence Reweighted Translation Averaging
* CoSCL: Cooperation of Small Continual Learners is Stronger Than a Big One
* CoSMix: Compositional Semantic Mix for Domain Adaptation in 3D LiDAR Segmentation
* Cost Aggregation with 4D Convolutional Swin Transformer for Few-Shot Segmentation
* CostDCNet: Cost Volume Based Depth Completion for a Single RGB-D Image
* COUCH: Towards Controllable Human-Chair Interactions
* Counterfactual Intervention Feature Transfer for Visible-Infrared Person Re-identification
* CoupleFace: Relation Matters for Face Recognition Distillation
* Coupling Attention and Convolution for Heuristic Network in Visual Dialog
* CoVisPose: Co-visibility Pose Transformer for Wide-Baseline Relative Pose Estimation in 360° Indoor Panoramas
* CP 2: Copy-Paste Contrastive Pretraining for Semantic Segmentation
* CPO: Change Robust Panorama to Point Cloud Localization
* CPrune: Compiler-Informed Model Pruning for Efficient Target-Aware DNN Execution
* CRAB: Certified Patch Robustness Against Poisoning-Based Backdoor Attacks
* CramNet: Camera-Radar Fusion with Ray-Constrained Cross-Attention for Robust 3D Object Detection
* CRCNet: Few-Shot Segmentation with Cross-Reference and Region-Global Conditional Networks
* Creating 3D Gramian Angular Field Representations for Higher Performance Energy Data Classification
* Crop Classification and Representative Crop Rotation Identifying Using Statistical Features of Time-Series Sentinel-1 GRD Data
* Cross Attention Based Style Distribution for Controllable Person Image Synthesis
* Cross Domain Low-Dose CT Image Denoising With Semantic Information Alignment
* Cross-Attention of Disentangled Modalities for 3D Human Mesh Recovery with Transformers
* Cross-Collaborative Fusion-Encoder Network for Robust RGB-Thermal Salient Object Detection
* Cross-Domain Cross-Set Few-Shot Learning via Learning Compact and Aligned Representations
* Cross-domain Ensemble Distillation for Domain Generalization
* Cross-Domain Few-Shot Semantic Segmentation
* Cross-modal 3D Shape Generation and Manipulation
* Cross-Modal Knowledge Transfer Without Task-Relevant Source Data
* Cross-Modal Prototype Driven Network for Radiology Report Generation
* Cross-Modality Image Registration Using a Training-Time Privileged Third Modality
* Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection
* Cross-Modality Transformer for Visible-Infrared Person Re-Identification
* Cross-Type Attribute Prediction For Point Cloud Compression
* Crowd-Powered Face Manipulation Detection: Fusing Human Examiner Decisions
* CryoAI: Amortized Inference of Poses for Ab Initio Reconstruction of 3D Molecular Volumes from Real Cryo-EM Images
* CSTNET: Enhancing Global-To-Local Interactions for Image Captioning
* CT2: Colorization Transformer via Color Tokens
* CTGAN: Cloud Transformer Generative Adversarial Network
* Cu-Net: Towards Continuous Multi-Class Contour Detection for Retinal Layer Segmentation In OCT Images
* Custom Structure Preservation in Face Aging
* CVIDS: A Collaborative Localization and Dense Mapping Framework for Multi-Agent Based Visual-Inertial SLAM
* CX-DaGAN: Domain Adaptation for Pneumonia Diagnosis on a Small Chest X-Ray Dataset
* CXR Segmentation by AdaIN-Based Domain Adaptation and Knowledge Distillation
* CYBORGS: Contrastively Bootstrapping Object Representations by Grounding in Segmentation
* CycDA: Unsupervised Cycle Domain Adaptation to Learn from Image to Video
* CyEDA: Cycle-Object Edge Consistency Domain Adaptation
* D 3 Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding
* D&D: Learning Human Dynamics from Dynamic Camera
* D-CBRS: Accounting for Intra-Class Diversity in Continual Learning
* D2-TPred: Discontinuous Dependency for Trajectory Prediction Under Traffic Lights
* D2ADA: Dynamic Density-Aware Active Domain Adaptation for Semantic Segmentation
* D2C-SR: A Divergence to Convergence Approach for Real-World Image Super-Resolution
* D2HNet: Joint Denoising and Deblurring with Hierarchical Network for Robust Night Image Restoration
* DACNN: Blind Image Quality Assessment via a Distortion-Aware Convolutional Neural Network
* DANBO: Disentangled Articulated Neural Body Representations via Graph Neural Networks
* DARTS-PD: Differentiable Architecture Search with Path-Wise Weight Sharing Derivation
* DAS: Densely-Anchored Sampling for Deep Metric Learning
* DAT: Domain Adaptive Transformer for Domain Adaptive Semantic Segmentation
* Data Association Between Event Streams and Intensity Frames Under Diverse Baselines
* Data Efficient 3D Learner via Knowledge Transferred from 2D Model
* Data Fusion Method for Generating Hourly Seamless Land Surface Temperature from Himawari-8 AHI Data, A
* Data Invariants to Understand Unsupervised Out-of-Distribution Detection
* Data-Centric Approach for Improving Ambiguous Labels with Combined Semi-supervised Classification and Clustering, A
* Data-Driven Approach for Automated Integrated Circuit Segmentation of Scan Electron Microscopy Images, A
* Data-Free Backdoor Removal Based on Channel Lipschitzness
* Data-Free Neural Architecture Search via Recursive Label Calibration
* Database of Visual Color Differences of Modern Smartphone Photography, A
* Dataset for Interactive Vision-Language Navigation with Unknown Command Feasibility, A
* Dataset Generation Framework for Evaluating Megapixel Image Classifiers and Their Explanations, A
* DaViT: Dual Attention Vision Transformers
* DCAN: A Dual Cascade Attention Network for Fusing Pet and MRI Images
* DCCF: Deep Comprehensible Color Filter Learning Framework for High-Resolution Image Harmonization
* DCL-Net: Deep Correspondence Learning Network for 6D Pose Estimation
* DCT-Based Residual Network for NIR Image Colorization
* DeciWatch: A Simple Baseline for 10× Efficient 2D and 3D Pose Estimationo
* Decomposing the Tangent of Occluding Boundaries According to Curvatures and Torsions
* Decouple-and-Sample: Protecting Sensitive Information in Task Agnostic Data Release
* Decoupled Adversarial Contrastive Learning for Self-supervised Adversarial Robustness
* Decoupled Contrastive Learning
* DecoupleNet: Decoupled Network for Domain Adaptive Semantic Segmentation
* Deeblif: Deep Blind Light Field Image Quality Assessment by Extracting Angular and Spatial Information
* DEEMD: Drug Efficacy Estimation Against SARS-CoV-2 Based on Cell Morphology With Deep Multiple Instance Learning
* Deep 360° Optical Flow Estimation Based on Multi-projection Fusion
* Deep Active Learning for Cryo-Electron Tomography Classification
* Deep Bayesian Video Frame Interpolation
* Deep Convolutional K-Means Clustering
* Deep Ensemble Learning Approach to Lung CT Segmentation for Covid-19 Severity Assessment, A
* Deep Ensemble Learning by Diverse Knowledge Distillation for Fine-Grained Object Classification
* Deep Ensemble Learning Model Based on Covariance Pooling of Multi-Layer CNN Features
* Deep Feature Compression using Rate-Distortion Optimization Guided Autoencoder
* Deep Fourier-Based Exposure Correction Network with Spatial-Frequency Interaction
* Deep Hash Distillation for Image Retrieval
* Deep Image Debanding
* Deep Incremental Optical Flow Coding For Learned Video Compression
* Deep Kernel Representation for Image Reconstruction in PET
* Deep Learning Application for Deformation Prediction from Ground-Based InSAR, A
* Deep Learning Based EEG Analysis Using Video Analytics
* Deep Learning Based Method for Railway Overhead Wire Reconstruction from Airborne LiDAR Data, A
* Deep Learning Classification of Large-Scale Point Clouds: A Case Study on Cuneiform Tablets
* Deep Learning From Imaging Genetics for Schizophrenia Classification
* Deep Learning Meets Radiomics For End-To-End Brain Tumor MRI Analysis
* Deep Learning of Radiometrical and Geometrical SAR Distorsions for Image Modality translations
* Deep Learning: Based Dictionary Learning and Tomographic Image Reconstruction
* Deep Metric Learning-Based Semi-Supervised Regression with Alternate Learning
* Deep Moving-Camera Background Model, A
* Deep Neural Network-Based Noisy Pixel Estimation for Breast Ultrasound Segmentation
* Deep Partial Updating: Towards Communication Efficient Updating for On-Device Inference
* Deep Portrait Delighting
* Deep Radial Embedding for Visual Sequence Learning
* Deep Residual Networks with Common Linear Multi-Step and Advanced Numerical Schemes
* Deep Semantic Statistics Matching (D2SM) Denoising Network
* Deep Shape-from-Template: Single-image quasi-isometric deformable registration and reconstruction
* Deep Unfolding of Image Denoising by Quantum Interactive Patches
* Deep Unrolling of Diffusion Process with Morphological Laplacian and its Implementation with SIMD Instructions
* Deep Visual Place Recognition for Waterborne Domains
* Deep Weighted Consensus Dense Correspondence Confidence Maps for 3d Shape Registration
* Deep-Based Quality Assessment of Medical Images Through Domain Adaptation
* Deep-Learning-Based Electrical Noise Removal Enables High Spectral Optoacoustic Contrast in Deep Tissue
* Deeply Learned Structure-Aware Transmission for Image Haze Removal
* DeepMend: Learning Occupancy Functions to Represent Shape for Repair
* DeepPS2: Revisiting Photometric Stereo Using Two Differently Illuminated Images
* DeepSAR: Vessel Detection in SAR Imagery with Noisy Labels
* DeepShadow: Neural Shape from Shadow
* Defect Detection Method Based on BC-YOLO for Transmission Line Components in UAV Remote Sensing Images, A
* Defining Point Cloud Boundaries Using Pseudopotential Scalar Field Implicit Surfaces
* Defocus Deblur Microscopy via Head-to-Tail Cross-Scale Fusion
* Deformable Alignment And Scale-Adaptive Feature Extraction Network For Continuous-Scale Satellite Video Super-Resolution
* Deformable Feature Aggregation for Dynamic Multi-modal 3D Object Detection
* Deforming Radiance Fields with Cages
* DeiT III: Revenge of the ViT
* DEKRV2: More Accurate or Fast than DEKR
* Delineating Fire-Hazardous Areas and Fire-Induced Patterns Based on Visible Infrared Imaging Radiometer Suite (VIIRS) Active Fires in Northeast China
* Delta Distillation for Efficient Video Processing
* DeltaGAN: Towards Diverse Few-Shot Image Generation with Sample-Specific Delta
* DELTAR: Depth Estimation from a Light-Weight ToF Sensor and RGB Image
* Delving into Details: Synopsis-to-Detail Networks for Video Recognition
* Delving into Inter-Image Invariance for Unsupervised Visual Representations
* Delving into Universal Lesion Segmentation: Method, Dataset, and Benchmark
* DeMFI: Deep Joint Deblurring and Multi-frame Interpolation with Flow-Guided Attentive Correlation and Recursive Boosting
* Demystifying Unsupervised Semantic Correspondence Estimation
* Dense Cross-Query-and-Support Attention Weighted Mask Aggregation for Few-Shot Segmentation
* Dense Gaussian Processes for Few-Shot Segmentation
* Dense Material Segmentation Dataset for Indoor and Outdoor Scene Parsing, A
* Dense Siamese Network for Dense Unsupervised Learning
* Dense Teacher: Dense Pseudo-Labels for Semi-supervised Object Detection
* DenseHybrid: Hybrid Anomaly Detection for Dense Open-Set Recognition
* Densely Constrained Depth Estimator for Monocular 3D Object Detection
* Depth Field Networks For Generalizable Multi-View Scene Representation
* Depth is all you Need: Single-Stage Weakly Supervised Semantic Segmentation From Image-Level Supervision
* Depth Map Decomposition for Monocular Depth Estimation
* Depth-Cooperated Trimodal Network for Video Salient Object Detection
* Depthformer: Multiscale Vision Transformer for Monocular Depth Estimation with Global Local Information Fusion
* Design and Implementation of Geospatial Information Verification Middle Platform for Natural Resources Government Affairs, The
* Design of f-SCAN Acquisition Mode for Synthetic Aperture Radar
* Designing One Unified Framework for High-Fidelity Face Reenactment and Swapping
* Detecting A Child's Stimming Behaviours for Autism Spectrum Disorder Diagnosis using Rgbpose-Slowfast Network
* Detecting and Recovering Sequential DeepFake Manipulation
* Detecting GAN-Generated Images by Orthogonal Training of Multiple CNNs
* Detecting Generated Images by Real Images
* Detecting Maritime Infrared Targets in Harsh Environment by Improved Visual Attention Model Preselector and Anti-Jitter Spatiotemporal Filter Discriminator
* Detecting Tampered Scene Text in the Wild
* Detecting Twenty-Thousand Classes Using Image-Level Supervision
* Detecting Wheat Heads from UAV Low-Altitude Remote Sensing Images Using Deep Learning Based on Transformer
* Detection of a Rare Multichannel Gaussian Signal via Higher Criticism
* Detection of Glass Insulators Using Deep Neural Networks Based on Optical Imaging
* Detection of the New Class of Hypersonic Targets under Emerging Hyperspectral Sample Streams: An Unsupervised Isolation Forest Solution
* Detection-Identification Balancing Margin Loss for One-Stage Multi-Object Tracking
* DetMatch: Two Teachers are Better than One for Joint 2D and 3D Semi-Supervised Object Detection
* Developing a Dual-Stream Deep-Learning Neural Network Model for Improving County-Level Winter Wheat Yield Estimates in China
* DEVIANT: Depth EquiVarIAnt NeTwork for Monocular 3D Object Detection
* DevNet: Self-supervised Monocular Depth Learning via Density Volume Construction
* DexMV: Imitation Learning for Dexterous Manipulation from Human Videos
* DFNet: Enhance Absolute Pose Regression with Direct Feature Matching
* DH-AUG: DH Forward Kinematics Model Driven Augmentation for 3D Human Pose Estimation
* DHNet: Salient Object Detection With Dynamic Scale-Aware Learning and Hard-Sample Refinement
* Diagnosing Autism Spectrum Disorder Using Ensemble 3D-CNN: A Preliminary Study
* DICE: Leveraging Sparsification for Out-of-Distribution Detection
* DID-M3D: Decoupling Instance Depth for Monocular 3D Object Detection
* DIFAI: Diverse Facial Inpainting using StyleGAN Inversion
* DiffConv: Analyzing Irregular Point Clouds with an Irregular View
* Differentiable Raycasting for Self-Supervised Occupancy Forecasting
* Differentiable SAR Renderer and Image-Based Target Reconstruction
* Differentiable Zooming for Multiple Instance Learning on Whole-Slide Images
* Differential Contrast Based Adaptive Quantization for Perceptual Quality Optimization in Image Coding
* Differential Invariants for SE(2)-Equivariant Networks
* Differential Pseudo-Image for Skeleton-Based Dynamic Gesture Recognition
* Difficulty-Aware Simulator for Open Set Recognition
* DiffuseMorph: Unsupervised Deformable Image Registration Using Diffusion Model
* DiffuStereo: High Quality Human Reconstruction via Diffusion-Based Stereo Using Sparse Cameras
* Digging into Radiance Grid for Real-Time View Synthesis with Detail Preservation
* Digital Data and Semantic Simulation: The Survey of the Ruins of the Convent of the Paolotti (12th Century A.D.)
* Dimensionality Reduction Techniques with Hydranet Framework for HSI Classification
* Direct Alignment of Narrow Field-of-View Hyperspectral Data and Full-View RGB Image
* Direct Handheld Burst Imaging to Simulated Defocus
* Direct Imaging Using Physics Informed Neural Networks
* Directed Ray Distance Functions for 3D Scene Reconstruction
* DisCo: Remedying Self-supervised Learning on Lightweight Models with Distilled Contrastive Learning
* Discover and Mitigate Unknown Biases with Debiasing Alternate Networks
* Discovering Deformable Keypoint Pyramids
* Discovering Human-Object Interaction Concepts via Self-Compositional Learning
* Discovering Transferable Forensic Features for CNN-Generated Images Detection
* Discrete Joint Semantic Alignment Hashing for Cross-Modal Image-Text Search
* Discrete Metric Learning for Fast Image Set Classification
* Discrete-Constrained Regression for Local Counting Models
* Discriminability-Transferability Trade-Off: An Information-Theoretic Perspective
* Discriminate Clearer To Rank Better: Image Cropping By Amplifying View-Wise Differences
* Discrimination of Rock Units in Karst Terrains Using Sentinel-2A Imagery
* Disentangled Capsule Routing for Fast Part-Object Relational Saliency
* Disentangled Differentiable Network Pruning
* Disentangled Sequential Autoencoder with Local Consistency for Infectious Keratitis Diagnosis
* Disentangling Architecture and Training for Optical Flow
* Disentangling Object Motion and Occlusion for Unsupervised Multi-frame Monocular Depth
* Disentangling the Frequency Content in Optoacoustics
* DISP6D: Disentangled Implicit Shape and Pose Learning for Scalable 6D Pose Estimation
* Dispense Mode for Inference to Accelerate Branchynet
* Distilling DETR-Like Detectors with Instance-Aware Feature
* Distilling Facial Knowledge with Teacher-Tasks: Semantic-Segmentation-Features For Pose-Invariant Face-Recognition
* Distilling Knowledge From Object Classification to Aesthetics Assessment
* Distilling Object Detectors with Global Knowledge
* Distilling the Undistillable: Learning from a Nasty Teacher
* DistPro: Searching a Fast Knowledge Distillation Process via Meta Optimization
* Distributed Radar Autofocus Imaging Using Deep Priors
* Distribution-Driven Predictor Screening For Point Cloud Attribute Compression
* Diurnal Variations in Different Precipitation Duration Events over the Yangtze River Delta Urban Agglomeration
* Diverse Generation from a Single Video Made Possible
* Diverse Generative Perturbations on Attention Space for Transferable Adversarial Attacks
* Diverse Human Motion Prediction Guided by Multi-level Spatial-Temporal Anchors
* Diverse Image Inpainting with Normalizing Flow
* Diverse Learner: Exploring Diverse Supervision for Semi-supervised Object Detection
* DLCFT: Deep Linear Continual Fine-Tuning for General Incremental Learning
* DLME: Deep Local-Flatness Manifold Embedding
* DnA: Improving Few-Shot Transfer Learning with Low-Rank Decomposition and Alignment
* Document Layout Analysis Via Positional Encoding
* Document Shadow Removal with Foreground Detection Learning From Fully Synthetic Images
* DODA: Data-Oriented Sim-to-Real Domain Adaptation for 3D Semantic Segmentation
* Does Physical Interpretability of Observation Map Improve Photometric Stereo Networks?
* Domain Adaptation for Unknown Image Distortions in Instance Segmentation
* Domain Adaptive Hand Keypoint and Pixel Localization in the Wild
* Domain Adaptive Person Search
* Domain Adaptive Video Segmentation via Temporal Pseudo Supervision
* Domain Generalization by Mutual-Information Regularization with Pre-trained Models
* Domain Invariant Masked Autoencoders for Self-supervised Learning from Multi-domains
* Domain Knowledge-Informed Self-supervised Representations for Workout Form Assessment
* Domain Randomization-Enhanced Depth Simulation and Restoration for Perceiving and Grasping Specular and Transparent Objects
* Don't Forget Me: Accurate Background Recovery for Text Removal via Modeling Local-Global Context
* DoodleFormer: Creative Sketch Drawing with Transformers
* Dose-Blind Denoising With Deep Learning in Cardiac Spect
* Dot-Product Based Global and Local Feature Fusion for Image Search
* Double-Stream Position Learning Transformer Network for Image Captioning
* Doubly Deformable Aggregation of Covariance Matrices for Few-Shot Segmentation
* Doubly-Fused ViT: Fuse Information from Vision Transformer Doubly with Local Representation
* Downsampling Based Light Field Video Coding with Restoration Network Using Joint Spatio-Angular and Epipolar Information
* Downscaling SMAP Brightness Temperatures to 3 km Using CYGNSS Reflectivity Observations: Factors That Affect Spatial Heterogeneity
* DP-MHT-TBD: A Dynamic Programming and Multiple Hypothesis Testing-Based Infrared Dim Point Target Detection Algorithm
* DPNET: Dual-Path Network for Efficient Object Detection with Lightweight Self-Attention
* DProST: Dynamic Projective Spatial Transformer Network for 6D Pose Estimation
* DRBANET: A Lightweight Dual-Resolution Network for Semantic Segmentation with Boundary Auxiliary
* DRCNet: Dynamic Image Restoration Contrastive Network
* Dress Code: High-Resolution Multi-category Virtual Try-On
* Drive&Segment: Unsupervised Semantic Segmentation of Urban Scenes via Cross-Modal Distillation
* DRNet: Double Recalibration Network for Few-Shot Semantic Segmentation
* Drought Resistance of Vegetation and Its Change Characteristics before and after the Implementation of the Grain for Green Program on the Loess Plateau, China
* DSR: A Dual Subspace Re-Projection Network for Surface Anomaly Detection
* DSRGAN: Detail Prior-Assisted Perceptual Single Image Super-Resolution via Generative Adversarial Networks
* DSSNet: A Deep Sequential Sleep Network for Self-Supervised Representation Learning Based on Single-Channel EEG
* DTransGAN: Deblurring Transformer Based on Generative Adversarial Network
* DTT-Net: Dual-Domain Translation Transformer for Semi-Supervised Image Deraining
* Dual Adaptive Transformations for Weakly Supervised Point Cloud Segmentation
* Dual Adversarial Attention Mechanism for Unsupervised Domain Adaptive Medical Image Segmentation
* Dual Contrastive Learning with Anatomical Auxiliary Supervision for Few-Shot Medical Image Segmentation
* Dual Path Cross-Scale Attention Network For Image Inpainting
* Dual Perspective Network for Audio-Visual Event Localization
* Dual-Domain Self-supervised Learning and Model Adaption for Deep Compressive Imaging
* Dual-Domain Update and Double-Group Optimization Network for Image Compressive Sensing
* Dual-ERP Representation for Object Detection in 360° Images
* Dual-Evidential Learning for Weakly-supervised Temporal Action Localization
* Dual-Level Representation Enhancement on Characteristic and Context for Image-Text Retrieval
* Dual-Stream Knowledge-Preserving Hashing for Unsupervised Video Retrieval
* Dualfeat: Dual Feature Aggregation for Video Object Detection
* DualFormer: Local-Global Stratified Transformer for Efficient Video Recognition
* Dually Distribution Pulling Network for Cross-Resolution Person Reidentification
* DualPrompt: Complementary Prompting for Rehearsal-Free Continual Learning
* DuelGAN: A Duel Between Two Discriminators Stabilizes the GAN Training
* DVS-Voltmeter: Stochastic Process-Based Event Simulator for Dynamic Vision Sensors
* Dynamic 3D Scene Analysis by Point Cloud Accumulation
* Dynamic Dual Trainable Bounds for Ultra-low Precision Super-Resolution Networks
* Dynamic Hybrid Model to Forecast the Spread of COVID-19 Using LSTM and Behavioral Models Under Uncertainty
* Dynamic Local Aggregation Network with Adaptive Clusterer for Anomaly Detection
* Dynamic Low-Resolution Distillation for Cost-Efficient End-to-End Text Spotting
* Dynamic Metric Learning with Cross-Level Concept Distillation
* Dynamic Monitoring of Environmental Quality in the Loess Plateau from 2000 to 2020 Using the Google Earth Engine Platform and the Remote Sensing Ecological Index
* Dynamic Multi-Reference Generative Prediction for Face Video Compression
* Dynamic Mutual Enhancement Network for Single Remote Sensing Image Dehazing
* Dynamic Selection Network For Rgb-D Salient Object Detection
* Dynamic Spatio-Temporal Specialization Learning for Fine-Grained Action Recognition
* Dynamic Template Update for Visual Object Tracking
* Dynamic Temporal Filtering in Video Models
* Dynamically Transformed Instance Normalization Network for Generalizable Person Re-Identification
* DynaST: Dynamic Sparse Transformer for Exemplar-Guided Image Generation
* E-Graph: Minimal Solution for Rigid Rotation with Extensibility Graphs
* E-NeRV: Expedite Neural Video Representation with Disentangled Spatial-Temporal Context
* EAGAN: Efficient Two-Stage Evolutionary Architecture Search for GANs
* EANET: Efficient Attention-Augmented Network for Real-Time Semantic Segmentation
* Early Monitoring of Cotton Verticillium Wilt by Leaf Multiple Symptom Characteristics
* Early Pedestrian Intent Prediction via Features Estimation
* Earthquake Location and Magnitude Estimation with Graph Neural Networks
* EASNet: Searching Elastic and Accurate Network Architecture for Stereo Matching
* EAutoDet: Efficient Architecture Search for Object Detection
* ECCV Caption: Correcting False Negatives by Collecting Machine-and-Human-verified Image-Caption Associations for MS-COCO
* EclipSE: Efficient Long-Range Video Retrieval Using Sight and Sound
* ECO-TR: Efficient Correspondences Finding via Coarse-to-Fine Refinement
* Ecological Policies Dominated the Ecological Restoration over the Core Regions of Kubuqi Desert in Recent Decades
* EdgeViTs: Competing Light-Weight CNNs on Mobile Devices with Vision Transformers
* Editable Indoor Lighting Estimation
* Editing Out-of-Domain GAN Inversion via Differential Activations
* Editorial for Special Issue Advances in Hyperspectral Data Exploitation
* Editorial to Special Issue Remote Sensing Image Denoising, Restoration and Reconstruction
* Effect of Space Objects on Ionospheric Observations: Perspective of SYISR, The
* Effect of Spatial and Temporal Occlusion on Word Level Sign Language Recognition, The
* Effective Fusion Method to Enhance the Robustness of CNN, An
* Effective Multimodal Encoding for Image Paragraph Captioning
* Effective Presentation Attack Detection Driven by Face Related Task
* Effective Tensor Completion via Element-Wise Weighted Low-Rank Tensor Train With Overlapping Ket Augmentation
* Effects of Human Disturbance on Riparian Wetland Landscape Pattern in a Coastal Region
* Efficient and Accurate Skeleton-Based Two-Person Interaction Recognition Using Inter-and Intra-Body Graphs
* Efficient and Degradation-Adaptive Network for Real-World Image Super-Resolution
* Efficient Clustering Using Alternating Minimization And A Projection-Gradient Method For Dimension Reduction
* Efficient CNN-Based Super Resolution Algorithms for MMwave Mobile Radar Imaging
* Efficient Deblurring Via High-Frequency and Low-Frequency Information Fusion
* Efficient Decoder-Free Object Detection with Transformers
* Efficient Deep Visual and Inertial Odometry with Adaptive Visual Modality Selection
* Efficient End-To-End Image Compression Transformer, An
* Efficient Feature Compression for the Object Tracking Task
* Efficient Fine-Tuning of Deep Neural Networks with Effective Parameter Allocation
* Efficient Framework for Human Action Recognition Based on Graph Convolutional Networks, An
* Efficient Inference Of Image-Based Neural Network Models In Reconfigurable Systems With Pruning And Quantization
* Efficient Long-Range Attention Network for Image Super-Resolution
* Efficient Meta-Tuning for Content-Aware Neural Video Delivery
* Efficient Method to Compensate Receiver Clock Jumps in Real-Time Precise Point Positioning, An
* Efficient One Pass Self-distillation with Zipf's Label Smoothing
* Efficient One-Shot Sports Field Image Registration with Arbitrary Keypoint Segmentation
* Efficient One-Stage Video Object Detection by Exploiting Temporal Consistency
* Efficient Person Clustering Algorithm for Open Checkout-free Groceries, An
* Efficient Point Cloud Analysis Using Hilbert Curve
* Efficient Point Cloud Segmentation with Geometry-Aware Sparse Networks
* Efficient Scalable 360-degree Video Compression Scheme using 3D Cuboid Partitioning
* Efficient Scheme of Multi-Hypothesis Motion Compensated Prediction for Video Coding Applications, An
* Efficient Self-Calibrated Convolution for Real-Time Image Super-Resolution
* Efficient Spatio-Temporal Pyramid Transformer for Action Detection, An
* Efficient Transformer with Locally Shared Attention for Video Quality Assessment
* Efficient Video Deblurring Guided by Motion Magnitude
* Efficient Video Enhancement Transformer
* Efficient Video Transformers with Spatial-Temporal Token Selection
* Egnet: A Novel Edge Guided Network for Instance Segmentation
* EgoBody: Human Body Shape and Motion of Interacting People from Head-Mounted Devices
* Egocentric Activity Recognition and Localization on a 3D Map
* ELCD: Efficient Lunar Crater Detection Based on Attention Mechanisms and Multiscale Feature Fusion Networks from Digital Elevation Models
* Eldnet: Establishment and Refinement of Edge Likelihood Distributions for Camouflaged Object Detection
* Electromagnetic Signal Attenuation Characteristics in the Lunar Regolith Observed by the Lunar Regolith Penetrating Radar (LRPR) Onboard the Chang'E-5 Lander
* EleGANt: Exquisite and Locally Editable GAN for Makeup Transfer
* Eliminating Gradient Conflict in Reference-based Line-Art Colorization
* Embedded Feature Whitening Approach to Deep Neural Network Optimization, An
* Embedding Contrastive Unsupervised Features to Cluster In- And Out-of-Distribution Noise in Corrupted Image Datasets
* Emcenet: Efficient Multi-Scale Context Exploration Network for Salient Object Detection
* Emotion Recognition for Multiple Context Awareness
* Emotion-aware Multi-view Contrastive Learning for Facial Emotion Recognition
* Empirical Approach for Optimising the Impact of a Preprocessor in a Transcoding Pipeline, An
* Empirical Examinations of Whether Rural Population Decline Improves the Rural Eco-Environmental Quality in a Chinese Context
* Encoder Enabled GAN-based Video Generators
* End-to-End Active Speaker Detection
* End-to-End Deep Multi-Score Model for No-Reference Stereoscopic Image Quality Assessment
* End-To-End Depth Map Compression Framework Via Rgb-To-Depth Structure Priors Learning
* End-to-End Graph-Constrained Vectorized Floorplan Generation with Panoptic Refinement
* End-to-End Radar HRRP Target Recognition Based on Integrated Denoising and Recognition Network
* End-to-End Spatial-Angular Light Field Super-Resolution Using Parallax Structure Preservation Strategy
* End-to-End Transformer Model for Crowd Localization, An
* End-to-End Visual Editing with a Generatively Pre-Trained Artist
* End-to-End Weakly Supervised Object Detection with Sparse Proposal Evolution
* Energy-Based Adversarial Example Detection for SAR Images
* Enhanced Accuracy and Robustness via Multi-teacher Adversarial Distillation
* Enhanced Deep Animation Video Interpolation
* Enhanced Dual-Level Representations for Facial Expression Recognition
* Enhanced Transferable Adversarial Attack of Scale-Invariant Methods, An
* Enhancing Deformable Convolution Based Video Frame Interpolation with Coarse-To-Fine 3D CNN
* Enhancing Multi-modal Features Using Local Self-attention for 3D Object Detection
* Enhancing Multi-View Stereo with Contrastive Matching and Weighted Focal Loss
* Enhancing Part Features via Contrastive Attention Module for Vehicle Re-identification
* Enhancing Underwater Image Using Degradation Adaptive Adversarial Network
* Ensemble Knowledge Guided Sub-network Search and Fine-Tuning for Filter Pruning
* Ensemble Learning Priors Driven Deep Unfolding for Scalable Video Snapshot Compressive Imaging
* Ensemble of Proximal Networks for Sparse Coding, An
* Ensemble Three-Dimensional Habitat Modeling of Indian Ocean Immature Albacore Tuna (Thunnus alalunga) Using Remote Sensing Data
* Entropy-Based Feature Extraction for Real-Time Semantic Segmentation
* Entropy-Driven Sampling and Training Scheme for Conditional Diffusion Generation
* Entropy-Reduced Attention for Image Compression
* Entry-Flipped Transformer for Inference and Prediction of Participant Behavior
* Envelope Travel-Time Objective Function for Reducing Source-Velocity Trade-Offs in Wave-Equation Tomography, An
* Eogface: Deep Face Recognition via Extensional Logits
* Episode Difficulty Based Sampling Method for Few-Shot Classification
* Equivariance and Invariance Inductive Bias for Learning from Insufficient Data
* Equivariant Hypergraph Neural Networks
* ERA: Enhanced Rational Activations
* ERA: Expert Retrieval and Assembly for Early Action Prediction
* ERDN: Equivalent Receptive Field Deformable Network for Video Deblurring
* Error Compensation Framework for Flow-Guided Video Inpainting
* ESS: Learning Event-Based Semantic Segmentation from Still Images
* Estimating 3D body mesh without SMPL annotations via alternating successive convex approximation
* Estimating 3D Green Volume and Aboveground Biomass of Urban Forest Trees by UAV-Lidar
* Estimating Brain Age with Global and Local Dependencies
* Estimating PM2.5 Concentrations Using the Machine Learning RF-XGBoost Model in Guanzhong Urban Agglomeration, China
* Estimating Soil Organic Matter Content in Desert Areas Using In Situ Hyperspectral Data and Feature Variable Selection Algorithms in Southern Xinjiang, China
* Estimating Spatially-Varying Lighting in Urban Scenes with Disentangled Representation
* Estimation Of 3d Body Shape And Clothing Measurements From Frontal-And Side-View Images
* Estimation of Aboveground Biomass of Potatoes Based on Characteristic Variables Extracted from UAV Hyperspectral Imagery
* Estimation of Snow Depth from AMSR2 and MODIS Data based on Deep Residual Learning Network
* ETPS: Efficient Two-Pass Encoding Scheme for Adaptive Live Streaming
* EvAC3D: From Event-Based Apparent Contours to 3D Models via Continuous Visual Hulls
* Evaluating Trade-Off and Synergies of Ecosystem Services Values of a Representative Resources-Based Urban Ecosystem: A Coupled Modeling Framework Applied to Panzhihua City, China
* Evaluation of Aerosol Typing with Combination of Remote Sensing Techniques with In Situ Data during the PANACEA Campaigns in Thessaloniki Station, Greece
* Evaluation of Arctic Sea Ice Drift Products Based on FY-3, HY-2, AMSR2, and SSMIS Radiometer Data
* Evaluation of Automatically Generated Video Captions Using Vision and Language Models
* Evaluation of the Methods for Estimating Leaf Chlorophyll Content with SPAD Chlorophyll Meters
* Evaluation of the Spatial Representativeness of In Situ SIF Observations for the Validation of Medium-Resolution Satellite SIF Products
* Event Neural Networks
* Event-Based Fusion for Motion Deblurring with Cross-modal Attention
* Event-guided Deblurring of Unknown Exposure Time Videos
* Evolution of Irrigation Effects on Agricultural Drought Mitigation in North China, The
* Excavating RoI Attention for Underwater Object Detection
* Exemplar-Free Online Continual Learning
* Expanded Adaptive Scaling Normalization for End to End Image Compression
* Expanding Language-Image Pretrained Models for General Video Recognition
* Explainable AI (XAI) In Biomedical Signal and Image Processing: Promises and Challenges
* Explaining Deepfake Detection by Analysing Image Matching
* Explicit Image Caption Editing
* Explicit Model Size Control and Relaxation via Smooth Regularization for Mixed-Precision Quantization
* Explicit Occlusion Reasoning for Multi-person 3D Human Pose Estimation
* Exploiting Doubly Adversarial Examples for Improving Adversarial Robustness
* Exploiting Multiperspective Driven Hierarchical Content-Aware Network for Finger Vein Verification
* Exploiting Spatial Sparsity for Event Cameras with Visual Transformers
* Exploiting the Local Parabolic Landscapes of Adversarial Losses to Accelerate Black-Box Adversarial Attack
* Exploiting Unlabeled Data with Vision and Language Models for Object Detection
* Explore Adversarial Attack via Black Box Variational Inference
* Explore Spatial and Channel Attention in Image Quality Assessment
* Exploring Active Learning for Semiconductor Defect Segmentation
* Exploring Disentangled Content Information for Face Forgery Detection
* Exploring Fine-Grained Audiovisual Categorization with the SSW60 Dataset
* Exploring Gradient-Based Multi-directional Controls in GANs
* Exploring Hierarchical Graph Representation for Large-Scale Zero-Shot Image Classification
* Exploring Intra- and Inter-Video Relation for Surgical Semantic Scene Segmentation
* Exploring Lottery Ticket Hypothesis in Spiking Neural Networks
* Exploring Occlusion-Sensitive Deep Network for Single-View 3D Face Reconstruction
* Exploring Plain Vision Transformer Backbones for Object Detection
* Exploring Resolution and Degradation Clues as Self-supervised Signal for Low Quality Object Detection
* Exploring Segment-Level Semantics for Online Phase Recognition From Surgical Videos
* Exploring Spatial Network Structure of the Metropolitan Circle Based on Multi-Source Big Data: A Case Study of Hanghou Metropolitan Circle
* Exploring Structural Sparsity in Neural Image Compression
* Exploring the Association of Spatial Capital and Economic Diversity in the Tourist City of Surat Thani, Thailand
* Exploring the Devil in Graph Spectral Domain for 3D Point Cloud Attacks
* Exploring the Impacts of Data Source, Model Types and Spatial Scales on the Soil Organic Carbon Prediction: A Case Study in the Red Soil Hilly Region of Southern China
* Exploring the Solution Space of Linear Inverse Problems with GAN Latent Geometry
* Exposure-Aware Dynamic Weighted Learning for Single-Shot HDR Imaging
* Extinction Effect of Foliar Dust Retention on Urban Vegetation as Estimated by Atmospheric PM10 Concentration in Shenzhen, China
* Extract Free Dense Labels from CLIP
* ExtractEO, a Pipeline for Disaster Extent Mapping in the Context of Emergency Management
* Extracting Effective Subnetworks with Gumbel-Softmax
* Extraction of Urban Built-Up Areas Based on Data Fusion: A Case Study of Zhengzhou, China
* ExtrudeNet: Unsupervised Inverse Sketch-and-Extrude for Shape Parsing
* F2RNET: A Full-Resolution Representation Network for Biomedical Image Segmentation
* Fabric Material Recovery from Video Using Multi-scale Geometric Auto-Encoder
* Face Photo Synthesis Via Intermediate Semantic Enhancement Generative Adversarial Network
* Face Recognition for Fisheye Images
* Face Reconstruction from Deep Facial Embeddings using a Convolutional Neural Network
* Face2Face-rho: Real-Time High-Resolution One-Shot Face Reenactment
* Facial Depth and Normal Estimation Using Single Dual-Pixel Camera
* Factorizing Knowledge in Neural Networks
* FADE: Fusing the Assets of Decoder and Encoder for Task-Agnostic Upsampling
* FairGRAPE: Fairness-Aware GRAdient Pruning mEthod for Face Attribute Classification
* FairStyle: Debiasing StyleGAN2 with Style Channel Manipulations
* Fake News in India: Scale, Diversity, Solution, and Opportunities
* FakeCLR: Exploring Contrastive Learning for Solving Latent Discontinuity in Data-Efficient GANs
* Faketracer: Exposing Deepfakes with Training Data Contamination
* Family of Onion Convolutions for Image Inpainting, The
* FAR: Fourier Aerial Video Recognition
* Fashionformer: A Simple, Effective and Unified Baseline for Human Fashion Segmentation and Recognition
* FashionViL: Fashion-Focused Vision-and-Language Representation Learning
* Fast Adaptive Self-Supervised Underwater Image Enhancement
* Fast and High Quality Image Denoising via Malleable Convolution
* Fast Convergent Ordered-Subsets Algorithm With Subiteration-Dependent Preconditioners for PET Image Reconstruction, A
* Fast Dejittering Approach for Line Scanning Microscopy, A
* Fast Fusion of Hyperspectral and Multispectral Images: A Tucker Approximation Approach
* Fast Knowledge Distillation Framework for Visual Recognition, A
* Fast Learning from Label Proportions with Small Bags
* Fast Semantic Image Segmentation for Autonomous Systems
* Fast Sky to Sky Interpolation for Radio Interferometric Imaging
* Fast Two-Step Blind Optical Aberration Correction
* Fast Two-View Motion Segmentation Using Christoffel Polynomials
* Fast Vehicle Detection and Tracking on Fisheye Traffic Monitoring Video Using CNN and Bounding Box Propagation
* Fast-MoCo: Boost Momentum-Based Contrastive Learning with Combinatorial Patches
* Fast-Vid2Vid: Spatial-Temporal Compression for Video-to-Video Synthesis
* FAST-VQA: Efficient End-to-End Video Quality Assessment with Fragment Sampling
* Faster VoxelPose: Real-time 3D Human Pose Estimation by Orthographic Projection
* Fatigue Detection of Pilots' Brain Through Brains Cognitive Map and Multilayer Latent Incremental Learning Model
* FBNet: Feedback Network for Point Cloud Completion
* FCAF3D: Fully Convolutional Anchor-Free 3D Object Detection
* FEAR: Fast, Efficient, Accurate and Robust Visual Tracker
* Feature Representation Learning for Unsupervised Cross-Domain Image Retrieval
* Feature Space Disentangling Based on Spatial Attention for Makeup Transfer
* Feature Transformation Framework With Selective Pseudo-Labeling for 2D Image-Based 3D Shape Retrieval, A
* Feature-Ensemble-Based Crop Mapping for Multi-Temporal Sentinel-2 Data Using Oversampling Algorithms and Gray Wolf Optimizer Support Vector Machine
* Federated Self-supervised Learning for Video Understanding
* FedLTN: Federated Learning for Sparse and Personalized Lottery Ticket Networks
* FedVLN: Privacy-Preserving Federated Vision-and-Language Navigation
* FedX: Unsupervised Federated Learning with Cross Knowledge Distillation
* Few Zero Level Set-Shot Learning of Shape Signed Distance Functions in Feature Space
* Few-Shot Action Recognition with Hierarchical Matching and Contrastive Learning
* Few-Shot Class-Incremental Learning for 3D Point Cloud Objects
* Few-Shot Class-Incremental Learning from an Open-Set Perspective
* Few-Shot Class-Incremental Learning via Entropy-Regularized Data-Free Replay
* Few-Shot Classification with Contrastive Learning
* Few-Shot End-to-End Object Detection via Constantly Concentrated Encoding Across Heads
* Few-Shot Image Generation with Mixup-Based Distance Learning
* Few-Shot Learning Network for Moving Object Detection Using Exemplar-Based Attention Map
* Few-Shot Object Counting and Detection
* Few-Shot Object Detection by Knowledge Distillation Using Bag-of-Visual-Words Representations
* Few-Shot Object Detection with Model Calibration
* Few-Shot Personalized Saliency Prediction with Similarity of Gaze Tendency Using Object-Based Structural Information
* Few-Shot Single-View 3D Reconstruction with Memory Prior Contrastive Network
* Few-Shot Video Object Detection
* FewGAN: Generating from the Joint Distribution of a Few Images
* FH-Net: A Fast Hierarchical Network for Scene Flow Estimation on Real-World Point Clouds
* FILM: Frame Interpolation for Large Motion
* Filter Pruning via Feature Discrimination in Deep Neural Networks
* Filtered Convolution for Synthetic Aperture Radar Images Ship Detection
* FindIt: Generalized Localization with Natural Language Queries
* FindNet: Can You Find Me? Boundary-and-Texture Enhancement Network for Camouflaged Object Detection
* Fine-grained Data Distribution Alignment for Post-Training Quantization
* Fine-Grained Egocentric Hand-Object Segmentation: Dataset, Model, and Applications
* Fine-Grained Fashion Representation Learning by Online Deep Clustering
* Fine-Grained Scene Graph Generation with Data Transfer
* Fine-Grained Visual Entailment
* Fine-Tune Your Classifier: Finding Correlations with Temperature
* Fingerprint Presentation Attack Detector Using Global-Local Model
* FingerprintNet: Synthesized Fingerprints for Generated Image Detection
* First Dedicated Balloon Catheter for Magnetic Particle Imaging
* FL0C: Fast L0 Cut Pursuit for Estimation of Piecewise Constant Functions
* FLEX: Extrinsic Parameters-free Multi-View 3D Human Motion Reconstruction
* Flexible-Rate Learned Hierarchical Bi-Directional Video Compression with Motion Refinement and Frame-Level Bit Allocation
* FloatingFusion: Depth from ToF and Image-Stabilized Stereo Cameras
* Flood Detection in Dual-Polarization SAR Images Based on Multi-Scale Deeplab Model
* Flow Graph to Video Grounding for Weakly-Supervised Multi-step Localization
* Flow-Guided Transformer for Video Inpainting
* Flow-Path Fitting from Images with Fourier Basis for River Health Assessment
* FlowFormer: A Transformer Architecture for Optical Flow
* Font Watermarking Network for Text Images
* For the Sake of Privacy: Skeleton-Based Salient Behavior Recognition
* Forensic License Plate Recognition with Compression-Informed Transformers
* Forgetful Active Learning with Switch Events: Efficient Sampling for Out-of-Distribution Data
* Forward Error Correction Applied to JPEG-XS Codestreams
* FOSTER: Feature Boosting and Compression for Class-Incremental Learning
* Frame-Type Sensitive RDO Control for Content-Adaptive Encoding
* Framework for Contrast Enhancement Algorithms Optimization, A
* Free-Viewpoint RGB-D Human Performance Capture and Rendering
* Frequency and Spatial Dual Guidance for Image Dehazing
* Frequency Domain Model Augmentation for Adversarial Attack
* Frequency-Relevant Residual Learning for Multi-Modal Image Denoising
* Frequency-Selective Geometry Upsampling of Point Clouds
* FrequencyLowCut Pooling: Plug and Play Against Catastrophic Overfitting
* From Face to Natural Image: Learning Real Degradation for Blind Image Super-Resolution
* From Meadow to Map: Integrating Field Surveys and Interactive Visualizations for Invasive Species Management in a National Park
* From Video to Hyperspectral: Hyperspectral Image-Level Feature Extraction with Transfer Learning
* Frozen CLIP Models are Efficient Video Learners
* FS-COCO: Towards Understanding of Freehand Sketches of Common Objects in Context
* Fully Convolutional and Feedforward Networks for The Semantic Segmentation of Remotely Sensed Images
* Fully convolutional online tracking
* Fully Shareable Scene Text Recognition Modeling for Horizontal and Vertical Writing
* Fully Trainable Gaussian Derivative Convolutional Layer
* Funque: Fusion of Unified Quality Evaluators
* FurryGAN: High Quality Foreground-Aware Image Synthesis
* Fusing Global and Local Features for Generalized AI-Synthesized Image Detection
* Fusing Local Similarities for Retrieval-Based 3D Orientation Estimation of Unseen Objects
* Fusion from Decomposition: A Self-Supervised Decomposition Approach for Image Fusion
* Fusion Temporal Color Constancy
* Fusion Temporal Color Constancy
* Fusion-Based Backlit Image Enhancement Using Multiple S-Type Transformations For Convex Combination Coefficients
* Fusioncount: Efficient Crowd Counting Via Multiscale Feature Fusion
* FusionVAE: A Deep Hierarchical Variational Autoencoder for RGB Image Fusion
* Future Frame Extrapolation Using Future Cost Volume
* GaitEdge: Beyond Plain End-to-End Gait Recognition for Better Practicality
* Gaitpoint: A Gait Recognition Network Based on Point Cloud Analysis
* GAITTAKE: Gait Recognition by Temporal Attention and Keypoint-Guided Embedding
* GALA: Toward Geometry-and-Lighting-Aware Object Search for Compositing
* GAMa: Cross-View Video Geo-Localization
* GAN Cocktail: Mixing GANs Without Dataset Access
* GAN with Multivariate Disentangling for Controllable Hair Editing
* Ganzzle: Reframing Jigsaw Puzzle Solving as a Retrieval Task using a Generative Mental Image
* Gated Convolutional Network for Metal Artifact Reduction in Computed Tomography Images
* Gaussian Activated Neural Radiance Fields for High Fidelity Reconstruction and Pose Estimation
* Gaussian Distributed Graph Constrained Multi-Modal Gaussian Process Latent Variable Model for Ordinal Labeled Data
* Gaussian Distribution-based Mode Selection for Intra Prediction of Spatial SHVC
* Gaussian Kernel-Based Cross Modal Network for Spatio-Temporal Video Grounding
* Gaussian Patch Mixture Model Guided Low-Rank Covariance Matrix Minimization for Image Denoising
* GCISG: Guided Causal Invariant Learning for Improved Syn-to-Real Generalization
* GCN-Based Multi-Modal Multi-Label Attribute Classification in Anime Illustration Using Domain-Specific Semantic Features
* GEB+: A Benchmark for Generic Event Boundary Captioning, Grounding and Retrieval
* Gen6D: Generalizable Model-Free 6-DoF Object Pose Estimation from RGB Images
* General Dynamic Knowledge Distillation Method for Visual Analytics, A
* General Object Pose Transformation Network from Unpaired Data
* Generalizable Medical Image Segmentation via Random Amplitude Mixup and Domain-Specific Image Restoration
* Generalizable Patch-Based Neural Rendering
* Generalized and Robust Framework for Timestamp Supervision in Temporal Action Segmentation, A
* Generalized Brain Image Synthesis with Transferable Convolutional Sparse Coding Networks
* Generalized Deep Internal Learning for Hyperspectral Image Super Resolution
* Generating Natural Images with Direct Patch Distributions Matching
* Generative Adversarial Network for Future Hand Segmentation from Egocentric Video
* Generative Domain Adaptation for Face Anti-Spoofing
* Generative Memory-Guided Semantic Reasoning Model for Image Inpainting
* Generative Meta-Adversarial Network for Unseen Object Navigation
* Generative Multiplane Images: Making a 2D GAN 3D-Aware
* Generative Negative Text Replay for Continual Vision-Language Pretraining
* Generative Subgraph Contrast for Self-Supervised Graph Representation Learning
* Generative-Model-Based Data Labeling for Deep Network Regression: Application to Seed Maturity Estimation from UAV Multispectral Images
* Generator Knows What Discriminator Should Learn in Unconditional GANs
* Genetic Algorithm Captured the Informative Bands for Partial Least Squares Regression Better on Retrieving Leaf Nitrogen from Hyperspectral Reflectance
* GeoAug: Data Augmentation for Few-Shot NeRF with Geometry Constraints
* Geodesic-Former: A Geodesic-Guided Few-Shot 3D Point Cloud Instance Segmenter
* Geometric Features Informed Multi-person Human-Object Interaction Recognition in Videos
* Geometric Representation Learning for Document Image Rectification
* Geometry Partitioning with Motion Vector Difference for Video Coding
* Geometry-Aware Single-Image Full-Body Human Relighting
* Geometry-Guided Progressive NeRF for Generalizable and Efficient Neural Human Rendering
* GeoRefine: Self-supervised Online Depth Refinement for Accurate Dense Mapping
* Ghost-free High Dynamic Range Imaging with Context-Aware Transformer
* GigaDepth: Learning Depth from Structured Light with Branching Neural Networks
* GIMO: Gaze-Informed Human Motion Prediction in Context
* GIPSO: Geometrically Informed Propagation for Online Adaptation in 3D LiDAR Segmentation
* GIS Based Procedural Modeling in 3D Urban Design
* GIS Pipeline to Produce GeoAI Datasets from Drone Overhead Imagery, A
* GitNet: Geometric Prior-Based Transformation for Birds-Eye-View Segmentation
* Glacier Mass Loss Simulation Based on Remote Sensing Data: A Case Study of the Yala Glacier and the Qiyi Glacier in the Third Pole
* Glacier Motion Monitoring Using a Novel Deep Matching Network with SAR Intensity Images
* GLAMD: Global and Local Attention Mask Distillation for Object Detectors
* GLASS: Global to Local Attention for Scene-Text Spotting
* Global Spectral Filter Memory Network for Video Object Segmentation
* Global-Local Motion Transformer for Unsupervised Skeleton-Based Action Learning
* GloVe Model for Urban Functional Area Identification Considering Nonlinear Spatial Relationships between Points of Interest, A
* GM-RF: An AV1 Intra-Frame Fast Decision Based on Random Forest
* GOCA: Guided Online Cluster Assignment for Self-Supervised Video Representation Learning
* Google Earth Engine for Informal Settlement Mapping: A Random Forest Classification Using Spectral and Textural Information
* Gpu-Accelerated Sift-Aided Source Identification of Stabilized Videos
* Grad-Cam Aware Supervised Attention for Visual Question Answering for Post-Disaster Damage Assessment
* GradAuto: Energy-Oriented Attack on Dynamic Neural Networks
* Gradient-Based Severity Labeling for Biomarker Classification in OCT
* Gradient-Based Uncertainty for Monocular Depth Estimation
* Granularity-Aware Adaptation for Image Retrieval Over Multiple Tasks
* Graph Autoencoder-Based Embedded Learning in Dynamic Brain Networks for Autism Spectrum Disorder Identification
* Graph Convolutional Networks and Manifold Ranking for Multimodal Video Retrieval
* Graph Filter-Based Fast Motion Matching for Inter Frame Coding of MPEG G-PCC
* Graph Neural Network for Cell Tracking in Microscopy Videos
* Graph R-CNN: Towards Accurate 3D Object Detection with Semantic-Decorated Local Graph
* Graph Refinement with Regression Prior for 3D Face Reconstruction
* Graph-Based Intercategory and Intermodality Network for Multilabel Classification and Melanoma Diagnosis of Skin Lesions in Dermoscopy and Clinical Images
* Graph-Constrained Contrastive Regularization for Semi-weakly Volumetric Segmentation
* Graph-Transformer for Whole Slide Image Classification, A
* GraphCSPN: Geometry-Aware Depth Completion via Dynamic GCNs
* GraphEx: Facial Action Unit Graph for Micro-Expression Classification
* GraphFit: Learning Multi-scale Graph-Convolutional Representation for Point Cloud Normal Estimation
* GraphVid: It only Takes a Few Nodes to Understand a Video
* Grasp'D: Differentiable Contact-Rich Grasp Synthesis for Multi-Fingered Hands
* GRIT-VLP: Grouped Mini-batch Sampling for Efficient Vision and Language Pre-training
* GRIT: Faster and Better Image Captioning Transformer Using Dual Visual Features
* Ground Positioning Method of Spaceborne SAR High-Resolution Sliding-Spot Mode Based on Antenna Pointing Vector
* Grounding Visual Representations with Texts for Domain Generalization
* Group-Wise Feature Enhancement-and-Fusion Network with Dual-Polarization Feature Enrichment for SAR Ship Detection, A
* GTCaR: Graph Transformer for Camera Re-localization
* Guest Editorial Introduction to the Special Issue on Biometrics Based Methods for Healthcare Applications
* Guided Sampling Based Feature Aggregation for Video Object Detection
* Gully Erosion Monitoring Based on Semi-Supervised Semantic Segmentation with Boundary-Guided Pseudo-Label Generation Strategy and Adaptive Loss Function
* Gyrovector Space Approach for Symmetric Positive Semi-definite Matrix Learning, A
* Haar Wavelet-Based Attention Network for Image Dehazing
* Hadamard-Coded Supervised Discrete Hashing on Complex and Quaternion Domain
* HairNet: Hairstyle Transfer with Pose Changes
* Half Wavelet Attention on M-Net+ for Low-Light Image Enhancement
* Halftoning with Multi-Agent Deep Reinforcement Learning
* Hallucinating Pose-Compatible Scenes
* Hardly Perceptible Trojan Attack Against Neural Networks with Bit Flips
* Hardware-Oriented Shallow Joint Demosaicing and Denoising
* Harmonizer: Learning to Perform White-Box Image and Video Harmonization
* HARP: Autoregressive Latent Video Prediction with High-Fidelity Image Generator
* HCIL: Hierarchical Class Incremental Learning for Longline Fishing Visual Monitoring
* HDR-AGAN: Ghost-Free High Dynamic Range Imaging with Attention Guided Adversarial Network
* HDR-Plenoxels: Self-Calibrating High Dynamic Range Radiance Fields
* HDR-TOF: HDR Time-of-Flight Imaging via Modulo Acquisition
* HEAD: HEtero-Assists Distillation for Heterogeneous Object Detectors
* Heart rate estimation in intense exercise videos
* Helpful or Harmful: Inter-task Association in Continual Learning
* HFF6D: Hierarchical Feature Fusion Network for Robust 6D Object Pose Tracking
* Hidden Conditional Adversarial Attacks
* Hiding Images Into Images with Real-World Robustness
* Hierarchical Average Precision Training for Pertinent Image Retrieval
* Hierarchical Contrastive Inconsistency Learning for Deepfake Video Detection
* Hierarchical Defect Detection Based On Reinforcement Learning
* Hierarchical Feature Alignment Network for Unsupervised Video Object Segmentation
* Hierarchical Feature Embedding for Visual Tracking
* Hierarchical Latent Structure for Multi-modal Vehicle Trajectory Forecasting
* Hierarchical Memory Learning for Fine-Grained Scene Graph Generation
* Hierarchical Motion Learning for Goal-Oriented Movements With Speed-Accuracy Tradeoff of a Musculoskeletal System
* Hierarchical Multi-Resolution Graph-Cuts for Water-Fat-Silicone Separation in Breast MRI
* Hierarchical Semantic Regularization of Latent Spaces in StyleGANs
* Hierarchical Semi-supervised Contrastive Learning for Contamination-Resistant Anomaly Detection
* Hierarchical Spatiotemporal Graph Regularized Discriminative Correlation Filter for Visual Object Tracking
* Hierarchical Training for Distributed Deep Learning Based on Multimedia Data over Band-Limited Networks
* Hierarchically Self-supervised Transformer for Human Skeleton Representation Learning
* High-Fidelity GAN Inversion with Padding Space
* High-Fidelity Image Inpainting with GAN Inversion
* High-Resolution NIR Prediction from RGB Images: Application to Plant Phenotyping
* High-Resolution Virtual Try-On with Misalignment and Occlusion-Handled Conditions
* Higher-Order Recurrent Network with Space-Time Attention for Video Early Action Recognition
* Highly Accurate Dichotomous Image Segmentation
* HiPDERL: An Improved Implementation of the PDERL Viewshed Algorithm and Accuracy Analysis
* Histogram-Based Transformation Function Estimation for Low-Light Image Enhancement
* Historian: A Large-Scale Historical Film Dataset with Cinematographic Annotation
* History Dependent Significance Coding for Incremental Neural Network Compression
* HIVE: Evaluating the Human Interpretability of Visual Explanations
* HM: Hybrid Masking for Few-Shot Segmentation
* Hologram Super-Resolution Using Dual-Generator GAN
* Homogeneous Multi-modal Feature Fusion and Interaction for 3D Object Detection
* Hourglass Attention Network for Image Inpainting
* Housekeep: Tidying Virtual Households Using Commonsense Reasoning
* How Accurate Is Passive Stereo For 3d Face Reconstruction?
* How Are Macro-Scale and Micro-Scale Built Environments Associated with Running Activity? The Application of Strava Data and Deep Learning in Inner London
* How Severe Is Benchmark-Sensitivity in Video Self-Supervised Learning?
* How Sound Affects Visual Attention in Omnidirectional Videos
* How Stable Are Transferability Metrics Evaluations?
* How to Synthesize a Large-Scale and Trainable Micro-Expression Dataset?
* HRDA: Context-Aware High-Resolution Domain-Adaptive Semantic Segmentation
* HULC: 3D HUman Motion Capture with Pose Manifold SampLing and Dense Contact Guidance
* Human Perception Measurement by Electroencephalography for Facial Image Compression
* Human Trajectory Prediction via Neural Social Physics
* Human-Centric Image Cropping with Partition-Aware and Content-Preserving Features
* Human-Centric Image Retrieval with Gaze-Based Image Captioning
* Humans Disagree With the IoU for Measuring Object Detector Localization Error
* HuMMan: Multi-modal 4D Human Dataset for Versatile Sensing and Modeling
* Hunting Group Clues with Transformers for Social Group Activity Recognition
* HVC-Net: Unifying Homography, Visibility, and Confidence Learning for Planar Object Tracking
* Hybrid Model-Based / Data-Driven Graph Transform for Image Coding
* Hybrid Warping Fusion for Video Frame Interpolation
* Hyper-LGNet: Coupling Local and Global Features for Hyperspectral Image Classification
* Hyper-Spectral Imaging for Overlapping Plastic Flakes Segmentation
* Hyperbolic Spatial Temporal Graph Convolutional Networks
* Hyperdeep: Comparison of AI-Based Methods for Predicting Chemical Components in Hyperspectral Images
* Hypergraph Convolutional Networks for Weakly-Supervised Semantic Segmentation
* Hyperspectral Reconstruction Using Auxiliary RGB Learning from a Snapshot Image
* Hyperspherical Learning in Multi-Label Classification
* Hyperview Challenge: Estimating Soil Parameters from Hyperspectral Images, The
* HyproGAN: Breaking the Dimensional Wall from Human to Anime
* I Saw: A Self-Attention Weighted Method for Explanation of Visual Transformers
* ICD: VHR-Oriented Interactive Change-Detection Algorithm
* ICIP 2022 Challenge on Parasitic Egg Detection and Classification in Microscopic Images: Dataset, Methods and Results
* ICIP 2022 Challenge: PEDCMI, TOOD Enhanced by Slicing-Aided Fine-Tuning and Inference
* IDa-Det: An Information Discrepancy-Aware Distillation for 1-Bit Detectors
* Identification of Paddy Varieties from Landsat 8 Satellite Image Data Using Spectral Unmixing Method in Indramayu Regency, Indonesia
* Identification of Widely Linear Systems Using Data-Dependent Superimposed Training
* Identifying Document Images with Glare Using Global and Localized Feature Fusion
* Identifying Hard Noise in Long-Tailed Sample Distribution
* Identifying optimised speaker identification model using hybrid GRU-CNN feature extraction technique
* Identity-Aware Hand Mesh Estimation and Personalization from RGB Images
* Identity-Guided Face Generation with Multi-Modal Contour Conditions
* Identity-Sensitive Knowledge Propagation for Cloth-Changing Person Re-Identification
* IGFormer: Interaction Graph Transformer for Skeleton-Based Human Interaction Recognition
* IHEM Loss: Intra-Class Hard Example Mining Loss for Robust Face Recognition
* IID-NORD: A Comprehensive Intrinsic Image Decomposition Dataset
* Illumination-Aware Style Transfer for Image Harmonization
* Image Coding for Machines with Omnipotent Feature Learning
* Image Compression Based on Importance Using Optimal Mass Transportation Map
* Image Data Augmentation with Unpaired Image-to-Image Camera Model Translation
* Image Deblurring Using Deep Multi-Scale Distortion Prior
* Image Enhancement for Improved Visibility of Digital Displays Under The Sunlight
* Image Fusion Transformer
* Image Inpainting with Cascaded Modulation GAN and Object-Aware Training
* Image Quantization Towards Data Reduction: Robustness Analysis for SLAM Methods On Embedded Platforms
* Image Restoration Using Probability-Inducing Nuclear Norm Minimization
* Image Segmentation and Recognition for Multi-Class Chinese Food
* Image Super-Resolution with Deep Dictionary
* Image Warp Preserving Content Intensity
* Image-Based Air Quality Forecasting Through Multi-Level Attention
* Image-Based CLIP-Guided Essence Transfer
* Image2Point: 3D Point-Cloud Understanding with 2D Image Pretrained Models
* Imaging Breast Microcalcifications Using Dark-Field Signal in Propagation-Based Phase-Contrast Tomography
* Imaging of Single Transducer-Harmonic Motion Imaging-Derived Displacements at Several Oscillation Frequencies Simultaneously
* IMC-NET: Learning Implicit Field with Corner Attention Network for 3D Shape Reconstruction
* Impact of Atmospheric Correction Methods Parametrization on Soil Organic Carbon Estimation Based on Hyperion Hyperspectral Data
* Impact of Downscaling on Adversarial Images
* Impact of Quasi-Biweekly Oscillation on Southeast Asian Cold Surge Rainfall Monitored by TRMM Satellite Observation
* Impact of Self-View Latency on Quality of Experience: Analysis of Natural Interaction in XR Environments
* Impartial Take to the CNN vs Transformer Robustness Contest, An
* Implicit Field Supervision for Robust Non-rigid Shape Matching
* Implicit Neural Representations for Image Compression
* Implicit Neural Representations for Variable Length Human Motion Generation
* Implicit Shape Biased Few-Shot Learning for 3D Object Generalization
* Improved DC Estimation for JPEG Compression Via Convex Relaxation
* Improved Hard Example Mining Approach for Single Shot Object Detectors
* Improved Landscape Expansion Index and Its Application to Urban Growth in Urumqi
* Improved Masked Image Generation with Token-Critic
* Improved Model-Based Forest Height Inversion Using Airborne L-Band Repeat-Pass Dual-Baseline Pol-InSAR Data
* Improvement of Typhoon Intensity Forecasting by Using a Novel Spatio-Temporal Deep Learning Model
* Improving Adversarial Robustness of 3D Point Cloud Classification Models
* Improving Closed and Open-Vocabulary Attribute Prediction Using Transformers
* Improving Covariance Conditioning of the SVD Meta-layer by Orthogonality
* Improving Deep Metric Learning with Virtual Classes and Examples Mining
* Improving Few-Shot Learning Through Multi-task Representation Learning Theory
* Improving Few-Shot Part Segmentation Using Coarse Supervision
* Improving Fine-Grained Visual Recognition in Low Data Regimes via Self-boosting Attention Mechanism
* Improving GANs for Long-Tailed Data Through Group Spectral Regularization
* Improving Generalization in Federated Learning by Seeking Flat Minima
* Improving Generalization of Reinforcement Learning Using a Bilinear Policy Network
* Improving Image Restoration by Revisiting Global Information Aggregation
* Improving IQA Performance Based on Deep Mutual Learning
* Improving Model Adaptation for Semantic Segmentation by Learning Model-Invariant Features with Multiple Source-Domain Models
* Improving RGB-D Point Cloud Registration by Learning Multi-scale Local Linear Transformation
* Improving Rgb-Infrared Pedestrian Detection by Reducing Cross-Modality Redundancy
* Improving Robustness by Enhancing Weak Subnets
* Improving Robustness to out-of-Distribution Data by Frequency-Based Augmentation
* Improving Self-Supervised Learning for Out-Of-Distribution Task via Auxiliary Classifier
* Improving Self-supervised Lightweight Model Learning via Hard-Aware Metric Distillation
* Improving Test-Time Adaptation Via Shift-Agnostic Weight Regularization and Nearest Source Prototypes
* Improving the Intra-class Long-Tail in 3D Detection via Rare Example Mining
* Improving the Perceptual Quality of 2D Animation Interpolation
* Improving the Reliability for Confidence Estimation
* Improving Vision Transformers by Revisiting High-Frequency Components
* In Defense of Image Pre-Training for Spatiotemporal Recognition
* In Defense of Online Models for Video Instance Segmentation
* InAction: Interpretable Action Decision Making for Autonomous Driving
* incDFM: Incremental Deep Feature Modeling for Continual Novelty Detection
* Incomplete Multi-view Domain Adaptation via Channel Enhancement and Knowledge Transfer
* Incremental Road Network Update Method with Trajectory Data and UAV Remote Sensing Imagery
* Incremental Task Learning with Incremental Rank Updates
* Incremental Translation Averaging
* Incrementally Semi-Supervised Classification of Arthritis Inflammation on a Clinical Dataset
* Individual Tree Species Classification Based on a Hierarchical Convolutional Neural Network and Multitemporal Google Earth Images
* Indoor Target-Driven Visual Navigation based on Spatial Semantic Information
* Indoor-Outdoor Point Cloud Alignment Using Semantic-Geometric Descriptor
* Inductive and Transductive Few-Shot Video Classification via Appearance and Temporal Alignments
* InfiniteNature-Zero: Learning Perpetual View Generation of Natural Scenes from Single Images
* Influence of Image Degradation on Hyperspectral Image Classification, The
* Information Theoretic Approach for Attention-Driven Face Forgery Detection, An
* Information-Growth Swin Transformer Network for Image Super-Resolution
* Informed Spatial Regularizations For Fast Fusion Of Astronomical Images
* Infrared and Visible Image Fusion Using Bimodal Transformers
* Infrared and Visible Image Registration for Airborne Camera Systems
* Initialization and Alignment for Adversarial Texture Optimization
* Injecting 3D Perception of Controllable NeRF-GAN into StyleGAN for Editable Portrait Image Synthesis
* Inpainting at Modern Camera Resolution by Guided PatchMatch with Auto-curation
* Input-to-State Stability Analysis of Digital Filters With Generalized Overflow Arithmetic and Time-Varying Delay
* Instance as Identity: A Generic Online Paradigm for Video Instance Segmentation
* Instance Contour Adjustment via Structure-Driven CNN
* INT: Towards Infinite-Frames 3D Detection with an Efficient Framework
* IntegratedPIFu: Integrated Pixel Aligned Implicit Function for Single-View Human Reconstruction
* Integrating Multi-Scale Remote-Sensing Data to Monitor Severe Forest Infestation in Response to Pine Wilt Disease
* Intelli-Paint: Towards Developing More Human-Intelligible Painting Agents
* Interactive Image Segmentation with Transformers
* Interclass Prototype Relation for Few-Shot Segmentation
* IntereStyle: Encoding an Interest Region for Robust StyleGAN Inversion
* Intermittent Estimator-Based Mixed Passive and H8 Control for High-Speed Train With Actuator Stochastic Fault
* Interpretable Concept-Based Prototypical Networks for Few-Shot Learning
* Interpretable Image Classification with Differentiable Prototypes Assignment
* Interpretable Open-Set Domain Adaptation via Angular Margin Separation
* Interpretations Steered Network Pruning via Amortized Inferred Saliency Maps
* Interpreting Mangrove Habitat and Coastal Land Cover Change in the Greater Bay Area, Southern China, from 1924 to 2020 Using Historical Aerial Photos and Multiple Sources of Satellite Data
* Intra Prediction of Regular and Near-Regular Textures Via Graph-Based Inpainting
* Intra-Inter Prediction for Versatile Video Coding Using a Residual Convolutional Neural Network
* Intra-Modal Constraint Loss for Image-Text Retrieval
* Intra-Pulse Frequency Coding Design for a High-Resolution Radar against Smart Noise Jamming
* Intrinsic Neural Fields: Learning Functions on Manifolds
* Intrinsic Temporal Performance of the RF Receive Coil in Magnetic Resonance Imaging
* Invariant Feature Learning for Generalized Long-Tailed Classification
* Inverted Pyramid Multi-task Transformer for Dense Scene Understanding
* Invertible Color-to-Grayscale Conversion Using Lossy Compression and High-Capacity Data Hiding
* Investigating Explainable Artificial Intelligence for MRI-based Classification of Dementia: A New Stability Criterion for Explainable Methods
* Investigating Inconsistencies in PRNU-Based Camera Identification
* Investigating Normalization Methods for CNN-Based Image Quality Assessment
* Investigation of Winter Wheat Leaf Area Index Fitting Model Using Spectral and Canopy Height Model Data from Unmanned Aerial Vehicle Imagery, An
* Invisible Black-Box Backdoor Attack Through Frequency Domain, An
* Irregularities recognition system for automotive pieces
* Is Appearance Free Action Recognition Possible?
* Is Geometry Enough for Matching in Visual Localization?
* Is It Necessary to Transfer Temporal Knowledge for Domain Adaptive Video Semantic Segmentation?
* Is the U-NET Directional-Relationship Aware?
* IS-MVSNet:Importance Sampling-Based MVSNet
* Iterative Contrastive Learning for Single Image Raindrop Removal
* Iterative Kernel Reconstruction for Deep Learning-Based Blind Image Super-Resolution
* Iterative Seeded Region Growing for Brain Tissue Segmentation
* IV-PSNR: The Objective Quality Metric for Immersive Video Applications
* Iwin: Human-Object Interaction Detection via Transformer with Irregular Windows
* JEDE: Universal Jersey Number Detector for Sports
* Joint Classification and out-of-Distribution Detection Based on Structured Latent Space of Variational Auto-Encoders
* Joint Disentanglement of Labels and Their Features with VAE
* Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework
* Joint Learning of Localized Representations from Medical Images and Reports
* Joint Motion Correction and 3D Segmentation with Graph-Assisted Neural Networks for Retinal OCT
* Joint Motion-Correction and Reconstruction in Cryo-Em Tomography
* Joint Optimization of k-t Sampling Pattern and Reconstruction of DCE MRI for Pharmacokinetic Parameter Estimation
* Joint Sample Enhancement and Instance-Sensitive Feature Learning for Efficient Person Search
* Joint Secret Image Sharing and Jpeg Compression Scheme, A
* Joint-Modal Label Denoising for Weakly-Supervised Audio-Visual Video Parsing
* JoJoGAN: One Shot Face Stylization
* JPEG Artifacts Removal via Contrastive Representation Learning
* JPEG Pleno Light Field Encoder with Breakpoint Dependent Affine Wavelet Transform for Disparity Maps
* JPerceiver: Joint Perception Network for Depth, Pose and Layout Estimation in Driving Scenes
* JPSS-1 VIIRS Prelaunch Reflective Solar Band Testing and Performance
* K-centered Patch Sampling for Efficient Video Recognition
* k-means Mask Transformer
* k-SALSA: k-Anonymous Synthetic Averaging of Retinal Images via Local Style Alignment
* KD-MVS: Knowledge Distillation Based Self-supervised Learning for Multi-View Stereo
* Kendall Shape Space Approach to 3D Shape Estimation from 2D Landmarks, A
* Kernel Minimum Noise Fraction Transformation-Based Background Separation Model for Hyperspectral Anomaly Detection
* Kernel Relative-prototype Spectral Filtering for Few-Shot Learning
* KeypointNeRF: Generalizing Image-Based Volumetric Avatars Using Relative Spatial Encoding of Keypoints
* Kidney image classification using transfer learning with convolutional neural network
* KING: Generating Safety-Critical Driving Scenarios for Robust Imitation via Kinematics Gradients
* Knowledge Condensation Distillation
* Knowledge Distillation for Multi-Target Domain Adaptation in Real-Time Person Re-Identification
* Knowledge-Based Visual Question Generation
* KTN: Knowledge Transfer Network for Learning Multiperson 2D-3D Correspondences
* KVT: k-NN Attention for Boosting Vision Transformers
* KXNet: A Model-Driven Deep Neural Network for Blind Super-Resolution
* L-CoDer: Language-Based Colorization with Color-Object Decoupling Transformer
* L-Tracing: Fast Light Visibility Estimation on Neural Surfaces by Sphere Tracing
* L2AMF-Net: An L2-Normed Attention and Multi-Scale Fusion Network for Lunar Image Patch Matching
* L3: Accelerator-Friendly Lossless Image Format for High-Resolution, High-Throughput DNN Training
* l8-Robustness and Beyond: Unleashing Efficient Adversarial Training
* LA3: Efficient Label-Aware AutoAugment
* Label-Guided Auxiliary Training Improves 3D Object Detector
* Label2Label: A Language Modeling Framework for Multi-attribute Learning
* LaLaLoc++: Global Floor Plan Comprehension for Layout Localisation in Unvisited Environments
* LaMAR: Benchmarking Localization and Mapping for Augmented Reality
* LANA: Latency Aware Network Acceleration
* Land Cover Background-Adaptive Framework for Large-Scale Road Extraction, A
* Lane Detection Transformer Based on Multi-frame Horizontal and Vertical Attention and Visual Transformer Module
* Language Matters: A Weakly Supervised Vision-Language Pre-training Approach for Scene Text Detection and Spotting
* Language-Driven Artistic Style Transfer
* Language-Grounded Indoor 3D Semantic Segmentation in the Wild
* Laplacian Mesh Transformer: Dual Attention and Topology Aware Network for 3D Mesh Classification and Segmentation
* Large Scale Real-World Multi-person Tracking
* Large-Displacement 3D Object Tracking with Hybrid Non-local Optimization
* Large-Scale Multiple-Objective Method for Black-box Attack Against Object Detection, A
* Latency Compensation Through Image Warping For Remote Rendering-Based Volumetric Video Streaming
* Latency-Aware Collaborative Perception
* Latent Discriminant Deterministic Uncertainty
* Latent Partition Implicit with Surface Codes for 3D Representation
* Latent Preserving Generative Adversarial Network for Imbalance Classification
* Latent Space Smoothing for Individually Fair Representations
* Latent Vector Prototypes Guided Conditional Face Synthesis
* LaTeRF: Label and Text Driven Object Radiance Fields
* Layered Controllable Video Generation
* LB-NERF: Light Bending Neural Radiance Fields for Transparent Medium
* Le-BEiT: A Local-Enhanced Self-Supervised Transformer for Semantic Segmentation of High Resolution Remote Sensing Images
* Learn from All: Erasing Attention Consistency for Noisy Label Facial Expression Recognition
* Learn-to-Decompose: Cascaded Decomposition Network for Cross-Domain Few-Shot Facial Expression Recognition
* Learn2Augment: Learning to Composite Videos for Data Augmentation in Action Recognition
* Learnable Pixel Clustering Via Structure and Semantic Dual Constraints for Unsupervised Image Segmentation
* Learned Image Compression with Multi-Scale Spatial and Contextual Information Fusion
* Learned Monocular Depth Priors in Visual-Inertial Initialization
* Learned Variational Video Color Propagation
* Learned Vertex Descent: A New Direction for 3D Human Model Fitting
* Learned Video Compression With Residual Prediction And Feature-Aided Loop Filter
* Learning a Prototype Discriminator With RBF for Multimodal Image Synthesis
* Learning Algebraic Representation for Systematic Generalization in Abstract Reasoning
* Learning an Evolved Mixture Model for Task-Free Continual Learning
* Learning an Isometric Surface Parameterization for Texture Unwrapping
* Learning Audio-Video Modalities from Image Captions
* Learning Channel-Aware Correlation Filters for Robust Object Tracking
* Learning Contextually Fused Audio-Visual Representations for Audio-Visual Speech Recognition
* Learning Continuous Implicit Representation for Near-Periodic Patterns
* Learning Cross-Video Neural Representations for High-Quality Frame Interpolation
* Learning Deep Non-blind Image Deconvolution Without Ground Truths
* Learning Degradation Representations for Image Deblurring
* Learning Depth from Focus in the Wild
* Learning Discriminative Shrinkage Deep Networks for Image Deconvolution
* Learning Disentanglement with Decoupled Labels for Vision-Language Navigation
* Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head Synthesis
* Learning Efficient Multi-Agent Cooperative Visual Exploration
* Learning Ego 3D Representation as Ray Tracing
* Learning Energy-Based Models with Adversarial Training
* Learning Extremely Lightweight and Robust Model with Differentiable Constraints on Sparsity and Condition Number
* Learning Frequency-Specific Quantization Scaling in VVC for Standard-Compliant Task-Driven Image Coding
* Learning from Designers: Fashion Compatibility Analysis Via Dataset Distillation
* Learning from Multiple Annotator Noisy Labels via Sample-Wise Label Fusion
* Learning from Noisy Labels via Meta Credible Label Elicitation
* Learning From Synthetic Data for Crowd Instance Segmentation in the Wild
* Learning from Unlabeled 3D Environments for Vision-and-Language Navigation
* Learning Graph Features for Colored Mesh Visual Quality Assessment
* Learning Graph Neural Networks for Image Style Transfer
* Learning hetero-synaptic delays for motion detection in a single layer of spiking neurons
* Learning Hierarchy Aware Features for Reducing Mistake Severity
* Learning Implicit Feature Alignment Function for Semantic Segmentation
* Learning Implicit Templates for Point-Based Clothed Human Modeling
* Learning Instance and Task-Aware Dynamic Kernels for Few-Shot Learning
* Learning Instance-Specific Adaptation for Cross-Domain Segmentation
* Learning Invariant Visual Representations for Compositional Zero-Shot Learning
* Learning Linguistic Association Towards Efficient Text-Video Retrieval
* Learning Local Implicit Fourier Representation for Image Warping
* Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection
* Learning Mutual Modulation for Self-supervised Cross-Modal Super-Resolution
* Learning Object Placement via Dual-Path Graph Completion
* Learning Omnidirectional Flow in 360° Video via Siamese Representation
* Learning Online Multi-Sensor Depth Fusion
* Learning Pedestrian Group Representations for Multi-modal Trajectory Prediction
* Learning Phase Mask for Privacy-Preserving Passive Depth Estimation
* Learning Prior Feature and Attention Enhanced Image Inpainting
* Learning Quality-aware Dynamic Memory for Video Object Segmentation
* Learning Regional Purity for Instance Segmentation on 3D Point Clouds
* Learning Selective Assignment Network for Scene-Aware Vehicle Detection
* Learning Self-prior for Mesh Denoising Using Dual Graph Convolutional Networks
* Learning Semantic Correspondence with Sparse Annotations
* Learning Semantic Segmentation from Multiple Datasets with Label Shifts
* Learning Series-Parallel Lookup Tables for Efficient Image Super-Resolution
* Learning Shadow Correspondence for Video Shadow Detection
* Learning Spatial-Preserved Skeleton Representations for Few-Shot Action Recognition
* Learning Spatio-Temporal Downsampling for Effective Video Upscaling
* Learning Spatiotemporal Frequency-Transformer for Compressed Video Super-Resolution
* Learning to Censor by Noisy Sampling
* Learning to Detect Every Thing in an Open World
* Learning to Drive by Watching YouTube Videos: Action-Conditioned Contrastive Policy Pretraining
* Learning to Fit Morphable Models
* Learning to Generate High-Quality Images for Homography Estimation
* Learning to Generate Realistic LiDAR Point Clouds
* Learning to Jointly Segment the Liver, Lesions and Vessels from Partially Annotated Datasets
* Learning to Learn with Smooth Regularization
* Learning to Train a Point Cloud Reconstruction Network Without Matching
* Learning to Weight Samples for Dynamic Early-Exiting Networks
* Learning Topological Interactions for Multi-Class Medical Image Segmentation
* Learning Trajectory-Conditioned Relations to Predict Pedestrian Crossing Behavior
* Learning Transferable Parameters for Unsupervised Domain Adaptation
* Learning Unbiased Transferability for Domain Adaptation by Uncertainty Modeling
* Learning Uncoupled-Modulation CVAE for 3D Action-Conditioned Human Motion Synthesis
* Learning Visibility for Robust Dense Human Body Estimation
* Learning Visual Representation from Modality-Shared Contrastive Language-Image Pre-training
* Learning Visual Styles from Audio-Visual Associations
* Learning Where to Look: Generative NAS is Surprisingly Efficient
* Learning with Free Object Segments for Long-Tailed Instance Segmentation
* Learning with Noisy Labels by Efficient Transition Matrix Estimation to Combat Label Miscorrection
* Learning with Recoverable Forgetting
* Learning-Based End-to-End Video Compression with Spatial-Temporal Adaptation
* Learning-Based Lossless Point Cloud Geometry Coding Using Sparse Tensors
* Learning-Based Point Cloud Registration for 6D Object Pose Estimation in the Real World
* LEDNet: Joint Low-Light Enhancement and Deblurring in the Dark
* Less Than Few: Self-shot Video Instance Segmentation
* LESS: Label-Efficient Semantic Segmentation for LiDAR Point Clouds
* Level Set Theory for Neural Implicit Evolution Under Explicit Flows, A
* Levenshtein OCR
* Leveraging Action Affinity and Continuity for Semi-supervised Temporal Action Segmentation
* LGV: Boosting Adversarial Example Transferability from Large Geometric Vicinity
* LiDAL: Inter-frame Uncertainty Based Active Learning for 3D LiDAR Semantic Segmentation
* LiDAR and UAV SfM-MVS of Merapi Volcanic Dome and Crater Rim Change from 2012 to 2014
* LiDAR Distillation: Bridging the Beam-Induced Domain Gap for 3D Object Detection
* LiDAR Odometry and Mapping Based on Neighborhood Information Constraints for Rugged Terrain
* Lidar Point Cloud Guided Monocular 3D Object Detection
* LiDAR-Based Hatch Localization
* LidarNAS: Unifying and Searching Neural Architectures for 3D Point Clouds
* Lifecycle of a Neural Network in the Wild: A Multiple Instance Learning Study on Cancer Detection from Breast Biopsies Imaged with Novel Technique, The
* Light Field Image Quality Assessment with Dense Atrous Convolutions
* Light Field Integral Image Coding Optimization under 2D Hierarchical Coding Structure
* Lighter and Faster Two-Pathway CMRNet for Video Saliency Prediction
* Lightweight Attentional Feature Fusion: A New Baseline for Text-to-Video Retrieval
* Lightweight Dual-Domain Network for Real-Time Medical Image Segmentation
* Lightweight Network with Multi-Stage Feature Fusion Module for Single-View 3d Face Reconstruction, A
* Linear Discriminant Analysis Metric Learning Using Siamese Neural Networks
* Linguistic Steganalysis Merging Semantic and Statistical Features
* LiP-Flow: Learning Inference-Time Priors for Codec Avatars via Normalizing Flows in Latent Space
* Lipschitz Continuity Retained Binary Neural Network
* Lisnet: A Covid-19 Lung Infection Segmentation Network Based on Edge Supervision and Multi-Scale Context Aggregation
* Local and Global Fusion Network for Learned Image Compression
* Local Color Distributions Prior for Image Enhancement
* Local Embedding for Axial Attention
* Local-Scale Horizontal CO2 Flux Estimation Incorporating Differential Absorption Lidar and Coherent Doppler Wind Lidar
* Local-Sparse-Information-Aggregation Transformer with Explicit Contour Guidance for SAR Ship Detection, A
* LocalBins: Improving Depth Estimation by Learning Local Distributions
* Locality Guidance for Improving Vision Transformers on Tiny Datasets
* Localization and Classification of Parasitic Eggs in Microscpic Images Using An Efficientdet Detector
* Localizing Visual Sounds the Easy Way
* Locally Varying Distance Transform for Unsupervised Visual Anomaly Detection
* LocVTP: Video-Text Pre-training for Temporal Localization
* Long Movie Clip Classification with State-Space Video Models
* Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer
* Long-tail Detection with Effective Class-Margins
* Long-Tailed Class Incremental Learning
* Long-Tailed Instance Segmentation Using Gumbel Optimized Loss
* Long-Term Baseflow Responses to Projected Climate Change in the Weihe River Basin, Loess Plateau, China
* Look Both Ways: Self-supervising Driver Gaze Estimation and Road Scene Saliency
* LoRD: Local 4D Implicit Representation for High-Fidelity Dynamic Human Modeling
* Low SNR Multiframe Registration for Cubesats
* Low-Complexity Multi-Type Tree Partitioning for Versatile Video Coding Based on Machine Learning
* Low-Complexity Scaler Based on Convolutional Neural Networks for Adaptive Video Streaming
* Low-Cost GNSS and Real-Time PPP: Assessing the Precision of the u-blox ZED-F9P for Kinematic Monitoring Applications
* Low-Delay and Energy-Efficient Opportunistic Routing for Maritime Search and Rescue Wireless Sensor Networks
* Low-Light Image Enhancement Method by Using a Modified Gamma Transform for Convex Combination Coefficients
* Low-Rank Tensor Bayesian Filter Framework For Multi-Modal Analysis, A
* LSCIDMR: Large-Scale Satellite Cloud Image Database for Meteorological Research
* LssDet: A Lightweight Deep Learning Detector for SAR Ship Detection in High-Resolution SAR Images
* LWGNet: Learned Wirtinger Gradients for Fourier Ptychographic Phase Retrieval
* Maanu-Net: Multi-Level Attention and Atrous Pyramid Nested U-Net for Wrecked Objects Segmentation in Forward-Looking Sonar Images
* Machine Learning Based Efficient Qt-Mtt Partitioning for VVC Inter Coding
* Machine Learning Fusion Multi-Source Data Features for Classification Prediction of Lunar Surface Geological Units
* Machine-Learning-Based Framework for Coding Digital Receiving Array with Few RF Channels
* MaCLR: Motion-Aware Contrastive Learning of Representations for Videos
* Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors
* Making Heads or Tails: Towards Semantically Consistent Visual Counterfactuals
* Making the Most of Text Semantics to Improve Biomedical Vision-Language Processing
* Manet: Improving Video Denoising with a Multi-Alignment Network
* MANET: Mitral Annulus Point Tracking Network in Cardiac Magnetic Resonance
* ManiFest: Manifold Deformation for Few-Shot Image Translation
* Manifold Adversarial Learning for Cross-Domain 3D Shape Representation
* Map-Free Visual Relocalization: Metric Pose Relative to a Single Image
* Mapping Forest Stock Volume Based on Growth Characteristics of Crown Using Multi-Temporal Landsat 8 OLI and ZY-3 Stereo Images in Planted Eucalyptus Forest
* Mapping Functional Changes in the Embryonic Heart of Atlantic Salmon Post Viral Infection Using AI Technique
* Mapping the Spatiotemporal Pattern of Sandy Island Ecosystem Health during the Last Decades Based on Remote Sensing
* Mask Guided Spatial-Temporal Fusion Network for Multiple Object Tracking
* Mask-Guided Attention and Episode Adaptive Weights for Few-Shot Segmentation
* Mask-Vit: an Object Mask Embedding in Vision Transformer for Fine-Grained Visual Classification
* Masked Autoencoders for Point Cloud Self-Supervised Learning
* Masked Discrimination for Self-supervised Learning on Point Clouds
* Masked Face Recognition via Self-Attention Based Local Consistency Regularization
* Masked Generative Distillation
* Masked Siamese Networks for Label-Efficient Learning
* Maskformer with Improved Encoder-Decoder Module for Semantic Segmentation of Fine-Resolution Remote Sensing Images
* Master of All: Simultaneous Generalization of Urban-Scene Segmentation to All Adverse Weather Conditions
* Max Pooling with Vision Transformers Reconciles Class and Shape in Weakly Supervised Semantic Segmentation
* Max-Flow Based Approach for Neural Architecture Search, A
* Maximum Likelihood Surface Profilometry Via Optical coherence Tomography
* MaxViT: Multi-axis Vision Transformer
* mc-BEiT: Multi-choice Discretization for Image BERT Pre-training
* MCFM: Mutual Cross Fusion Module for Intermediate Fusion-Based Action Segmentation
* MCGNet: Multi-Level Context-aware and Geometric-aware Network for 3D Object Detection
* MDNet: Motion Distinction Network for Effective Action Recognition
* Measurably Stronger Explanation Reliability Via Model Canonization
* Measuring Class-Imbalance Sensitivity of Deterministic Performance Evaluation Metrics
* Measuring Spatial Accessibility of Healthcare Facilities in Marinduque, Philippines
* Med-DANet: Dynamic Architecture Network for Efficient Medical Volumetric Segmentation
* Medium-Resolution Mapping of Evapotranspiration at the Catchment Scale Based on Thermal Infrared MODIS Data and ERA-Interim Reanalysis over North Africa
* MegBA: A GPU-Based Distributed Library for Large-Scale Bundle Adjustment
* Memory Reduction of CGH Calculation Based On Integrating Point Light Sources
* Memory-Augmented Model-Driven Network for Pansharpening
* Memory-Efficient Deformable Convolution Based Joint Denoising and Demosaicing for UHD Images
* Memory-Efficient Learned Image Compression with Pruned Hyperprior Module
* MemSAC: Memory Augmented Sample Consistency for Large Scale Domain Adaptation
* MENet: A Memory-Based Network with Dual-Branch for Efficient Event Stream Processing
* Merged U-Net for Bone Tumors X-Ray Images Segmentation
* MeshLoc: Mesh-Based Visual Localization
* MeshMAE: Masked Autoencoders for 3D Mesh Data Analysis
* MeshUDF: Fast and Differentiable Meshing of Unsigned Distance Field Networks
* Meta Spatio-Temporal Debiasing for Video Scene Graph Generation
* Meta-BNS FOR Adversarial Data-Free Quantization
* Meta-GF: Training Dynamic-Depth Neural Networks Harmoniously
* Meta-Learned Initialization For 3D Human Recovery
* Meta-Learning with Less Forgetting on Large-Scale Non-Stationary Task Distributions
* Meta-sampler: Almost-Universal yet Task-Oriented Sampling for Point Clouds
* MetaGait: Learning to Learn an Omni Sample Adaptive Representation for Gait Recognition
* Method of Whole-Network Adjustment for Clock Offset Based on Satellite-Ground and Inter-Satellite Link Observations, A
* Method to Estimate Clear-Sky Albedo of Paddy Rice Fields, A
* Methods and Algorithms of Subsurface Holographic Sounding
* Metric Learning Based Interactive Modulation for Real-World Super-Resolution
* MFIM: Megapixel Facial Identity Manipulation
* MHR-Net: Multiple-Hypothesis Reconstruction of Non-Rigid Shapes from 2D Views
* Microscale Image Enhancement Via PCA and Well-Exposedness Maps
* Middle-Level Feature Fusion for Lightweight RGB-D Salient Object Detection
* MILES: Visual BERT Pre-training with Injected Language Semantics for Video-Text Retrieval
* MIME: Minority Inclusion for Majority Group Enhancement of AI Performance
* Mimic Embedding via Adaptive Aggregation: Learning Generalizable Person Re-identification
* MimicME: A Large Scale Diverse 4D Database for Facial Expression Analysis
* Minconvnets: a New Class of Multiplication-Less Neural Networks
* Mind the Gap in Distilling StyleGANs
* MINER: Multiscale Implicit Neural Representation
* Minimal Neural Atlas: Parameterizing Complex Surfaces with Minimal Charts and Distortion
* Mining Cross-Person Cues for Body-Part Interactiveness Learning in HOI Detection
* Mining Relations Among Cross-Frame Affinities for Video Semantic Segmentation
* Missing Link: Finding Label Relations Across Datasets, The
* Mixed Membership Generative Adversarial Networks
* Mixed-Precision Neural Network Quantization via Learned Layer-Wise Importance
* MixSKD: Self-Knowledge Distillation from Mixup for Image Recognition
* Mixture of Teacher Experts for Source-Free Domain Adaptive Object Detection
* Mixup-Based Deep Metric Learning Approaches for Incomplete Supervision
* ML-BPM: Multi-teacher Learning with Bidirectional Photometric Mixing for Open Compound Domain Adaptation in Semantic Segmentation
* MLP-Stereo: Heterogeneous Feature Fusion in MLP for Stereo Matching
* MLS-GAN: Multi-Level Semantic Guided Image Colorization
* MLSA-UNet: End-to-End Multi-Level Spatial Attention Guided UNet for Industrial Defect Segmentation
* MMGL: Multi-Scale Multi-View Global-Local Contrastive Learning for Semi-Supervised Cardiac Image Segmentation
* MMSR: Multiple-Model Learned Image Super-Resolution Benefiting from Class-Specific Image Priors
* MoADNet: Mobile Asymmetric Dual-Stream Networks for Real-Time and Lightweight RGB-D Salient Object Detection
* MoDA: Map Style Transfer for Self-supervised Domain Adaptation of Embodied Agents
* Modality Synergy Complement Learning with Cascaded Aggregation for Visible-Infrared Person Re-Identification
* MODE: Multi-view Omnidirectional Depth Estimation with 360° Cameras
* Model Attribution of Face-Swap Deepfake Videos
* Model-Based Quantitative Elasticity Reconstruction Using ADMM
* MOdel-Based SyntheTic Data-Driven Learning (MOST-DL): Application in Single-Shot T2 Mapping With Severe Head Motion Using Overlapping-Echo Acquisition
* Modeling Mask Uncertainty in Hyperspectral Image Reconstruction
* Modeling the HEVC Encoding Energy Using the Encoder Processing Time
* Modular and Lightweight Networks for Bi-Scale Style Transfer
* MoFaNeRF: Morphable Facial Neural Radiance Field
* MONet: Multi-Scale Overlap Network for Duplication Detection in Biomedical Images
* Monitored Distillation for Positive Congruent Depth Completion
* Monitoring and Mapping Vegetation Cover Changes in Arid and Semi-Arid Areas Using Remote Sensing Technology: A Review
* Monitoring Megathrust-Earthquake-Cycle-Induced Relative Sea-Level Changes near Phuket, South Thailand, Using (Space) Geodetic Techniques
* Monitoring of Atmospheric Carbon Dioxide over a Desert Site Using Airborne and Ground Measurements
* Monitoring of Varroa Infestation Rate in Beehives: A Simple AI Approach
* MONO6D: Monocular Vehicle 6D Pose Estimation with 3D Priors
* Monocular 3D Object Detection with Depth from Motion
* Monocular 3D Object Reconstruction with GAN Inversion
* Monocular Robust 3D Human Localization by Global and Body-Parts Depth Awareness
* MonoPLFlowNet: Permutohedral Lattice FlowNet for Real-Scale 3D Scene Flow Estimation with Monocular Images
* Monotonically Convergent Regularization by Denoising
* MonteBoxFinder: Detecting and Filtering Primitives to Fit a Noisy Point Cloud
* MORE: Multi-Order RElation Mining for Dense Captioning in 3D Scenes
* MorphMLP: An Efficient MLP-Like Backbone for Spatial-Temporal Representation Learning
* Most and Least Retrievable Images in Visual-Language Query Systems
* MOTCOM: The Multi-Object Tracking Dataset Complexity Metric
* MOTFR: Multiple Object Tracking Based on Feature Recoding
* Motion and Appearance Adaptation for Cross-domain Motion Transfer
* Motion Inspired Unsupervised Perception and Prediction in Autonomous Driving
* Motion Sensitive Contrastive Learning for Self-Supervised Video Representation
* Motion Transformer for Unsupervised Image Animation
* MotionCLIP: Exposing Human Motion Generation to CLIP Space
* MOTR: End-to-End Multiple-Object Tracking with Transformer
* Mouse Arterial Wall Imaging and Analysis from Synchrotron X-Ray Microtomography
* MovieCuts: A New Dataset and Benchmark for Cut Type Recognition
* Moving Object Detection in Noisy Video Sequences Using Deep Convolutional Disentangled Representations
* MPIB: An MPI-Based Bokeh Rendering Framework for Realistic Partial Occlusion Effects
* MPPNet: Multi-frame Feature Intertwining with Proxy Points for 3D Temporal Object Detection
* MR.TOMP: Inversion of the Modulo Radon Transform (MRT) via Orthogonal Matching Pursuit (OMP)
* MRS-XNet: An Explainable One-Dimensional Deep Neural Network for Magnetic Spectroscopic Data Classification
* MSCNet: A Multilevel Stacked Context Network for Oriented Object Detection in Optical Remote Sensing Images
* MSDESIS: Multitask Stereo Disparity Estimation and Surgical Instrument Segmentation
* MSL-FER: Mirrored Self-Supervised Learning for Facial Expression Recognition
* MSNet: Multifunctional Feature-Sharing Network for Land-Cover Segmentation
* MTFormer: Multi-task Learning via Transformer and Cross-Task Reasoning
* MTTrans: Cross-domain Object Detection with Mean Teacher Transformer
* MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration
* Multi Label Image Classification using Adaptive Graph Convolutional Networks (ML-AGCN)
* Multi Object Tracking Based on Uncertainty-Aware RE-ID
* Multi-agents system for breast tumour detection in mammography by deep learning pre-processing and watershed segmentation
* Multi-Curve Translator for High-Resolution Photorealistic Image Translation
* Multi-domain Learning for Updating Face Anti-spoofing Models
* Multi-domain Multi-definition Landmark Localization for Small Datasets
* Multi-Exit Semantic Segmentation Networks
* Multi-faceted Distillation of Base-Novel Commonality for Few-Shot Object Detection
* Multi-Feature Aggregation for Semantic Segmentation of an Urban Scene Point Cloud
* Multi-Field De-Interlacing Using Deformable Convolution Residual Blocks and Self-Attention
* Multi-Frame Video Prediction with Learnable Motion Encodings
* Multi-Granularity Aggregation Transformer for Light Field Image Super-Resolution
* Multi-granularity Distillation Scheme Towards Lightweight Semi-supervised Semantic Segmentation
* Multi-granularity Prediction for Scene Text Recognition
* Multi-granularity Pruning for Model Acceleration on Mobile Devices
* Multi-label Aerial Image Classification Based on Image-Specific Concept Graphs
* Multi-Latent GAN Inversion for Unsupervised 3D Shape Completion
* Multi-modal Masked Pre-training for Monocular Panoramic Depth Completion
* Multi-modal Text Recognition Networks: Interactive Enhancements Between Visual and Semantic Features
* Multi-Modal Transformer for RGB-D Salient Object Detection
* Multi-Modality Diversity Fusion Network with Swintransformer for RGB-D Salient Object Detection
* Multi-Object Tracking and Segmentation Via Neural Message Passing
* Multi-Person 3D Pose and Shape Estimation via Inverse Kinematics and Refinement
* Multi-Query Video Retrieval
* Multi-scale and Cross-scale Contrastive Learning for Semantic Segmentation
* Multi-Scale Content-Insensitive Fusion CNN for Source Social Network Identification, A
* Multi-Scale Deformable Transformer Encoder Based Single-Stage Pedestrian Detection
* Multi-Scale end-to-End Learning for Point Cloud Geometry Compression
* Multi-Scale Gridded Gabor Attention for Cirrus Segmentation
* Multi-Scale Raft: Combining Hierarchical Concepts for Learning-Based Optical Flow Estimation
* Multi-Scale Transformer-Based Feature Combination for Image Retrieval
* Multi-Source Image Matching Network for UAV Visual Location, A
* Multi-Stage Duplex Fusion Convnet for Aerial Scene Classification, A
* Multi-Stage Feature Alignment Network for Video Super-Resolution
* Multi-Step Test-Time Adaptation with Entropy Minimization and Pseudo-Labeling
* Multi-Task Semantic Segmentation Network for Threat Detection in X-Ray Security Images, A
* Multi-view 3D Reconstruction from Video with Transformer
* Multi-View Feature Boosting Network for Deep Subspace Clustering
* Multiclass-SGCN: Sparse Graph-Based Trajectory Prediction with Agent Class Embedding
* Multifractal Anomaly Detection in Images via Space-Scale Surrogates
* Multifractal Correlation between Terrain and River Network Structure in the Yellow River Basin, China
* Multilevel Spatial-Temporal Feature Aggregation for Video Object Detection
* MultiMAE: Multi-modal Multi-task Masked Autoencoders
* Multimodal Conditional Image Synthesis with Product-of-Experts GANs
* Multimodal Differential Evolution Algorithm in Initial Orbit Determination for a Space-Based Too Short Arc, A
* Multimodal Object Detection via Probabilistic Ensembling
* Multimodal Transformer for Automatic 3D Annotation and Object Detection
* Multimodal Transformer with Variable-Length Memory for Vision-and-Language Navigation
* Multiple-Phases-Sectionalized-Modulation SAR Barrage Jamming Method Based on NLFM Signal
* Multiscale Low-Light Image Enhancement Network With Illumination Constraint
* Multistage Adaptive Point-Growth Network for Dense Point Cloud Completion
* Multitask learning via pseudo-label generation and ensemble prediction for parasitic egg cell detection: IEEE ICIP Challenge 2022
* Multitask Multigranularity Aggregation With Global-Guided Attention for Video Person Re-Identification
* Multitemporal Mountain Rice Identification and Extraction Method Based on the Optimal Feature Combination and Machine Learning, A
* Multiview Graph Restricted Boltzmann Machines
* Multiview Regenerative Morphing with Dual Flows
* Multiview Stereo with Cascaded Epipolar RAFT
* Multiview Subspace Clustering Using Low-Rank Representation
* MuLUT: Cooperating Multiple Look-Up Tables for Efficient Image Super-Resolution
* Mutually Reinforcing Structure with Proposal Contrastive Consistency for Few-Shot Object Detection
* MvDeCor: Multi-view Dense Correspondence Learning for Fine-Grained 3D Segmentation
* MVDG: A Unified Multi-view Framework for Domain Generalization
* MVMO: A Multi-Object Dataset for Wide Baseline Multi-View Semantic Segmentation
* MVP: Multimodality-Guided Visual Pre-training
* MVSalNet: Multi-view Augmentation for RGB-D Salient Object Detection
* MVSTER: Epipolar Transformer for Efficient Multi-view Stereo
* MWNET: A Tracking Method for Frequently Occluded Scenes Based on Matter Waves
* My View is the Best View: Procedure Learning from Egocentric Videos
* Narrowing the Gap: Improved Detector Training With Noisy Location Annotations
* NashAE: Disentangling Representations Through Adversarial Covariance Minimization
* Natural Image Matting with Shifted Window Self-Attention
* Natural Synthetic Anomalies for Self-supervised Anomaly Detection and Localization
* NBD-GAP: Non-Blind Image Deblurring without Clean Target Images
* NCTR: Neighborhood Consensus Transformer for Feature Matching
* NDF: Neural Deformable Fields for Dynamic Human Modelling
* Ndist2vec: Node with Landmark and New Distance to Vector Method for Predicting Shortest Path Distance along Road Networks
* NeFSAC: Neurally Filtered Minimal Samples
* Negative Samples are at Large: Leveraging Hard-Distance Elastic Loss for Re-identification
* Neighborhood Collective Estimation for Noisy Label Identification and Chiorrection
* NeILF: Neural Incident Light Field for Physically-based Material Estimation
* NeRF for Outdoor Scene Relighting
* NEST: Neural Event Stack for Event-Based Image Enhancement
* Network Binarization via Contrastive Learning
* NeuMan: Neural Human Radiance Field from a Single Video
* NeuMesh: Learning Disentangled Neural Mesh-Based Implicit Field for Geometry and Texture Editing
* Neural Architecture Search for Fracture Classification
* Neural Architecture Search for Spiking Neural Networks
* Neural Capture of Animatable 3D Human from Monocular Video
* Neural Color Operators for Sequential Image Retouching
* Neural Correspondence Field for Object Pose Estimation
* Neural Density-Distance Fields
* Neural Image Representations for Multi-Image Fusion and Layer Separation
* Neural Light Field Estimation for Street Scenes with Differentiable Virtual Object Insertion
* Neural Network Fragile watermarking With No Model Performance Degradation
* Neural Network Lifting Based Secondary Transform for Improved Fully Scalable Image Compression in Jpeg 2000, A
* Neural Radiance Transfer Fields for Relightable Novel-View Synthesis with Global Illumination
* Neural Scene Decoration from a Single Photograph
* Neural Space-Filling Curves
* Neural Strands: Learning Hair Geometry and Appearance from Multi-view Images
* Neural Video Compression Using GANs for Detail Synthesis and Propagation
* Neural-Sim: Learning to Generate Training Data with NeRF
* NeuRIS: Neural Reconstruction of Indoor Scenes Using Normal Priors
* Neuro-Inspired Deep Neural Networks with Sparse, Strong Activations
* Neuromorphic Data Augmentation for Training Spiking Neural Networks
* New Active Learning Approach for Seabed Segmentation
* New Application: A Hand Air Writing System Based on Radar Dual View Sequential Feature Fusion Idea
* New Datasets and Models for Contextual Reasoning in Visual Dialog
* New Exospheric Temperature Model Based on CHAMP and GRACE Measurements, A
* New Regularization for Retinex Decomposition of Low-Light Images, A
* New Video Quality Assessment Dataset for Video Surveillance Applications, A
* NewsStories: Illustrating Articles with Visual Summaries
* NeXT: Towards High Quality Neural Radiance Fields via Multi-skip Transformer
* NLCMAP: A Framework for the Efficient Mapping of Non-Linear Convolutional Neural Networks on FPGA Accelerators
* No Token Left Behind: Explainability-Aided Image Classification and Generation
* No-Reference Measure for Uneven Illumination Assessment on Laparoscopic Images, A
* No-Reference Quality Assessment for 3D Colored Point Cloud and Mesh Models
* Noise Preserving Sharpening Filter for CT Image Enhancement, A
* Non-Deterministic Face Mask Removal Based on 3d Priors
* Non-isotropic Probabilistic Take on Proxy-based Deep Metric Learning, A
* Non-Iterative Optimization of Pseudo-Labeling Thresholds for Training Object Detection Models from Multiple Datasets
* Non-Rigid Multiple Point Set Registration Using Latent Gaussian Mixture
* Non-Separable Filtering with Side-Information and Contextually-Designed Filters for Next Generation Video Codecs
* Non-Smooth Energy Dissipating Networks
* Non-uniform Step Size Quantization for Accurate Post-training Quantization
* Nondeterministic Deformation Analysis Using Quasiconformal Geometry
* Nonlinear Unmixing via Deep Autoencoder Networks for Generalized Bilinear Model
* NormAttention-PSN: A High-frequency Region Enhanced Photometric Stereo Network with Normalized Attention
* Not All Models Are Equal: Predicting Model Transferability in a Self-challenging Fisher Space
* Not Just Streaks: Towards Ground Truth for Single Image Deraining
* Novel Approaches in Tropical Forests Mapping and Monitoring-Time for Operationalization
* Novel Class Discovery Without Forgetting
* Novel Contrastive Learning Framework for Self-Supervised Anomaly Detection, A
* Novel Dual-Branch Neural Network Model for Flood Monitoring in South Asia Based on CYGNSS Data, A
* novel fast combine-and-conquer object detector based on only one-level feature map, A
* Novel Long-Term Iterative Mining Scheme for Video Salient Object Detection, A
* Novel Low-Cost GNSS Solution for the Real-Time Deformation Monitoring of Cable Saddle Pushing: A Case Study of Guojiatuo Suspension Bridge, A
* Novel Rank Correlation Measure for Manifold Learning on Image Retrieval and Person Re-ID, A
* Novel Reconstruction With Inter-Frame Motion Compensation For Fast Super-Resolution Live Cell Imaging
* Novel Self-Supervised Cross-Modal Image Retrieval Method in Remote Sensing, A
* Novel System for Deep Contour Classifiers Certification Under Filtering Attacks, A
* Novel Visual Feature and Gaze Driven Egocentric Video Retargeting, A
* NPCFORMER: Automatic Nasopharyngeal Carcinoma Segmentation Based on Boundary Attention and Global Position Context Attention
* NSNet: Non-saliency Suppression Sampler for Efficient Video Recognition
* NullSpaceRDAR: Regularized discriminative adaptive nullspace for object tracking
* NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion
* OASIS: Only Adversarial Supervision for Semantic Image Synthesis
* Object Detection as Probabilistic Set Prediction
* Object Discovery and Representation Networks
* Object Discovery via Contrastive Learning for Weakly Supervised Object Detection
* Object Level Depth Reconstruction for Category Level 6D Object Pose Estimation from Monocular RGB Image
* Object Manipulation via Visual Target Localization
* Object Wake-Up: 3D Object Rigging from a Single Image
* Object-Aware Self-Supervised Multi-Label Learning
* Object-Centric and Memory-Guided Normality Reconstruction for Video Anomaly Detection
* Object-Centric Unsupervised Image Captioning
* Object-Compositional Neural Implicit Surfaces
* ObjectBox: From Centers to Boxes for Anchor-Free Object Detection
* Objects Can Move: 3D Change Detection by Geometric Transformation Consistency
* OccamNets: Mitigating Dataset Bias by Favoring Simpler Hypotheses
* Occlusion-Invariant Representation Alignment for Entity Re-Identification
* OCR-Free Document Understanding Transformer
* OCTA Retinal Vessel Segmentation Based on Vessel Thickness Inconsistency Loss
* Offline-Online Associated Camera-Aware Proxies for Unsupervised Person Re-Identification
* OIMNet++: Prototypical Normalization and Localization-Aware Learning for Person Search
* OMNET: Real-Time Stereo Matching with Unsupervised Occlusion Mask
* On Adversarial Robustness of Deep Image Deblurring
* On Label Granularity and Object Localization
* On Mitigating Hard Clusters for Face Clustering
* On Monocular Depth Estimation and Uncertainty Quantification Using Classification Approaches for Regression
* On Multi-Domain Long-Tailed Recognition, Imbalanced Domain Generalization and Beyond
* On Quantization of Image Classification Neural Networks for Compression Without Retraining
* On the Accuracy of Open Video Quality Metrics for Local Decision in AV1 Video Codec
* On the Angular Update and Hyperparameter Tuning of a Scale-Invariant Network
* On The Benefit of Parameter-Driven Approaches for the Modeling and the Prediction of Satisfied User Ratio for Compressed Video
* On The Limits of Perceptual Quality Measures for Enhanced Underwater Images
* On the Link Between Emotion, Attention and Content in Virtual Immersive Environments
* On The Relevance of Multi-Graph Matching for Sulcal Graphs
* On the Robustness of Quality Measures for GANs
* On the Versatile Uses of Partial Distance Correlation in Deep Learning
* One Size Does NOT Fit All: Data-Adaptive Adversarial Training
* One Where They Reconstructed 3D Humans and Environments in TV Shows, The
* One-Cycle Pruning: Pruning Convnets With Tight Training Budget
* One-Shot Medical Landmark Localization by Edge-Guided Transform and Noisy Landmark Refinement
* One-Trimap Video Matting
* OneFace: One Threshold for All
* Online Adaptive Personalization for Face Anti-Spoofing
* Online Continual Learning with Contrastive Vision Transformer
* Online Domain Adaptation for Semantic Segmentation in Ever-Changing Conditions
* Online multi-object tracking with d-GLMB filter based on occlusion and identity switch handling
* Online Segmentation of LiDAR Sequences: Dataset and Algorithm
* Online Task-free Continual Learning with Dynamic Sparse Distributed Memory
* OOD-CV: A Benchmark for Robustness to Out-of-Distribution Shifts of Individual Nuisances in Natural Images
* OPD: Single-View 3D Openable Part Detection
* Open Dataset for Video Coding for Machines Standardization, An
* Open Vocabulary Object Detection with Pseudo Bounding-Box Labels
* Open-Set Semi-Supervised Object Detection
* Open-Vocabulary DETR with Conditional Matching
* Open-World Object Detection via Discriminative Class Prototype Learning
* Open-world Semantic Segmentation for LIDAR Point Clouds
* Open-World Semantic Segmentation via Contrasting and Clustering Vision-Language Embedding
* OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning
* Optical Flow Training Under Limited Label Budget via Active Learning
* Optics Lens Design for Privacy-Preserving Scene Captioning
* Optimal Boxes: Boosting End-to-End Scene Text Recognition by Adjusting Annotated Bounding Boxes via Reinforcement Learning
* Optimal Noise-Aware Imaging with Switchable Prefilters
* Optimal On-Orbit Inspection of Satellite Formation
* Optimal Transport for Label-Efficient Visible-Infrared Person Re-Identification
* Optimal Transport with a New Preprocessing for Deep-Learning Full Waveform Inversion
* Optimal Transport-Based Graph Matching for 3D Retinal OCT Image Registration
* Optimality Conditions for Bilevel Imaging Learning Problems with Total Variation Regularization
* Optimization and Validation of Hyperspectral Estimation Capability of Cotton Leaf Nitrogen Based on SPA and RF
* Optimization of the Ecological Network Structure Based on Scenario Simulation and Trade-Offs/Synergies among Ecosystem Services in Nanping
* Optimization over Disentangled Encoding: Unsupervised Cross-Domain Point Cloud Completion via Occlusion Factor Manipulation
* Optimized Decoding-Energy-Aware Encoding In Practical VVC Implementations
* Optimized Learned Entropy Coding Parameters for Practical Neural-Based Image and Video Compression
* Optimizing AV1 Encoder for Real-Time Communication
* Optimizing Image Compression via Joint Learning with Denoising
* Optimizing Radar-Based Rainfall Estimation Using Machine Learning Models
* Order Learning Using Partially Ordered Data via Chainization
* Organic Priors in Non-rigid Structure from Motion
* Orthonormal Convolutions for the Rotation Based Iterative Gaussianization
* Osegnet: Operational Segmentation Network for Covid-19 Detection Using Chest X-Ray Images
* OSFormer: One-Stage Camouflaged Instance Segmentation with Transformers
* Out-of-distribution Detection with Boundary Aware Learning
* Out-of-Distribution Detection with Semantic Mismatch Under Masking
* Out-of-Distribution Identification: Let Detector Tell Which I Am Not Sure
* Outpainting by Queries
* Overcoming Shortcut Learning in a Target Domain by Generalizing Basic Visual Factors from a Source Domain
* Overlooked Poses Actually Make Sense: Distilling Privileged Knowledge for Human Motion Prediction
* Overview of the Low Complexity Enhancement Video Coding (LCEVC) Standard
* Overview of Vegetation Dynamics Revealed by Remote Sensing and Its Feedback to Regional and Global Climate, An
* Ownership Verification of DNN Architectures via Hardware Cache Side Channels
* P-Frame Coding with Generalized Difference: A Novel Conditional Coding Approach
* P-STMO: Pre-trained Spatial Temporal Many-to-One Model for 3D Human Pose Estimation
* PAC-Net: Highlight Your Video via History Preference Modeling
* PACS: A Dataset for Physical Audiovisual CommonSense Reasoning
* PACTran: PAC-Bayesian Metrics for Estimating the Transferability of Pretrained Models to Classification Tasks
* Paint2Pix: Interactive Painting Based Progressive Image Synthesis and Editing
* Pairwise Contrastive Learning Network for Action Quality Assessment
* Pairwise Rotational-Difference LBP for Fine-Grained Leaf Image Retrieval
* PalGAN: Image Colorization with Palette Generative Adversarial Networks
* PalQuant: Accelerating High-Precision Networks on Low-Precision Accelerators
* PAM-DenseNet: A Deep Convolutional Neural Network for Computer-Aided COVID-19 Diagnosis
* PANDORA: A Panoramic Detection Dataset for Object with Orientation
* PANDORA: Polarization-Aided Neural Decomposition of Radiance
* PanoFormer: Panorama Transformer for Indoor 360 ° Depth Estimation
* Panoptic Scene Graph Generation
* Panoptic-Deeplab-DVA: Improving Panoptic Deeplab with Dual Value Attention and Instance Boundary Aware Regression
* Panoptic-PartFormer: Learning a Unified Model for Panoptic Part Segmentation
* Panoramic Human Activity Recognition
* Panoramic Viewport Prediction Relying on Emotional Attention Map
* Panoramic Vision Transformer for Saliency Detection in 360° Videos
* Parallel Attribute Computation for Distributed Component Forests
* Parallel Electrical Conductivity at Low and Middle Latitudes in the Topside Ionosphere Derived from CSES-01 Measurements
* Parallel Partitioning: Path Reducing and Union-Find Based Watershed for the GPU
* Parallelizable Global Quasi-Conformal Parameterization of Multiply Connected Surfaces via Partial Welding
* Parameterized Temperature Scaling for Boosting the Expressive Power in Post-Hoc Uncertainty Calibration
* Parasitic Egg Detection and Classification by Utilizing the YOLO Algorithm with Deep Latent Space Image Restoration and GrabCut Augmentation
* Parasitic Egg Detection and Classification with Transformer-Based Architectures
* Parasitic Egg Detection with a Deep Learning Ensemble
* ParC-Net: Position Aware Circular Convolution with Merits from ConvNets and Transformer
* Partial Point Cloud Registration Via Soft Segmentation
* Particle Video Revisited: Tracking Through Occlusions Using Point Trajectories
* ParticleSfM: Exploiting Dense Point Trajectories for Localizing Moving Cameras in the Wild
* PartImageNet: A Large, High-Quality Dataset of Parts
* Partition and Reunion: A Viewpoint-Aware Loss for Vehicle Re-Identification
* PASNET: A Self-AdaPtive Point Cloud Sorting APProach to an ImProved Feature Extraction
* PASS: Part-Aware Self-Supervised Pre-Training for Person Re-Identification
* Passenger Flow Prediction of Scenic Spots in Jilin Province Based on Convolutional Neural Network and Improved Quantile Regression Long Short-Term Memory Network
* PASTS: Toward Effective Distilling Transformer for Panoramic Semantic Segmentation
* Patch Similarity Aware Data-Free Quantization for Vision Transformers
* Patch-Based Algorithm for Diverse and High Fidelity Single Image Generation, A
* Patch-Based Approach for Artistic Style Transfer Via Constrained Multi-Scale Image Matching, A
* PatchRD: Detail-Preserving Shape Completion by Learning Patch Retrieval and Deformation
* Patient Aware Active Learning for Fine-Grained OCT Classification
* PCA Event-Based Optical Flow: A Fast and Accurate 2D Motion Estimation
* PCR-CG: Point Cloud Registration via Deep Explicit Color and Geometry
* PCRP: Unsupervised Point Cloud Object Retrieval and Pose Estimation
* PCW-Net: Pyramid Combination and Warping Cost Volume for Stereo Matching
* PD-Flow: A Point Cloud Denoising Framework with Normalizing Flows
* PDE-Constrained Optimization for Nuclear Mechanics
* Perceiving and Modeling Density for Image Dehazing
* Perception-Distortion Balanced ADMM Optimization for Single-Image Super-Resolution
* Perception-Distortion Trade-Off in the SR Space Spanned by Flow Models
* Perceptual Artifacts Localization for Inpainting
* Perceptual Hashing With Complementary Color Wavelet Transform and Compressed Sensing for Reduced-Reference Image Quality Assessment
* Perceptual Quality Metric for Video Frame Interpolation, A
* Performance Assessment of GPM IMERG Products at Different Time Resolutions, Climatic Areas and Topographic Conditions in Catalonia
* Perovskite CsPbBr3 Single Crystal Detector for High Flux X-Ray Photon Counting
* PersFormer: 3D Lane Detection via Perspective Transformer and the OpenLane Benchmark
* Person Re-Identification Baseline Based on Attention Block Neural Architecture Search, A
* Person Re-Identification in Panoramic Views Based on Bayesian Transformers
* Personalized Education: Blind Knowledge Distillation
* Personalizing Federated Medical Image Segmentation via Local Calibration
* Perspective Flow Aggregation for Data-Limited 6D Object Pose Estimation
* Perspective on the Impact of Grassland Degradation on Ecosystem Services for the Purpose of Sustainable Management, A
* Perspective Phase Angle Model for Polarimetric 3D Reconstruction
* Perturbation-Constrained Adversarial Attack for Evaluating the Robustness of Optical Flow, A
* PET/CT Co-Segmentation Based on Hybrid Active Contour Model
* PETR: Position Embedding Transformation for Multi-View 3D Object Detection
* PGTNet: Prototype Guided Transfer Network for Few-Shot Anomaly Localization
* PGUNeT: Covid-19 CT Image Segmentation Using GAN and Feature Pyramid
* Photo-realistic Neural Domain Randomization
* Physical Attack on Monocular Depth Estimation with Optimal Adversarial Patches
* Physically-Based Editing of Indoor Scene Lighting from a Single Image
* PillarNet: Real-Time and High-Performance Pillar-Based 3D Object Detection
* PIP: Physical Interaction Prediction via Mental Simulation with Span Selection
* Pixel-Wise Energy-Biased Abstention Learning for Anomaly Segmentation on Complex Urban Driving Scenes
* PixelFolder: An Efficient Progressive Pixel Synthesis Network for Image Generation
* Placial-Discursive Topologies of Violence: Volunteered Geographic Information and the Reproduction of Violent Places in Recife, Brazil
* PlaneFormers: From Sparse View Planes to 3D Reconstruction
* Planes vs. Chairs: Category-Guided 3D Shape Learning Without any 3D Cues
* Plant and Animal Species Recognition Based on Dynamic Vision Transformer Architecture
* Point Cloud Completion By Minimizing Prediction Errors In Both 2D And 3D Spaces
* Point Cloud Compression with Range Image-Based Entropy Model for Autonomous Driving
* Point Cloud Compression with Sibling Context and Surface Priors
* Point Cloud Domain Adaptation via Masked Local 3D Structure Prediction
* Point MixSwap: Attentional Point Cloud Mixing via Swapping Matched Structural Divisions
* Point Primitive Transformer for Long-Term 4D Point Cloud Video Understanding
* Point Scene Understanding via Disentangled Instance Mesh Reconstruction
* Point-to-Box Network for Accurate Object Detection via Single Point Supervision
* PointCLM: A Contrastive Learning-based Framework for Multi-instance Point Cloud Registration
* PointFix: Learning to Fix Domain Bias for Robust Online Stereo Adaptation
* PointInst3D: Segmenting 3D Instances by Points
* Pointivae: Invertible Variational Autoencoder Framework for 3D Point Cloud Generation
* Pointly-Supervised Panoptic Segmentation
* PointMixer: MLP-Mixer for Point Cloud Understanding
* PointScatter: Point Set Representation for Tubular Structure Extraction
* PointTree: Transformation-Robust Point Cloud Encoder with Relaxed K-D Trees
* Polarimetric Pose Prediction
* PolarMOT: How Far Can Geometric Relations Take us in 3D Multi-object Tracking?
* PolSAR Models with Multimodal Intensities
* Polygon-Free: Unconstrained Scene Text Detection with Box Annotations
* Polygonal 3D Layout Reconstruction of an Indoor Environment via Voxel-Based Room Segmentation and Space Partition, The
* PolyphonicFormer: Unified Query Learning for Depth-Aware Video Panoptic Segmentation
* POP: Mining POtential Performance of New Fashion Products via Webly Cross-modal Query Expansion
* Pose Calibrated Feature Aggregation for Face Set Recognition
* Pose for Everything: Towards Category-Agnostic Pose Estimation
* Pose Forecasting in Industrial Human-Robot Collaboration
* Pose-NDF: Modeling Human Pose Manifolds with Neural Distance Fields
* Pose2Room: Understanding 3D Scenes from Human Activities
* PoseGPT: Quantization-Based 3D Human Motion Generation and Forecasting
* PoserNet: Refining Relative Camera Poses Exploiting Object Detections
* PoseScript: 3D Human Poses from Natural Language
* PoseTrans: A Simple yet Effective Pose Transformation Augmentation for Human Pose Estimation
* Poseur: Direct Human Pose Regression with Transformers
* Positive Unlabeled Learning by Semi-Supervised Learning
* Posterior Refinement on Metric Matrix Improves Generalization Bound in Metric Learning
* Potential of ALOS2 Polarimetric Imagery to Support Management of Poplar Plantations in Northern Italy
* PPT: Anomaly Detection Dataset of Printed Products with Templates
* PPT: Token-Pruned Pose Transformer for Monocular and Multi-view Human Pose Estimation
* Practical and Scalable Desktop-Based High-Quality Facial Capture
* Practical Bulk Denoising Of Large Binary Images
* Pre-training Strategies and Datasets for Facial Representation Learning
* Predicting Habitat Properties Using Remote Sensing Data: Soil pH and Moisture, and Ground Vegetation Cover
* Predicting Human Perception of Scene Complexity
* Predicting Is Not Understanding: Recognizing and Addressing Underspecification in Machine Learning
* Predicting Path Loss Distributions of a Wireless Communication System for Multiple Base Station Altitudes from Satellite Images
* Predicting Radiologist Attention During Mammogram Reading with Deep and Shallow High-Resolution Encoding
* Predicting Soil Properties from Hyperspectral Satellite Images
* Predicting the Colors of Reference Surfaces for Color Constancy
* Prediction-Guided Distillation for Dense Object Detection
* PREF: Predictability Regularized Neural Motion Fields
* PressureVision: Estimating Hand Pressure from a Single RGB Image
* PreTraM: Self-supervised Pre-training via Connecting Trajectory and Map
* PRIF: Primary Ray-Based Implicit Function
* Primary Interannual Variability Patterns of the Growing-Season NDVI over the Tibetan Plateau and Main Climatic Factors
* PRIME: A Few Primitives Can Boost Robustness to Common Corruptions
* Primitive-Based Shape Abstraction via Nonparametric Bayesian Inference
* Prior Knowledge Guided Unsupervised Domain Adaptation
* Prior Semantic Harmonization Network for Few-Shot Semantic Segmentation
* Prior-Guided Adversarial Initialization for Fast Adversarial Training
* Privacy-Preserving Action Recognition via Motion Difference Quantization
* Privacy-Preserving Face Recognition with Learnable Privacy Budgets in Frequency Domain
* PrivHAR: Recognizing Human Actions from Privacy-Preserving Lens
* Probability Model Estimation for M-Ary Random Variables
* Probing Seismogenic Faults with Machine Learning
* Progressive Multiscale Consistent Network for Multiclass Fundus Lesion Segmentation
* Progressive Training Enabled Fine-Grained Recognition
* Prohibited Object Detection in X-ray Images with Dynamic Deformable Convolution and Adaptive IoU
* Projective Parallel Single-Pixel Imaging to Overcome Global Illumination in 3D Structure Light Scanning
* PromptDet: Towards Open-Vocabulary Detection Using Uncurated Images
* Prompting Visual-Language Models for Efficient Video Understanding
* Propagating Facial Prior Knowledge for Multitask Learning in Face Super-Resolution
* Proper Orthogonal Decomposition Approach for Parameters Reduction of Single Shot Detector Networks, A
* Proposal-Free Temporal Action Detection via Global Segmentation Mask Learning
* ProposalContrast: Unsupervised Pre-training for LiDAR-Based 3D Object Detection
* Protecting World Leader Using Facial Speaking Pattern Against Deepfakes
* Prototype Queue Learning for Multi-Class Few-Shot Semantic Segmentation
* Prototype-Guided Continual Adaptation for Class-Incremental Unsupervised Domain Adaptation
* Prototypical Contrast Adaptation for Domain Adaptive Semantic Segmentation
* ProX: A Reversed Once-for-All Network Training Paradigm for Efficient Edge Models Training in Medical Imaging
* Prune Your Model Before Distill It
* PS-NeRF: Neural Inverse Rendering for Multi-view Photometric Stereo
* PSCC-Net: Progressive Spatio-Channel Correlation Network for Image Manipulation Detection and Localization
* PseCo: Pseudo Labeling and Consistency Training for Semi-Supervised Object Detection
* Pseudo Decoder Guided Light-Weight Architecture for Image Inpainting
* PseudoAugment: Learning to Use Unlabeled Data for Data Augmentation in Point Clouds
* PseudoClick: Interactive Image Segmentation with Click Imitation
* psi-Net is an Efficient Tiny Defect Detector
* PSS: Progressive Sample Selection for Open-World Visual Representation Learning
* PT4AL: Using Self-supervised Pretext Tasks for Active Learning
* PTQ4ViT: Post-training Quantization for Vision Transformers with Twin Uniform Quantization
* PTSEFormer: Progressive Temporal-Spatial Enhanced TransFormer Towards Video Object Detection
* Pure Transformer with Integrated Experts for Scene Text Recognition
* Pure Versus Hybrid Transformers For Multi-Modal Brain Tumor Segmentation: A Comparative Study
* Pyramid Knowledge Distillation for Efficient Human Pose Estimation
* Q-FW: A Hybrid Classical-Quantum Frank-Wolfe for Quadratic Binary Optimization
* QISTA-ImageNet: A Deep Compressive Image Sensing Framework Solving Lq-Norm Optimization Problem
* Quadtree-based Guided CNN for AV1 In-loop Filtering
* Quantification of Underwater Sargassum Aggregations Based on a Semi-Analytical Approach Applied to Sentinel-3/OLCI (Copernicus) Data in the Tropical Atlantic Ocean
* Quantify the Potential Spatial Reshaping Utility of Urban Growth Boundary (UGB): Evidence from the Constrained Scenario Simulation Model
* Quantifying Lidar Elevation Accuracy: Parameterization and Wavelength Selection for Optimal Ground Classifications Based on Time since Fire/Disturbance
* Quantitative Analysis of Tectonic Geomorphology Research Based on Web of Science from 1981 to 2021
* Quantitative Inversion of Lunar Surface Chemistry Based on Hyperspectral Feature Bands and Extremely Randomized Trees Algorithm
* Quantized GAN for Complex Music Generation from Dance Videos
* Quantum Motion Segmentation
* Quasi-Balanced Self-Training on Noise-Aware Synthesis of Object Point Clouds for Closing Domain Gap
* Quaternion-based dynamic mode decomposition for background modeling in color videos
* Query Learning of Both Thing and Stuff for Panoptic Segmentation
* Query-Efficient Adversarial Attack Based On Latin Hypercube Sampling
* Ques-to-Visual Guided Visual Question Answering
* R-DFCIL: Relation-Guided Representation Learning for Data-Free Class Incremental Learning
* R2L: Distilling Neural Radiance Field to Neural Light Field for Efficient Novel View Synthesis
* RA-Depth: Resolution Adaptive Self-supervised Monocular Depth Estimation
* Radatron: Accurate Detection Using Multi-resolution Cascaded MIMO Radar
* RadioTransformer: A Cascaded Global-Focal Transformer for Visual Attention-Guided Disease Classification
* Rain-Prior Injected Knowledge Distillation for Single Image Deraining
* RamGAN: Region Attentive Morphing GAN for Region-Level Makeup Transfer
* Random Forest Algorithm for Landsat Image Chromatic Aberration Restoration Based on GEE Cloud Platform: A Case Study of Yucatan Peninsula, Mexico, A
* Random Generated Dictionaries for Convolutional Sparse Coding: An ELM Interpretation for Simple CSC Applications
* RankSeg: Adaptive Pixel Classification with Image Category Ranking for Segmentation
* RAPID: A Single Stage Pruning Framework
* Rate-Distortion Optimal Transform Coefficient Selection for Unoccupied Regions in Video-Based Point Cloud Compression
* RAWtoBit: A Fully End-to-end Camera ISP Network
* Rayleigh EigenDirections (REDs): Nonlinear GAN Latent Space Traversals for Multidimensional Features
* RayTran: 3D Pose Estimation and Shape Reconstruction of Multiple Objects from Videos with Ray-Traced Transformers
* RBC: Rectifying the Biased Context in Continual Semantic Segmentation
* RBP-Pose: Residual Bounding Box Projection for Category-Level Pose Estimation
* RC-MVSNet: Unsupervised Multi-View Stereo with Neural Rendering
* RCLane: Relay Chain Prediction for Lane Detection
* RD-IWAN: Residual Dense Based Imperceptible Watermark Attack Network
* RDA: Reciprocal Distribution Alignment for Robust Semi-supervised Learning
* RDO-Q: Extremely Fine-Grained Channel-Wise Quantization via Rate-Distortion Optimization
* ReAct: Temporal Action Detection with Relational Queries
* Real Spike: Learning Real-Valued Spikes for Spiking Neural Networks
* Real World Dataset for Multi-view 3D Reconstruction, A
* Real- and Complex-Valued Neural Networks for SAR Image Segmentation Through Different Polarimetric Representations
* Real-RawVSR: Real-World Raw Video Super-Resolution with a Benchmark Dataset
* Real-Time Intermediate Flow Estimation for Video Frame Interpolation
* Real-Time Neural Character Rendering with Pose-Guided Multiplane Images
* Real-Time Online Video Detection with Temporal Smoothing Transformers
* Real-World Image Super-Resolution Via Kernel Augmentation And Stochastic Variation
* Real-World Video Anomaly Detection by Extracting Salient Features
* RealFlow: EM-Based Realistic Optical Flow Dataset Generation from Videos
* Realism Metric for Generated LiDAR Point Clouds, A
* Realistic Blur Synthesis for Learning Image Deblurring
* Realistic One-Shot Mesh-Based Head Avatars
* RealPatch: A Statistical Matching Framework for Model Patching with Real Samples
* REALY: Rethinking the Evaluation of 3D Face Reconstruction
* Recognition-Aware Deep Video Compression for Remote Surveillance
* Recognizing Slanted Deck Scenes by Non-Manhattan Spatial Right Angle Projection
* ReCoNet: Recurrent Correction Network for Fast and Efficient Multi-modality Image Fusion
* Recover Fair Deep Classification Models via Altering Pre-trained Structure
* Recurrent Attentive Decomposition Network for Low-Light Image Enhancement
* Recurrent Bilinear Optimization for Binary Neural Networks
* Reduced Dependency Fast Unsupervised 3D Face Reconstruction
* Reducing Information Loss for Spiking Neural Networks
* Reference Guided Reflection Removal Using Deep Visual Attribute Cues
* Reference-Based Blind Super-Resolution Kernel Estimation
* Reference-Based Image Super-Resolution with Deformable Attention Transformer
* Reference-Based JPEG Image Artifacts Removal
* Reference-Guided Texture and Structure Inference for Image Inpainting
* Referring Object Manipulation of Natural Images with Conditional Classifier-Free Guidance
* Refined Model for Quad-Polarimetric Reconstruction from Compact Polarimetric Data, A
* Refining Self-Supervised Learning in Imaging: Beyond Linear Metric
* RegGeoNet: Learning Regular Representations for Large-Scale 3D Point Clouds
* Region-Of-Interest Coding Schemes for HTTP Adaptive Streaming With VVC
* Regional Saliency Map Attack for Medical Image Segmentation
* RegionCL: Exploring Contrastive Region Pairs for Self-supervised Representation Learning
* Registration Based Few-Shot Anomaly Detection
* Registration of Building Scan with IFC-Based BIM Using the Corner Points
* Regularizing Vector Embedding in Bottom-Up Human Pose Estimation
* Reinforcing Neuron Extraction from Calcium Imaging Data via Depth-Estimation Constrained Nonnegative Matrix Factorization
* Relation and context augmentation network for facial expression recognition
* Relation Enhanced Vision Language Pre-Training
* Relation-Guided Network for Image-Text Retrieval
* Relational Future Captioning Model for Explaining Likely Collisions in Daily Tasks
* Relationformer: A Unified Framework for Image-to-Graph Generation
* Relationship Spatialization for Depth Estimation
* Relative Contrastive Loss for Unsupervised Representation Learning
* Relative Pose from SIFT Features
* Reliability-Aware Prediction via Uncertainty Learning for Person Image Retrieval
* Reliable Online Method for Joint Estimation of Focal Length and Camera Rotation, A
* Reliable Visual Question Answering: Abstain Rather Than Answer Incorrectly
* Relighting4D: Neural Relightable Human from Videos
* RelPose: Predicting Probabilistic Relative Rotation for Single Objects in the Wild
* Remote Respiration Monitoring of Moving Person Using Radio Signals
* Remote Sensing of Chlorophyll-a in Xinkai Lake Using Machine Learning and GF-6 WFV Images
* Remote Sensing of Coastal Wetland Degradation Using the Landscape Directional Succession Model
* Remote-Sensing Scene-Image Classification Method Based on Deep Multiple-Instance Learning with a Residual Dense Attention ConvNet, A
* RepMix: Representation Mixing for Robust Attribution of Synthesized Images
* REPNP: Plug-and-Play with Deep Reinforcement Learning Prior for Robust Image Restoration
* Representation Learning Optimization for 3D Point Cloud Quality Assessment Without Reference
* Representation Learning Using Rank Loss for Robust Neurosurgical Skills Evaluation
* Reproducing Sensory Induced Hallucinations via Neural Fields
* Repulsive Force Unit for Garment Collision Handling in Neural Networks, A
* Research into the Optimal Regulation of the Groundwater Table and Quality in the Southern Plain of Beijing Using Geographic Information Systems Data and Machine Learning Algorithms
* Research on Spatial Distribution Characteristics and Influencing Factors of Pension Resources in Shanghai Community-Life Circle
* Residual Graph Attention Network and Expression-Respect Data Augmentation Aided Visual Grounding
* Residual Swin Transformer Unet with Consistency Regularization for Automatic Breast Ultrasound Tumor Segmentation
* Residual U-Structure Nested Conditional Adversarial Nets Colorized CT Improves Deep Learning Based Abdominal Multi-Organ Segmentation
* Resolution-Free Point Cloud Sampling Network with Data Distillation
* Resolving Copycat Problems in Visual Imitation Learning via Residual Action Prediction
* Response of Land Surface Temperature Changes to the Vegetation Dynamics in the Yangtze River Basin, The
* Responsive Listening Head Generation: A Benchmark Dataset and Baseline
* Restore Globally, Refine Locally: A Mask-Guided Scheme to Accelerate Super-Resolution Networks
* Rethinking Closed-Loop Training for Autonomous Driving
* Rethinking Clustering-Based Pseudo-Labeling for Unsupervised Meta-Learning
* Rethinking Confidence Calibration for Failure Prediction
* Rethinking Data Augmentation for Robust Visual Question Answering
* Rethinking Efficacy of Softmax for Lightweight Non-local Neural Networks
* Rethinking Few-Shot Object Detection on a Multi-Domain Benchmark
* Rethinking Generic Camera Models for Deep Single Image Camera Calibration to Recover Rotation and Fisheye Distortion
* Rethinking IoU-based Optimization for Single-stage 3D Object Detection
* Rethinking Keypoint Representations: Modeling Keypoints and Poses as Objects for Multi-person Human Pose Estimation
* Rethinking Learning Approaches for Long-Term Action Anticipation
* Rethinking Robust Representation Learning Under Fine-Grained Noisy Faces
* Rethinking Unified Spectral-Spatial-Based Hyperspectral Image Classification Under 3D Configuration of Vision Transformer
* Rethinking Unsupervised Neural Superpixel Segmentation
* Rethinking Video Rain Streak Removal: A New Synthesis Model and a Deraining Network with Video Rain Prior
* Rethinking Zero-shot Action Recognition: Learning from Latent Atomic Actions
* Retina-Inspired Spatio-Temporal Filtering for Dynamic Video Coding
* Retrieval of Farmland Surface Soil Moisture Based on Feature Optimization and Machine Learning
* Reverse Error Modeling for Improved Semantic Segmentation
* Reversible Data Hiding With Brightness Preserving Contrast Enhancement by Two-Dimensional Histogram Modification
* Revisiting a kNN-Based Image Classification System with High-Capacity Storage
* Revisiting Artistic Style Transfer for Data Augmentation in A Real-Case Scenario
* Revisiting Batch Norm Initialization
* Revisiting Click-Based Interactive Video Object Segmentation
* Revisiting Natural Scene Statistical Modeling Using Deep Features for Opinion-Unaware Image Quality Assessment
* Revisiting Outer Optimization in Adversarial Training
* Revisiting Point Cloud Simplification: A Learnable Feature Preserving Approach
* Revisiting Spatial Inductive Bias with MLP-Like Model
* Revisiting the Critical Factors of Augmentation-Invariant Representation Learning
* Revisiting the Efficiency of UGC Video Quality Assessment
* Revisiting-Reciprocal Distance Re-Ranking for Skeleton-Based Person Re-Identification
* Reviving Iterative Training with Mask Guidance for Interactive Segmentation
* RFLA: Gaussian Receptive Field Based Label Assignment for Tiny Object Detection
* RFNet-4D: Joint Object Reconstruction and Flow Estimation from 4D Point Clouds
* RGB-T tracking by modality difference reduction and feature re-selection
* RIBAC: Towards Robust and Imperceptible Backdoor Attack against Compact DNN
* Rice Crop Height Inversion from TanDEM-X PolInSAR Data Using the RVoG Model Combined with the Logistic Growth Equation
* RIChEx: A Robust Inter-Frame Change Exposure for Segmenting Moving Objects
* RigNet: Repetitive Image Guided Network for Depth Completion
* Rise of the Lottery Heroes: Why Zero-Shot Pruning is Hard, The
* Robust 3d Cell Segmentation: Extending The View Of Cellpose
* Robust Algorithm for Multi-GNSS Precise Positioning and Performance Analysis in Urban Environments, A
* Robust and Accurate Object Detection Via Self-Knowledge Distillation
* Robust Beamforming Based on Complex-Valued Convolutional Neural Networks for Sensor Arrays
* Robust Calibration-Marker and Laser-Line Detection For Underwater 3D Shape Reconstruction By Deep Neural Network
* Robust Category-Level 6D Pose Estimation with Coarse-to-Fine Rendering of Neural Features
* Robust Collaborative Learning of Patch-Level and Image-Level Annotations for Diabetic Retinopathy Grading From Fundus Image
* Robust Ensemble Model For Parasitic Egg Detection And Classification, A
* Robust Grid Detection in Historical Map Images
* Robust Landmark-Based Stent Tracking in X-ray Fluoroscopy
* Robust Misalignment Estimation Approach in Non-Aligned Double JPEG Compression Scenario, A
* Robust Multi-object Tracking by Marginal Inference
* Robust Network Architecture Search via Feature Distortion Restraining
* Robust Object Detection with Inaccurate Bounding Boxes
* Robust PCA Unrolling Network for Super-Resolution Vessel Extraction in X-Ray Coronary Angiography
* Robust real-world point cloud registration by inlier detection
* Robust Temporally-Coherent Strategy for Few-shot Video Instance Segmentation
* Robust Visual Tracking by Segmentation
* Robustness Analysis of Distributed Kalman Filter for Estimation in Sensor Networks
* RocNet: Recursive octree network for efficient 3D processing
* Rotation Regularization Without Rotation
* Rotation-Equivariant Graph Convolutional Networks For Spherical Data Via Global-Local Attention
* Route Plans for UAV Aerial Surveys according to Different DEMs in Complex Mountainous Surroundings: A Case Study in the Zheduoshan Mountains, China
* RPFNET: Complementary Feature Fusion for Hand Gesture Recognition
* RRSR: Reciprocal Reference-Based Image Super-Resolution with Progressive Feature Alignment and Selection
* RSDet++: Point-Based Modulated Loss for More Accurate Rotated Object Detection
* Rupture Process of the 2022 Mw6.6 Menyuan, China, Earthquake from Joint Inversion of Accelerogram Data and InSAR Measurements
* RVSL: Robust Vehicle Similarity Learning in Real Hazy Scenes Based on Semi-supervised Learning
* RW-HAZE: A Real-World Benchmark Dataset to Evaluate Quantitatively Dehazing Algorithms
* RWN: Robust Watermarking Network for Image Cropping Localization
* S 2 Contact: Graph-Based Network for 3D Hand-Object Contact Estimation with Semi-supervised Learning
* S2-VER: Semi-supervised Visual Emotion Recognition
* S2F2: Single-Stage Flow Forecasting for Future Multiple Trajectories Prediction
* S2N: Suppression-Strengthen Network for Event-Based Recognition Under Variant Illuminations
* S2Net: Stochastic Sequential Pointcloud Forecasting
* S3C: Self-Supervised Stochastic Classifiers for Few-Shot Class-Incremental Learning
* SAFA: Sample-Adaptive Feature Augmentation for Long-Tailed Image Classification
* SAGA: Stochastic Whole-Body Grasping with Contact
* Saliency Detection via Global Context Enhanced Feature Fusion and Edge Weighted Loss
* Saliency Hierarchy Modeling via Generative Kernels for Salient Object Detection
* Salient Object Detection for Point Clouds
* Salient Object Detection via Dynamic Scale Routing
* SALISA: Saliency-Based Input Sampling for Efficient Video Object Detection
* SALVe: Semantic Alignment Verification for Floorplan Reconstruction from Sparse Panoramas
* Sampling Agnostic Feature Representation for Long-Term Person Re-Identification
* SAR Image Super-Resolution Reconstruction Based on Full-Resolution Discrimination
* Sat: Self-Adaptive Training for Fashion Compatibility Prediction
* Satellite Image Change Detection Using Disjoint Information and Local Dissimilarity Map
* Satellite-Derived Photosynthetically Available Radiation at the Coastal Arctic Seafloor
* SAU: Smooth Activation Function Using Convolution with Approximate Identities
* SAVE: Spatial-Attention Visual Exploration
* SC-wLS: Towards Interpretable Feed-forward Camera Re-localization
* Scalability and Performance of LiDAR Point Cloud Data Management Systems: A State-of-the-Art Review
* Scalable Gamma-Driven Multilayer Network for Brain Workload Detection Through Functional Near-Infrared Spectroscopy
* Scalable Learning to Optimize: A Learned Optimizer Can Train Big Models
* ScalableViT: Rethinking the Context-Oriented Generalization of Vision Transformer
* Scale Effects and Time Variation of Trade-Offs and Synergies among Ecosystem Services in the Pearl River Delta, China
* Scale-Aware Spatio-Temporal Relation Learning for Video Anomaly Detection
* ScaleNet: Searching for the Model to Scale
* Scaling Adversarial Training to Large Perturbation Bounds
* Scaling Open-Vocabulary Image Segmentation with Image-Level Labels
* SCAM! Transferring Humans Between Images with Semantic Cross Attention Modulation
* Scanpath Prediction Via Semantic Representation of the Scene
* Scene Context Enhanced Network for Person Search
* Scene Reconstruction with Functional Objects for Robot Autonomy
* Scene Representation Learning from Videos Using Self-Supervised and Weakly-Supervised Techniques
* Scene Text Recognition with Permuted Autoregressive Sequence Models
* SCINet: Semantic Cue Infusion Network for Lane Detection
* Scraping Textures from Natural Images for Synthesis and Editing
* SdAE: Self-distillated Masked Autoencoder
* SecretGen: Privacy Recovery on Pre-trained Models via Distribution Discrimination
* Secrets of Event-Based Optical Flow
* Secure Distributed Adaptive Platooning Control of Automated Vehicles Over Vehicular Ad-Hoc Networks Under Denial-of-Service Attacks
* SeedFormer: Patch Seeds Based Point Cloud Completion with Upsample Transformer
* Seeing Far in the Dark with Patterned Flash
* Seeing Through a Black Box: Toward High-Quality Terahertz Imaging via Subspace-and-Attention Guided Restoration
* Segmentation-Free Super-Resolved 4D flow MRI Reconstruction Exploiting Navier-Stokes Equations and Spatial Regularization
* SegPGD: An Effective and Efficient Adversarial Attack for Evaluating and Boosting Segmentation Robustness
* Selection and Cross Similarity for Event-Image Deep Stereo
* SelectionConv: Convolutional Neural Networks for Non-rectilinear Image Data
* Selective Element and Two Orders Vectorization Networks for Automatic Depression Severity Diagnosis via Facial Changes
* Selective Intra-Image Similarity for Personalized Fixation-Based Object Segmentation
* Selective Query-Guided Debiasing for Video Corpus Moment Retrieval
* Selective TransHDR: Transformer-Based Selective HDR Imaging Using Ghost Region Mask
* Self-calibrating Photometric Stereo by Neural Inverse Rendering
* Self-Constrained Inference Optimization on Structural Groups for Human Pose Estimation
* Self-Contrastive Learning Framework for Skin Cancer Detection Using Histological Images, A
* Self-Distillation for Robust LiDAR Semantic Segmentation in Autonomous Driving
* Self-distilled Feature Aggregation for Self-supervised Monocular Depth Estimation
* Self-Feature Distillation with Uncertainty Modeling for Degraded Image Recognition
* Self-Filtering: A Noise-Aware Sample Selection for Label Noise with Confidence Penalization
* Self-Promoted Supervision for Few-Shot Transformer
* Self-Regulated Feature Learning via Teacher-free Feature Distillation
* Self-slimmed Vision Transformer
* Self-Superflow: Self-Supervised Scene Flow Prediction in Stereo Sequences
* Self-Supervised Class-Cognizant Few-Shot Classification
* Self-Supervised Classification Network
* Self-Supervised Cooperative Colorization of Achromatic Faces
* Self-Supervised Domain Adaptation in Crowd Counting
* Self-Supervised Frontalization and Rotation GAN with Random Swap for Pose-Invariant Face Recognition
* Self-supervised Human Mesh Recovery with Cross-Representation Alignment
* Self-supervised Interactive Object Segmentation Through a Singulation-and-Grasping Approach
* Self-supervised Learning for Real-World Super-Resolution from Dual Zoomed Observations
* Self-Supervised Learning for Texture Classification Using Limited Labeled Data
* Self-Supervised Learning of Optical Flow, Depth, Camera Pose and Rigidity Segmentation with Occlusion Handling
* Self-supervised Learning of Visual Graph Matching
* Self-Supervised Low-Light Image Enhancement Using Discrepant Untrained Network Priors
* Self-Supervised Method for Infrared and Visible Image Fusion, A
* Self-Supervised Pretraining for Deep Hash-Based Image Retrieval
* Self-supervised Social Relation Representation for Human Group Detection
* Self-supervised Sparse Representation for Video Anomaly Detection
* Self-Supervision Can Be a Good Few-Shot Learner
* Self-support Few-Shot Semantic Segmentation
* Self-Training Weakly-Supervised Framework for Pathologist-Like Histopathological Image Analysis, A
* Sem2NeRF: Converting Single-View Semantic Masks to Neural Radiance Fields
* Semantic Alignment for Multi-Item Compression
* Semantic Graph Attention With Explicit Anatomical Association Modeling for Tooth Segmentation From CBCT Images
* Semantic Image Segmentation: Two Decades of Research
* Semantic Novelty Detection via Relational Reasoning
* Semantic Unfolding of StyleGAN Latent Space
* Semantic-Aware Fine-Grained Correspondence
* Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation
* Semantic-Guided Multi-mask Image Harmonization
* Semantic-Sparse Colorization Network for Deep Exemplar-Based Colorization
* SemAug: Semantically Meaningful Image Augmentations for Object Detection Through Language Grounding
* Semi-Leak: Membership Inference Attacks Against Semi-supervised Learning
* Semi-Overcomplete Convolutional Auto-Encoder Embedding as Shape Priors for Deep Vessel Segmentation
* Semi-Supervised 3D Medical Image Segmentation Via Boundary-Aware Consistent Hidden Representation Learning
* Semi-supervised 3D Object Detection with Proficient Teachers
* Semi-supervised Deep Convolutional Transform Learning for Hyperspectral Image Classification
* Semi-supervised Keypoint Detector and Descriptor for Retinal Image Matching
* Semi-supervised Learning of Optical Flow by Flow Supervisor
* Semi-supervised Monocular 3D Object Detection by Multi-view Consistency
* Semi-Supervised Neuron Segmentation via Reinforced Consistency Learning
* Semi-supervised Object Detection via VC Learning
* Semi-Supervised Ranking for Object Image Blur Assessment
* Semi-supervised Single-View 3D Reconstruction via Prototype Shape Priors
* Semi-supervised Temporal Action Detection with Proposal-Free Masking
* Semi-supervised Vision Transformers
* SEMICON: A Learning-to-Hash Solution for Large-Scale Fine-Grained Image Retrieval
* Semidefinite Relaxations of Truncated Least-Squares in Robust Rotation Search: Tight or Not
* SentiStory: A Multi-Layered Sentiment-Aware Generative Model for Visual Storytelling
* SepLUT: Separable Image-Adaptive Lookup Tables for Real-Time Image Enhancement
* SeqDNet: Improving Missing Value by Sequential Depth Network
* SeqFormer: Sequential Transformer for Video Instance Segmentation
* SeqTR: A Simple Yet Universal Network for Visual Grounding
* Sequential Cross Attention Based Multi-Task Learning
* Sequential Multi-view Fusion Network for Fast LiDAR Point Motion Estimation
* SESS: Saliency Enhancing with Scaling and Sliding
* Severity Classification in Cases of Collagen Vi-Related Myopathy with Convolutional Neural Networks and Handcrafted Texture Features
* SFIC: Sparsity-Driven Facial Image Compression Network
* SFPN: Synthetic FPN for Object Detection
* SGBANet: Semantic GAN and Balanced Attention Network for Arbitrarily Oriented Scene Text Recognition
* SGUIE-Net: Semantic Attention Guided Underwater Image Enhancement With Multi-Scale Perception
* Shallow Sea Topography Detection from Multi-Source SAR Satellites: A Case Study of Dazhou Island in China
* Shap-CAM: Visual Explanations for Convolutional Neural Networks Based on Shapley Value
* Shape Matters: Deformable Patch Attack
* Shape Part Slot Machine: Contact-Based Reasoning for Generating 3D Shapes from Parts, The
* Shape-Pose Disentanglement Using SE(3)-Equivariant Vector Neurons
* ShAPO: Implicit Representations for Multi-Object Shape, Appearance, and Pose Optimization
* Share with Thy Neighbors: Single-View Reconstruction by Cross-Instance Consistency
* Shift-Tolerant Perceptual Similarity Metric
* Should All Proposals Be Treated Equally in Object Detection?
* Shuffle Attention Multiple Instances Learning for Breast Cancer Whole Slide Image Classification
* ShuffleCloudNet: A Lightweight Composite Neural Network-Based Method for Cloud Computation in Remote-Sensing Images
* SiamDoGe: Domain Generalizable Semantic Segmentation Using Siamese Network
* SiaTrans: Siamese transformer network for RGB-D salient object detection with depth image classification
* Sign-OPT+: An Improved Sign Optimization Adversarial Attack
* Sim-2-Sim Transfer for Vision-and-Language Navigation in Continuous Environments
* Sim-to-Real 6D Object Pose Estimation via Iterative Self-training for Robotic Bin Picking
* SimCC: A Simple Coordinate Classification Perspective for Human Pose Estimation
* Similarity Distillation Guided Feature Refinement Network for Few-Shot Semantic Segmentation, A
* Simple and Robust Correlation Filtering Method for Text-Based Person Search, A
* Simple Approach and Benchmark for 21,000-Category Object Detection, A
* Simple Baseline for Open-Vocabulary Semantic Segmentation with Pre-trained Vision-Language Model, A
* Simple Baselines for Image Restoration
* Simple Open-Vocabulary Object Detection
* Simple Siamese Framework for Vibration Signal Representations, A
* Simple Single-Scale Vision Transformer for Object Detection and Instance Segmentation, A
* SimpleRecon: 3D Reconstruction Without 3D Convolutions
* Simulator Attack+ for Black-Box Adversarial Attack
* Simultaneous Learning and Compression for Convolution Neural Networks
* Simultaneous Obstacle Avoidance and Target Tracking of Multiple Wheeled Mobile Robots With Certified Safety
* Simultaneous Smoothing and Sharpening Using iWGIF
* Simurgh: A Framework for CAD-Driven Deep Learning Based X-Ray CT Reconstruction
* Single Frame Atmospheric Turbulence Mitigation: A Benchmark Study and a New Physics-Inspired Transformer Model
* Single Image Dehazing via Model-Based Deep-Learning
* Single Image Reflection Removal Based on Bi-Channels Prior
* Single Stage Virtual Try-On Via Deformable Attention Flows
* Single-Snapshot Nested Virtual Array Completion: Necessary and Sufficient Conditions
* Single-Stream Multi-level Alignment for Vision-Language Pretraining
* SinNeRF: Training Neural Radiance Fields on Complex Scenes from a Single Image
* SiRi: A Simple Selective Retraining Mechanism for Transformer-Based Visual Grounding
* Skeleton-Free Pose Transfer for Stylized 3D Characters
* Skeleton-Parted Graph Scattering Networks for 3D Human Motion Prediction
* Sketch is Worth a Thousand Words: Image Retrieval with Text and Sketch, A
* SketchSampler: Sketch-Based 3D Reconstruction via View-Dependent Depth Sampling
* Skip-MLP Network for Point Cloud Classification
* Sliced Recursive Transformer
* Slicing Aided Hyper Inference and Fine-Tuning for Small Object Detection
* SLiDE: Self-supervised LiDAR De-snowing Through Reconstruction Difficulty
* Sliding Window Detection and Analysis Method of Night-Time Light Remote Sensing Time Series: A Case Study of the Torch Festival in Yunnan Province, China
* Sliding Window Scheme for Online Temporal Action Localization, A
* Slim Scissors: Segmenting Thin Object from Synthetic Background
* SLIP: Self-supervision Meets Language-Image Pre-training
* SLTFILL: Spatial and Light Transformer for Multi-Reference Image Inpainting
* Small Low-Contrast Target Detection: Data-Driven Spatiotemporal Feature Fusion and Implementation
* Smart Learning of Click and Refine for Nuclei Segmentation on Histology Images
* Smile: Sequence-to-Sequence Domain Adaptation with Minimizing Latent Entropy for Text Image Recognition
* SmoothNet: A Plug-and-Play Network for Refining Human Poses in Videos
* SNeS: Learning Probably Symmetric Neural Surfaces from Incomplete Data
* Sobolev Training for Implicit Neural Representations with Approximated Image Derivatives
* Social ODE: Multi-agent Trajectory Forecasting with Neural Ordinary Differential Equations
* Social-Implicit: Rethinking Trajectory Prediction Evaluation and The Effectiveness of Implicit Maximum Likelihood Estimation
* Social-SSL: Self-supervised Cross-Sequence Representation Learning Based on Transformers for Multi-agent Trajectory Prediction
* SocialVAE: Human Trajectory Prediction Using Timewise Latents
* Socio-Ecological Vulnerability in Aba Prefecture, Western Sichuan Plateau: Evaluation, Driving Forces and Scenario Simulation
* Soft Masking for Cost-Constrained Channel Pruning
* Soli Radar Image-Based Target Localization
* Solution Space Analysis of Essential Matrix Based on Algebraic Error Minimization
* SOS! Self-supervised Learning over Sets of Handled Objects in Egocentric Action Recognition
* Sound Localization by Self-supervised Time Delay Estimation
* Sound-Guided Semantic Video Generation
* Source-Free Domain Adaptation with Contrastive Domain Alignment and Self-supervised Exploration for Face Anti-spoofing
* Source-Free Video Domain Adaptation by Learning Temporal Consistency for Action Recognition
* SP-Net: Slowly Progressing Dynamic Inference Networks
* SPA-GAN: SAR Parametric Autofocusing Method with Generative Adversarial Network
* Space-Partitioning RANSAC
* Sparse Distortionless Modal Beamforming for Spherical Microphone Arrays
* Sparse-Based Transformer Network With Associated Spatiotemporal Feature for Micro-Expression Recognition, A
* SparseNeuS: Fast Generalizable Neural Surface Reconstruction from Sparse Views
* Spatial and Visual Perspective-Taking via View Rotation and Relation Reasoning for Embodied Reference Understanding
* Spatial Gap-Filling of GK2A Daily Sea Surface Temperature (SST) around the Korean Peninsula Using Meteorological Data and Regression Residual Kriging (RRK)
* Spatial Modeling of COVID-19 Prevalence Using Adaptive Neuro-Fuzzy Inference System
* Spatial Moment Pooling Improves Neural Image Assessment
* Spatial Sensitive GRAD-CAM: Visual Explanations for Object Detection by Incorporating Spatial Sensitivity
* Spatial Validation of Spectral Unmixing Results: A Case Study of Venice City
* Spatial-Frequency Domain Information Integration for Pan-Sharpening
* Spatial-Semantic Attention for Grounded Image Captioning
* Spatial-Separated Curve Rendering Network for Efficient and High-Resolution Image Harmonization
* SpatialDETR: Robust Scalable Transformer-Based 3D Object Detection From Multi-view Camera Images With Global Cross-Sensor Attention
* Spatially Consistent Transformer for Colorization in Monochrome-Color Dual-Lens System
* Spatially Continuous Mapping of Forest Canopy Height in Canada by Combining GEDI and ICESat-2 with PALSAR and Sentinel
* Spatially Invariant Unsupervised 3D Object-Centric Learning and Scene Decomposition
* Spatially Non-Stationary Relationships between Changing Environment and Water Yield Services in Watersheds of China's Climate Transition Zones
* Spatio-Temporal Attention Graph for Monocular 3d Human Pose Estimation
* Spatio-Temporal Deformable Attention Network for Video Deblurring
* Spatio-Temporal Parallelization Scheme for HEVC Encoding on Multi-Computer Systems
* Spatio-Temporal Variability of the Impact of Population Mobility on Local Business Sales in Response to COVID-19 in Seoul, Korea
* Spatio-Temporal-Spectral Hierarchical Graph Convolutional Network with Semisupervised Active Learning for Patient-Specific Seizure Prediction
* Spatiotemporal Assessment of Satellite Image Time Series for Land Cover Classification Using Deep Learning Techniques: A Case Study of Reunion Island, France
* Spatiotemporal Change in Livestock Population and Its Correlation with Meteorological Disasters during 2000-2020 across Inner Mongolia
* Spatiotemporal Patterns and Driving Factors of Ecological Vulnerability on the Qinghai-Tibet Plateau Based on the Google Earth Engine
* Spatiotemporal Self-Attention Modeling with Temporal Patch Shift for Action Recognition
* SPE-Net: Boosting Point Cloud Analysis via Rotation Robustness Enhancement
* Speaker-Adaptive Lip Reading with User-Dependent Padding
* Spectral View of Randomized Smoothing Under Common Corruptions: Benchmarking and Improving Certified Robustness, A
* Spectral-Spatial-Dependent Global Learning Framework for Insufficient and Imbalanced Hyperspectral Image Classification, A
* Spectrum-Aware and Transferable Architecture Search for Hyperspectral Image Restoration
* SphereFed: Hyperspherical Federated Learning
* Spike Transformer: Monocular Depth Estimation for Spiking Camera
* SPIN: An Empirical Evaluation on Sharing Parameters of Isotropic Networks
* Spline Human Motion Recovery
* Sports Video Analysis on Large-Scale Data
* SPot-the-Difference Self-supervised Pre-training for Anomaly Detection and Segmentation
* SpOT: Spatiotemporal Modeling for 3D Object Tracking
* Spotting Temporally Precise, Fine-Grained Events in Video
* SPSN: Superpixel Prototype Sampling Network for RGB-D Salient Object Detection
* SPViT: Enabling Faster Vision Transformers via Latency-Aware Soft Token Pruning
* SQN: Weakly-Supervised Semantic Segmentation of Large-Scale 3D Point Clouds
* SRK-Net: Learning to Detect Repeatable Keypoints with Local Saliency Knowledge
* SRL-SOA: Self-Representation Learning with Sparse 1D-Operational Autoencoder for Hyperspectral Image Band Selection
* SSAS: Spatiotemporal Scale Adaptive Selection for Improving Bias Correction on Precipitation
* SSAT: Self-Supervised Associating Network for Multiobject Tracking
* SSBNet: Improving Visual Recognition Efficiency by Adaptive Sampling
* SSIS-Seg: Simulation-Supervised Image Synthesis for Surgical Instrument Segmentation
* SSP-Regularizer: A Star Shape Prior Based Regularizer for Vessel Lumen Segmentation in OCT Images
* ST-P3: End-to-End Vision-Based Autonomous Driving via Spatial-Temporal Feature Learning
* ST-VTON: Self-supervised vision transformer for image-based virtual try-on
* ST3DNetCrime: Improved ST-3DNet Model for Crime Prediction at Fine Spatial Temporal Scales
* Stable Clustering Ensemble Based on Evidence Theory
* Stacked Topological Preserving Dynamic Brain Networks Representation and Classification
* Stacking Ensemble Learning Method to Classify the Patterns of Complex Road Junctions, A
* Stacking More Linear Operations with Orthogonal Regularization to Learn Better
* StARformer: Transformer with State-Action-Reward Representations for Visual Reinforcement Learning
* Starvqa: Space-Time Attention for Video Quality Assessment
* Static and Dynamic Concepts for Self-Supervised Video Representation Learning
* Statistical Analysis of Inter Coding in VVC Test Model (VTM)
* STEEX: Steering Counterfactual Explanations with Semantics
* Stereo Depth Estimation with Echoes
* Stochastic Binary-Ternary Quantization for Communication Efficient Federated Computation
* Stochastic Consensus: Enhancing Semi-Supervised Learning with Consistency of Stochastic Classifiers
* StoryDALL-E: Adapting Pretrained Text-to-Image Transformers for Story Continuation
* Streamable Neural Fields
* Streaming Multiscale Deep Equilibrium Models
* Streaming-Capable High-Performance Architecture of Learned Image Compression Codecs
* StretchBEV: Stretching Future Instance Prediction Spatially and Temporally
* Stripformer: Strip Transformer for Fast Image Deblurring
* Strong-Weak Integrated Semi-Supervision for Unsupervised Domain Adaptation
* Structural Causal 3D Reconstruction
* Structural Triangulation: A Closed-Form Solution to Constrained 3D Human Pose Estimation
* Structure and Motion from Casual Videos
* Structure From Motion Pipeline for Orthographic Multi-View Images, A
* Structure-Aware Editable Morphable Model for 3D Facial Detail Animation and Manipulation
* Structure-Preserving Random Noise Attenuation Method for Seismic Data Based on a Flexible Attention CNN
* Structured Dropconnect for Uncertainty Inference in Image Classification
* Study of Deep Learning Networks for Motion Compensation in Cardiac Gated SPECT Images, A
* Study of Shape Modeling Against Noise, A
* Study on the Parameters of Ice Clouds Based on 1.5 mu-m Micropulse Polarization Lidar
* Studying Bias in GANs Through the Lens of Race
* Style Transfer Using Optimal Transport Via Wasserstein Distance
* Style Your Hair: Latent Optimization for Pose-Invariant Hairstyle Transfer via Local-Style-Aware Hair Alignment
* Style-Agnostic Reinforcement Learning
* Style-Based GAN Encoder for High Fidelity Reconstruction of Images and Videos, A
* Style-Guided Shadow Removal
* Style-Hallucinated Dual Consistency Learning for Domain Generalized Semantic Segmentation
* StyleBabel: Artistic Style Tagging and Captioning
* StyleFace: Towards Identity-Disentangled Face Generation on Megapixels
* StyleGAN-Human: A Data-Centric Odyssey of Human Generation
* StyleHEAT: One-Shot High-Resolution Editable Talking Face Generation via Pre-trained StyleGAN
* StyleLight: HDR Panorama Generation for Lighting Estimation and Editing
* StyleSwap: Style-Based Generator Empowers Robust Face Swapping
* Sub-Aperture Feature Adaptation in Single Image Super-Resolution Model for Light Field Imaging
* Sub-pixel Optical Satellite Image Registration for Ground Deformation Using Deep Learning
* Subjective and Objective Quality Assessment of High-Motion Sports Videos at Low-Bitrates
* Subjective Assessment Of High Dynamic Range Videos Under Different Ambient Conditions
* Subjective Quality Evaluation of Point Clouds with 3D Stereoscopic Visualization
* Subjective Quality Study for Video Frame Interpolation, A
* Subspace Diffusion Generative Models
* Subspace Modeling for Fast Out-Of-Distribution and Anomaly Detection
* Subspace-PnP: A Geometric Constraint Loss for Mutual Assistance of Depth and Optical Flow Estimation
* SUNS: A User-Friendly Scheme for Seamless and Ubiquitous Navigation Based on an Enhanced Indoor-Outdoor Environmental Awareness Approach
* Super-Resolution 3D Human Shape from a Single Low-Resolution Image
* Super-Resolution by Predicting Offsets: An Ultra-Efficient Super-Resolution Network for Rasterized Images
* Super-Resolution Magnetic Resonance Imaging using Segmented Signals in Phase-Scrambling Fourier Transform Imaging and Deep Learning
* Super-Resolution Photoacoustic Microscopy via Modified Phase Compounding
* SuperLine3D: Self-supervised Line Segmentation and Description for LiDAR Point Cloud
* Superpixel Group-Correlation Network for Co-Saliency Detection
* SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning
* Supervised Attribute Information Removal and Reconstruction for Image Manipulation
* Supervising Remote Sensing Change Detection Models With 3d Surface Semantics
* SUPR: A Sparse Unified Part-Based Human Representation
* Surprisingly Straightforward Scene Text Removal Method with Gated Attention and Region of Interest Generation: A Comprehensive Prominent Model Analysis, The
* Surveillance Video Quality Assessment Based on Quality Related Retraining
* SVBR-Net: A Non-Blind Spatially Varying Defocus Blur Removal Network
* SVG Vector Font Generation for Chinese Characters with Transformer
* SWFormer: Sparse Window Transformer for 3D Object Detection in Point Clouds
* SWIS: Self-Supervised Representation Learning for Writer Independent Offline Signature Verification
* Switch-BERT: Learning to Model Multimodal Interactions by Switching Attention and Input
* Switchable CNN-Based Same-Resolution and Super-Resolution In-Loop Restoration for Next Generation Video Codecs
* Switchable Online Knowledge Distillation
* SYGNet: A SVD-YOLO based GhostNet for Real-time Driving Scene Parsing
* Symmetry Regularization and Saturating Nonlinearity for Robust Quantization
* Synchronized Audio-Visual Frames with Fractional Positional Encoding for Transformers in Video-to-Text Translation
* Synergistic Effect of Atmospheric Boundary Layer and Regional Transport on Aggravating Air Pollution in the Twain-Hu Basin: A Case Study
* Synergistic Retrieval of Temperature and Humidity Profiles from Space-Based and Ground-Based Infrared Sounders Using an Optimal Estimation Method
* Synergistic Self-supervised and Quantization Learning
* Synthesizing Light Field Video from Monocular Video
* Synthetic Aperture Radar Doppler Tomography Reveals Details of Undiscovered High-Resolution Internal Structure of the Great Pyramid of Giza
* System Matrix Based Reconstruction for Pulsed Sequences in Magnetic Particle Imaging
* Tackling Background Distraction in Video Object Segmentation
* Tackling Long-Tailed Category Distribution Under Domain Shifts
* TACS: Taxonomy Adaptive Cross-Domain Semantic Segmentation
* TAFIM: Targeted Adversarial Attacks Against Facial Image Manipulations
* Tailoring Self-Supervision for Supervised Learning
* Talisman: Targeted Active Learning for Object Detection with Rare Classes and Slices Using Submodular Mutual Information
* TallFormer: Temporal Action Localization with a Long-Memory Transformer
* TAPE: Task-Agnostic Prior Embedding for Image Restoration
* Target-Absent Human Attention
* Task-Aware Few-Shot Visual Classification with Improved Self-Supervised Metric Learning
* Task-Driven Self-Supervised BI-Channel Networks Learning for Diagnosis of Breast Cancers with Mammography
* TAVA: Template-free Animatable Volumetric Actors
* Taxonomy Driven Learning of Semantic Hierarchy of Classes
* TD-Road: Top-Down Road Network Extraction with Holistic Graph Construction
* TDAM: Top-Down Attention Module for Contextually Guided Feature Selection in CNNs
* TDViT: Temporal Dilated Video Transformer for Dense Video Tasks
* Teaching Where to Look: Attention Similarity Knowledge Distillation for Low Resolution Face Recognition
* Teaching with Soft Label Smoothing for Mitigating Noisy Labels in Facial Expressions
* Technique to Navigate Autonomous Underwater Vehicles Using a Virtual Coordinate Reference Network during Inspection of Industrial Subsea Structures, A
* Telepresence Video Quality Assessment
* TEMOS: Generating Diverse Human Motions from Textual Descriptions
* TempFormer: Temporally Consistent Transformer for Video Denoising
* Temporal and Cross-modal Attention for Audio-Visual Zero-Shot Learning
* Temporal Axial Attention For Lidar-Based 3d Object Detection In Autonomous Driving
* Temporal Flow Mask Attention for Open-Set Long-Tailed Recognition of Wild Animals in Camera-Trap Images
* Temporal Lift Pooling for Continuous Sign Language Recognition
* Temporal Saliency Query Network for Efficient Video Recognition
* Temporal-MPI: Enabling Multi-plane Images for Dynamic Scene Modelling via Temporal Basis Learning
* Temporally Coherent Background Model for DIBR View Synthesis, A
* Temporally Consistent Semantic Video Editing
* Temporally Precise Action Spotting in Soccer Videos Using Dense Detection Anchors
* Tensor-Based Deepfake Detection in Scaled and Compressed Images
* TensoRF: Tensorial Radiance Fields
* Terrestrial Laser Scanning in Assessing the Effect of Different Thinning Treatments on the Competition of Scots Pine (Pinus sylvestris L.) Forests
* Text-Based Temporal Localization of Novel Events
* Text2LIVE: Text-Driven Layered Image and Video Editing
* TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers
* Texture-Guided End-to-End Depth Map Compression
* Texturify: Generating Textures on 3D Shape Surfaces
* Theoretical Understanding of the Information Flow on Continual Learning Performance
* Thermal to Visible Image Synthesis Under Atmospheric Turbulence
* This Is My Unicorn, Fluffy: Personalizing Frozen Vision-Language Representations
* Three Things Everyone Should Know About Vision Transformers
* TICNet: A Target-Insight Correlation Network for Object Tracking
* TIDEE: Tidying Up Novel Rooms Using Visuo-Semantic Commonsense Priors
* Time-Lagged Ensemble Quantitative Precipitation Forecasts for Three Landfalling Typhoons in the Philippines Using the CReSS Model, Part II: Verification Using Global Precipitation Measurement Retrievals
* Time-rEversed DiffusioN tEnsor Transformer: A New TENET of Few-Shot Object Detection
* Time-Space Compression Effect of High-Speed Rail on Tourist Destinations in China
* TinyViT: Fast Pretraining Distillation for Small Vision Transformers
* Tip-Adapter: Training-Free Adaption of CLIP for Few-Shot Classification
* TIPS: Text-Induced Pose Synthesis
* TISE: Bag of Metrics for Text-to-Image Synthesis Evaluation
* TL;DW? Summarizing Instructional Videos with Task Relevance and Cross-Modal Saliency
* TM2T: Stochastic and Tokenized Modeling for the Reciprocal Generation of 3D Human Motions and Texts
* TO-Scene: A Large-Scale Dataset for Understanding 3D Tabletop Scenes
* TOCH: Spatio-Temporal Object-to-Hand Correspondence for Motion Refinement
* TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers
* Tomography of Turbulence Strength Based on Scintillation Imaging
* Topological Access Methods for Spatial and Spatiotemporal Data
* Topologically-Consistent Magnitude Pruning for Very Lightweight Graph Convolutional Networks
* Topology-Aware Flow-Based Point Cloud Generation
* Totems: Physical Objects for Verifying Visual Integrity
* Toward Robust Histology-Prior Embedding for Endomicroscopy Image Classification
* Toward Snow Removal Via the Diversity and Complexity of Snow Image
* Toward Stable Co-Saliency Detection and Object Co-Segmentation
* Toward Understanding and Boosting Adversarial Transferability from a Distribution Perspective
* Toward Understanding WordArt: Corner-Guided Transformer for Scene Text Recognition
* Towards a Unified View of Unsupervised Non-Local Methods for Image Denoising: The NL-Ridge Approach
* Towards Accurate Active Camera Localization
* Towards Accurate Binary Neural Networks via Modeling Contextual Dependencies
* Towards Accurate Network Quantization with Equivalent Smooth Regularizer
* Towards Accurate Open-Set Recognition via Background-Class Regularization
* Towards Calibrated Hyper-Sphere Representation via Distribution Overlap Coefficient for Long-Tailed Learning
* Towards Classification of Architectural Styles of Chinese Traditional Settlements Using Deep Learning: A Dataset, a New Framework, and Its Interpretability
* Towards Comprehensive Representation Enhancement in Semantics-Guided Self-supervised Monocular Depth Estimation
* Towards Data-Efficient Detection Transformers
* Towards Effective and Robust Neural Trojan Defenses via Input Filtering
* Towards Efficient Adversarial Training on Vision Transformers
* Towards Efficient and Effective Self-supervised Learning of Visual Representations
* Towards Efficient and Scale-Robust Ultra-High-Definition Image Demoiréing
* Towards Efficient Capsule Networks
* Towards Efficient Variational Auto-Encoder Using Wasserstein Distance
* Towards Forest Condition Assessment: Evaluating Small-Footprint Full-Waveform Airborne Laser Scanning Data for Deriving Forest Structural and Compositional Metrics
* Towards Generalizable DEEPFAKE Face Forgery Detection with Semi-Supervised Learning and Knowledge Distillation
* Towards Generic 3D Tracking in RGBD Videos: Benchmark and Baseline
* Towards Grand Unification of Object Tracking
* Towards Hard-Positive Query Mining for DETR-Based Human-Object Interaction Detection
* Towards High-Fidelity Single-View Holistic Reconstruction of Indoor Scenes
* Towards Integrated Land Management: The Role of Green Infrastructure
* Towards Interpretable Video Super-Resolution via Alternating Optimization
* Towards Learning Neural Representations from Shadows
* Towards Lightweight Neural Network-based Chroma Intra Prediction for Video Coding
* Towards Metrical Reconstruction of Human Faces
* Towards Model Quantization on the Resilience Against Membership Inference Attacks
* Towards Open Set Video Anomaly Detection
* Towards Open-Vocabulary Scene Graph Generation with Prompt-Based Finetuning
* Towards Racially Unbiased Skin Tone Estimation via Scene Disambiguation
* Towards Real-World HDRTV Reconstruction: A Data Synthesis-Based Approach
* Towards Realistic Semi-supervised Learning
* Towards Regression-Free Neural Networks for Diverse Compute Platforms
* Towards Robust Face Recognition with Comprehensive Search
* Towards Scale-Aware, Robust, and Generalizable Unsupervised Monocular Depth Estimation by Integrating IMU Motion Dynamics
* Towards Sequence-Level Training for Visual Tracking
* Towards Ultra Low Latency Spiking Neural Networks for Vision and Sequential Tasks Using Temporal Pruning
* Towards Unbiased Label Distribution Learning for Facial Pose Estimation Using Anisotropic Spherical Gaussian
* Towards Zero-Latency Video Transmission Through Frame Extrapolation
* Trace Controlled Text to Image Generation
* Tracking by Associating Clips
* Tracking Every Thing in the Wild
* Tracking Objects as Pixel-Wise Distributions
* Trading Positional Complexity vs Deepness in Coordinate Networks
* Training Strategy for Limited Labeled Data by Learning from Confusion
* Training Vision Transformers with only 2040 Images
* Trajectory-Based Pattern of Life Analysis
* Transfer Learning from Vision Transformers or ConvNets for 360-Degree Images Quality Assessmentƒ
* Transfer Without Forgetting
* Transferable Learning Classification Model and Carbon Sequestration Estimation of Crops in Farmland Ecosystem, A
* TransFGU: A Top-Down Approach to Fine-Grained Unsupervised Semantic Segmentation
* Transform Skip Inspired End-to-End Compression for Screen Content Image
* Transform Your Smartphone into a DSLR Camera: Learning the ISP in the Wild
* Transformation-Based Adversarial Defense Via Sparse Representation
* Transformed ROIs for capturing visual transformations in videos
* Transformer Based Self-Context Aware Prediction for Few-Shot Anomaly Detection in Videos
* Transformer Compressed Sensing Via Global Image Tokens
* Transformer Visual Tracker Based on Template Features Corresponding to Foreground Region
* Transformer with Implicit Edges for Particle-Based Physics Simulation
* Transformer-Based Approach for Document Layout Understanding
* Transformer-Based Decoder for Semantic Segmentation with Multi-level Context Mining, A
* Transformers as Meta-learners for Implicit Neural Representations
* Transformers for Workout Video Segmentation
* TransGrasp: Grasp Pose Estimation of a Category of Objects by Transferring Grasps from Only One Labeled Instance
* Translated Skip Connections: Expanding the Receptive Fields of Fully Convolutional Neural Networks
* Translating a Visual LEGO Manual to a Machine-Executable Plan
* Translation of Illustration Artist Style Using Sailormoonredraw Data
* Translation, Scale and Rotation: Cross-Modal Alignment Meets RGB-Infrared Vehicle Detection
* TransMatting: Enhancing Transparent Objects Matting with Transformers
* TransVLAD: Focusing on Locally Aggregated Descriptors for Few-Shot Learning
* TransYOLO: High-Performance Object Detector for Forward Looking Sonar Images
* Trapped in Texture Bias? A Large Scale Comparison of Deep Instance Segmentation
* Tree Species Classification Based on Fusion Images by GF-5 and Sentinel-2A
* Tree Structure-Aware Few-Shot Image Classification via Hierarchical Aggregation
* Trellis-Coded Quantization for End-to-End Learned Image Compression
* TREND: Truncated Generalized Normal Density Estimation of Inception Embeddings for GAN Evaluation
* Trends in User Identity and Continuous Authentication
* Triangle Attack: A Query-Efficient Decision-Based Adversarial Attack
* Triplet Cross-Fusion Learning for Unpaired Image Denoising in Optical Coherence Tomography
* TRoVE: Transforming Road Scene Datasets into Photorealistic Virtual Environments
* Truncated Lottery Ticket for Deep Pruning
* Trust, but Verify: Using Self-supervised Probing to Improve Trustworthiness
* TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval
* tSF: Transformer-Based Semantic Filter for Few-Shot Learning
* Tuning Neural ODE Networks to Increase Adversarial Robustness in Image Forensics
* Two Distillation Perspectives Based on Tanimoto Coefficient
* Two-Dimensional InSAR Monitoring of the Co- and Post-Seismic Ground Deformation of the 2021 Mw 5.9 Arkalochori (Greece) Earthquake and Its Impact on the Deformations of the Heraklion City Wall Relic
* Two-Stage Mesh Deep Learning for Automated Tooth Segmentation and Landmark Localization on 3D Intraoral Scans
* Two-Step Color-Polarization Demosaicking Network
* Two-Stream Non-Uniform Concentration Reasoning Network for Single Image Air Pollution Estimation
* U-Boost NAS: Utilization-Boosted Differentiable Neural Architecture Search
* U-Deepdig: Scalable Deep Decision Boundary Instance Generation
* U-Net for Taiwan Shoreline Detection from SAR Images
* UAV-Mounted GPR for Object Detection Based on Cross-Correlation Background Subtraction Method
* UC-OWOD: Unknown-Classified Open World Object Detection
* UCF-CAP, Video Captioning in the Wild
* UCTNet: Uncertainty-Aware Cross-Modal Transformer Network for Indoor RGB-D Semantic Segmentation
* UFO: Unified Feature Optimization
* UIA-ViT: Unsupervised Inconsistency-Aware Method Based on Vision Transformer for Face Forgery Detection
* Ultra-High-Resolution Unpaired Stain Transformation via Kernelized Instance Normalization
* Ultra-Low Bitrate Video Conferencing Using Deep Image Animation
* Unbiased Gradient Estimation for Differentiable Surface Splatting via Poisson Sampling
* Unbiased Manifold Augmentation for Coarse Class Subdivision
* Unbiased Multi-modality Guidance for Image Inpainting
* Unbiased Validation of the Algorithms for Automatic Needle Localization in Ultrasound-Guided Breast Biopsies
* Unblurring ISAR Imaging for Maneuvering Target Based on UFGAN
* Uncertainty Aware Multitask Pyramid Vision Transformer for UAV-Based Object Re-Identification
* Uncertainty Guided Multi-View Stereo Network for Depth Estimation
* Uncertainty Inspired Underwater Image Enhancement
* Uncertainty Learning in Kernel Estimation for Multi-stage Blind Image Super-Resolution
* Uncertainty Quantification in Depth Estimation via Constrained Ordinal Regression
* Uncertainty-Aware Multi-modal Learning via Cross-Modal Random Network Prediction
* Uncertainty-Based Spatial-Temporal Attention for Online Action Detection
* Uncertainty-DTW for Time Series and Sequences
* Uncertainty-Guided Source-Free Domain Adaptation
* Undersampled Dynamic Fourier Ptychography via Phaseless PCA
* Understanding Collapse in Non-contrastive Siamese Representation Learning
* Understanding the Dynamics of DNNs Using Graph Modularity
* Understanding Water Level Changes in the Great Lakes by an ICA-Based Merging of Multi-Mission Altimetry Measurements
* Unfolded Deep Kernel Estimation for Blind Image Super-Resolution
* UniCR: Universally Approximated Certified Robustness via Randomized Smoothing
* Unidirectional Video Denoising by Mimicking Backward Recurrent Modules with Look-Ahead Forward Ones
* UNIF: United Neural Implicit Functions for Clothed Human Reconstruction and Animation
* Unified Framework for Domain Adaptive Pose Estimation, A
* Unified Framework for Masked and Mask-Free Face Recognition Via Feature Rectification, A
* Unified Fully and Timestamp Supervised Temporal Action Segmentation via Sequence to Sequence Translation
* Unified Implicit Neural Stylization
* Unifying Event Detection and Captioning as Sequence Generation via Pre-training
* Unifying Framework for Human-Agent Collaborative Systems: Part II: Design Procedure and Application, A
* Unifying Visual Contrastive Learning for Object Recognition from a Graph Perspective
* Unifying Visual Perception by Dispersible Points Learning
* UniMiSS: Universal Medical Self-supervised Learning via Breaking Dimensionality Barrier
* UniNet: Unified Architecture Search with Convolution, Transformer, and MLP
* Union-Set Multi-source Model Adaptation for Semantic Segmentation
* UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language Modeling
* Unitail: Detecting, Reading, and Matching in Retail Scene
* United Defocus Blur Detection and Deblurring via Adversarial Promoting Learning
* Unknown-Oriented Learning for Open Set Domain Adaptation
* Unleashing Transformers: Parallel Token Prediction with Discrete Absorbing Diffusion for Fast High-Resolution Image Generation from Vector-Quantized Codes
* Unpaired Deep Image Dehazing Using Contrastive Disentanglement Learning
* Unpaired Image Translation via Vector Symbolic Architectures
* UnrealEgo: A New Dataset for Robust Egocentric 3D Human Motion Capture
* Unrolling Graph Total Variation for Light Field Image Denoising
* Unstructured Feature Decoupling for Vehicle Re-identification
* Unsupervised and Adaptive Perimeter Intrusion Detector
* Unsupervised and Semi-supervised Bias Benchmarking in Face Recognition
* Unsupervised Anomaly Detection with Self-Training and Knowledge Distillation
* Unsupervised Change Detection in Multitemporal VHR Images Based on Deep Kernel PCA Convolutional Mapping Network
* Unsupervised Cross-Modal Hashing Method Robust to Noisy Training Image-Text Correspondences in Remote Sensing, An
* Unsupervised Deep Event Stereo for Depth Estimation
* Unsupervised Deep Multi-Shape Matching
* Unsupervised Domain Adaptation for Monocular 3D Object Detection via Self-training
* Unsupervised Domain Adaptation for One-Stage Object Detector Using Offsets to Bounding Box
* Unsupervised Domain Adaptation Person Re-Identification by Camera-Aware Style Decoupling and Uncertainty Modeling
* Unsupervised Domain-Adaptive Person Re-Identification with Multi-Camera Constraints
* Unsupervised Few-Shot Image Classification by Learning Features into Clustering Space
* Unsupervised Generative Network for Blind Hyperspectral Image Super-Resolution
* Unsupervised Generative Variational Continual Learning
* Unsupervised High-Fidelity Facial Texture Generation and Reconstruction
* Unsupervised Image Fusion Using Deep Image Priors
* Unsupervised Learning of 3D Semantic Keypoints with Mutual Reconstruction
* Unsupervised Learning of Efficient Geometry-Aware Neural Articulated Representations
* Unsupervised Multi-Task Learning for 3D Subtomogram Image Alignment, Clustering and Segmentation
* Unsupervised Night Image Enhancement: When Layer Decomposition Meets Light-Effects Suppression
* Unsupervised Outlier Detection Using Memory and Contrastive Learning
* Unsupervised Parameter-Free Nuclei Segmentation Method for Histology Images, An
* Unsupervised Person Re-Identification via Multi-Label Classification
* Unsupervised Point Cloud Pre-Training Via Contrasting and Clustering
* Unsupervised Pose-aware Part Decomposition for Man-Made Articulated Objects
* Unsupervised Segmentation in Real-World Images via Spelke Object Inference
* Unsupervised Selective Labeling for More Effective Semi-supervised Learning
* Unsupervised Video Segmentation Algorithms Based On Flexibly Regularized Mixture Models
* Unsupervised Visual Representation Learning by Synchronous Momentum Grouping
* Unsustainable Anthropogenic Activities: A Paired Watershed Approach of Lake Urmia (Iran) and Lake Van (Turkey)
* UPHDR-GAN: Generative Adversarial Network for High Dynamic Range Imaging With Unpaired Data
* Urbanization Intensifies the Mismatch between the Supply and Demand of Regional Ecosystem Services: A Large-Scale Case of the Yangtze River Economic Belt in China
* UrbanLF: A Comprehensive Light Field Dataset for Semantic Segmentation of Urban Scenes
* Usage of Vehicle Re-Identification Models for Improved Persistent Multiple Object Tracking in Wide Area Motion Imagery
* Using Convolutional Neural Networks for Cloud Detection on VENnuS Images over Multiple Land-Cover Types
* Using Deep Learning to Improve Detection and Decoding Of Barcodes
* Using Flickr Data to Understand Image of Urban Public Spaces with a Deep Learning Model: A Case Study of the Haihe River in Tianjin
* Using Machine Learning to Extract Building Inventory Information Based on LiDAR Data
* Using Remote Sensing Methods to Study Active Geomorphologic Processes on Cantabrian Coastal Cliffs
* Using Tracking Data to Identify Gaps in Knowledge and Conservation of the Critically Endangered Siberian Crane (Leucogeranus leucogeranus)
* Using Vision Transformers in 3-D Medical Image Classifications
* Utility and Feasibility of a Center Surround Event Camera
* Utilizing Excess Resources in Training Neural Networks
* V2X-ViT: Vehicle-to-Everything Cooperative Perception with Vision Transformer
* Variable-Scale Visualization of High-Density Polygonal Buildings on a Tile Map
* Variance-Aware Weight Initialization for Point Convolutional Neural Networks
* Variance-Reduced Randomized Kaczmarz Algorithm In Xfel Single-Particle Imaging Phase Retrieval
* Variational Depth Estimation on Hypersphere for Panorama
* Variational Hyperparameter Inference for Few-Shot Learning Across Domains
* VCT-NET: An Octa Retinal Vessel Segmentation Network Based on Convolution and Transformer
* VecGAN: Image-to-Image Translation with Interpretable Latent Directions
* Vector Quantized Image-to-Image Translation
* Vector-Based Efficient Data Hiding in Encrypted Images via Multi-MSB Replacement
* Vectorizing Images of Any Size
* VEFNet: an Event-RGB Cross Modality Fusion Network for Visual Place Recognition
* Vegetation Coverage in the Desert Area of the Junggar Basin of Xinjiang, China, Based on Unmanned Aerial Vehicle Technology and Multisource Data
* Vegetation Productivity and Precipitation Use Efficiency across the Yellow River Basin: Spatial Patterns and Controls
* Ventriloquist-Net: Leveraging Speech Cues for Emotive Talking Head Generation
* Vessel Segmentation and Dirt/Reflection Detection For Retinal Fundus Photographs
* VG-GAN: Conditional GAN Framework for Graphical Design Generation
* Vibration-Based Uncertainty Estimation for Learning from Limited Supervision
* Video Activity Localisation with Uncertainties in Temporal Boundary
* Video Anomaly Detection by Solving Decoupled Spatio-Temporal Jigsaw Puzzles
* Video Dialog as Conversation About Objects Living in Space-Time
* Video Extrapolation in Space and Time
* Video Graph Transformer for Video Question Answering
* Video Instance Segmentation via Multi-Scale Spatio-Temporal Split Attention Transformer
* Video Interpolation by Event-Driven Anisotropic Adjustment of Optical Flow
* Video Mask Transfiner for High-Quality Video Instance Segmentation
* Video Person Re-Identification Using Attribute-Enhanced Features
* Video Question Answering with Iterative Video-Text Co-tokenization
* Video Restoration Framework and Its Meta-adaptations to Data-Poor Conditions
* Video Signal-Dependent Noise Estimation via Inter-Frame Prediction
* Video-Analytics Task-Aware Quad-Tree Partitioning and Quantization for HEVC
* Video-Grounded Dialogues with Joint Video and Image Training
* View Vertically: A Hierarchical Network for Trajectory Prediction via Fourier Spectrums
* ViewFormer: NeRF-Free Neural Rendering from Few Images Using Transformers
* Viewport-Oriented Panoramic Image Inpainting
* ViP: Unified Certified Detection and Recovery for Patch Attack with Vision Transformers
* VirtualPose: Learning Generalizable 3D Human Pose Models from Virtual Data
* VisageSynTalk: Unseen Speaker Video-to-Speech Synthesis via Speech-Visage Feature Selection
* Visual Cross-View Metric Localization with Dense Uncertainty Estimates
* Visual Knowledge Tracing
* Visual Navigation Perspective for Category-Level Object Pose Estimation, A
* Visual Prompt Tuning
* Visual Sentiment Prediction Using Cross-Way Few-Shot Learning Based on Knowledge Distillation
* Visual Servoing of Flexible-Link Manipulators by Considering Vibration Suppression Without Deformation Measurements
* Visual Sound Source Separation with Partial Supervision Learning
* Visual Tempo Contrastive Learning for Few-Shot Action Recognition
* Visual-Tactile Fused Graph Learning for Object Clustering
* ViTAS: Vision Transformer Architecture Search
* Vitranspad: Video Transformer Using Convolution And Self-Attention For Face Presentation Attack Detection
* VizWiz-FewShot: Locating Objects in Images Taken by People with Visual Impairments
* VL-LTR: Learning Class-wise Visual-Linguistic Representation for Long-Tailed Visual Recognition
* VLCAP: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning
* Vote from the Center: 6 DoF Pose Estimation in RGB-D Images by Radial Keypoint Voting
* VoViT: Low Latency Graph-Based Audio-Visual Voice Separation Transformer
* VQAMix: Conditional Triplet Mixup for Medical Visual Question Answering
* VQFR: Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder
* VQGAN-CLIP: Open Domain Image Generation and Editing with Natural Language Guidance
* VSA: Learning Varied-Size Window Attention in Vision Transformers
* VTC: Improving Video-Text Retrieval with User Comments
* W2N: Switching from Weak Supervision to Noisy Supervision for Object Detection
* Watermark Vaccine: Adversarial Attacks to Prevent Watermark Removal
* Wave-ViT: Unifying Wavelet and Transformers for Visual Representation Learning
* WaveGAN: Frequency-Aware GAN for High-Fidelity Few-Shot Image Generation
* Waymo Open Dataset: Panoramic Video Panoptic Segmentation
* Weakened Impacts of the East Asia-Pacific Teleconnection on the Interannual Variability of Summertime Precipitation over South China since the Mid-2000s
* Weakly Supervised 3D Scene Segmentation with Region-Level Boundary Awareness and Instance Discrimination
* Weakly Supervised Grounding for VQA in Vision-Language Transformers
* Weakly Supervised Object Localization Through Inter-class Feature Similarity and Intra-class Appearance Consistency
* Weakly Supervised Object Localization via Transformer with Implicit Spatial Calibration
* Weakly-Supervised Stitching Network for Real-World Panoramic Image Generation
* Weakly-Supervised Temporal Action Detection for Fine-Grained Videos with Hierarchical Atomic Actions
* Weather-degraded image semantic segmentation with multi-task knowledge distillation
* Webly Supervised Concept Expansion for General Purpose Vision Models
* Weekly Small Uncrewed Aerial System Surveys, Structure from Motion, and Empirical Orthogonal Function Analyses Reveal Unique Modes of Sediment Exchange Generated by Seasonal and Episodic Phenomena: Waikiki, Hawaii
* Weight Fixing Networks
* Weighted Supervised Contrastive Learning and Domain Mixture for Generalized Person Re-Identification
* WeLSA: Learning to Predict 6D Pose from Weakly Labeled Data Using Shape Alignment
* Wetlands Classification Using Quad-Polarimetric Synthetic Aperture Radar through Convolutional Neural Networks Based on Polarimetric Features
* What if Image Self-Similarity can be Better Exploited in Data Fidelity Terms?
* What Matters for 3D Scene Flow Network
* What to Hide from Your Students: Attention-Guided Masked Image Modeling
* When Active Learning Meets Implicit Semantic Data Augmentation
* When Counting Meets HMER: Counting-Aware Network for Handwritten Mathematical Expression Recognition
* When Deep Classifiers Agree: Analyzing Correlations Between Learning Order and Image Statistics
* When is the Cleaning of Subjective Data Relevant to Train UGC Video Quality Metrics?
* Where in the World Is This Image? Transformer-Based Geo-localization in the Wild
* Where to Focus: Investigating Hierarchical Attention Relationship for Fine-Grained Visual Classification
* Which Metrics for Network Pruning: Final Accuracy? Or Accuracy Drop?
* Width-Wise Parameter Sharing for Multi-Domain GAN Learning
* WiFi-Based Spatiotemporal Human Action Perception
* WISE: Whitebox Image Stylization by Example-Based Learning
* Word-Level Fine-Grained Story Visualization
* Worst Case Matters for Few-Shot Recognition
* WTM: The Site-Wise Empirical Wuhan University Tropospheric Model
* X-DETR: A Versatile Architecture for Instance-wise Vision-Language Tasks
* X-Learner: Learning Cross Sources and Tasks for Universal Visual Representation
* XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
* YOLO-SD: Small Ship Detection in SAR Images by Multi-Scale Convolution and Feature Transformer Module
* Yolo-SG: Salience-Guided Detection Of Small Objects In Medical Images
* You Already Have It: A Generator-Free Low-Precision DNN Training Framework Using Stochastic Rounding
* You Should Look at All Objects
* Zero-Shot Attribute Attacks on Fine-Grained Recognition Models
* Zero-Shot Category-Level Object Pose Estimation
* Zero-Shot Learning for Reflection Removal of Single 360-Degree Image
* Zero-Shot Temporal Action Detection via Vision-Language Prompting
2999 for 2211

Index for "2"


Last update: 6-May-24 16:27:55
Use price@usc.edu for comments.