| _ | mask | _ |
| 30 m Resolution Surface Water | mask | Including Estimation of Positional and Thematic Differences Using Landsat 8, SRTM and OpenStreetMap: A Case Study in the Murray-Darling Basin, Australia, A |
| 3D Face | mask | Anti-spoofing via Deep Fusion of Dynamic Texture and Shape Clues |
| 3D face | mask | presentation attack detection based on intrinsic image analysis |
| 3D Face Recognition Using an Expression Insensitive Dynamic | mask | |
| 3D Facial Geometric Attributes Based Anti-Spoofing Approach against | mask | Attacks |
| 3D High-Fidelity | mask | Face Presentation Attack Detection Challenge |
| 3D human body modeling with orthogonal human | mask | image based on multi-channel Swin transformer architecture |
| 3D | mask | Face Anti-Spoofing Database with Real World Variations, A |
| 3D | mask | Face Anti-spoofing with Remote Photoplethysmography |
| 3D | mask | presentation attack detection via high resolution face parts |
| 3d Model-Based Approach for Fitting | mask | s To Faces In the Wild, A |
| 4DPM: Deepfake Detection With a Denoising Diffusion Probabilistic | mask | |
| Abandoned Objects Detection Using Double Illumination Invariant Foreground | mask | s |
| Accurate fingertip detection from binocular | mask | images |
| Accurate Object Localization with Shape | mask | s |
| Accurate Object Recognition with Shape | mask | s |
| Action Detection in Crowded Videos Using | mask | s |
| Active | mask | Hierarchies for Object Detection |
| Active | mask | Segmentation of Fluorescence Microscope Images |
| Adaptive active- | mask | image segmentation for quantitative characterization of mitochondrial morphology |
| Adaptive | mask | for Region-based Facial Micro-Expression Recognition |
| Adaptive | mask | -based Pyramid Network for Realistic Bokeh Rendering |
| Adding New Tasks to a Single Network with Weight Transformations Using Binary | mask | s |
| Adversarial | mask | Generation for Preserving Visual Privacy |
| AFNet-M: Adaptive Fusion Network with | mask | s for 2D+3D Facial Expression Recognition |
| AMGSN: Adaptive | mask | -guide supervised network for debiased facial expression recognition |
| Amodal Instance Segmentation of Thin Objects with Large Overlaps by Seed-to- | mask | Extending |
| Any3DIS: Class-Agnostic 3D Instance Segmentation by 2D | mask | Tracking |
| Appearance Matching of Occluded Objects Using Coarse-to-Fine Adaptive | mask | s |
| Appearance Radii in Medial Axis Test | mask | for Small Planar Chamfer Norms |
| Applying Deep Learning to Clear-Sky Radiance Simulation for VIIRS with Community Radiative Transfer Model: Part 1: Develop AI-Based Clear-Sky | mask | |
| Arbitrarily Shaped Scene Text Detection With a | mask | Tightness Text Detector |
| Assessment of Cloud Cover Characteristics Over Calibration Test Sites Using Modis Cloud | mask | Products |
| Assessment of More Suitable Image Spatial Resolutions for Offshore Aquaculture Areas Automatic Monitoring Based on Coupled NDWI and | mask | R-CNN, The |
| Assessment of the GOES-16 Clear Sky | mask | Product over the Contiguous USA Using CALIPSO Retrievals |
| Assessment of the Reprocessed Suomi NPP VIIRS Enterprise Cloud | mask | Product |
| ASTER Cloud Coverage Assessment and Mission Operations Analysis Using Terra/MODIS Cloud | mask | Products |
| Asymmetric | mask | Scheme for Self-Supervised Real Image Denoising |
| Attentive | mask | CLIP |
| Attributes based skin lesion detection and recognition: A | mask | RCNN and transfer learning-based deep learning framework |
| Augmented Self- | mask | Attention Transformer for Naturalistic Driving Action Recognition |
| Automated SAR Image Thresholds for Water | mask | Production in Alberta's Boreal Region |
| Automated Stereo Retrieval of Smoke Plume Injection Heights and Retrieval of Smoke Plume | mask | s From AATSR and Their Assessment With CALIPSO and MISR |
| Automated tree-crown and height detection in a young forest plantation using | mask | region-based convolutional neural network (Mask R-CNN) |
| Automated tree-crown and height detection in a young forest plantation using | mask | region-based convolutional neural network (Mask R-CNN) |
| Automatic calculation of chamfer | mask | coefficients for large masks and anisotropic images pages. |
| Automatic calculation of chamfer | mask | coefficients for large masks and anisotropic images pages. |
| Automatic Calibration between Multi-Lines LiDAR and Visible Light Camera Based on Edge Refinement and Virtual | mask | Matching |
| Automatic detection of multiple sclerosis lesions using | mask | R-CNN on magnetic resonance scans |
| Automatic Diagnosis of Glaucoma on Color Fundus Images Using Adaptive | mask | Deep Network |
| Automatic Evaluation of Wheat Resistance to Fusarium Head Blight Using Dual | mask | -RCNN Deep Learning Frameworks in Computer Vision |
| Automatic Fitting of a Deformable Face | mask | Using a Single Image |
| Automatic Identification and Dynamic Monitoring of Open-Pit Mines Based on Improved | mask | R-CNN and Transfer Learning |
| Automatic Inspection System for Printed Wiring Board | mask | s, An |
| Automatic | mask | Extraction for PIV-Based Dam-Break Analysis |
| Automatic Mura Detection for Display Film Using | mask | Filtering in Wavelet Transform |
| Automatic semantic style transfer using deep convolutional neural networks and soft | mask | s |
| Automatic Sheep Behaviour Analysis Using | mask | R-CNN |
| Automatical Adaptation of Anatomical | mask | s to the Neocortex |
| Autonomous Detection of Disruptions in the Intensive Care Unit Using Deep | mask | R-CNN |
| BA-SAM: Scalable Bias-Mode Attention | mask | for Segment Anything Model |
| Background-Tolerant Object Classification With Embedded Segmentation | mask | For Infrared and Color Imagery |
| Benchmarking Segmentation Models with | mask | -Preserved Attribute Editing |
| Beyond Boxes: | mask | -Guided Spatio-Temporal Feature Aggregation for Video Object Detection |
| Beyond | mask | : Rethinking guidance types in few-shot segmentation |
| bi-directional fractional-order derivative | mask | for image processing applications, A |
| Bi-Modal Progressive | mask | Attention for Fine-Grained Recognition |
| Bi-Polar | mask | for Joint Cell and Nuclei Instance Segmentation |
| Bidirectional | mask | Selection for Zero-Shot Referring Image Segmentation |
| Blind Adaptive | mask | to Improve Intelligibility of Non-Stationary Noisy Speech |
| Boosting Adversarial Transferability With Learnable Patch-Wise | mask | s |
| Boosting binary | mask | s for multi-domain learning through affine transformations |
| Boosting Robust Multi-Focus Image Fusion With Frequency | mask | and Hyperdimensional Computing |
| BooW-VTON: Boosting In-the-Wild Virtual Try-On via | mask | -Free Pseudo Data Training |
| Boundary-preserving | mask | R-CNN |
| BshapeNet: Object detection and instance segmentation with bounding shape | mask | s |
| Building Extraction from High Resolution Remote Sensing Images Based on Improved | mask | R-CNN |
| Building Extraction from Satellite Images Using | mask | R-CNN with Building Boundary Regularization |
| Building Segmentation From Airborne VHR Images Using | mask | R-CNN |
| CACM-Net: Daytime Cloud | mask | for AGRI Onboard the FY-4A Satellite |
| Calculation Method of Surface Representation Using B-Spline | mask | |
| CAV-MAE Sync: Improving Contrastive Audio-Visual | mask | Autoencoders via Fine-Grained Alignment |
| Cell Detection and Segmentation in Microscopy Images with Improved | mask | R-CNN |
| Chamfer | mask | s: discrete distance functions, geometrical properties and optimization |
| Chinese Character Component Segmentation Based on Character Structure | mask | s |
| Class of Three-Dimensional Recursive Parallelpiped | mask | s, A |
| Class-Aware | mask | -guided feature refinement for scene text recognition |
| Classification and Segmentation of Rotated and Scaled Textured Images Using Texture Tuned | mask | s |
| Classification of forms with similar layouts based on Mixed Gaussian Weighted | mask | |
| Classification of Natural Textures by Means of Two-Dimensional Orthogonal | mask | s |
| Classifying, Segmenting, and Tracking Object Instances in Video with | mask | Propagation |
| CLME: Robust Screen-Shooting Watermarking With Contrastive Learning and | mask | -Guided Embedding |
| Cloud | mask | Detection by Combining Active and Passive Remote Sensing Data |
| cloud | mask | methodology for high resolution remote sensing data combining information from high and medium resolution optical sensors, A |
| CM-Net: Concentric | mask | Based Arbitrary-Shaped Text Detection |
| CMT-DeepLab: Clustering | mask | Transformers for Panoptic Segmentation |
| CNN Patch Pooling for Detecting 3D | mask | Presentation Attacks in NIR |
| Co-Saliency Detection via | mask | -Guided Fully Convolutional Networks With Multi-Scale Label Smoothing |
| Coarse | mask | Guided Interactive Object Segmentation |
| Coarse-to-Fine Adaptive | mask | s for Appearance Matching of Occluded Scenes |
| Coconut trees detection and segmentation in aerial imagery using | mask | region-based convolution neural network |
| CodedStereo: Learned Phase | mask | s for Large Depth-of-field Stereo |
| COFNet: Contrastive Object-Aware Fusion Using Box-Level | mask | s for Multispectral Object Detection |
| Co | mask | : Corresponding Mask-Based End-to-End Extrinsic Calibration of the Camera and LiDAR |
| Combining Cylindrical Voxel and | mask | R-CNN for Automatic Detection of Water Leakages in Shield Tunnel Point Clouds |
| Commentary Paper 1 on A Localized Approach to Abandoned Luggage Detection with Foreground- | mask | Sampling |
| Commentary Paper 2 on A Localized Approach to Abandoned Luggage Detection with Foreground- | mask | Sampling |
| Comments on Fast Convolution with Laplacian-of-Gaussian | mask | s |
| Comparison of Aqua/Terra MODIS and Himawari-8 Satellite Data on Cloud | mask | and Cloud Type Classification Using Split Window Algorithm |
| Comparison of Classical Methods and | mask | R-CNN for Automatic Tree Detection and Mapping Using UAV Imagery |
| Comparison of Cloud- | mask | Algorithms and Machine-Learning Methods Using Sentinel-2 Imagery for Mapping Paddy Rice in Jianghan Plain |
| Comparison of depth-of-focus-enhancing pupil | mask | s based on a signal-to-noise-ratio criterion after deconvolution |
| Comparison of MODIS/VIIRS Cloud | mask | s over Ice-Bearing River: On Achieving Consistent Cloud Masking and Improved River Ice Mapping, A |
| Complex Organ | mask | Guided Radiology Report Generation |
| Complexity Reduced Face Detection Using Probability-Based Face | mask | Prefiltering and Pixel-Based Hierarchical-Feature Adaboosting |
| ComplexMix: Semi-Supervised Semantic Segmentation Via | mask | -Based Data Augmentation |
| Component-Wise Edge Detection by Laplacian Operator | mask | s |
| Compressive Prior Guided | mask | Predictive Coding Approach for Video Analysis, A |
| Computing Multi-Colored Polygonal | mask | s in Pipeline Architectures and Its Application to Automated Visual Inspection |
| Concept | mask | : Large-Scale Segmentation from Semantic Concepts |
| Conditioning diffusion models via attributes and semantic | mask | s for face generation |
| Constrained Probabilistic | mask | Learning for Task-specific Undersampled MRI Reconstruction |
| Continual Diffusion with STAMINA: STack-And- | mask | INcremental Adapters |
| Continuity MODIS-VIIRS Cloud | mask | , The |
| Contrastive 3D Human Skeleton Action Representation Learning via CrossMoCo With Spatiotemporal Occlusion | mask | Data Augmentation |
| Convolution with Separable | mask | s for Early Image Processing |
| Convolutional Attribute | mask | with Two-step Attention for Fashion Image Retrieval |
| Convolutional Neural Network with Learnable | mask | s for EIT Based Tactile Sensing |
| Correlation Tracking via | mask | and Multi-peaks Re-prediction |
| Countermeasure for the protection of face recognition systems against | mask | attacks |
| COutfitGAN: Learning to Synthesize Compatible Outfits Supervised by Silhouette | mask | s and Fashion Styles |
| Crocos-V1: Enhancing | mask | Leakage and Bounding Box Localization for Real-Time Crop/Weed Instance Segmentation* |
| Crop Yield Estimation Using Time-Series MODIS Data and the Effects of Cropland | mask | s in Ontario, Canada |
| CSF: Closed- | mask | -guided semantic fusion method for semantic perception of unknown scenes |
| Curvature Estimation for Discrete Curves Based on Auto-adaptive | mask | s of Convolution |
| Damaged Building Extraction Using Modified | mask | R-CNN Model Using Post-Event Aerial Images of the 2016 Kumamoto Earthquake |
| Data-Driven Probabilistic Occlusion | mask | to Promote Visual Tracking |
| DCT- | mask | : Discrete Cosine Transform Mask Representation for Instance Segmentation |
| DCT- | mask | : Discrete Cosine Transform Mask Representation for Instance Segmentation |
| De-noising | mask | transformer for referring image segmentation |
| Decision Boundaries Using Bayes Factors: The Case of Cloud | mask | s |
| Deep 3D | mask | Volume for View Synthesis of Dynamic Scenes |
| Deep Free-Form Deformation Network for Object- | mask | Registration |
| Deep leaf: | mask | R-CNN based leaf detection and segmentation from digitized herbarium specimen images |
| Deep Learning Based | mask | Detection In Smart Home Entries During The Epidemic Process |
| Deep Learning Trained Clear-Sky | mask | Algorithm for VIIRS Radiometric Bias Assessment, A |
| Deep learning-based apple detection using a suppression | mask | R-CNN |
| Deep learning-based approach to latent overlapped fingerprints | mask | segmentation |
| Deepfake detection with domain generalization and | mask | -guided supervision |
| DeepLIR: Attention-Based Approach for | mask | -Based Lensless Image Reconstruction |
| DeepWindows: Windows Instance Segmentation through an Improved | mask | R-CNN Using Spatial Attention and Relation Modules |
| Delineation and Grading of Actual Crop Production Units in Modern Smallholder Areas Using RS Data and | mask | R-CNN, The |
| Delving Deeper Into | mask | Utilization in Video Object Segmentation |
| Demosaicing of Color Filter Array Captured Images Using Gradient Edge Detection | mask | s and Adaptive Heterogeneity-Projection |
| Denoising convolutional neural network with | mask | for salt and pepper noise |
| Dense Cross-Query-and-Support Attention Weighted | mask | Aggregation for Few-Shot Segmentation |
| Density-Based Flow | mask | Integration via Deformable Convolution for Video People Flux Estimation |
| Depth from Defocus on a Transmissive Diffraction | mask | -based Sensor |
| Design2Cloth: 3D Cloth Generation from 2D | mask | s |
| Designing a Validation Protocol for Remote Sensing Based Operational Forest | mask | s Applications. Comparison of Products Across Europe |
| Designing Phase | mask | s for Under-Display Cameras |
| Detecting Arbitrary Keypoints on Limbs and Skis with Sparse Partly Correct Segmentation | mask | s |
| Detecting Presentation Attacks from 3D Face | mask | s Under Multispectral Imaging |
| Detection and classification of opened and closed flowers in grape inflorescences using | mask | R-CNN |
| Detection of Face | mask | Wearing Conditions with Lightweight CNN Models on Raspberry Pi 4 and Jetson Nano |
| Detection of Intensity Changes with Subpixel Accuracy Using Laplacian-Gaussian | mask | s |
| Development of a novel simplification | mask | for multi-shot optical scanners |
| Devil is in the Points: Weakly Semi-Supervised Instance Segmentation via Point-Guided | mask | Representation, The |
| Devil is in the Queries: Advancing | mask | Transformers for Real-world Medical Image Segmentation and Out-of-Distribution Localization |
| Differentiable | mask | for Pruning Convolutional and Recurrent Networks |
| Digital Color Halftoning with Generalized Error Diffusion and Multichannel Green-Noise | mask | s |
| Digital halftoning by means of green-noise | mask | s |
| Direct Differential Range Estimation Using Optical | mask | s |
| Directional | mask | s, Gaussian Masks, Canny etc. |
| Directional | mask | s, Gaussian Masks, Canny etc. |
| DMM-Net: Differentiable | mask | -Matching Network for Video Object Segmentation |
| Do Not | mask | What You Do Not Need to Mask: A Parser-free Virtual Try-on |
| Do Not | mask | What You Do Not Need to Mask: A Parser-free Virtual Try-on |
| Dual image and | mask | synthesis with GANs for semantic segmentation in optical coherence tomography |
| Dual-stream Framework for 3D | mask | Face Presentation Attack Detection, A |
| DWW: Robust Deep Wavelet-Domain Watermarking With Enhanced Frequency | mask | |
| Dyna | mask | : Dynamic Mask Selection for Instance Segmentation |
| Dynamic River | mask | s from Multi-Temporal Satellite Imagery: An Automatic Algorithm Using Graph Cuts Optimization |
| Edge Detection by Compass Gradient | mask | s |
| Edge Detection in Correlated Noise Using Latin Square | mask | s |
| Edge-and- | mask | Integration-Driven Diffusion Models for Medical Image Segmentation |
| Edge | mask | Former: Adapting Mask Transformer for Semantic Edge Detection |
| Effect of Cloud | mask | on the Consistency of Snow Cover Products from MODIS and VIIRS |
| Effect of Forest | mask | Quality in the Wall-to-Wall Estimation of Growing Stock Volume, The |
| Effect of Wearing a Face | mask | on Face Image Quality, The |
| Effectiveness of Detection-based and Regression-based Approaches for Estimating | mask | -Wearing Ratio |
| Efficient Image Fusion Network Exploiting Unifying Language and | mask | Guidance, An |
| Efficient | mask | Correction for Click-Based Interactive Image Segmentation |
| Efficient Neural Generation of 4k | mask | s for Homogeneous Diffusion Inpainting |
| Efficient scan | mask | techniques for connected components labeling algorithm |
| Efficient Space-time Video Super Resolution using Low-Resolution Flow and | mask | Upsampling |
| Efficient Ungrouped | mask | Method With two Learnable Parameters for 3D Object Detection, An |
| EFM-Net: Feature Extraction and Filtration with | mask | Improvement Network for Object Detection in Remote Sensing Images |
| emperor's new | mask | s: On demographic differences and disguises, The |
| enhanced approach for few-shot segmentation via smooth downsampling | mask | and label smoothing loss, An |
| Enhanced blind face inpainting via structured | mask | prediction |
| Enhanced geometrical superresolved imaging with moving binary random | mask | |
| Enhancing a Simple MODIS Cloud | mask | Algorithm for the Landsat Data Continuity Mission |
| Enhancing perceptual quality of watermarked high-definition video through composite | mask | |
| Evaluation of Cloud | mask | and Cloud Top Height from Fengyun-4A with MODIS Cloud Retrievals over the Tibetan Plateau |
| Event Probability | mask | (EPM) and Event Denoising Convolutional Neural Network (EDnCNN) for Neuromorphic Cameras |
| Extended evaluation of the effect of real and simulated | mask | s on face recognition performance |
| extended-shadow-code based approach for off-line signature verification. I. Evaluation of the bar | mask | definition, An |
| External | mask | Based Depth and Light Field Camera |
| Extracting foreground | mask | s towards object recognition |
| Extrapolating Satellite-Based Flood | mask | s by One-Class Classification: A Test Case in Houston |
| EyeGAN: Gaze-Preserving, | mask | -Mediated Eye Image Synthesis |
| Face | mask | Aware Robust Facial Expression Recognition During the Covid-19 Pandemic |
| Face | mask | detection using deep convolutional neural network and multi-stage image processing |
| Face | mask | Extraction in Video Sequence |
| Face Presentation Attack with Latex | mask | s in Multispectral Videos |
| Face Recognition Systems, Occlusions, | mask | s |
| Face- | mask | recognition for fraud prevention using Gaussian mixture model |
| Face- | mask | -aware Facial Expression Recognition based on Face Parsing and Vision Transformer |
| Facial biometry by stimulating salient singularity | mask | s |
| Facial expression transformation for anime-style image based on decoder control and attention | mask | |
| Facial | mask | Completion Using StyleGAN2 Preserving Features of the Person |
| Facial | mask | s and soft-biometrics: Leveraging face recognition CNNs for age and gender prediction on mobile ocular images |
| Facial Skin Beautification Using Adaptive Region-Aware | mask | s |
| Farey Sequences and the Planar Euclidean Medial Axis Test | mask | |
| Fast algorithm of phase | mask | s for image encryption in the Fresnel domain |
| Fast Convolution Method and Its Application in | mask | Optimization for Intensity Calculation Using Basis Expansion |
| Fast Convolution with Laplacian-of-Gaussian | mask | s |
| Fast Implementations of the Levelset Segmentation Method With Bias Field Correction in MR Images: Full Domain and | mask | -Based Versions |
| Fast Seismic Landslide Detection Based on Improved | mask | R-CNN |
| Fast Video Object Segmentation by Reference-Guided | mask | Propagation |
| Faster training of | mask | R-CNN by focusing on instance boundaries |
| FCOS- | mask | : Fully Convolution Neural Network for Face Mask Detection |
| FCOS- | mask | : Fully Convolution Neural Network for Face Mask Detection |
| Feature classification criterion for missing features | mask | estimation in robust speaker recognition |
| Feature | mask | network for person re-identification |
| Figure-ground segmentation by transferring window | mask | s |
| Filter-Wise | mask | Pruning and FPGA Acceleration for Object Classification and Detection |
| Fire Together Wire Together: A Dynamic Pruning Approach with Self-Supervised | mask | Prediction |
| FlatCam: Replacing Lenses with | mask | s and Computation |
| Floating | mask | Method for Extracting Hand-Printed Character Features |
| Flow-based frame interpolation networks combined with occlusion-aware | mask | estimation |
| FMD-Yolo: An efficient face | mask | detection method for COVID-19 prevention and control in public |
| Focusing Properties of Annular Cylindrical Vector Beam Induce by Tunable | mask | |
| Foreground | mask | Network for Cell Counting, A |
| Fourier Interpretation of the Frei-Chen Edge | mask | s, A |
| FourierNet: Compact | mask | Representation for Instance Segmentation Using Differentiable Shape Decoders |
| Fractional Differential | mask | : A Fractional Differential-Based Approach for Multiscale Texture Enhancement |
| Franken | mask | : Manipulating semantic masks with transformers for face parts editing |
| Free-atm: Harnessing Free Attention | mask | s for Representation Learning on Diffusion-generated Images |
| Frequential and color analysis for hair | mask | segmentation |
| FSSDD: Few-shot steel defect detection based on multi-scale semantic enhancement representation and | mask | category information mapping |
| Fusion Target Attention | mask | Generation Network For Video Segmentation |
| Fusion Transformer with Object | mask | Guidance for Image Forgery Analysis |
| Gender Stereotypes in Interaction Design. Render Me: Augmented Reality | mask | s to Inhabit the Metaverse |
| General Framework to Generate Sizing Systems from 3D Motion Data Applied to Face | mask | Design, A |
| Generating | mask | s from Boxes by Mining Spatio-Temporal Consistencies in Videos |
| Generative Semantic Manipulation with | mask | -Contrasting GAN |
| GLAMD: Global and Local Attention | mask | Distillation for Object Detectors |
| Global Channel Pruning With Self-Supervised | mask | Learning |
| Global Land High-Resolution Cloud Climatology Based on an Improved MOD09 Cloud | mask | |
| GMT: Guided | mask | Transformer for Leaf Instance Segmentation |
| GNSS Multipath and Jamming Mitigation Using High- | mask | -Angle Antennas and Multiple Constellations |
| Gradient-Based Source and | mask | Optimization in Optical Lithography |
| GRS-Det: An Anchor-Free Rotation Ship Detector Based on Gaussian- | mask | in Remote Sensing Images |
| GSAM+Cutie: Text-Promptable Tool | mask | Annotation for Endoscopic Video |
| Gtms: A Gradient-driven Tree-guided | mask | -free Referring Image Segmentation Method |
| H.264 Intra Mode Decision for Reducing Complexity Using Directional | mask | s and Neighboring Modes |
| Hierarchical Dynamic | mask | s for Visual Explanation of Neural Networks |
| Hierarchical Improvement of Foreground Segmentation | mask | s in Background Subtraction |
| Hierarchical | mask | Prompting and Robust Integrated Regression for Oriented Object Detection |
| High angular resolution light field reconstruction with coded-aperture | mask | |
| High-Accuracy RGB-D Face Recognition via Segmentation-Aware Face Depth Estimation and | mask | -Guided Attention Network |
| High-Quality Damaged Building Instance Segmentation Based on Improved | mask | Transfiner Using Post-Earthquake UAS Imagery: A Case Study of the Luding Ms 6.8 Earthquake in China |
| High-Resolution Airborne Color-Infrared Camera Water | mask | for the NASA ABoVE Campaign, A |
| HINT: High-Quality INpainting Transformer With | mask | -Aware Encoding and Enhanced Attention |
| Human Detection Aided by Deeply Learned Semantic | mask | s |
| Human-Centric Visual Relation Segmentation Using | mask | R-CNN and VTransE |
| Hyperspectral Imaging With Random Printed | mask | |
| I Dream My Painting: Connecting MLLMs and Diffusion Models via Prompt Generation for Text-Guided Multi- | mask | Inpainting |
| I Only Have Eyes for You: The Impact of | mask | s On Convolutional-Based Facial Expression Recognition |
| IBFusion: An Infrared and Visible Image Fusion Method Based on Infrared Target | mask | and Bimodal Feature Extraction Strategy |
| Identification, characterization, and segmentation of Halftone or stippled regions of binary images by growing a seed to a clipping | mask | |
| Image Animation with Perturbed | mask | s |
| Image Compression Using Biorthogonal Wavelet Transforms with Multiplierless 2-D Filter | mask | Operation |
| Image Denoising With Edge-Preserving and Segmentation Based on | mask | NHA |
| Image Inpainting by End-to-End Cascaded Refinement With | mask | Awareness |
| Image Mosaicing Without Distortion Using Projected | mask | For Image Digitization |
| Image processing by self-generated spatial and spectral | mask | s |
| Image splicing detection using | mask | -RCNN |
| Image Synthesis from Layout with Locality-Aware | mask | Adaption |
| IMP: Instance | mask | Projection for High Accuracy Semantic Segmentation of Things |
| Impact of Occlusion | mask | s on Gender Classification from Iris Texture |
| Improved fine-tuning of | mask | -aware transformer for personalized face inpainting with semantic-aware regularization |
| Improved Low-Complexity Algorithm for 2-D Integer Lifting-Based Discrete Wavelet Transform Using Symmetric | mask | -Based Scheme |
| Improved | mask | R-CNN for Instance Segmentation of Tree Crowns in Aerial Imagery, An |
| Improved | mask | R-CNN for Rural Building Roof Type Recognition from UAV High-Resolution Images: A Case Study in Hunan Province, China |
| Improvement of Bounding Box and Instance Segmentation Accuracy Using ResNet-152 FPN with Modulated Deformable ConvNets v2 Backbone-based | mask | Scoring R-CNN |
| Improving Boundary Detection Using Variable Resolution | mask | s |
| Improving patch-based synthesis by learning patch | mask | s |
| Improving ViT interpretability with patch-level | mask | prediction |
| IN-Loop Filter for Object | mask | Coding in Versatile Video Coding |
| Individual Tree Species Identification and Crown Parameters Extraction Based on | mask | R-CNN: Assessing the Applicability of Unmanned Aerial Vehicle Optical Images |
| influence of vegetation index thresholding on EO-based assessments of exposed soil | mask | s in Germany between 1984 and 2019, The |
| Infrared and Visible Image Fusion Method Based on Semantic-Sensitive | mask | Selection and Bidirectional-Collaboration Region Fusion, An |
| Initialize with | mask | : For More Efficient Federated Learning |
| Instance Segmentation for Large, Multi-Channel Remote Sensing Imagery Using | mask | -RCNN and a Mosaicking Approach |
| Instance Segmentation with | mask | -supervised Polygonal Boundary Transformers |
| Instant pose extraction based on | mask | transformer for occluded person re-identification |
| Instrument for Automatically Inspecting Integrated Circuit | mask | s for Pinholes and Spots |
| Integer approximation of 3D chamfer | mask | coefficients using a scaling factor in anisotropic grids |
| Integrating Binary | mask | Estimation With MRF Priors of Cochleagram for Speech Separation |
| Integrating Boxes and | mask | s: A Multi-Object Framework for Unified Visual Tracking and Segmentation |
| Integrating Pose and | mask | Predictions for Multi-person in Videos |
| Intercomparisons of Cloud | mask | Products Among Fengyun-4A, Himawari-8, and MODIS |
| Interpretation of eight-point discrete cosine and sine transforms as 3x3 orthogonal edge | mask | s in terms of the Frei-Chen masks |
| Interpretation of eight-point discrete cosine and sine transforms as 3x3 orthogonal edge | mask | s in terms of the Frei-Chen masks |
| Interpreting Image Classifiers by Generating Discrete | mask | s |
| Inverse image problem of designing phase shifting | mask | s in optical lithography |
| ISTR: | mask | -Embedding-Based Instance Segmentation Transformer |
| Iterative | mask | generation method for handling occlusion in optical flow assisted view interpolation |
| Iterative Optimization of Quarter Sampling | mask | s for Non-Regular Sampling Sensors |
| k-means | mask | Transformer |
| KDC-MAE: Knowledge Distilled Contrastive | mask | Auto-Encoder |
| kMaXU: Medical image segmentation U-Net with k-means | mask | Transformer and contrastive cluster assignment |
| KSM: Fast Multiple Task Adaption via Kernel-wise Soft | mask | Learning |
| Landslide Extraction Using | mask | R-CNN with Background-Enhancement Method |
| Latent-OFER: Detect, | mask | , and Reconstruct with Latent Vectors for Occluded Facial Expression Recognition |
| Layered Depth Refinement with | mask | Guidance |
| Layered image model using binary PCA transparency | mask | s |
| Learnable | mask | s for Pose-Guided View Synthesis |
| Learning Adaptive Patch Generators for | mask | -Robust Image Inpainting |
| Learning Adaptive Target-and-Surrounding Soft | mask | for Correlation Filter Based Visual Tracking |
| Learning Auxiliary Representations With Inconsistency-Guided Detail Regularization for | mask | -Guided Matting |
| Learning Box Regression and | mask | Segmentation Under Long-Tailed Distribution with Gradient Transfusing |
| Learning Efficient GANs for Image Translation via Differentiable | mask | s and Co-Attention Distillation |
| Learning Guided Attention | mask | s for Facial Action Unit Recognition |
| Learning Phase | mask | for Privacy-Preserving Passive Depth Estimation |
| Learning Sparse | mask | s for Diffusion-Based Image Inpainting |
| Learning Texture-Discrimination | mask | s |
| Learning to Generate Text-Grounded | mask | for Open-World Semantic Segmentation from Only Image-Text Pairs |
| Learning to Inpaint by Progressively Growing the | mask | Regions |
| Learning to | mask | and permute visual tokens for Vision Transformer pre-training |
| Lensless Imaging with Focusing Sparse Ura | mask | s in Long-wave Infrared and Its Application for Human Detection |
| Less Is More: Unsupervised | mask | -Guided Annotated CT Image Synthesis With Minimum Manual Segmentations |
| Lifting for Blind Deconvolution in Random | mask | Imaging: Identifiability and Convex Relaxation |
| Linking Broken Character Borders with Variable Sized | mask | s to Improve Recognition |
| LMQFormer: A Laplace-Prior-Guided | mask | Query Transformer for Lightweight Snow Removal |
| Local directional | mask | maximum edge patterns for image retrieval and face recognition |
| Localization of Craniomaxillofacial Landmarks on CBCT Images Using 3D | mask | R-CNN and Local Dependency Learning |
| Localized Approach to Abandoned Luggage Detection with Foreground- | mask | Sampling, A |
| LQMFormer: Language-Aware Query | mask | Transformer for Referring Image Segmentation |
| M2FNet: | mask | -Guided Multi-Level Fusion for RGB-T Pedestrian Detection |
| MA-SAM: A Multi-Atlas Guided SAM Using Pseudo | mask | Prompts Without Manual Annotation for Spine Image Segmentation |
| Machine-learned Regularization and Polygonization of Building Segmentation | mask | s |
| MACnet: | mask | augmented counting network for class-agnostic counting |
| MaGAT: | mask | -Guided Adversarial Training for Defending Face Editing GAN Models From Proactive Defense |
| MagConv: | mask | -Guided Convolution for Image Inpainting |
| MagDR: | mask | -guided Detection and Reconstruction for Defending Deepfakes |
| MagGAN: High-resolution Face Attribute Editing with | mask | -guided Generative Adversarial Network |
| MAP: | mask | -Pruning for Source-Free Model Intellectual Property Protection |
| Mapping of Dwellings in IDP/Refugee Settlements from Very High-Resolution Satellite Imagery Using a | mask | Region-Based Convolutional Neural Network |
| Mapping the Topographic Features of Mining-Related Valley Fills Using | mask | R-CNN Deep Learning and Digital Elevation Data |
| MART: | mask | -Aware Reasoning Transformer for Vehicle Re-Identification |
| Marten: Visual Question Answering with | mask | Generation for Multi-modal Document Understanding |
| MASIC: Deep | mask | Stereo Image Compression |
| mask | as Supervision: Leveraging Unified Mask Information for Unsupervised 3d Pose Estimation |
| mask | as Supervision: Leveraging Unified Mask Information for Unsupervised 3d Pose Estimation |
| mask | Aware Network for Masked Face Recognition in the Wild |
| mask | building for perceptually hiding frequency embedded watermarks |
| mask | Captioning Network |
| mask | Connectivity by Viscous Closings: Linking Merging Galaxies without Merging Double Stars |
| mask | Cross-Modal Hashing Networks |
| mask | Design for Optical Microlithography: An Inverse Imaging Problem |
| mask | Detection Method for Shoppers Under the Threat of COVID-19 Coronavirus, A |
| mask | DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation |
| mask | Encoding for Single Shot Instance Segmentation |
| mask | encoding: A general instance mask representation for object segmentation |
| mask | encoding: A general instance mask representation for object segmentation |
| mask | Grounding for Referring Image Segmentation |
| mask | Guided Attention for Fine-Grained Patchy Image Classification |
| mask | Guided Fusion for Group Activity Recognition in Images |
| mask | Guided Matting via Progressive Refinement Network |
| mask | Guided Spatial-Temporal Fusion Network for Multiple Object Tracking |
| mask | OBB: A Semantic Attention-Based Mask Oriented Bounding Box Representation for Multi-Category Object Detection in Aerial Images |
| mask | OBB: A Semantic Attention-Based Mask Oriented Bounding Box Representation for Multi-Category Object Detection in Aerial Images |
| mask | prior generation with language queries guided networks for referring image segmentation |
| mask | R-CNN |
| mask | R-CNN Refitting Strategy for Plant Counting and Sizing in UAV Imagery |
| mask | R-CNN With Pyramid Attention Network for Scene Text Detection |
| mask | R-CNN-Based Landslide Hazard Identification for 22.6 Extreme Rainfall Induced Landslides in the Beijiang River Basin, China |
| mask | RCNN algorithm for nuclei detection on breast cancer histopathological images |
| mask | Scoring R-CNN |
| mask | Selection and Propagation for Unsupervised Video Object Segmentation |
| mask | Sparse Representation Based on Semantic Features for Thermal Infrared Target Tracking |
| mask | spoofing in face recognition and countermeasures |
| mask | SSD: An Effective Single-Stage Approach to Object Instance Segmentation |
| mask | Textspotter v3: Segmentation Proposal Network for Robust Scene Text Spotting |
| mask | TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes |
| mask | Transfiner for High-Quality Instance Segmentation |
| mask | -Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation |
| mask | -Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation |
| mask | -and-Edge Co-Guided Separable Network for Camouflaged Object Detection |
| mask | -Attention-Free Transformer for 3D Instance Segmentation |
| mask | -Aware 3D axial transformer for video inpainting |
| mask | -Aware Hierarchical Aggregation Transformer for Occluded Person Re-Identification |
| mask | -Aware Light Field De-Occlusion With Gated Feature Aggregation and Texture-Semantic Attention |
| mask | -Aware Networks for Crowd Counting |
| mask | -Aware Pseudo Label Denoising for Unsupervised Vehicle Re-Identification |
| mask | -based anomaly segmentation in complex driving scenes |
| mask | -Based Invisible Backdoor Attacks on Object Detection |
| mask | -Based Second-Generation Connectivity and Attribute Filters |
| mask | -based Style-Controlled Image Synthesis Using a Mask Style Encoder |
| mask | -based Style-Controlled Image Synthesis Using a Mask Style Encoder |
| mask | -CNN: Localizing parts and selecting descriptors for fine-grained bird species categorization |
| mask | -edge connectivity: Theory, computation, and application to historical document analysis |
| mask | -Embedded Discriminator with Region-based Semantic Regularization for Semi-Supervised Class-Conditional Image Synthesis |
| mask | -Free OVIS: Open-Vocabulary Instance Segmentation without Manual Mask Annotations |
| mask | -Free OVIS: Open-Vocabulary Instance Segmentation without Manual Mask Annotations |
| mask | -Free Video Instance Segmentation |
| mask | -Guided Attention and Episode Adaptive Weights for Few-Shot Segmentation |
| mask | -Guided Attention Network and Occlusion-Sensitive Hard Example Mining for Occluded Pedestrian Detection |
| mask | -Guided Attention Network for Occluded Pedestrian Detection |
| mask | -Guided Contrastive Attention Model for Person Re-identification |
| mask | -Guided Cross-Modality Fusion Network for Visible-Infrared Vehicle Detection |
| mask | -guided cycle-GAN for specular highlight removal |
| mask | -Guided Discriminative Feature Network for Occluded Person Re-Identification |
| mask | -Guided Feature Extraction and Augmentation for Ultra-Fine-Grained Visual Categorization |
| mask | -guided image person removal with data synthesis |
| mask | -Guided Matting in the Wild |
| mask | -guided multiscale feature aggregation network for hand gesture recognition |
| mask | -guided network for image captioning |
| mask | -Guided Portrait Editing With Conditional GANs |
| mask | -Guided Siamese Tracking With a Frequency-Spatial Hybrid Network |
| mask | -guided Spectral-wise Transformer for Efficient Hyperspectral Image Reconstruction |
| mask | -Guided Teacher-Student Learning for Open-Vocabulary Object Detection in Remote Sensing Images |
| mask | -Guided Transformer Network with Topic Token for Remote Sensing Image Captioning, A |
| mask | -invariant Face Recognition through Template-level Knowledge Distillation |
| mask | -Pose Cascaded CNN for 2D Hand Pose Estimation From Single Color Image |
| mask | -ranking Network for Semi-supervised Video Object Segmentation |
| mask | -ShadowGAN: Learning to Remove Shadows From Unpaired Data |
| mask | -ShadowNet: Toward Shadow Removal via Masked Adaptive Instance Normalization |
| mask | -SLAM: Robust Feature-Based Monocular SLAM by Masking Using Semantic Segmentation |
| mask | -Specific Inpainting with Deep Neural Networks |
| mask | -streaming CNN for pedestrian detection |
| mask | -ToF: Learning Microlens Masks for Flying Pixel Correction in Time-of-Flight Imaging |
| mask | -ToF: Learning Microlens Masks for Flying Pixel Correction in Time-of-Flight Imaging |
| mask | -Vit: an Object Mask Embedding in Vision Transformer for Fine-Grained Visual Classification |
| mask | -Vit: an Object Mask Embedding in Vision Transformer for Fine-Grained Visual Classification |
| mask | 2Anomaly: Mask Transformer for Universal Open-Set Segmentation |
| mask | 2DiT: Dual Mask-based Diffusion Transformer for Multi-Scene Long Video Generation |
| mask | 2map: Vectorized HD Map Construction Using Bird's Eye View Segmentation Masks |
| mask | 4Align: Aligned Entity Prompting with Color Masks for Multi-Entity Localization Problems |
| mask | : An Object Identification Algorithm |
| mask | Clustering: View Consensus Based Mask Graph Clustering for Open-Vocabulary 3D Instance Segmentation |
| mask | COV: A random mask covariance network for ultra-fine-grained visual categorization |
| mask | Diffusion: Boosting Text-to-Image Consistency with Conditional Mask |
| mask | ed Faces with Faced Masks |
| mask | ed-attention Mask Transformer for Universal Image Segmentation |
| mask | Flownet: Asymmetric Feature Matching With Learnable Occlusion Mask |
| mask | Gaussian: Adaptive 3D Gaussian Representation from Probabilistic Masks |
| mask | GWM: A Generalizable Driving World Model with Video Mask Reconstruction |
| mask | Hunter: real-time object detection of face masks during the COVID-19 pandemic |
| mask | pan: Mask Prior Guided Network For Pansharpening |
| mask | Plus: Improving Mask Generation for Instance Segmentation |
| mask | renderer: 3D-infused multi-mask realistic face reenactment |
| mask | s based human action detection in crowded videos |
| Mass classification in mammograms based on two-concentric | mask | s and discriminating texton |
| MAT-MS: A | mask | -aware transformer for constructing gap-free MODIS normalized difference snow index products |
| MAT: | mask | -Aware Transformer for Large Hole Image Inpainting |
| MaX-DeepLab: End-to-End Panoptic Segmentation with | mask | Transformers |
| MC-PANDA: | mask | Confidence for Panoptic Domain Adaptation |
| ME-PCN: Point Completion Conditioned on | mask | Emptiness |
| Measuring Isotropic Local Contrast: A Circular | mask | Based Approach |
| Median-Type Filters with Model-Based Preselection | mask | s |
| Medical Transformer With Mix | mask | Generation for Thorax Disease Classification |
| Method and apparatus for displaying radiation image, and method and apparatus for calculating unsharp | mask | signal used for the same |
| Method and apparatus for halftone rendering of a gray scale image using a blue noise | mask | |
| Method for generating sprites for object-based coding sytems using | mask | s and rounding average |
| method for sparse disparity densification using voting | mask | propagation, A |
| MGMap: | mask | -Guided Learning for Online Vectorized HD Map Construction |
| MGPAN: | mask | Guided Pixel Aggregation Network |
| MGRLN-net: | mask | -guided Residual Learning Network for Joint Single-image Shadow Detection and Removal |
| MiM: | mask | in Mask Self-Supervised Pre-Training for 3D Medical Image Analysis |
| MiM: | mask | in Mask Self-Supervised Pre-Training for 3D Medical Image Analysis |
| Minimizing stochastic moire in frequency-modulated halftones by means of green-noise | mask | s |
| MixReorg: Cross-Modal Mixed Patch Reorganization is a Good | mask | Learner for Open-World Semantic Segmentation |
| MixSyn: Compositional Image Synthesis with Fuzzy | mask | s and Style Fusion |
| Mixup | mask | Adaptation: Bridging the gap between input saliency and representations via attention mechanism in feature mixup |
| MM-BSN: Self-Supervised Image Denoising for Real-World with Multi- | mask | based on Blind-Spot Network |
| MMAE: A universal image fusion method via | mask | attention mechanism |
| MMR: | mask | Based Multi-Resolution Images and Videos |
| Model Breadcrumbs: Scaling Multi-task Model Merging with Sparse | mask | s |
| Model-Based Edge Detector for Spectral Imagery Using Sparse Spatiospectral | mask | s |
| Modeling | mask | Uncertainty in Hyperspectral Image Reconstruction |
| Modeling Stroke | mask | for End-to-End Text Erasing |
| Modular Interactive Video Object Segmentation: Interaction-to- | mask | , Propagation and Difference-Aware Fusion |
| Monocular 3D object detection via | mask | -Revised Network and quality perception loss |
| MOOSS: | mask | -Enhanced Temporal Contrastive Learning for Smooth State Evolution in Visual Reinforcement Learning |
| Motion-Aware | mask | Feature Reconstruction for Skeleton-Based Action Recognition |
| MP-Former: | mask | -Piloted Transformer for Image Segmentation |
| MP2PMatch: A | mask | -guided Part-to-Part Matching network based on transformer for occluded person re-identification |
| MPOD123: One Image to 3D Content Generation Using | mask | -Enhanced Progressive Outline-to-Detail Optimization |
| MRG-T: | mask | -Relation-Guided Transformer for Remote Vision-Based Pedestrian Attribute Recognition in Aerial Imagery |
| MRI Motion Correction Through Disentangled CycleGAN Based on Multi- | mask | K-Space Subsampling |
| MT-DSNet: Mix- | mask | teacher-student strategies and dual dynamic selection plug-in module for fine-grained image recognition |
| MTA-CLIP: Language-guided Semantic Segmentation with | mask | -text Alignment |
| MTADiffusion: | mask | Text Alignment Diffusion Model for Object Inpainting |
| MTGAN: | mask | and Texture-driven Generative Adversarial Network for Lung Nodule Segmentation |
| MTRNet++: One-stage | mask | -based scene text eraser |
| Multi-dimensional multi-directional | mask | maximum edge pattern for bio-medical image retrieval |
| Multi-Modal Representation Learning with Text-Driven Soft | mask | s |
| Multi-object tracking using binary | mask | s |
| Multi-Resolution Texture Classifier Based on Multi-Resolution Tuned | mask | , A |
| Multi-scale Adaptive | mask | 3D Rigid Registration of Ultrasound and CT Images |
| Multi-Scale Guided | mask | Refinement for Coarse-to-Fine RGB-D Perception |
| Multi-Scale | mask | Convolution-Based Blind-Spot Network for Hyperspectral Anomaly Detection, A |
| Multi-Species Individual Tree Segmentation and Identification Based on Improved | mask | R-CNN and UAV Imagery in Mixed Forests |
| Multi-Swin | mask | Transformer for Instance Segmentation of Agricultural Field Extraction |
| Multi-Task Network with Distance- | mask | -Boundary Consistency Constraints for Building Extraction from Aerial Images, A |
| Multi-Temporal Pixel-Based Compositing for Cloud Removal Based on Cloud | mask | s Developed Using Classification Techniques |
| multimodal approach for 3D face modeling and recognition using 3D deformable facial | mask | , A |
| Multiple | mask | Enhanced Transformer for Robust Visual Tracking |
| Multiple-image encryption based on chaotic phase | mask | and equal modulus decomposition in quaternion gyrator domain |
| MWVOS: | mask | -Free Weakly Supervised Video Object Segmentation via promptable foundation model |
| Natural Adversarial | mask | for Face Identity Protection in Physical World |
| Neural Acoustic-Phonetic Approach for Speaker Verification With Phonetic Attention | mask | |
| Neural implicit shape modeling for small planetary bodies from multi-view images using a | mask | -based classification sampling strategy |
| New Building | mask | Using the Gradient of Heights for Automatic Building Extraction, A |
| New Method for Computing Polygonal | mask | s In Image Processing Pipeline Architectures, A |
| new spatial perceptual | mask | for image watermarking, A |
| Non-Deterministic Face | mask | Removal Based on 3d Priors |
| Non-Semantics Suppressed | mask | Learning for Unsupervised Video Semantic Compression |
| Not just Compete, but Collaborate: Local Image-to-Image Translation via Cooperative | mask | Prediction |
| Note on the Coefficients of Compass | mask | Convolutions, A |
| novel attention-based enhancement framework for face | mask | detection in complicated scenarios, A |
| Novel Framework Based on | mask | R-CNN and Histogram Thresholding for Scalable Segmentation of New and Old Rural Buildings, A |
| novel method for stereo matching using Gabor Feature Image and Confidence | mask | , A |
| Novel presentation attack detection algorithm for face recognition system: Application to 3D face | mask | attack |
| novel scene coupling semantic | mask | network for remote sensing image segmentation, A |
| Object detection based on RGC | mask | R-CNN |
| Object-Location-Aware Hashing for Multi-Label Image Retrieval via Automatic | mask | Learning |
| Occlusion and Deformation Handling Visual Tracking for UAV via Attention-Based | mask | Generative Network |
| Occlusion robust face recognition based on | mask | learning |
| Occlusion Robust Face Recognition Based on | mask | Learning With Pairwise Differential Siamese Network |
| Ocean Color Quality Control | mask | s Contain the High Phytoplankton Fraction of Coastal Ocean Observations |
| OMNet: Learning Overlapping | mask | for Partial-to-Partial Point Cloud Registration |
| OMNET: Real-Time Stereo Matching with Unsupervised Occlusion | mask | |
| On Advantages of | mask | -level Recognition for Outlier-aware Segmentation |
| On Hallucinating Context and Background Pixels from a Face | mask | using Multi-scale GANs |
| On the Robustness and Generalization Ability of Building Footprint Extraction on the Example of SegNet and | mask | R-CNN |
| One-dimensional frequency domain interpretation of compass roof edge and Frei-Chen line | mask | s |
| One-Shot Synthesis of Images and Segmentation | mask | s |
| Online and real-time | mask | -guided multi-person tracking and segmentation |
| Online Class Incremental Learning on Stochastic Blurry Task Boundary via | mask | and Visual Prompt Tuning |
| OnlineAnySeg: Online Zero-Shot 3D Segmentation by Visual Foundation Model Guided 2D | mask | Merging |
| Open-Vocabulary Semantic Segmentation with | mask | -adapted CLIP |
| Open3DIS: Open-Vocabulary 3D Instance Segmentation with 2D | mask | Guidance |
| OPMP: An Omnidirectional Pyramid | mask | Proposal Network for Arbitrary-Shape Scene Text Detection |
| Optical and SAR Image Registration in Equatorial Cloudy Regions Guided by Automatically Point-Prompted Cloud | mask | s |
| Optimization of Packetization | mask | s for Image Coding Based on an Objective Cost Function for Desired Packet Spreading |
| Optimization of Targeted Differential Interferometric Measurements for Wellpads Detected by | mask | Region-Based Convolutional Neural Network in the Tengiz Oilfield of the Caspian Sea Coast |
| Optimizing Eigenfaces by Face | mask | s for Facial Expression Recognition |
| Orientation and scale invariant mean shift using object | mask | -based kernel |
| Outlier Denoising Using a Novel Statistics-Based | mask | Strategy for Compressive Sensing |
| Overexposure | mask | Fusion: Generalizable Reverse ISP Multi-Step Refinement |
| Page segmentation using texture discrimination | mask | s |
| Painting 3D Nature in 2D: View Synthesis of Natural Scenes from a Single Semantic | mask | |
| Parallel Thinning Algorithm Using KxK | mask | s, A |
| PATMAT: Person Aware Tuning of | mask | -Aware Transformer for Face inpainting |
| Perceptually-Weighted Evaluation Criteria for Segmentation | mask | s in Video Sequences |
| Person Search by Separated Modeling and A | mask | -Guided Two-Stream CNN Model |
| Person Search via a | mask | -Guided Two-Stream CNN Model |
| Personalized Privacy Protection | mask | Against Unauthorized Facial Recognition |
| PFENet++: Boosting Few-Shot Semantic Segmentation With the Noise-Filtered Context-Aware Prior | mask | |
| PhlatCam: Designed Phase- | mask | Based Thin Lensless Camera |
| PICK: Predict and | mask | for Semi-supervised Medical Image Segmentation |
| Piggyback: Adapting a Single Network to Multiple Tasks by Learning to | mask | Weights |
| Pineapples' Detection and Segmentation Based on Faster and | mask | R-CNN in UAV Imagery |
| Pixelated source and | mask | optimization for immersion lithography |
| PlanePDM: Boundary-aware 3D planar recovery by using parallel dilated | mask | head |
| PLFace: Progressive Learning for Face Recognition with | mask | Bias |
| Point Proposal Based Instance Segmentation with Rectangular | mask | s for Robot Picking Task |
| PointCMP: Contrastive | mask | Prediction for Self-supervised Learning on Point Cloud Videos |
| PolyMaX: General Dense Prediction with | mask | Transformer |
| Pose classification of human faces by weighting | mask | function approach |
| Power Exponent Based Weighting Criterion for DNN-Based | mask | Approximation in Speech Enhancement |
| PPR10K: A Large-Scale Portrait Photo Retouching Dataset with Human-Region | mask | and Group-Level Consistency |
| Prediction of cirrhosis from liver ultrasound B-mode images based on Laws' | mask | s analysis |
| PRFormer: Matching Proposal and Reference | mask | s by Semantic and Spatial Similarity for Few-Shot Semantic Segmentation |
| Prior Season Crop Type | mask | s for Winter Wheat Yield Forecasting: A US Case Study |
| Prism- | mask | System for Multispectral Video Acquisition, A |
| Pro2sam: | mask | Prompt to Sam with Grid Points for Weakly Supervised Object Localization |
| Procedure for Generating Template | mask | s for Detecting Variable Signals |
| Production of a Dynamic Cropland | mask | by Processing Remote Sensing Image Series at High Temporal and Spatial Resolutions |
| Progressive | mask | Transformer With Edge Enhancement for Image Manipulation Localization |
| Proposal-Free Temporal Action Detection via Global Segmentation | mask | Learning |
| Proposal-Free Volumetric Instance Segmentation from Latent Single-Instance | mask | s |
| Protecting Facial Privacy: Generating Adversarial Identity | mask | s via Style-robust Makeup Transfer |
| Pseudo | mask | Augmented Object Detection |
| Pseudo- | mask | Matters in Weakly-supervised Semantic Segmentation |
| PU- | mask | : 3D Point Cloud Upsampling via an Implicit Virtual Mask |
| PU- | mask | : 3D Point Cloud Upsampling via an Implicit Virtual Mask |
| PW-NeRF: Progressive wavelet- | mask | guided neural radiance fields view synthesis |
| PWCLO-Net: Deep LiDAR Odometry in 3D Point Clouds Using Hierarchical Embedding | mask | Optimization |
| Pyramidal Multiple Instance Detection Network With | mask | Guided Self-Correction for Weakly Supervised Object Detection |
| Quantitative assessment of image quality enhancement due to unsharp- | mask | processing in x-ray fluoroscopy |
| Railway Fastener Pixel-Level Detection Based on Dual-Stream Encoder Network With | mask | Guidance |
| Randomized Channel-pass | mask | for Channel-wise Explanation of Black-box Models |
| Range-Imaging System Utilizing Nematic Liquid Crystal | mask | |
| Rapid Pattern Inspection of Shadow | mask | s by Machine Vision Integrated with Fourier Optics |
| Ray Contribution | mask | s for Structure Adaptive Sinogram Filtering |
| Real | mask | s and spoof faces: On the masked face presentation attack detection |
| Real-Time Text Detection With Similar | mask | in Traffic, Industrial, and Natural Scenes |
| Real-time tonal depiction method by reaction-diffusion | mask | |
| Recognizing apple leaf diseases using a novel parallel real-time processing framework based on | mask | RCNN and transfer learning: An application for smart agriculture |
| Recurrent | mask | Refinement for Few-Shot Medical Image Segmentation |
| Reducing randomness of non-regular sampling | mask | s for image reconstruction |
| Refining Biologically Inconsistent Segmentation | mask | s with Masked Autoencoders |
| Regularized | mask | Tuning: Uncovering Hidden Knowledge in Pre-trained Vision-Language Models |
| Reinforcement Shrink- | mask | for Text Detection |
| Remote Photoplethysmography Correspondence Feature for 3D | mask | Face Presentation Attack Detection |
| Remote Pulse Estimation in the Presence of Face | mask | s |
| Removing the | mask | : Reconstructing a Real-Valued Field on the Sphere from a Masked Field by Spherical Fourier Analysis |
| Research on an embedded-based | mask | wearing detection system |
| Research on Landslides Automatic Extraction Model Based on the Improved | mask | R-CNN, A |
| Research on | mask | Wearing Detection Algorithm Based on Pedestrian Detection in the Post-Pandemic |
| Research on | mask | -Wearing Detection Algorithm Based on Improved YOLOv7-Tiny |
| Research on Ship's Officer Behavior Identification Based on | mask | R-CNN |
| Research On The Improved Image Dodging Algorithm Based On | mask | Technique |
| Resilience | mask | for Robust Audio Hashing, A |
| Resolution Improvement In FZA Lens-Less Camera By Synthesizing Images Captured With Different | mask | -Sensor Distances |
| Resolution-robust Large | mask | Inpainting with Fourier Convolutions |
| Respiratory Rate Estimation Based on Detected | mask | Area in Thermal Images |
| Restore Anything with | mask | s: Leveraging Mask Image Modeling for Blind All-in-one Image Restoration |
| Restore Anything with | mask | s: Leveraging Mask Image Modeling for Blind All-in-one Image Restoration |
| Restore Globally, Refine Locally: A | mask | -Guided Scheme to Accelerate Super-Resolution Networks |
| Rethinking referring relationships from a perspective of | mask | -level relational reasoning |
| Rethinking the sparse | mask | learning mechanism in sparse convolution for object detection on drone images |
| Reviving Iterative Training with | mask | Guidance for Interactive Segmentation |
| Ridge Linking Using an Adaptive Oriented | mask | Applied to Plant Root Images with Thin Structures |
| Road Marking Detection Based on | mask | R-CNN Instance Segmentation Model |
| Road Pothole Detection Based on Crowdsourced Data and Extended | mask | R-CNN |
| Robots Understanding Contextual Information in Human-Centered Environments Using Weakly Supervised | mask | Data Distillation |
| Robust and Fast Object Tracking Method Using a Dynamic | mask | and an Adaptive Search, A |
| Robust cost function for optimizing chamfer | mask | s |
| Robust fast corner detector based on filled circle and outer ring | mask | |
| Robust Perturbation for Visual Explanation: Cross-Checking | mask | Optimization to Avoid Class Distortion |
| Robust retinal blood vessel segmentation using line detectors with multiple | mask | s |
| Robust Tracking via Bidirectional Transduction With | mask | Information |
| RPMG-FSS: Robust Prior | mask | Guided Few-Shot Semantic Segmentation |
| rPPG-Based Spoofing Detection for Face | mask | Attack using Efficientnet on Weighted Spatial-Temporal Representation |
| rubber- | mask | technique, I: Pattern measurement and analysis, The |
| rubber- | mask | technique, II: Pattern Storage and Recognition, The |
| Salient Object Detection Using Window | mask | Transferring with Multi-layer Background Contrast |
| SAR interferogram filtering in the wavelet domain using a coherence map | mask | |
| SatSynth: Augmenting Image- | mask | Pairs Through Diffusion Models for Aerial Semantic Segmentation |
| Scalable, Detailed and | mask | -Free Universal Photometric Stereo |
| Scale-invariant | mask | -guided vehicle keypoint detection from a monocular image |
| Scatter Correction for Spectral CT Using a Primary Modulator | mask | |
| Sea-Land Segmentation of Remote-Sensing Images with Prompt | mask | -Attention |
| Security augmentation grounded on Fresnel and Arnold transforms using hybrid chaotic structured phase | mask | |
| SegDA: Maximum Separable Segment | mask | with Pseudo Labels for Domain Adaptive Semantic Segmentation |
| Segmentation | mask | guided end-to-end person search |
| Segmentation-Aware Convolutional Networks Using Local Attention | mask | s |
| Selective TransHDR: Transformer-Based Selective HDR Imaging Using Ghost Region | mask | |
| Self-Challenging | mask | for Cross-Domain Few-Shot Classification |
| self-immune to 3D | mask | s attacks face recognition system, A |
| Self-supervised random | mask | attention GAN in tackling pose-invariant face recognition |
| Self-Supervised Visual Representations Learning by Contrastive | mask | Prediction |
| Self2Channel: Self-supervised denoising of different regions using coalition game based channel | mask | |
| Sem2NeRF: Converting Single-View Semantic | mask | s to Neural Radiance Fields |
| Semantic feature refinement of YOLO for human | mask | detection in dense crowded |
| Semantic-Guided Multi- | mask | Image Harmonization |
| Semi-Automatic Generation Of Tight Binary | mask | s And Non-Convex Isosurfaces For Quantitative Analysis of 3D Biological Samples |
| Semi-Supervised Skin Lesion Segmentation via Iterative | mask | Optimization |
| Sensing increased image resolution using aperture | mask | s |
| Shadow cameras: Reciprocal views from illumination | mask | s |
| ShadowRefiner: Towards | mask | -free Shadow Removal via Fast Fourier Transformer |
| Shape and Texture Based Countermeasure to Protect Face Recognition Systems against | mask | Attacks |
| Siamese Dynamic | mask | Estimation Network for Fast Video Object Segmentation |
| SICNet: Learning selective inter-slice context via | mask | -Guided Self-knowledge distillation for NPC segmentation |
| SIM: Semantic-aware Instance | mask | Generation for Box-Supervised Instance Segmentation |
| Similar Pattern Discrimination by Filter | mask | Learning with Probabilistic Descent |
| Simple Framework for 3D Lensless Imaging with Programmable | mask | s, A |
| Simple Latent Diffusion Approach for Panoptic Segmentation and | mask | Inpainting, A |
| Simplified Concrete Dropout - Improving the Generation of Attribution | mask | s for Fine-grained Classification |
| Simultaneous Face Completion and Frontalization via | mask | Guided Two-Stage GAN |
| Single Patch Based 3D High-Fidelity | mask | Face Anti-Spoofing |
| Single- | mask | Inpainting for Voxel-based Neural Radiance Fields |
| SketchEdit: | mask | -Free Local Image Manipulation with Partial Sketches |
| Skin lesion segmentation based on | mask | RCNN, Multi Atrous Full-CNN, and a geodesic method |
| Sky | mask | : Attack-Agnostic Robust Federated Learning with Fine-grained Learnable Masks |
| Slice- | mask | Based 3d Cardiac Shape Reconstruction from CT Volume |
| Smart | mask | : Context Aware High-Fidelity Mask Generation for Fine-grained Object Insertion and Layout Control |
| Snippets Relation and Hard-Snippets | mask | Network for Weakly-Supervised Temporal Action Localization, A |
| Snow | mask | Guided Adaptive Residual Network for Image Snow Removal |
| SODAR: Exploring Locally Aggregated Learning of | mask | Representations for Instance Segmentation |
| Soft | mask | Correlation Filter for Visual Object Tracking |
| SoftShadow: Leveraging Soft | mask | s for Penumbra-Aware Shadow Removal |
| SOS | mask | Fuse: An Infrared and Visible Image Fusion Architecture Based on Salient Object Segmentation Mask |
| Spatial Localization of Broadleaf Species in Mixed Forests in Northern Japan Using UAV Multi-Spectral Imagery and | mask | R-CNN Model |
| Spatial | mask | -Based Adaptive Robust Training for Video Object Segmentation With Noisy Labels |
| Spatial-temporal saliency action | mask | attention network for action recognition |
| Spatio-temporal video interpolation and denoising using motion-assisted steering kernel ( | mask | ) regression |
| Spatiotemporal Sequence Prediction Framework Based on | mask | Reconstruction: Application to Short-Duration Precipitation Radar Echoes, A |
| Spectrally Constrained Unimodular Sequence Design Without Spectral Level | mask | |
| Spherical | mask | : Coarse-to-Fine 3D Point Cloud Instance Segmentation with Spherical Representation |
| SS R-CNN: Self-Supervised Learning Improving | mask | R-CNN for Ship Detection in Remote Sensing Images |
| Statistical Motion | mask | and Sliding Registration |
| Structured Semi-supervised Forest for Facial Landmarks Localization with Face | mask | Reasoning |
| Study of the Automatic Recognition of Landslides by Using InSAR Images and the Improved | mask | R-CNN Model in the Eastern Tibet Plateau |
| Subdivided | mask | Dispersion Framework for Semi-Supervised Semantic Segmentation |
| surprising impact of | mask | -head architecture on novel class segmentation, The |
| survey on 3D | mask | presentation attack detection and countermeasures, A |
| SweepCam: Depth-Aware Lensless Imaging Using Programmable | mask | s |
| Symmetric | mask | s for In-fill Pixel Interpolation on Discrete p:q Lattices |
| System for Medical | mask | Detection in the Operating Room Through Facial Attributes |
| S^2-Transformer for | mask | -Aware Hyperspectral Image Reconstruction |
| Teaching in adverse scenes: a statistically feedback-driven threshold and | mask | adjustment teacher-student framework for object detection in UAV images under adverse scenes |
| Template Enhancement and | mask | Generation for Siamese Tracking |
| Temporal Flow | mask | Attention for Open-Set Long-Tailed Recognition of Wild Animals in Camera-Trap Images |
| Temporal Similarity Analysis of Remote Photoplethysmography for Fast 3D | mask | Face Presentation Attack Detection |
| Tensor ring with alternative change | mask | for multitemporal hyperspectral image change detection |
| Ternary Feature | mask | s: zero-forgetting for task-incremental learning |
| Text-Enhanced Scene Image Super-Resolution via Stroke | mask | and Orthogonal Attention |
| Text-to-image via | mask | anchor points |
| TextDCT: Arbitrary-Shaped Text Detection via Discrete Cosine Transform | mask | |
| TextDiff: Enhancing scene text image super-resolution with | mask | -guided residual diffusion models |
| Textual Query-driven | mask | Transformer for Domain Generalized Segmentation |
| TFM2: Training-Free | mask | Matching for Open-Vocabulary Semantic Segmentation |
| Thanka Mural Inpainting Based on Multi-Scale Adaptive Partial Convolution and Stroke-Like | mask | |
| Theoretical Characterization of Effect of | mask | s in Snapshot Compressive Imaging |
| Thin-Cloud | mask | Method for Remote Sensing Images Based on Sparse Dark Pixel Region Detection, A |
| Through-The- | mask | : Mask-based Motion Trajectories for Image-to-Video Generation |
| Through-The- | mask | : Mask-based Motion Trajectories for Image-to-Video Generation |
| Time- and Space-Optimal Algorithm for Boolean | mask | Operations for Orthogonal Polygons, A |
| Time-Frequency Feature and AMS-GMM | mask | for Acoustic Emotion Classification |
| TMBO-AOD: Transparent | mask | Background Optimization for Accurate Object Detection in Large-Scale Remote-Sensing Images |
| Towards Face Encryption by Generating Adversarial Identity | mask | s |
| Towards | mask | -robust Face Recognition |
| Towards PDE-Based Video Compression with Optimal | mask | s and Optic Flow |
| Track Cyclist Detection and Identification using | mask | R-CNN and K-means Clustering |
| Traditional Village Building Extraction Based on Improved | mask | R-CNN: A Case Study of Beijing, China |
| Transferability of the Deep Learning | mask | R-CNN Model for Automated Mapping of Ice-Wedge Polygons in High-Resolution Satellite and UAV Images |
| Transferable Belief Model for hair | mask | segmentation |
| Translation-tolerant | mask | matching using noncoherent reflective optics |
| Tree Health Assessment Using | mask | R-CNN on UAV Multispectral Imagery over Apple Orchards |
| TSDM: Tracking by SiamRPN++ with a Depth-refiner and a | mask | -generator |
| TubeFormer-DeepLab: Video | mask | Transformer |
| U-Noise: Learnable Noise | mask | s for Interpretable Image Segmentation |
| Understanding Deep Networks via Extremal Perturbations and Smooth | mask | s |
| Underwater image enhancement via brightness | mask | -guided multi-attention embedding |
| UniDAformer: Unified Domain Adaptive Panoptic Segmentation Transformer via Hierarchical | mask | Calibration |
| Unified Framework for | mask | ed and Mask-Free Face Recognition Via Feature Rectification, A |
| Unified | mask | Embedding and Correspondence Learning for Self-Supervised Video Segmentation |
| Unpaired translation of chest X-ray images for lung opacity diagnosis via adaptive activation | mask | s and cross-domain alignment |
| Unsharp | mask | Guided Filtering |
| Unstructured Multi-view Depth Estimation Using | mask | -Based Multiplane Representation |
| Unsupervised Image Retrieval With | mask | -Based Prominent Feature Accumulation |
| Unsupervised person re-identification via simultaneous clustering and | mask | prediction |
| Unsupervised Semantic Segmentation by Contrasting Object | mask | Proposals |
| Use of Vanishing Point for the Classification of Reflections From Foreground | mask | in Videos, The |
| Using | mask | -Based Enhancement and Feature Aggregation for Single Image Deraining |
| Using Multiple | mask | s to Improve End-to-End Face Recognition Performance |
| Utilizing | mask | R-CNN for Waterline Detection in Canoe Sprint Video Analysis |
| Validation of Copernicus Sentinel-2 Cloud | mask | s Obtained from MAJA, Sen2Cor, and FMask Processors Using Reference Cloud Masks Generated with a Supervised Active Learning Procedure |
| Validation of Copernicus Sentinel-2 Cloud | mask | s Obtained from MAJA, Sen2Cor, and FMask Processors Using Reference Cloud Masks Generated with a Supervised Active Learning Procedure |
| Variation Autoencoder of Spatial-Spectral Joint | mask | for Hyperspectral Anomaly Detection |
| Vehicular Social Dynamic Anomaly Detection With Recurrent Multi- | mask | Aggregator Enabled VAE |
| Very Fast Convolution with Laplacian-of-Gaussian | mask | s |
| Video Frame Prediction by Deep Multi-Branch | mask | Network |
| Video Instance Segmentation Without Using | mask | and Identity Supervision |
| Video | mask | Transfiner for High-Quality Video Instance Segmentation |
| Video Object Segmentation with Joint Re-identification and Attention-Aware | mask | Propagation |
| View Synthesis of Dynamic Scenes Based on Deep 3D | mask | Volume |
| Vision Transformers are Good | mask | Auto-Labelers |
| Visual and Textual Prior Guided | mask | Assemble for Few-Shot Segmentation and Beyond |
| Volumetric Model Reconstruction from Unrestricted Camera Views Based on the Photo-consistency of 3D Voxel | mask | |
| Voronoi tessellated halftone | mask | s |
| Watermarking image encryption using deterministic phase | mask | and singular value decomposition in fractional Mellin transform domain |
| Wavelet based fuzzy perceptual | mask | for images |
| Weakly supervised Branch Network with Template | mask | for Classifying Masses in 3D Automated Breast Ultrasound |
| Weakly supervised camouflaged object detection based on the SAM model and | mask | guidance |
| Weakly Supervised Few-Shot Semantic Segmentation via Pseudo | mask | Enhancement and Meta Learning |
| Weakly Supervised Group | mask | Network for Object Detection |
| Weakly Supervised Instance Segmentation for Videos with Temporal | mask | Consistency |
| Weakly-supervised semantic segmentation via online pseudo- | mask | correcting |
| Weather-Resilient Localizing Ground-Penetrating Radar via Adaptive Spatio-Temporal | mask | Alignment |
| When Sketch Face Recognition Meets | mask | Obfuscation: Database and Benchmark |
| Why Does Non-binary | mask | Optimisation Work for Diffusion-Based Image Compression? |
| Withdrawn: Incremental subspace and probability | mask | constrained tracking in smart and autonomous systems |
| Yield Estimation of Wheat Using Cropland | mask | s from European Common Agrarian Policy: Comparing the Performance of Enhanced Vegetation Index 2, Normalized Difference Vegetation Index, and MERIS Terrestrial Chlorophyll Index in Spanish Nomenclature of Territorial Units for Statistics Level 2 Regions |
| Zero-Shot Hyperspectral Image Denoising With Self-Completion with Patterned | mask | s |
794 for mask