MVA21
* *MVA
* Action Spotting and Temporal Attention Analysis in Soccer Videos
* Adversarial Defense Through High Frequency Loss Variational Autoencoder Decoder and Bayesian Update With Collective Voting
* Analysis of Evaluation Metrics with the Distance between Positive Pairs and Negative Pairs in Deep Metric Learning
* Angular Margin Constrained Loss for Automatic Liver Fibrosis Staging
* Attention Mining Branch for Optimizing Attention Map
* Augmenting Discriminative Correlation Filters with Stereo Blob Tracking for Long-Term Tracking of Underwater Animals
* AVM Image Quality Enhancement by Synthetic Image Learning for Supervised Deblurring
* baseline for semi-supervised learning of efficient semantic segmentation models, A
* Bi-directional Recurrent MVSNet for High-resolution Multi-view Stereo
* Boosting Semi-Supervised Anomaly Detection via Contrasting Synthetic Images
* Content Filtering in Streaming Video Using Domain Adaptation
* Contextual Information based Network with High-Frequency Feature Fusion for High Frame Rate and Ultra-Low Delay Small-Scale Object Detection
* Crack Segmentation for Low-Resolution Images using Joint Learning with Super- Resolution
* Critically Compressed Quantized Convolution Neural Network based High Frame Rate and Ultra-Low Delay Fruit External Defects Detection
* Cut and paste curriculum learning with hard negative mining for point-of-sale systems
* Data Augmentation for Human Motion Prediction
* Distant Bird Detection for Safe Drone Flight and Its Dataset
* Efficient transfer learning for multi-channel convolutional neural networks
* Encoding-free Incrementing Hough Transform for High Frame Rate and Ultra-low Delay Straight-line Detection
* Estimating Contribution of Training Datasets using Shapley Values in Data-scale for Visual Recognition
* Expandable Spherical Projection and Feature Fusion Methods for Object Detection from Fisheye Images
* Facial landmark detection transfer learning for a specific user in driver status monitoring systems
* FBNet: FeedBack-Recursive CNN for Saliency Detection
* Group Activity Recognition Using Joint Learning of Individual Action Recognition and People Grouping
* HMA-Depth: A New Monocular Depth Estimation Model Using Hierarchical Multi-Scale Attention
* Human-Object Interaction Detection with Missing Objects
* Illumination Planning for Measuring Per-Pixel Surface Roughness
* Image Information Assistance Neural Network for VideoPose3D-based Monocular 3D Pose Estimation
* Information Hiding Using a Coded Aperture as a Key
* Japanese Sentence Dataset for Lip- reading
* Joint Learning of Object Detection and Pose Estimation using Augmented Autoencoder
* Learning VAE with Categorical Labels for Generating Conditional Handwritten Characters
* Leveraging Frequency Based Salient Spatial Sound Localization to Improve 360° Video Saliency Prediction
* Live Video Action Recognition from Unsupervised Action Proposals
* Lossless AI: Toward Guaranteeing Consistency between Inferences Before and After Quantization via Knowledge Distillation
* Machine-learning-based Quality-level-estimation System for Inspecting Steel Microstructures
* Model-based Crack Width Estimation using Rectangle Transform
* Multi-Modal Pedestrian Detection with Large Misalignment Based on Modal-Wise Regression and Multi-Modal IoU
* Multi-physical and Temporal Feature Based Self-correcting Approximation Model for Monocular 3D Volleyball Trajectory Analysis
* Multiple Fisheye Camera Calibration and Stereo Measurement Methods for Uniform Distance Errors throughout Imaging Ranges
* Occlusion-Robust 3D Hand Pose Estimation from a Single RGB Image
* On the Influence of Viewpoint Change for Metric Learning
* Open-set Recognition with Supervised Contrastive Learning
* Optical Model for Show-through Cancellation in Ancient Document Imaging with Dark and Bright Mounts, An
* Output augmentation works well without any domain knowledge
* Pix2Point: Learning Outdoor 3D Using Sparse Point Clouds and Optimal Transport
* Position Estimation of Pedestrians in Surveillance Video Using Face Detection and Simple Camera Calibration
* Practical Descattering of Transmissive Inspection Using Slanted Linear Image Sensors
* Predicting Next Local Appearance for Video Anomaly Detection
* Recurrent RLCN-Guided Attention Network for Single Image Deraining
* Relational Subgraph for Graph-based Path Prediction
* ROT-Harris: A Dynamic Approach to Asynchronous Interest Point Detection
* Saliency based Subject Selection for Diverse Image Captioning
* Seeing Farther Than Supervision: Self-supervised Depth Completion in Challenging Environments
* Selecting an Iconic Pose From an Action Video
* Self-Supervised Deep Fisheye Image Rectification Approach using Coordinate Relations
* Semantic Hierarchy Preserving Deep Hashing for Large-Scale Image Retrieval
* Shape from shading and polarization constrained by approximate shape
* Shape-Based Floor Plan Retrieval Using Parse Tree Matching
* Synthetically Generating Motion Blur in a Depth Map from Time-of-Flight Sensors
* Temporal Extension for Encoder-Decoder-based Crowd Counting Approaches
* Understanding the Reason for Misclassification by Generating Counterfactual Images
* Video Summarization With Frame Index Vision Transformer
* Video-Based Camera Localization Using Anchor View Detection and Recursive 3D Reconstruction
* Weakly Supervised Domain Adaptation using Super-pixel labeling for Semantic Segmentation
66 for MVA21