| _ | benchmark | _ |
| 2D Human Pose Estimation: New | benchmark | and State of the Art Analysis |
| 360Loc: A Dataset and | benchmark | for Omnidirectional Visual Localization with Cross-Device Queries |
| 360VOT: A New | benchmark | Dataset for Omnidirectional Visual Object Tracking |
| 3D AffordanceNet: A | benchmark | for Visual Object Affordance Understanding |
| 3D Face Reconstruction Error Decomposed: A Modular | benchmark | for Fair and Fast Method Evaluation |
| 3D human pose estimation and action recognition using fisheye cameras: A survey and | benchmark | |
| 3d Shape | benchmark | For Retrieval And Automatic Classification Of Architectural Data, A |
| 3D Understanding of Deformable Linear Objects: Datasets and Transferability | benchmark | |
| 3D-ZeF: A 3D Zebrafish Tracking | benchmark | Dataset |
| 3DRef: 3D Dataset and | benchmark | for Reflection Detection in RGB and Lidar Data |
| : A Large-Scale | benchmark | for Rib Labeling and Anatomical Centerline Extraction |
| A-OKVQA: A | benchmark | for Visual Question Answering Using World Knowledge |
| Abingdon Cross | benchmark | Survey, The |
| ABO: Dataset and | benchmark | s for Real-World 3D Object Understanding |
| Accurate semantic segmentation of very high-resolution remote sensing images considering feature state sequences: From | benchmark | datasets to urban applications |
| Action Recognition and | benchmark | Using Event Cameras |
| Active Vision Dataset | benchmark | |
| ActivityNet: A large-scale video | benchmark | for human activity understanding |
| Adaptiope: A Modern | benchmark | for Unsupervised Domain Adaptation |
| Advancing Image Understanding in Poor Visibility Environments: A Collective | benchmark | Study |
| Advancing Saliency Ranking with Human Fixations: Dataset, Models and | benchmark | s |
| Advancing Visual Grounding with Scene Knowledge: | benchmark | and Method |
| Adversarial pruning: A survey and | benchmark | of pruning methods for adversarial robustness |
| Adversarial VQA: A New | benchmark | for Evaluating the Robustness of VQA Models |
| Adversarially Robust Panoptic Segmentation (arpas) | benchmark | |
| Aerial Photogrammetry | benchmark | Dataset for Point Cloud Segmentation and Style Translation, An |
| Aerial-Ground Cross-View Vehicle Re-Identification: A | benchmark | Dataset and Baseline |
| Affective Visual Dialog: A Large-scale | benchmark | for Emotional Reasoning Based on Visually Grounded Conversations |
| AG-VPReID: A Challenging Large-Scale | benchmark | for Aerial-Ground Video-based Person Re-Identification |
| AGQA: A | benchmark | for Compositional Spatio-Temporal Reasoning |
| AgriSen-COG, a Multicountry, Multitemporal Large-Scale Sentinel-2 | benchmark | Dataset for Crop Mapping Using Deep Learning |
| AI | benchmark | : All About Deep Learning on Smartphones in 2019 |
| AI | benchmark | : Running Deep Neural Networks on Android Smartphones |
| AI-Face: A Million-Scale Demographically Annotated AI-Generated Face Dataset and Fairness | benchmark | |
| AID: A | benchmark | Data Set for Performance Evaluation of Aerial Scene Classification |
| AIDCON: An Aerial Image Dataset and | benchmark | for Construction Machinery |
| AIR-POLSAR-CR1.0: A | benchmark | Dataset for Cloud Removal in High-Resolution Optical Remote Sensing Images with Fully Polarized SAR |
| ALFRED: A | benchmark | for Interpreting Grounded Instructions for Everyday Tasks |
| Algorithm and | benchmark | dataset for stain separation in histology images |
| Alignment-Free RGBT Salient Object Detection: Semantics-Guided Asymmetric Correlation Network and a Unified | benchmark | |
| Amodal Intra-class Instance Segmentation: Synthetic Datasets and | benchmark | |
| AmsterTime: A Visual Place Recognition | benchmark | Dataset for Severe Domain Shift |
| Analysis of segmentation performance on the CEDAR | benchmark | database |
| Analyzing EEG Data with Machine and Deep Learning: A | benchmark | |
| Anatomy of Video Editing: A Dataset and | benchmark | Suite for AI-Assisted Video Editing, The |
| ANetQA: A Large-scale | benchmark | for Fine-grained Compositional Reasoning over Untrimmed Videos |
| AnimalTrack: A | benchmark | for Multi-Animal Tracking in the Wild |
| Anomaly detection in video sequences: A | benchmark | and computational model |
| Anomaly-Led Prompting Learning Caption Generating Model and | benchmark | |
| Anti-UAV410: A Thermal Infrared | benchmark | and Customized Scheme for Tracking Drones in the Wild |
| Anti-UAV: A Large-Scale | benchmark | for Vision-Based UAV Tracking |
| ApolloCar3D: A Large 3D Car Instance Understanding | benchmark | for Autonomous Driving |
| Appearance-Based Gaze Estimation With Deep Learning: A Review and | benchmark | |
| ArabSign: A Multi-modality Dataset and | benchmark | for Continuous Arabic Sign Language Recognition |
| Architectures for multi-threaded MVC-compliant multi-view video decoding and | benchmark | tests |
| Are They Going to Cross? A | benchmark | Dataset and Baseline for Pedestrian Crosswalk Behavior |
| Are we ready for autonomous driving? The KITTI vision | benchmark | suite |
| Are We Ready for Vision-Centric Driving Streaming Perception? The ASAP | benchmark | |
| Aria Digital Twin: A New | benchmark | Dataset for Egocentric 3D Machine Perception |
| ARNOLD: A | benchmark | for Language-Grounded Task Learning With Continuous States in Realistic 3D Scenes |
| AssemblyNet: A Point Cloud Dataset and | benchmark | for Predicting Part Directions in an Exploded Layout |
| AthletePose3D: A | benchmark | Dataset for 3D Human Pose Estimation and Kinematic Validation in Athletic Movements |
| Attack Modelling: Towards a Second Generation Watermarking | benchmark | |
| Audio Retrieval With Natural Language Queries: A | benchmark | Study |
| Audio-visual saliency prediction for movie viewing in immersive environments: Dataset and | benchmark | s |
| Augmentation of rPPG | benchmark | Datasets: Learning to Remove and Embed rPPG Signals via Double Cycle Consistent Learning from Unpaired Facial Videos |
| Auto Arborist Dataset: A Large-Scale | benchmark | for Multiview Urban Forest Monitoring Under Domain Shift, The |
| Autocropping: A Closer Look at | benchmark | Datasets |
| Autoeval-video: An Automatic | benchmark | for Assessing Large Vision Language Models in Open-ended Video Question Answering |
| Automatic measure of imitation during social interaction: A behavioral and hyperscanning-EEG | benchmark | |
| Automatic Production of Deep Learning | benchmark | Dataset for Affine-Invariant Feature Matching |
| Automatic Spectral Calibration of Hyperspectral Images: Method, Dataset and | benchmark | |
| AVQACL: A Novel | benchmark | for Audio-Visual Question Answering Continual Learning |
| BackdoorBench: A Comprehensive | benchmark | and Analysis of Backdoor Learning |
| Bag of Tricks for Training Deeper Graph Neural Networks: A Comprehensive | benchmark | Study |
| BAMFORESTS: Bamberg | benchmark | Forest Dataset of Individual Tree Crowns in Very-High-Resolution UAV Images |
| BAWSeg: A UAV Multispectral | benchmark | for Barley Weed Segmentation |
| BCOT: A Markerless High-Precision 3D Object Tracking | benchmark | |
| BEDI: a comprehensive | benchmark | for evaluating embodied agents on UAVs |
| BehavePassDB: Public Database for Mobile Behavioral Biometrics and | benchmark | Evaluation |
| Behind the Magic, MERLIM: Multi-Modal Evaluation | benchmark | for Large Image-Language Models |
| benchmark | Analysis for Robustness of Multi-Scale Urban Road Networks Under Global Disruptions |
| benchmark | and a Baseline for Robust Multi-view Depth Estimation, A |
| benchmark | and Baseline for Language-driven Image Editing, A |
| benchmark | and Comparative Study of Video-Based Face Recognition on COX Face Database, A |
| benchmark | and comparison of active learning for logistic regression, A |
| benchmark | and Evaluation for Real-World Out-of-Distribution Detection Using Vision-Language Models, A |
| benchmark | and Evaluation of Non-Rigid Structure from Motion, A |
| benchmark | and Evaluation of Surveillance Task |
| benchmark | and Investigation of Deep-learning-based Techniques for Detecting Natural Disasters in Aerial Images, A |
| benchmark | and Simulator for UAV Tracking, A |
| benchmark | Data and Method for Real-Time People Counting in Cluttered Scenes Using Depth Sensors |
| benchmark | Data Set and Method for Depth Estimation From Light Field Images |
| benchmark | database for fine-grained image classification of benthic macroinvertebrates |
| benchmark | Dataset and Effective Inter-Frame Alignment for Real-World Video Super-Resolution |
| benchmark | Dataset and Evaluation for Non-Lambertian and Uncalibrated Photometric Stereo, A |
| benchmark | Dataset and Evaluation Methodology for Video Object Segmentation, A |
| benchmark | Dataset and Pair-Wise Ranking Method for Quality Evaluation of Night-Time Image Enhancement |
| benchmark | Dataset and Saliency-Guided Stacked Autoencoders for Video-Based Salient Object Detection, A |
| benchmark | dataset and semantics-guided detection network for spatial-temporal human actions in urban driving scenes, A |
| benchmark | Dataset for Aircraft Detection in Optical Remote Sensing Imagery, A |
| benchmark | Dataset for Automated Diagnosis and Treatment Planning of Class III Malocclusion Using X-Rays and Profile Photos, A |
| benchmark | Dataset for Outdoor Foreground/Background Extraction, A |
| benchmark | Dataset for Performance Evaluation of Multi-Label Remote Sensing Image Retrieval, A |
| benchmark | Dataset for Performance Evaluation of Shape-from-X Algorithms, A |
| benchmark | dataset for real-time detection of icons in mobile apps and a small-scale feature module, A |
| benchmark | Dataset for Segmenting Liver, Vasculature and Lesions from Large-scale Computed Tomography Data, A |
| benchmark | Dataset to Study the Representation of Food Images, A |
| benchmark | Evaluation of a Model-Based Object Recognition System |
| benchmark | face detection using a face recognition database |
| benchmark | for Algorithms Segmenting the Left Atrium From 3D CT and MRI Datasets |
| benchmark | for Analyzing Chart Images, A |
| benchmark | for Automatic Visual Classification of Clinical Skin Disease Images, A |
| benchmark | for Background Subtraction Algorithms in monocular vision: A comparative study, A |
| benchmark | for best view selection of 3D objects, A |
| benchmark | for Chinese-English Scene Text Image Super-resolution, A |
| benchmark | for Controllable Text-Image-to-Video Generation, A |
| benchmark | for Edge-Preserving Image Smoothing, A |
| benchmark | for Epithelial Cell Tracking, A |
| benchmark | for Evaluating Pedestrian Action Prediction |
| benchmark | for Full Rotation Head Tracking, A |
| benchmark | for Generic Product Detection: A Low Data Baseline for Dense Object Detection |
| benchmark | for Graphics Recognition Systems, A |
| benchmark | for Hierarchical Emotion Cause Extraction in Spoken Dialogues, A |
| benchmark | for interactive image segmentation algorithms, A |
| benchmark | for Large-scale Heritage Point Cloud Semantic Segmentation, A |
| benchmark | for Multi-Modal LiDAR SLAM with Ground Truth in GNSS-Denied Environments, A |
| benchmark | for Sparse Coding: When Group Sparsity Meets Rank Minimization, A |
| benchmark | for Studying Diabetic Retinopathy: Segmentation, Grading, and Transferability, A |
| benchmark | for the Comparison of 3-D Motion Segmentation Algorithms, A |
| benchmark | for the robustness of image features in rainy conditions |
| benchmark | for unconstrained online handwritten Uyghur word recognition, A |
| benchmark | for Vehicle Re-Identification in Mixed Visible and Infrared Domains, A |
| benchmark | Framework for Multiregion Analysis of Vesselness Filters, A |
| benchmark | Framework for the Right Atrium Cavity Segmentation From LGE-MRIs, A |
| benchmark | Generation Framework with Customizable Distortions for Image Classifier Robustness |
| benchmark | Grocery Dataset of Realworld Point Clouds From Single View, A |
| benchmark | image database of isolated Bangla handwritten compound characters, A |
| benchmark | image dataset for industrial tools, A |
| benchmark | Kannada Handwritten Document Dataset and Its Segmentation, A |
| benchmark | of classifiers on feature drifting data streams, A |
| benchmark | of Computational Models of Saliency to Predict Human Fixations, A |
| benchmark | of DIBR Synthesized View Quality Assessment Metrics on a New Database for Immersive Media Applications, A |
| benchmark | Of Machine Learning Methods For Classification Of A Sentinel-2 Image |
| benchmark | of Metric Quality Assessment in Photogrammetric Reconstruction for Historical Film Footage |
| benchmark | of simulated range images for partial shape retrieval, A |
| benchmark | of Variance of Opinion Scores in Image Quality Assessment, A |
| benchmark | on Automatic Six-Month-Old Infant Brain Segmentation Algorithms: The iSeg-2017 Challenge |
| benchmark | Platform for Ultra-Fine-Grained Visual Categorization Beyond Human Performance |
| benchmark | Problems for Phase Retrieval |
| benchmark | results: the Abingdon cross |
| benchmark | Revision for HOG-SVM Pedestrian Detector Through Reinvigorated Training and Evaluation Methodologies |
| benchmark | : Performance Evaluation of Dashed Line Detection Algorithms, A |
| benchmark | s and Challenges in Pose Estimation for Egocentric Hand Interactions with Objects |
| Berkeley Segmentation Dataset and | benchmark | , The |
| BEST: | benchmark | and Evaluation of Surveillance Task |
| Beyond Academic | benchmark | s: Critical Analysis and Best Practices for Visual Industrial Anomaly Detection |
| Beyond Image Classification: A Video | benchmark | and Dual-Branch Hybrid Discrimination Framework for Compositional Zero-Shot Learning |
| Beyond Object Recognition: A New | benchmark | towards Object Concept Learning |
| Beyond PASCAL: A | benchmark | for 3D object detection in the wild |
| Beyond Photo-Domain Object Recognition: | benchmark | s for the Cross-Depiction Problem |
| Beyond Standard | benchmark | s: Parameterizing Performance Evaluation in Visual Object Tracking |
| BiasBench: A Reproducible | benchmark | for Tuning the Biases of Event Cameras |
| Biased Aerosol Wet Deposition CAM5 Simulations: A Result of Misrepresented Convective-Stratiform Precipitation Partitioning When | benchmarked | Against SPCAM |
| BigDetection: A Large-scale | benchmark | for Improved Object Detector Pre-training |
| BigHand2.2M | benchmark | : Hand Pose Dataset and State of the Art Analysis |
| BioDrone: A Bionic Drone-Based Single Object Tracking | benchmark | for Robust Vision |
| BioLab-ICAO: A new | benchmark | to evaluate applications assessing face image compliance to ISO/IEC 19794-5 standard |
| BlackboxBench: A Comprehensive | benchmark | of Black-Box Adversarial Attacks |
| Blinkvision: A | benchmark | for Optical Flow, Scene Flow and Point Tracking Estimation Using Rgb Frames and Events |
| BMAD: | benchmark | s for Medical Anomaly Detection |
| BMT-Bench: A | benchmark | Sports Dataset For Video Generation |
| Booster: A | benchmark | for Depth From Images of Specular and Transparent Surfaces |
| BOP: | benchmark | for 6D Object Pose Estimation |
| BrainGB: A | benchmark | for Brain Network Analysis With Graph Neural Networks |
| Breaking Barriers, Localizing Saliency: A Large-Scale | benchmark | and Baseline for Condition-Constrained Salient Object Detection |
| Breaking Common Sense: WHOOPS! A Vision-and-Language | benchmark | of Synthetic and Compositional Images |
| BTS: A Bi-lingual | benchmark | for Text Segmentation in the Wild |
| Building3D: An Urban-Scale Dataset and | benchmark | s for Learning Roof Structures from Point Clouds |
| BURST: A | benchmark | for Unifying Object Recognition, Segmentation and Tracking in Video |
| CabNIR: A | benchmark | for In-Vehicle Infrared Monocular Depth Estimation |
| CadastreVision: A | benchmark | dataset for cadastral boundary delineation from multi-resolution earth observation images |
| CADTalk: An Algorithm and | benchmark | for Semantic Commenting of CAD Programs |
| California Crop Yield | benchmark | : Combining Satellite Image, Climate, Evapotranspiration, and Soil Data Layers for County-Level Yield Forecasting of Over 70 Crops |
| Call to Reflect on Evaluation Practices for Age Estimation: Comparative Analysis of the State-of-the-Art and a Unified | benchmark | , A |
| Caltech Fish Counting Dataset: A | benchmark | for Multiple-Object Tracking and Counting, The |
| Cam4DOcc: | benchmark | for Camera-Only 4D Occupancy Forecasting in Autonomous Driving Applications |
| Camouflaged Instance Segmentation In-the-Wild: Dataset, Method, and | benchmark | Suite |
| Can Machines Understand Composition? Dataset and | benchmark | for Photographic Image Composition Embedding and Understanding |
| CardioSyntax: End-to-End SYNTAX Score Prediction - Dataset, | benchmark | and Method |
| CARL-D: A vision | benchmark | suite and large scale dataset for vehicle detection and scene segmentation |
| CarPatch: A Synthetic | benchmark | for Radiance Field Evaluation on Vehicle Components |
| CASIA-SURF CeFA: A | benchmark | for Multi-modal Cross-Ethnicity Face Anti-spoofing |
| Casual Conversations v2 Dataset: A diverse, large | benchmark | for measuring fairness and robustness in audio/vision/speech models, The |
| CATS: A Color and Thermal Stereo | benchmark | |
| CDnet 2014: An Expanded Change Detection | benchmark | Dataset |
| CDTB: A Color and Depth Visual Object Tracking Dataset and | benchmark | |
| CDTD: A Large-Scale Cross-Domain | benchmark | for Instance-Level Image-to-Image Translation and Domain Adaptive Object Detection |
| CellTypeGraph: A New Geometric Computer Vision | benchmark | |
| CeyMo: See More on Roads - A Novel | benchmark | Dataset for Road Marking Detection |
| Challenges in Procedural Multimodal Machine Comprehension: A Novel Way To | benchmark | |
| Challenging | benchmark | of Anime Style Recognition, A |
| Change Detection | benchmark | Website |
| Changedetection.net: A new change detection | benchmark | dataset |
| Changes in Facial Expression as Biometric: A Database and | benchmark | s of Identification |
| Chaotic World: A Large and Challenging | benchmark | for Human Behavior Understanding in Chaotic Events |
| ChartX and ChartVLM: A Versatile | benchmark | and Foundation Model for Complicated Chart Reasoning |
| Chasing Shadows: Solving Deepfake Detection | benchmark | s Using Irrelevant Features Only |
| CheckManual: A New Challenge and | benchmark | for Manual-based Appliance Manipulation |
| ChestX-Ray8: Hospital-Scale Chest X-Ray Database and | benchmark | s on Weakly-Supervised Classification and Localization of Common Thorax Diseases |
| ChildPlay: A New | benchmark | for Understanding Children's Gaze Behaviour |
| ChromSeg-P3GAN: A | benchmark | Dataset and Pix2Pix Patch Generative Adversarial Network for Chromosome Segmentation |
| CityFlow: A City-Scale | benchmark | for Multi-Target Multi-Camera Vehicle Tracking and Re-Identification |
| CL-Cross VQA: A Continual Learning | benchmark | for Cross-Domain Visual Question Answering |
| CleanSea Set: A | benchmark | Corpus for Underwater Debris Detection and Recognition, The |
| ClearPose: Large-scale Transparent Object Dataset and | benchmark | |
| ClothPose: A Real-world | benchmark | for Visual Analysis of Garment Pose via An Indirect Recording Solution |
| Cloud-Based Evaluation of Anatomical Structure Segmentation and Landmark Detection Algorithms: VISCERAL Anatomy | benchmark | s |
| CloudSim: A fair | benchmark | for comparison of methods for times series reconstruction from cloud and atmospheric contamination |
| COCO-O: A | benchmark | for Object Detectors under Natural Distribution Shifts |
| Coding Framework and | benchmark | Towards Low-Bitrate Video Understanding, A |
| collaborative | benchmark | for region of interest detection algorithms, A |
| Collaborative License Plate Recognition via Association Enhancement Network With Auxiliary Learning and a Unified | benchmark | |
| collection of challenging motion segmentation | benchmark | datasets, A |
| ColorWater: A Diverse Dataset and | benchmark | for Semantic Water Surface Understanding |
| Com Kitchens: An Unedited Overhead-view Video Dataset as a Vision-language | benchmark | |
| Common Corruption Robustness of Point Cloud Detectors: | benchmark | and Enhancement |
| Comparative Analysis of | benchmark | Datasets for Face Recognition Algorithms Verification |
| Comparison and Evaluation of Different Techniques, Segmentation Evaluation, | benchmark | s |
| comparison of 3D shape retrieval methods based on a large-scale | benchmark | supporting multimodal queries, A |
| Comparison of Novel Hybrid and | benchmark | Machine Learning Algorithms to Predict Groundwater Potentiality: Case of a Drought-Prone Region of Medjerda Basin, Northern Tunisia |
| Competitive baseline methods set new standards for the NIPS 2003 feature selection | benchmark | |
| Complex Mountain Road Extraction in High-Resolution Remote Sensing Images via a Light Roadformer and a New | benchmark | |
| comprehensive | benchmark | analysis for sand dust image reconstruction, A |
| Comprehensive | benchmark | Analysis of Single Image Deraining: Current Challenges and Future Perspectives, A |
| Comprehensive | benchmark | for Evaluating Night-time Visual Object Tracking, A |
| Comprehensive | benchmark | for Single Image Compression Artifact Reduction, A |
| comprehensive | benchmark | of local binary pattern algorithms for texture retrieval, A |
| comprehensive survey of handwritten document | benchmark | s: Structure, usage and evaluation, A |
| Computer Vision Workshops -- Performance, Evaluation, | benchmark | s |
| ConCon-Chi: Concept-Context Chimera | benchmark | for Personalized Vision-Language Tasks |
| ConeQuest: A | benchmark | for Cone Segmentation on Mars |
| Consumer video understanding: a | benchmark | database and an evaluation of human and machine performance |
| Context-aware mathematical expression recognition: An end-to-end framework and a | benchmark | |
| Continual Deepfake Detection | benchmark | : Dataset, Methods, and Essentials, A |
| Continuous hand gesture recognition: | benchmark | s and methods |
| Continuous Optical Zooming: A | benchmark | for Arbitrary-Scale Image Super-Resolution in Real World |
| Convolutional Neural Networks for Omnidirectional Image Quality Assessment: A | benchmark | |
| Cooperative Adaptive Cruise Control for String Stable Mixed Traffic: | benchmark | and Human-Centered Design |
| CoRBS: Comprehensive RGB-D | benchmark | for SLAM using Kinect v2 |
| COSOS-1k: A | benchmark | Dataset and Occlusion-Aware Uncertainty Learning for Multi-View Video Object Detection |
| Cost aggregation | benchmark | for light field depth estimation |
| Counting From Sky: A Large-Scale Data Set for Remote Sensing Object Counting and a | benchmark | Method |
| CoWs on Pasture: Baselines and | benchmark | s for Language-Driven Zero-Shot Object Navigation |
| Crackseg9k: A Collection and | benchmark | for Crack Segmentation Datasets and Frameworks |
| CRBeDaSet: A | benchmark | Dataset for High Accuracy Close Range 3D Object Reconstruction |
| Critical Review of Action Recognition | benchmark | s, A |
| Cross | benchmark | Assessment of a Deep Convolutional Neural Network for Face Recognition, A |
| Cross-Domain Document Object Detection: | benchmark | Suite and Method |
| Cross-Domain Facial Expression Recognition: A Unified Evaluation | benchmark | and Adversarial Graph Learning |
| Cross-Modal Collaborative Representation Learning and a Large-Scale RGBT | benchmark | for Crowd Counting |
| Cross-Modal Learning for Anomaly Detection in Complex Industrial Process: Methodology and | benchmark | |
| Cross-platform Video Person ReID: A New | benchmark | Dataset and Adaptation Approach |
| Cross-Spectral Body Recognition with Side Information Embedding: | benchmark | s on LLCM and Analyzing Range-Induced Occlusions on IJB-MDF |
| CrowdPose: Efficient Crowded Scenes Pose Estimation and a New | benchmark | |
| Current trends in deep learning for Earth Observation: An open-source | benchmark | arena for image classification |
| CUS3D: A New Comprehensive Urban-Scale Semantic-Segmentation 3D | benchmark | Dataset |
| dacl10k: | benchmark | for Semantic Bridge Damage Segmentation |
| Dailydvs-200: A Comprehensive | benchmark | Dataset for Event-based Action Recognition |
| Daimler Pedestrian Detection | benchmark | |
| Dancing in the Dark: A | benchmark | towards General Low-light Video Enhancement |
| DAPlankton: | benchmark | Dataset for Multi-Instrument Plankton Recognition Via Fine-Grained Domain Adaptation |
| DARPA Image Understanding | benchmark | for Parallel Computers, The |
| DARPA Image Understanding Motion | benchmark | , The |
| Data-centric is a novel perspective for UAV-based tracking: A new | benchmark | via efficient data utilization strategy |
| Dataset and | benchmark | for 3D Scene Plausibility Assessment, A |
| Dataset and | benchmark | for Large-Scale Multi-Modal Face Anti-Spoofing, A |
| DD-RobustBench: An Adversarial Robustness | benchmark | for Dataset Distillation |
| DDFAV: Remote Sensing Large Vision Language Models Dataset and Evaluation | benchmark | |
| dedicated | benchmark | for contour-based corner detection evaluation, A |
| Deep Fashion3d: A Dataset and | benchmark | for 3d Garment Reconstruction from Single Images |
| Deep Learning-Based Point Cloud Compression: An In-Depth Survey and | benchmark | |
| Deep Temporal Graph Clustering: A Comprehensive | benchmark | and Datasets |
| Deep Visual Geo-localization | benchmark | |
| DeepChange: A Long-Term Person Re-Identification | benchmark | with Clothes Change |
| DeepFashion2: A Versatile | benchmark | for Detection, Pose Estimation, Segmentation and Re-Identification of Clothing Images |
| DeepScoresV2 Dataset and | benchmark | for Music Object Detection, The |
| Dehazing Evaluation: Real-World | benchmark | Datasets, Criteria, and Baselines |
| Delving into Underwater Image Utility: | benchmark | Dataset and Prediction Model |
| Delving into Universal Lesion Segmentation: Method, Dataset, and | benchmark | |
| Dense-Haze: A | benchmark | for Image Dehazing with Dense-Haze and Haze-Free Images |
| Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale | benchmark | and Baseline |
| Detecting tiny objects in aerial images: A normalized Wasserstein distance and a new | benchmark | |
| Detection, Tracking, and Counting Meets Drones in Crowds: A | benchmark | |
| Developing the Raster Big Data | benchmark | : A Comparison of Raster Analysis on Big Data Platforms |
| Development of Efficient Nonlinear | benchmark | Bicycle Dynamics for Control Applications |
| Development of the Chinese Space-Based Radiometric | benchmark | Mission LIBRA |
| DEVIL is in the Details: A Diagnostic Evaluation | benchmark | for Video Inpainting, The |
| DexYCB: A | benchmark | for Capturing Hand Grasping of Objects |
| DFME: A New | benchmark | for Dynamic Facial Micro-Expression Recognition |
| Diagnosing a disorder in a classification | benchmark | |
| Diagnostic | benchmark | and Iterative Inpainting for Layout-Guided Image Generation |
| DiagViB-6: A Diagnostic | benchmark | Suite for Vision Models in the Presence of Shortcut and Generalization Opportunities |
| Digital Surface Model Extraction and Re nement through Image Segmentation - Application to the ISPRS | benchmark | Stereo Dataset |
| DiLiGenT-Pi: Photometric Stereo for Planar Surfaces with Rich Details - | benchmark | Dataset and Beyond |
| DiLiGenT102: A Photometric Stereo | benchmark | Dataset with Controlled Shape and Material Variation |
| Disentangled Feature Learning Network and a Comprehensive | benchmark | for Vehicle Re-Identification |
| Dissecting Out-of-Distribution Detection and Open-Set Recognition: A Critical Analysis of Methods and | benchmark | s |
| Distributed Computing for Vision: Architecture and a | benchmark | Test |
| Diverse Embedding Expansion Network and Low-Light Cross-Modality | benchmark | for Visible-Infrared Person Re-identification |
| DLR HySU: A | benchmark | Dataset for Spectral Unmixing |
| DomainVerse: A | benchmark | Towards Real-World Distribution Shifts for Training-Free Adaptive Domain Generalization |
| DOTA: A Large-Scale | benchmark | and Challenges for Object Detection in Aerial Images |
| Drive4C: A Closed-Loop | benchmark | on what Foundation Models Really Need to be Capable of for Language-Guided Autonomous Driving |
| DriveTrack: A | benchmark | for Long-Range Point Tracking in Real-World Videos |
| Driving by the Rules: A | benchmark | for Integrating Traffic Sign Regulations into Vectorized HD Map |
| Drone-type-Set: Drone types detection | benchmark | for drone detection and tracking |
| DSM Accuracy Evaluation for the ISPRS Commission I Image Matching | benchmark | |
| DVGBench: Implicit-to-explicit visual grounding | benchmark | in UAV imagery with large vision-language models |
| Dynamic Degradation Intensity Estimation for Adaptive Blind Super-Resolution: A Novel Approach and | benchmark | Dataset |
| Dynamic Fusion Module Evolves Drivable Area and Road Anomaly Detection: A | benchmark | and Algorithms |
| E-MLB: Multilevel | benchmark | for Event-Based Camera Denoising |
| e-ViL: A Dataset and | benchmark | for Natural Language Explanations in Vision-Language Tasks |
| E3V-K5: An Authentic | benchmark | for Redefining Video-based Energy Expenditure Estimation |
| ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition | benchmark | |
| ECDaily: A Large-scale | benchmark | for Emotion Cause Extraction in Conversations |
| EEE-Bench: A Comprehensive Multimodal Electrical And Electronics Engineering | benchmark | |
| Efficient Deep Models for Real-Time 4K Image Super-Resolution. NTIRE 2023 | benchmark | and Report |
| Efficient Discovery and Effective Evaluation of Visual Perceptual Similarity: A | benchmark | and Beyond |
| EgoCart: A | benchmark | Dataset for Large-Scale Indoor Image-Based Localization in Retail Stores |
| Egocentric recognition of handled objects: | benchmark | and analysis |
| Egocvr: An Egocentric | benchmark | for Fine-grained Composed Video Retrieval |
| EgoGesture: A New Dataset and | benchmark | for Egocentric Hand Gesture Recognition |
| EgoHumans: An Egocentric 3D Multi-Human | benchmark | |
| EgoVQA: An Egocentric Video Question Answering | benchmark | Dataset |
| Embedded plant recognition: a | benchmark | for low footprint deep neural networks |
| Embil: An English-manipuri Bi-lingual | benchmark | for Scene Text Detection and Language Identification |
| Emotions in LatAm: A New Dataset and | benchmark | for Emotion Recognition in Latin America |
| Empirical Investigations on | benchmark | Tasks for Automatic Image Annotation |
| Encoding color information for visual tracking: Algorithms and | benchmark | |
| Enhancement of Detecting Permanent Water and Temporary Water in Flood Disasters by Fusing Sentinel-1 and Sentinel-2 Imagery Using Deep Learning Algorithms: Demonstration of Sen1Floods11 | benchmark | Datasets |
| Env-QA: A Video Question Answering | benchmark | for Comprehensive Understanding of Dynamic Environments |
| ePillID Dataset: A Low-Shot Fine-Grained | benchmark | for Pill Identification |
| Establishing Good | benchmark | s and Baselines for Face Recognition |
| EuroCity Persons: A Novel | benchmark | for Person Detection in Traffic Scenes |
| Evaluating Crowd Flow Forecasting Algorithms for Indoor Pedestrian Spaces: A | benchmark | Using a Synthetic Dataset |
| Evaluating Image Super-resolution Performance on Mobile Devices: An Online | benchmark | |
| Evaluating Shape Correspondence for Statistical Shape Analysis: A | benchmark | Study |
| Evaluating Text-to-Video Alignment: A Hierarchical | benchmark | for Video Generation Models |
| Evaluation and | benchmark | for biological image segmentation |
| Evaluation of Descriptors and Distance Measures on | benchmark | s and First-Person-View Videos for Face Identification |
| Evaluation of LBP and Deep Texture Descriptors with a New Robustness | benchmark | |
| EvCSLR: Event-Guided Continuous Sign Language Recognition and | benchmark | |
| Event Stream based Human Action Recognition: A High-Definition | benchmark | Dataset and Algorithms |
| Event Stream-Based Visual Object Tracking: A High-Resolution | benchmark | Dataset and A Novel Baseline |
| Event-based Head Pose Estimation: | benchmark | and Method |
| Event-driven Re-Id: A New | benchmark | and Method Towards Privacy-Preserving Person Re-Identification |
| EVREAL: Towards a Comprehensive | benchmark | and Analysis Suite for Event-based Video Reconstruction |
| Exert Diversity and Mitigate Bias: Domain Generalizable Person Re-identification with a Comprehensive | benchmark | |
| Explore Spatio-Temporal Aggregation for Insubstantial Object Detection: | benchmark | Dataset and Baseline |
| Exploring Endogenous Shift for Cross-domain Detection: A Large-scale | benchmark | and Perturbation Suppression Network |
| Extensive | benchmark | and Survey of Modeling Methods for Scene Background Initialization |
| Face Analysis, Evaluations, | benchmark | s, Databases of Images |
| Face Verification, Authentication, Evaluations, Verification | benchmark | s |
| FaceScape: 3D Facial Dataset and | benchmark | for Single-View 3D Face Reconstruction |
| FACET: Fairness in Computer Vision Evaluation | benchmark | |
| Factory Extraction from Satellite Images: | benchmark | and Baseline |
| FAIR1M: A | benchmark | dataset for fine-grained object recognition in high-resolution remote sensing imagery |
| FakePoI: A Large-Scale Fake Person of Interest Video Detection | benchmark | and a Strong Baseline |
| Family of Two-Dimensional | benchmark | Data Sets and Its Application to Comparing Different Cluster Validation Indices, A |
| Fast Human Classification Of 3d Object | benchmark | s |
| Fast large-scale image enlargement method with a novel evaluation approach: | benchmark | function-based peak signal-to-noise ratio |
| Fast motion estimation for field sequential imaging: Survey and | benchmark | |
| FDDB: Face Detection Data Set and | benchmark | |
| Feature-based multimodal remote sensing image matching: | benchmark | and state-of-the-art |
| Federated Learning for Generalization, Robustness, Fairness: A Survey and | benchmark | |
| FedGait: A | benchmark | for Federated Gait Recognition |
| Few-Shot Image Classification | benchmark | s are Too Far From Reality: Build Back Better with Semantic Task Sampling |
| FGPR: A large-scale dataset and | benchmark | for fine-grained product retrieval |
| FiGVCL: Fine-Grained | benchmark | and Method for Video Copy Localization |
| Fine-Grained Butterfly Recognition with Deep Residual Networks: A New Baseline and | benchmark | |
| Fine-grained classification of pedestrians in video: | benchmark | and state of the art |
| Fine-tuning Convolutional Neural Networks: a comprehensive guide and | benchmark | analysis for Glaucoma Screening |
| First Facial Landmark Tracking in-the-Wild Challenge: | benchmark | and Results, The |
| First Results of the LEM | benchmark | Database for Agricultural Applications |
| First-Person Hand Action | benchmark | with RGB-D Videos and 3D Hand Pose Annotations |
| FISBe: A Real-World | benchmark | Dataset for Instance Segmentation of Long-Range thin Filamentous Structures |
| FishEye8K: A | benchmark | and Dataset for Fisheye Camera Object Detection |
| Fishing Gear Classification from Vessel Trajectories and Velocity Profiles: Database and | benchmark | |
| FishNet: A Large-scale Dataset and | benchmark | for Fish Recognition, Detection, and Functional Trait Prediction |
| Fishyscapes | benchmark | : Measuring Blind Spots in Semantic Segmentation, The |
| Fishyscapes: A | benchmark | for Safe Semantic Segmentation in Autonomous Driving |
| Fitting Facial Models to Spatial Points: Blendshape Approaches and | benchmark | |
| Fixation prediction for advertising images: Dataset and | benchmark | |
| Fixation-Based 360° | benchmark | Dataset For Salient Object Detection, A |
| FLAG3D++: A | benchmark | for 3D Fitness Activity Comprehension With Language Instruction |
| Flexible-Modal Face Anti-Spoofing: A | benchmark | |
| FloW: A Dataset and | benchmark | for Floating Waste Detection in Inland Waters |
| FLYBO: A Unified | benchmark | Environment for Autonomous Flying Robots |
| Forensics-Bench: A Comprehensive Forgery Detection | benchmark | Suite for Large Vision Language Models |
| ForgeryNet: A Versatile | benchmark | for Comprehensive Forgery Analysis |
| FOSS4G Date Assessment On the Isprs Optical Stereo Satellite Data: A | benchmark | for DSM Generation |
| FP60 and FSNet: A | benchmark | Dataset and a Family-Species Network for Forestry Pest Recognition |
| FPHA-Afford: A Domain-Specific | benchmark | Dataset for Occluded Object Affordance Estimation in Human-Object-Robot Interaction |
| Framework for Making Face Detection | benchmark | Databases, A |
| From Aardvark to Zorro: A | benchmark | for Mammal Image Classification |
| From Appearance to Inherence: A Hyperspectral Image Dataset and | benchmark | of Material Classification for Surveillance |
| From Laboratory to Real World: A New | benchmark | Towards Privacy-Preserved Visible-Infrared Person Re-Identification |
| From Sky to the Ground: A Large-scale | benchmark | and Simple Baseline Towards Real Rain Removal |
| From Words to Structured Visuals: A | benchmark | and Framework for Text-to-Diagram Generation and Editing |
| FSBench: A Figure Skating | benchmark | for Advancing Artistic Sports Understanding |
| FungiTastic: A Multi-Modal Dataset and | benchmark | for Image Categorization |
| Gain-first or Exposure-first: | benchmark | for Better Low-light Video Photography and Enhancement |
| Gait Recognition in the Wild with Dense 3D Representations and A | benchmark | |
| Gait Recognition in the Wild: A | benchmark | |
| Gait Recognition in the Wild: A Large-Scale | benchmark | and NAS-Based Baseline |
| Gait Recognition With Drones: A | benchmark | |
| GazeSearch: Radiology Findings Search | benchmark | |
| GEB+: A | benchmark | for Generic Event Boundary Captioning, Grounding and Retrieval |
| GeneCIS: A | benchmark | for General Conditional Image Similarity |
| Generating synthetic test matrices as a | benchmark | for the computational behavior of typical testor-finding algorithms |
| Generation of a | benchmark | Dataset Using Historical Photographs for An Automated Evaluation of Different Feature Matching Methods |
| Generic Event Boundary Detection: A | benchmark | for Event Segmentation |
| GeoBIM | benchmark | 2019: Design and Initial Results |
| GeoBIM | benchmark | 2019: Intermediate Results |
| GeoSPARQL Compliance | benchmark | , A |
| GigaMVS: A | benchmark | for Ultra-Large-Scale Gigapixel-Level 3D Reconstruction |
| Glitch in the matrix: A large scale | benchmark | for content driven audio-visual forgery detection and localization |
| GMOT-40: A | benchmark | for Generic Multiple Object Tracking |
| GOAT-Bench: A | benchmark | for Multi-Modal Lifelong Navigation |
| Going Deeper into Recognizing Actions in Dark Environments: A Comprehensive | benchmark | Study |
| Google Landmarks Dataset v2: A Large-Scale | benchmark | for Instance-Level Recognition and Retrieval |
| GOT-10k: A Large High-Diversity | benchmark | for Generic Object Tracking in the Wild |
| GPR1200: A | benchmark | for General-Purpose Content-Based Image Retrieval |
| Graph Attention Layer Evolves Semantic Segmentation for Road Pothole Detection: A | benchmark | and Algorithms |
| GraspNet-1Billion: A Large-Scale | benchmark | for General Object Grasping |
| GREYC keystroke: A | benchmark | for keystroke dynamics biometric systems |
| Grid Anchor Based Image Cropping: A New | benchmark | and An Efficient Model |
| GSLAM: A General SLAM Framework and | benchmark | |
| Gudalur Spectral Target Detection (GST-D): A New | benchmark | Dataset and Engineered Material Target Detection in Multi-Platform Remote Sensing Data |
| H-Patches: A | benchmark | and Evaluation of Handcrafted and Learned Local Descriptors |
| H2O: A | benchmark | for Visual Human-human Object Handover Analysis |
| H3WB: Human3.6M 3D WholeBody Dataset and | benchmark | |
| hand pose tracking | benchmark | from stereo matching, A |
| Handwritten isolated Bangla compound character recognition: A new | benchmark | using a novel deep learning approach |
| Hard-Copy | benchmark | Suite for Image Understanding in Manufacturing |
| Hardware and Software Cache Prefetching Techniques for MPEG | benchmark | s |
| HazeRD: An outdoor scene dataset and | benchmark | for single image dehazing |
| HBA Vision Architecture: Built and | benchmarked | |
| HCI | benchmark | Suite: Stereo and Flow Ground Truth with Uncertainties for Urban Autonomous Driving, The |
| HDR light field imaging of dynamic scenes: A learning-based method and a | benchmark | dataset |
| HDR Video Reconstruction: A Coarse-to-fine Network and A Real-world | benchmark | Dataset |
| Head Pose Estimation in Single- and Multi-view Environments: Results on the CLEAR'07 | benchmark | s |
| HFD-Net: A | benchmark | framework of foreign object detection for high-speed train |
| HiCervix: An Extensive Hierarchical Dataset and | benchmark | for Cervical Cytology Classification |
| HICO: A | benchmark | for Recognizing Human-Object Interactions in Images |
| Hier R-CNN: Instance-Level Human Parts Detection and A New | benchmark | |
| Hierarchic Multi-atlas Based Segmentation for Anatomical Structures: Evaluation in the VISCERAL Anatomy | benchmark | s |
| HiEve: A Large-Scale | benchmark | for Human-Centric Video Analysis in Complex Events |
| High Dynamic Range Video Compression: A Large-Scale | benchmark | Dataset and A Learned Bit-depth Scalable Compression Algorithm |
| High-Quality Landmarked Infrared Eye Video Dataset (IREye4Task): Eye Behaviors, Insights and | benchmark | s for Wearable Mental State Analysis, A |
| High-Resolution Feature Evaluation | benchmark | |
| HIMO: A New | benchmark | for Full-body Human Interacting with Multiple Objects |
| HOD: New Harmful Object Detection | benchmark | s for Robust Surveillance |
| HoloVic:Large-scale Dataset and | benchmark | for Multi-Sensor Holographic Intersection and Vehicle-Infrastructure Cooperative |
| HOOT: Heavy Occlusions in Object Tracking | benchmark | |
| How Many Are in This Image A Safety Evaluation | benchmark | for Vision LLMs |
| How not to | benchmark | Image Processors |
| How Severe Is | benchmark | -Sensitivity in Video Self-Supervised Learning? |
| How to | benchmark | Vision Foundation Models for Semantic Segmentation? |
| How to Collect Segmentations for Biomedical Images? A | benchmark | Evaluating the Performance of Experts, Crowdsourced Non-experts, and Algorithms |
| How to Train Neural Field Representations: A Comprehensive Study and | benchmark | |
| HRS-Bench: Holistic, Reliable and Scalable | benchmark | for Text-to-Image Models |
| HRVQA: A Visual Question Answering | benchmark | for high-resolution aerial images |
| Human running detection: | benchmark | and baseline |
| Human-Centric Behavior Description in Videos: New | benchmark | and Model |
| HUMBI: A Large Multiview Dataset of Human Body Expressions and | benchmark | Challenge |
| HuPerFlow: A Comprehensive | benchmark | for Human vs. Machine Motion Estimation Comparison |
| HuPR: A | benchmark | for Human Pose Estimation Using Millimeter Wave Radar |
| HyperDehazing: A hyperspectral image dehazing | benchmark | dataset and a deep learning model for haze removal |
| Hyperspectral Image Classification on Large-Scale Agricultural Crops: The Heilongjiang | benchmark | Dataset, Validation Procedure, and Baseline Results |
| Hytas: A Hyperspectral Image Transformer Architecture Search | benchmark | and Analysis |
| I-HAZE: A Dehazing | benchmark | with Real Hazy and Haze-Free Indoor Images |
| IARPA Janus | benchmark | A (IJB-A) dataset |
| IARPA Janus | benchmark | -B Face Dataset |
| IC9600: A | benchmark | Dataset for Automatic Image Complexity Assessment |
| IceBench: A | benchmark | for Deep-Learning-Based Sea-Ice Type Classification |
| IDD-AW: A | benchmark | for Safe and Robust Segmentation of Drive Scenes in Unstructured Traffic and Adverse Weather |
| Identifying Cropland Non-Agriculturalization with High Representational Consistency from Bi-Temporal High-Resolution Remote Sensing Images: From | benchmark | Datasets to Real-World Application |
| Identifying rural roads in remote sensing imagery: From | benchmark | dataset to coarse-to-fine extraction network: A case study in China |
| IIIT-CFW: A | benchmark | Database of Cartoon Faces in the Wild |
| Illumination Distillation Framework for Nighttime Person Re-Identification and a New | benchmark | |
| IM-IAD: Industrial Image Anomaly Detection | benchmark | in Manufacturing |
| IMC: A | benchmark | for Invariant Learning Under Multiple Causes |
| Implementing the Abingdon Cross | benchmark | on the ASP |
| Improving face verification using facial marks and deep CNN: IARPA Janus | benchmark | -A |
| Improving the Robustness of 3D Human Pose Estimation: A | benchmark | Dataset and Learning from Noisy Input |
| iNatAg: Multi-Class Classification Models Enabled by a Large-Scale | benchmark | Dataset with 4.7M Images of 2,959 Crop and Weed Species |
| Indexing in large scale image collections: Scaling properties and | benchmark | |
| Indian Movie Face Database: A | benchmark | for face recognition under wide variations |
| Indoor Modelling | benchmark | for 3D Geometry Extraction |
| Insect Classification Using Squeeze-and-Excitation and Attention Modules: a | benchmark | Study |
| Instrument Development: Chinese Radiometric | benchmark | of Reflected Solar Band Based on Space Cryogenic Absolute Radiometer |
| Integrated Understanding | benchmark | : Recognition of a 2 1/2 D MOBILE, An |
| Interactive Medical Image Segmentation: A | benchmark | Dataset and Baseline |
| IntPhys 2019: A | benchmark | for Visual Intuitive Physics Understanding |
| Intra-Camera Supervised Person Re-Identification: A New | benchmark | |
| Introduction to a Large-Scale General Purpose Ground Truth Database: Methodology, Annotation Tool and | benchmark | s |
| Inverse Visual Question Answering: A New | benchmark | and VQA Diagnosis Tool |
| Investigating Calibration and Corruption Robustness of Post-hoc Pruned Perception CNNs: An Image Classification | benchmark | Study |
| Invisible gas detection: An RGB-thermal cross attention network and a new | benchmark | |
| Iotbench: A | benchmark | Suite for Intelligent Internet of Things Edge Devices |
| IP102: A Large-Scale | benchmark | Dataset for Insect Pest Recognition |
| IPN Hand: A Video Dataset and | benchmark | for Real-Time Continuous Hand Gesture Recognition |
| Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based | benchmark | for Future Long Video Generation |
| ISAR: A | benchmark | for Single- and Few-Shot Object Instance Segmentation and Re-Identification |
| ISPRS | benchmark | on Indoor Modelling - Preliminary Results, The |
| ISPRS | benchmark | On Urban Object Classification And 3d Building Reconstruction, The |
| ISPRS | benchmark | on urban object detection and 3D building reconstruction |
| ISPRS | benchmark | s |
| ISPRS-Eurosdr GeoBIM | benchmark | 2019, The |
| ISTD-PDS7: A | benchmark | Dataset for Multi-Type Pavement Distress Segmentation from CCD Images in Complex Scenarios |
| IU Parallel Processing | benchmark | |
| JHU-CROWD++: Large-Scale Crowd Counting Dataset and A | benchmark | Method |
| JRDB: A Dataset and | benchmark | of Egocentric Robot Visual Perception of Humans in Built Environments |
| JRDB: Visual Perception for Navigation in Human Environments: The JackRabbot Social Grouping and Activity Dataset and | benchmark | |
| K-Lane: Lidar Lane Dataset and | benchmark | for Urban Roads and Highways |
| Khmerst: A Low-resource Khmer Scene Text Detection and Recognition | benchmark | |
| KITTI Vision | benchmark | Suite, The |
| KITTI-360: A Novel Dataset and | benchmark | s for Urban Scene Understanding in 2D and 3D |
| KOFFVQA: An Objectively Evaluated Free-Form VQA | benchmark | for Large Vision-Language Models in the Korean Language |
| LAM Dataset: A Novel | benchmark | for Line-Level Handwritten Text Recognition, The |
| LAMP-HQ: A Large-Scale Multi-pose High-Quality Database and | benchmark | for NIR-VIS Face Recognition |
| LaMPilot: An Open | benchmark | Dataset for Autonomous Driving with Language Model Programs |
| Landmark based head pose estimation | benchmark | and method |
| Large Language Model-Driven Structured Output: A Comprehensive | benchmark | and Spatial Data Generation Framework |
| Large Language Models, Evaluations, | benchmark | s, Surveys |
| Large-scale Annotated Mechanical Components | benchmark | for Classification and Retrieval Tasks with Deep Neural Networks, A |
| large-scale | benchmark | dataset for event recognition in surveillance video, A |
| large-scale combinatorial | benchmark | for sign language recognition, A |
| Large-Scale Deep Learning Based Binary and Semantic Change Detection in Ultra High Resolution Remote Sensing Imagery: From | benchmark | Datasets to Urban Application |
| large-scale drone based thermal infrared | benchmark | and inception transformer network for crowd counting, A |
| Large-Scale Homography | benchmark | , A |
| Large-Scale Outdoor Multi-modal Dataset and | benchmark | for Novel View Synthesis and Implicit Scene Reconstruction, A |
| Large-Scale Spatio-Temporal Person Re-Identification: Algorithms and | benchmark | |
| Large-scale Study of Spatiotemporal Representation Learning with a New | benchmark | on Action Recognition, A |
| Large-scale Video Panoptic Segmentation in the Wild: A | benchmark | |
| LaRS: A Diverse Panoptic Maritime Obstacle Detection Dataset and | benchmark | |
| LasHeR: A Large-Scale High-Diversity | benchmark | for RGBT Tracking |
| LaSOT: A High-Quality | benchmark | for Large-Scale Single Object Tracking |
| LaSOT: A High-quality Large-scale Single Object Tracking | benchmark | |
| Latent Fingerprint Quality Assessment for Criminal Investigations: A | benchmark | Dataset and Method |
| Layeredflow: A Real-world | benchmark | for Non-lambertian Multi-Layer Optical Flow |
| Learning Adaptive Spatio-Temporal Inference Transformer for Coarse-to-Fine Animal Visual Tracking: Algorithm and | benchmark | |
| Learning End-to-End Lossy Image Compression: A | benchmark | |
| Learning Gait Representation From Massive Unlabelled Walking Videos: A | benchmark | |
| Learning real-world heterogeneous noise models with a | benchmark | dataset |
| Lessons and Insights from Creating a Synthetic Optical Flow | benchmark | |
| LibEER: A Comprehensive | benchmark | and Algorithm Library for EEG-Based Emotion Recognition |
| LIBSVX: A Supervoxel Library and | benchmark | for Early Video Processing |
| Line segment matching: A | benchmark | |
| LIT: A System and | benchmark | for Light Understanding |
| Long-Term Tracking in the Wild: A | benchmark | |
| Long-Term Visual Object Tracking | benchmark | |
| LongVALE: Vision-Audio-Language-Event | benchmark | Towards Time-Aware Omni-Modal Perception of Long Videos |
| Look into Person: Joint Body Parsing & Pose Estimation Network and a New | benchmark | |
| Look into Person: Self-Supervised Structure-Sensitive Learning and a New | benchmark | for Human Parsing |
| Looking at Words and Points with Attention: A | benchmark | for Text-to-Shape Coherence |
| Low-Cost and Scalable Framework to Build Large-Scale Localization | benchmark | for Augmented Reality, A |
| Low-level multiscale image segmentation and a | benchmark | for its evaluation |
| LvBench: A | benchmark | for Long-form Video Understanding with Versatile Multi-modal Question Answering |
| LVLM-EHub: A Comprehensive Evaluation | benchmark | for Large Vision-Language Models |
| LVOS: A | benchmark | for Large-Scale Long-Term Video Object Segmentation |
| LVOS: A | benchmark | for Long-term Video Object Segmentation |
| LWIRPOSE: A Novel Long Wave Infrared Thermal Image Pose Dataset and | benchmark | |
| m&m's: A | benchmark | to Evaluate Tool-use for multi-step multi-modal Tasks |
| M2FPA: A Multi-Yaw Multi-Pitch High-Quality Dataset and | benchmark | for Facial Pose Analysis |
| M3-UDA: A New | benchmark | for Unsupervised Domain Adaptive Fetal Cardiac Structure Detection |
| M3D: A | benchmark | Dataset and Model for Microscopic 3D Shape Reconstruction |
| Mago Approach for Semantic Segmentation: the Case Study of UAVid | benchmark | Dataset |
| MammalNet: A Large-Scale Video | benchmark | for Mammal Recognition and Behavior Understanding |
| MAPLM: A Real-World Large-Scale Vision-Language | benchmark | for Map and Traffic Scene Understanding |
| MARS: A Video | benchmark | for Large-Scale Person Re-Identification |
| MarsLS-Net: Martian Landslides Segmentation Network and | benchmark | Dataset |
| MaSS13K: A Matting-level Semantic Segmentation | benchmark | |
| MC-Blur: A Comprehensive | benchmark | for Image Deblurring |
| MC-GTA: A Synthetic | benchmark | for Multi-camera Vehicle Tracking |
| Measuring the Utilization of Public Open Spaces by Deep Learning: A | benchmark | Study at the Detroit Riverfront |
| mEBAL2 database and | benchmark | : Image-based multispectral eyeblink detection |
| Medium Scale | benchmark | for Cricket Excited Actions Understanding |
| MegaFace | benchmark | : 1 Million Faces for Recognition at Scale, The |
| Menpo | benchmark | for Multi-pose 2D and 3D Facial Landmark Localisation and Tracking, The |
| MerCulture: A Comprehensive | benchmark | to Evaluate Vision-Language Models on Cultural Understanding in Singapore |
| Meta Omnium: A | benchmark | for General-Purpose Learning-to-Learn |
| Meta Self-Learning for Multi-Source Domain Adaptation: A | benchmark | |
| Methodology and | benchmark | for Automated Driving Theory Test of Large Language Models |
| MeViS: A Large-scale | benchmark | for Video Segmentation with Motion Expressions |
| MFC Datasets: Large-Scale | benchmark | Datasets for Media Forensic Challenge Evaluation |
| MicroVQA: A Multimodal Reasoning | benchmark | for Microscopy-Based Scientific Research |
| Mind the Gap - A | benchmark | for Dense Depth Prediction Beyond Lidar |
| Mind the Prompt: A Novel | benchmark | for Prompt-Based Class-Agnostic Counting |
| MIO-TCD: A New | benchmark | Dataset for Vehicle Classification and Localization |
| MIP-GAF: A MLLM-Annotated | benchmark | for Most Important Person Localization and Group Context Understanding |
| Mitigating Representation Bias in Action Recognition: Algorithms and | benchmark | s |
| MITS: A large-scale multimodal | benchmark | dataset for Intelligent Traffic Surveillance |
| Mix MSTAR: A Synthetic | benchmark | Dataset for Multi-Class Rotation Vehicle Detection in Large-Scale SAR Images |
| MM-Safetybench: A | benchmark | for Safety Evaluation of Multimodal Large Language Models |
| MMMU: A Massive Multi-Discipline Multimodal Understanding and Reasoning | benchmark | for Expert AGI |
| MMP-2k: A | benchmark | Multi-Labeled Macro Photography Image Quality Assessment Database |
| MMPTRACK: Large-scale Densely Annotated Multi-camera Multiple People Tracking | benchmark | |
| MMVR: Millimeter-wave Multi-view Radar Dataset and | benchmark | for Indoor Perception |
| Mnd: A New Dataset and | benchmark | of Movie Scenes Classified by Their Narrative Function |
| Mobile Visual Assistive Apps: | benchmark | s of Vision Algorithm Performance |
| Modality-missing RGBT Tracking: Invertible Prompt Learning and High-quality | benchmark | s |
| Modified Neutral Models as | benchmark | s to Evaluate the Dynamics of Land System (DLS) Model Performance |
| MODS: A USV-Oriented Object Detection and Obstacle Segmentation | benchmark | |
| Mono2Stereo: A | benchmark | and Empirical Study for Stereo Conversion |
| Monocular Image-Based 3-D Model Retrieval: A | benchmark | |
| MOOD 2020: A Public | benchmark | for Out-of-Distribution Detection and Localization on Medical Images |
| Mosaic of Modalities: A Comprehensive | benchmark | for Multimodal Graph Learning |
| MOTA Object Tracking | benchmark | |
| MOTChallenge: A | benchmark | for Single-Camera Multiple Target Tracking |
| MovieCuts: A New Dataset and | benchmark | for Cut Type Recognition |
| MovingFashion: a | benchmark | for the Video-to-Shop Challenge |
| MRSSC: A | benchmark | Dataset for Multimodal Remote Sensing Scene Classification |
| MS-Celeb-1M: A Dataset and | benchmark | for Large-Scale Face Recognition |
| Msd: A | benchmark | Dataset for Floor Plan Generation of Building Complexes |
| MSSDet: Multi-Scale Ship-Detection Framework in Optical Remote-Sensing Images and New | benchmark | |
| MTA-VPS: A Large-Scale | benchmark | for Video-Based Person Search |
| MTMMC: A Large-Scale Real-World Multi-Modal Camera Tracking | benchmark | |
| Multi-Dataset | benchmark | s for Masked Identification using Contrastive Representation Learning |
| Multi-feature based | benchmark | for cervical dysplasia classification evaluation |
| Multi-interactive Feature Learning and a Full-time Multi-modality | benchmark | for Image Fusion and Segmentation |
| Multi-Label Continual Learning for the Medical Domain: A Novel | benchmark | |
| multi-modal universe of fast-fashion: the Visuelle 2.0 | benchmark | , The |
| Multi-Perspective Assessment Method with a Dynamic | benchmark | for Human Activity Impacts on Alpine Ecosystem under Climate Change, A |
| Multi-Purpose Realistic Haze | benchmark | With Quantifiable Haze Levels and Ground Truth, A |
| Multi-shot Temporal Event Localization: a | benchmark | |
| Multi-View Multi-Focus Image Fusion: A Novel | benchmark | Dataset and Method |
| Multi-View Photometric Stereo: A Robust Solution and | benchmark | Dataset for Spatially Varying Isotropic Materials |
| Multi-view Stereo | benchmark | with High-Resolution Images and Multi-camera Videos, A |
| MultiEYE: Dataset and | benchmark | for OCT-Enhanced Retinal Disease Recognition From Fundus Images |
| MultiFire20K: A semi-supervised enhanced large-scale UAV-based | benchmark | for advancing multi-task learning in fire monitoring |
| Multimodal | benchmark | and Improved Architecture for Zero Shot Learning, A |
| Multimodal | benchmark | Dataset and Model for Crop Disease Diagnosis, A |
| Multimodal Brain Tumor Image Segmentation | benchmark | (BRATS), The |
| Multimodal remote sensing | benchmark | datasets for land cover classification with a shared and specific feature learning model |
| Multispectral Airborne Laser Scanning for Tree Species Classification: A | benchmark | of Machine Learning and Deep Learning Algorithms |
| Multispectral pedestrian detection: | benchmark | dataset and baseline |
| Multispectral Video Semantic Segmentation: A | benchmark | Dataset and Baseline |
| MultiVeg: A Very High-Resolution | benchmark | for Deep Learning-Based Multi-Class Vegetation Segmentation |
| MultiVENT 2.0: A Massive Multilingual | benchmark | for Event-Centric Video Retrieval |
| Multiview Depth-based Motion Capture | benchmark | Dataset for Human Motion Denoising and Enhancement Research, A |
| MUVA: A New Large-Scale | benchmark | for Multi-view Amodal Instance Segmentation in the Shopping Scenario |
| MUVOD: A Novel Multi-View Video Object Segmentation Dataset and a | benchmark | for 3D Segmentation |
| MVBench: A Comprehensive Multi-modal Video Understanding | benchmark | |
| MVHM: A Large-Scale Multi-View Hand Mesh | benchmark | for Accurate 3D Hand Pose Estimation |
| MVPOD: A Dataset and | benchmark | for Multi-Vertical-Perspective Object Detection in Multi-Platform Remote Sensing Images |
| MyMultiMediaWorld.com: A | benchmark | platform for 3D compression algorithms |
| National-Standards- and Deep-Learning-Oriented Raster and Vector | benchmark | Dataset (RVBD) for Land-Use/Land-Cover Mapping in the Yangtze River Basin |
| Need for Speed: A | benchmark | for Higher Frame Rate Object Tracking |
| Needles & Haystacks: Dataset and | benchmark | for Domain-Agnostic Image-Based Rigid Slice-to-Volume Registration |
| Nemo: An Open-Source Transformer-Supercharged | benchmark | for Fine-Grained Wildfire Smoke Detection |
| NEPose: A novel | benchmark | dataset with an improved framework for vision-based nasal endoscope pose estimation |
| Neural network analysis of MINERVA scene analysis | benchmark | |
| Neural network based cognitive approaches from face perception with human performance | benchmark | |
| Neural network-based framework for wide visibility dehazing with synthetic | benchmark | s |
| New | benchmark | and Baseline for Real-Time High-Resolution Image Inpainting on Edge Devices, A |
| New | benchmark | and Low Computational Cost Localization Method for Cephalometric Analysis, A |
| New | benchmark | Database and Objective Metric for Light Field Image Quality Evaluation, A |
| new | benchmark | on the recognition of handwritten Bangla and Farsi numeral characters, A |
| New | benchmark | : Clinical Uncertainty and Severity Aware Labeled Chest X-Ray Images With Multi-Relationship Graph Learning, A |
| New | benchmark | : On the Utility of Synthetic Data with Blender for Bare Supervised Learning and Downstream Domain Adaptation, A |
| New Comprehensive | benchmark | for Semi-supervised Video Anomaly Detection and Anticipation, A |
| New Image Data Set and | benchmark | for Cervical Dysplasia Classification Evaluation, A |
| New Method and | benchmark | for Detecting Co-Saliency Within a Single Image, A |
| New People-Object Interaction Dataset and NVS | benchmark | s, A |
| New Shape | benchmark | for 3D Object Retrieval, A |
| New Stereo Dense Matching | benchmark | Dataset for Deep Learning, A |
| New Trajectory Based Motion Segmentation | benchmark | Dataset (UdG-MS15), A |
| NH-HAZE: An Image Dehazing | benchmark | with Non-Homogeneous Hazy and Haze-Free Images |
| NineRec: A | benchmark | Dataset Suite for Evaluating Transferable Recommendation |
| NIR-Assisted Image Denoising: A Selective Fusion Approach and a Real-World | benchmark | Dataset |
| NIRPed: A Novel | benchmark | for Nighttime Pedestrian and Its Distance Joint Detection |
| nnWNet: Rethinking the Use of Transformers in Biomedical Image Segmentation and Calling for a Unified Evaluation | benchmark | |
| NOAA-AVHRR Orbital Drift Correction: Validating Methods Using MSG-SEVIRI Data as a | benchmark | Dataset |
| Nothing Stands Still: A spatiotemporal | benchmark | on 3D point cloud registration under large geometric and temporal change |
| Novel | benchmark | for Refinement of Noisy Localization Labels in Autolabeled Datasets for Object Detection, A |
| Novel | benchmark | RGBD Dataset for Dormant Apple Trees and Its Application to Automatic Pruning, A |
| Novel | benchmark | s and Approaches for Real-World Continual Learning |
| NSD-Imagery: A | benchmark | dataset for extending fMRI vision decoding methods to mental imagery |
| NT-VOT211: A Large-scale | benchmark | for Night-time Visual Object Tracking |
| NTU RGB+D 120: A Large-Scale | benchmark | for 3D Human Activity Understanding |
| Numerical methods for shape-from-shading: A new survey with | benchmark | s |
| NWPU-Crowd: A Large-Scale | benchmark | for Crowd Counting and Localization |
| O-HAZE: A Dehazing | benchmark | with Real Hazy and Haze-Free Outdoor Images |
| Object Detection in Aerial Images: A Large-Scale | benchmark | and Challenges |
| Object detection in optical remote sensing images: A survey and a new | benchmark | |
| Object Detection Using Event Camera: A MoE Heat Conduction Based Detector and A New | benchmark | Dataset |
| Object Folder | benchmark | : Multisensory Learning with Neural and Real Objects, The |
| Object Tracking | benchmark | |
| Occluded Video Instance Segmentation: A | benchmark | |
| Off-Nadir Satellite Image Scene Classification: | benchmark | Dataset, Angle-Aware Active Domain Adaptation, and Angular Impact Analysis |
| Offline Cursive Character Challenge: a New | benchmark | for Machine Learning and Pattern Recognition Algorithms. |
| OK-VQA: A Visual Question Answering | benchmark | Requiring External Knowledge |
| OMAD-6: Advancing Offshore Mariculture Monitoring with a Comprehensive Six-Type Dataset and Performance | benchmark | |
| Omni-Crack30k: A | benchmark | for Crack Segmentation and the Reasonable Effectiveness of Transfer Learning |
| Omni-Scene Infrared Vehicle Detection: An Efficient Selective Aggregation approach and a unified | benchmark | |
| Omni3D: A Large | benchmark | and Model for 3D Object Detection in the Wild |
| Omni6dpose: A | benchmark | and Model for Universal 6d Object Pose Estimation and Tracking |
| Omniact: A Dataset and | benchmark | for Enabling Multimodal Generalist Autonomous Agents for Desktop and Web |
| OmniLabel: A Challenging | benchmark | for Language-Based Object Detection |
| OmniMedVQA: A New Large-Scale Comprehensive Evaluation | benchmark | for Medical LVLM |
| OmniMMI: A Comprehensive Multi-modal Interaction | benchmark | in Streaming Video Contexts |
| On Training Traffic Predictors via Broad Learning Structures: A | benchmark | Study |
| On-Board Crowd Counting and Density Estimation Using Low Altitude Unmanned Aerial Vehicles: Looking beyond Beating the | benchmark | |
| On-line Handwriting Recognition of Indian Scripts: The First | benchmark | |
| Online and offline handwritten Chinese character recognition: A comprehensive study and new | benchmark | |
| Online Object Tracking: A | benchmark | |
| OOD-CV-v2: An Extended | benchmark | for Robustness to Out-of-Distribution Shifts of Individual Nuisances in Natural Images |
| OOD-CV: A | benchmark | for Robustness to Out-of-Distribution Shifts of Individual Nuisances in Natural Images |
| Open | benchmark | Dataset for Forest Characterization from Sentinel-1 and -2 Time Series, An |
| Open-Source | benchmark | of Deep Learning Models for Audio-Visual Apparent and Self-Reported Personality Recognition, An |
| Open-TransMind: A New Baseline and | benchmark | for 1st Foundation Model Challenge of Intelligent Transportation |
| Open-Vocabulary Video Question Answering: A New | benchmark | for Evaluating the Generalizability of Video Question Answering Models |
| Open-World Group Retrieval with Ambiguity Removal: A | benchmark | |
| Open-World, Diverse, Cross-Spatial-Temporal | benchmark | for Dynamic Wild Person Re-Identification, An |
| OpenCapBench: A | benchmark | to Bridge Pose Estimation and Biomechanics |
| Opencc: An open | benchmark | data set for Corpus Callosum Segmentation and Evaluation |
| OpenEarthMap: A | benchmark | Dataset for Global High-Resolution Land Cover Mapping |
| OpenGait: A Comprehensive | benchmark | Study for Gait Recognition Toward Better Practicality |
| OpenING: A Comprehensive | benchmark | for Judging Open-ended Interleaved Image-Text Generation |
| OpenMIBOOD: Open Medical Imaging | benchmark | s for Out-Of-Distribution Detection |
| OpenMonkeyChallenge: Dataset and | benchmark | Challenges for Pose Estimation of Non-human Primates |
| OpenOccupancy: A Large Scale | benchmark | for Surrounding Semantic Occupancy Perception |
| Ophnet: A Large-scale Video | benchmark | for Ophthalmic Surgical Workflow Understanding |
| Opportunity challenge: A | benchmark | database for on-body sensor-based activity recognition, The |
| OpticalNet: An Optical Imaging Dataset and | benchmark | Beyond the Diffraction Limit |
| Optimized Dual Fire Attention Network and Medium-Scale Fire Classification | benchmark | |
| Orientation Of Oblique Airborne Image Sets: Experiences From The ISPRS/EUROSDR | benchmark | On Multi-platform Photogrammetry |
| Oriented Cell Dataset: A Dataset and | benchmark | for Oriented Cell Detection and Applications |
| Oriented Tiny Object Detection: A Dataset, | benchmark | , and Dynamic Unbiased Learning |
| Orion Pottery Repository: A Publicly Available 3D Objects' | benchmark | Database with Texture Information, The |
| OTCBVS | benchmark | Dataset Collection |
| OTCBVS | benchmark | Dataset Collection |
| PANet: A multi-scale temporal decoupling network and its high-resolution | benchmark | dataset for detecting pseudo changes in cropland non-agriculturalization |
| Pano3D: A Holistic | benchmark | and a Solid Baseline for 360° Depth Estimation |
| ParaLBench: A Large-Scale | benchmark | for Computational Paralinguistics Over Acoustic Foundation Models |
| Pars-OFF: A | benchmark | for Offensive Language Detection on Farsi Social Media |
| Part-Based RDF for Direction Classification of Pedestrians, and a | benchmark | |
| PartNet: A Large-Scale | benchmark | for Fine-Grained and Hierarchical Part-Level 3D Object Understanding |
| PathBench: Advancing the | benchmark | of Large Multimodal Models for Pathology Image Understanding at Patch and Whole Slide Level |
| Pathmmu: A Massive Multimodal Expert-level | benchmark | for Understanding and Reasoning in Pathology |
| PatternNet: A | benchmark | dataset for performance evaluation of remote sensing image retrieval |
| Pedestrian detection at night time in FIR domain: Comprehensive study about temperature and brightness and new | benchmark | |
| Pedestrian detection: A | benchmark | |
| Perceptual Quality Assessment of Enhanced Colonoscopy Images: A | benchmark | Dataset and an Objective Method |
| Perceptual Quality Assessment of Face Video Compression: A | benchmark | and An Effective Method |
| Perceptual Quality Assessment of High-Dynamic-Range Image: A | benchmark | Dataset and a No Reference Method |
| Perceptually Motivated | benchmark | for Video Matting |
| perceptually motivated online | benchmark | for image matting, A |
| Performance | benchmark | of DSP and FPGA Implementations of Low-Level Vision Algorithms |
| PersFormer: 3D Lane Detection via Perspective Transformer and the OpenLane | benchmark | |
| Personalized | benchmark | for Face Anti-spoofing, A |
| Petface: A Large-scale Dataset and | benchmark | for Animal Identification |
| PETS 2001 | benchmark | Data |
| PETS 2006 | benchmark | Data |
| PETS | benchmark | Datasets |
| PetsRS - a Dataset and | benchmark | for Pet Recognition on a Climate Disaster Scenario |
| Phantom of | benchmark | Dataset: Resolving Label Ambiguity Problem on Image Recognition in the Wild |
| PhenoBench: A Large Dataset and | benchmark | s for Semantic Image Interpretation in the Agricultural Domain |
| PIDray: A Large-Scale X-ray | benchmark | for Real-World Prohibited Item Detection |
| PKU-DyMVHumans: A Multi-View Video | benchmark | for High-Fidelity Dynamic Human Modeling |
| PKUBench: A context rich mobile visual search | benchmark | |
| Place Recognition in Gardens by Learning Visual Representations: Data Set and | benchmark | Analysis |
| PlanarTrack: A high-quality and challenging | benchmark | for large-scale planar object tracking |
| PlanarTrack: A Large-scale Challenging | benchmark | for Planar Object Tracking |
| Plant Disease Recognition: A Large-Scale | benchmark | Dataset and a Visual Region and Loss Reweighting Approach |
| Playing for | benchmark | s |
| Point cloud-based scene flow estimation on realistically deformable objects: A | benchmark | of deep learning-based methods |
| PointCloud-Text Matching: | benchmark | Dataset and Baseline |
| Pose-to-Pose: A New Task and | benchmark | for Human Pose Transition in Yoga |
| PoseTrack: A | benchmark | for Human Pose Estimation and Tracking |
| PosterLayout: A New | benchmark | and Approach for Content-Aware Visual-Textual Presentation Layout |
| PQPP: A Joint | benchmark | for Text-to-Image Prompt and Query Performance Prediction |
| Privacy-Aware Visualization of Volunteered Geographic Information (VGI) to Analyze Spatial Activity: A | benchmark | Implementation |
| ProAI: An Efficient Embedded AI Hardware for Automotive Applications: A | benchmark | Study |
| Probabilistic Speech-Driven 3D Facial Motion Synthesis: New | benchmark | s, Methods, and Applications |
| Progress On Isprs | benchmark | On Multisensory Indoor Mapping And Positioning |
| PTB-TIR: A Thermal Infrared Pedestrian Tracking | benchmark | |
| Pushing the Frontiers of Unconstrained Crowd Counting: New Dataset and | benchmark | Method |
| Pushing the frontiers of unconstrained face detection and recognition: IARPA Janus | benchmark | A |
| Pyramid Pooling Module-Based Semi-Siamese Network: A | benchmark | Model for Assessing Building Damage from xBD Satellite Imagery Datasets |
| Q-Bench+: A | benchmark | for Multi-Modal Foundation Models on Low-Level Vision From Single Images to Pairs |
| Q-Bench-Video: | benchmark | the Video Quality Understanding of LMMs |
| Quad-Pixel Image Defocus Deblurring: A New | benchmark | and Model |
| Quality Criteria | benchmark | for Hyperspectral Imagery |
| RareAnom: A | benchmark | Video Dataset for Rare Type Anomalies |
| Real Acoustic Fields: An Audio-Visual Room Acoustics Dataset and | benchmark | |
| Real-RawVSR: Real-World Raw Video Super-Resolution with a | benchmark | Dataset |
| Real-Time Tone Mapping: A Survey and Cross-Implementation Hardware | benchmark | |
| Real-World Animation Super-Resolution | benchmark | With Color Degradation and Multi-Scale Multi-Frequency Alignment, A |
| Real-World Underwater Enhancement: Challenges, | benchmark | s, and Solutions Under Natural Light |
| Real-World Video Deblurring: A | benchmark | Dataset and an Efficient Recurrent Neural Network |
| Real-world Video Super-resolution: A | benchmark | Dataset and A Decomposition based Learning Scheme |
| Realfred: An Embodied Instruction Following | benchmark | in Photo-realistic Environments |
| REAP: A Large-Scale Realistic Adversarial Patch | benchmark | |
| Reconstruction of 3D Objects of Assets and Facilities by Using | benchmark | Points |
| RED: A Simple but Effective Baseline Predictor for the TrajNet | benchmark | |
| Registration of large-scale terrestrial laser scanner point clouds: A review and | benchmark | |
| Reinforcement Learning | benchmark | for Autonomous Driving in General Urban Scenarios, A |
| Relative Saliency and Ranking: Models, Metrics, Data and | benchmark | s |
| Remote Respiration Measurement with RGB Cameras: A Review and | benchmark | |
| Remote Sensing Image Scene Classification: | benchmark | and State of the Art |
| Remote-Sensing Cross-Domain Scene Classification: A Dataset and | benchmark | |
| Rendered | benchmark | Data Set for Evaluation of Occlusion-Handling Strategies of a Parts-Based Car Detector |
| Report on the Results of the DARPA Integrated Image Understanding | benchmark | Exercise, A |
| Responsive Listening Head Generation: A | benchmark | Dataset and Baseline |
| Results of the ISPRS | benchmark | on urban object detection and 3D building reconstruction |
| Rethinking Few-Shot Object Detection on a Multi-Domain | benchmark | |
| Rethinking the Low-Light Video Enhancement: | benchmark | Datasets and Methods |
| RETOUCH: The Retinal OCT Fluid Detection and Segmentation | benchmark | and Challenge |
| Review on Deep Learning Algorithms and | benchmark | Datasets for Pairwise Global Point Cloud Registration |
| Revisiting Point Cloud Classification: A New | benchmark | Dataset and Classification Model on Real-World Data |
| Revisiting pre-trained remote sensing model | benchmark | s: resizing and normalization matters |
| Revisiting RGBT Tracking | benchmark | s From the Perspective of Modality Validity: A New Benchmark, Problem, and Solution |
| Revisiting RGBT Tracking | benchmark | s From the Perspective of Modality Validity: A New Benchmark, Problem, and Solution |
| Revisiting Shadow Detection: A New | benchmark | Dataset for Complex World |
| Revisiting Video Saliency: A Large-Scale | benchmark | and a New Model |
| RGB-D Human Matting: A Real-World | benchmark | Dataset and a Baseline Method |
| RGB-Sonar Tracking | benchmark | and Spatial Cross-Attention Transformer Tracker |
| RGB-T Crowd Counting from Drone: A | benchmark | and Mmccn Network |
| RGB-T object tracking: | benchmark | and baseline |
| RGBD Salient Object Detection: A | benchmark | and Algorithms |
| RGBT Salient Object Detection: A Large-Scale Dataset and | benchmark | |
| RGBT Salient Object Detection: | benchmark | and A Novel Cooperative Ranking Approach |
| RiceStageSeg: A Multimodal | benchmark | Dataset for Semantic Segmentation of Rice Growth Stages |
| Rip Current Segmentation: A Novel | benchmark | and YOLOv8 Baseline Results |
| RipVIS: Rip Currents Video Instance Segmentation | benchmark | for Beach Monitoring and Safety |
| RNVE: A Real Nighttime Vision Enhancement | benchmark | and Dual-Stream Fusion Network |
| RoadSocial: A Diverse VideoQA Dataset and | benchmark | for Road Event Understanding from Social Video Narratives |
| RoBIC: A | benchmark | Suite for Assessing Classifiers Robustness |
| RoboSense: Large-scale Dataset and | benchmark | for Egocentric Robot Perception and Navigation in Crowded and Unstructured Environments |
| RoboTwin: Dual-Arm Robot | benchmark | with Generative Digital Twins |
| Robust AD: A Real World | benchmark | Dataset for Robustness in Industrial Anomaly Detection |
| Robust Multi-Drone Multi-Target Tracking to Resolve Target Occlusion: A | benchmark | |
| Robust Texture-Aware Computer-Generated Image Forensic: | benchmark | and Algorithm |
| RobustCLEVR: A | benchmark | and Framework for Evaluating Robustness in Object-centric Learning |
| RockCloud-Align: A High-Precision | benchmark | for Rock-Mass Point-Cloud Registration |
| RSAR: Restricted State Angle Resolver and Rotated SAR | benchmark | |
| RSGPT: A remote sensing vision language model and | benchmark | |
| RT-POSE: A 4d Radar Tensor-based 3d Human Pose Estimation and Localization | benchmark | |
| RUBIK: A Structured | benchmark | for Image Matching across Geometric Challenges |
| RViDeformer: Efficient Raw Video Denoising Transformer With a Larger | benchmark | Dataset |
| RW-HAZE: A Real-World | benchmark | Dataset to Evaluate Quantitatively Dehazing Algorithms |
| S-Hock dataset: A new | benchmark | for spectator crowd analysis, The |
| Salient Object Detection: A | benchmark | |
| Salient Object Detection: A | benchmark | |
| SALVE: A 3D Reconstruction | benchmark | of Wounds from Consumer-Grade Videos |
| SAREval: A Multi-Dimensional and Multi-Task | benchmark | for Evaluating Visual Language Models on SAR Image Understanding |
| Satellite Video Multi-Label Scene Classification With Spatial and Temporal Feature Cooperative Encoding: A | benchmark | Dataset and Method |
| Satellite video single object tracking: A systematic review and an oriented object tracking | benchmark | |
| Scalable Person Re-identification: A | benchmark | |
| Scaling object recognition: | benchmark | of current state of the art techniques |
| ScanNeRF: a Scalable | benchmark | for Neural Radiance Fields |
| SceneFake: An initial dataset and | benchmark | s for scene fake audio detection |
| Score-Level Fusion | benchmark | Database for Biometric Authentication, A |
| Screen Content Quality Assessment: Overview, | benchmark | , and Beyond |
| SCUT-COUCH2009: A comprehensive online unconstrained Chinese handwriting database and | benchmark | evaluation |
| SCUT-FBP5500: A Diverse | benchmark | Dataset for Multi-Paradigm Facial Beauty Prediction |
| SCUT-HCCDoc: A new | benchmark | dataset of handwritten Chinese text in unconstrained camera-captured documents |
| SeaDronesSee: A Maritime | benchmark | for Detecting Humans in Open Water |
| SEED4D: A Synthetic Ego-Exo Dynamic 4D Data Generator, Driving Dataset and | benchmark | |
| Seeing Through the Data: A Statistical Evaluation of Prohibited Item Detection | benchmark | Datasets for X-ray Security Screening |
| Segmentation of Pericardial Adipose Tissue in CMR Images: A | benchmark | Dataset MRPEAT and a Triple-Stage Network 3SUnet |
| segmented and annotated IAPR TC-12 | benchmark | , The |
| Select Informative Samples for Night-Time Vehicle Detection | benchmark | in Urban Scenes |
| Self-Supervised Anomaly Detection and a New | benchmark | for X-Ray Cargo Images |
| Self-Supervised Skeleton-Based Action Representation Learning: A | benchmark | and Beyond |
| Semantic Boundaries Dataset and | benchmark | |
| Semantic Segmentation in Thermal Videos: A New | benchmark | and Multi-Granularity Contrastive Learning-Based Framework |
| Semantic segmentation on Swiss3DCities: A | benchmark | study on aerial photogrammetric 3D pointcloud dataset |
| Semiglobal Matching Results on the ISPRS Stereo Matching | benchmark | |
| Separability Criteria for the Evaluation of Boundary Detection | benchmark | s |
| SeriesBench: A | benchmark | for Narrative-Driven Drama Series Understanding |
| set of | benchmark | s for Handwritten Text Recognition on historical documents, A |
| Sewer-ML: A Multi-Label Sewer Defect Classification Dataset and | benchmark | |
| ShapeMed-Knee: A Dataset and Neural Shape Model | benchmark | for Modeling 3D Femurs |
| Show or Tell? A | benchmark | to Evaluate Visual and Textual Prompts in Semantic Segmentation |
| SHREC'11: Robust Feature Detection And Description | benchmark | |
| SI Traceable Solar Spectral Irradiance Measurement Based on a Quantum | benchmark | : A Prototype Design |
| SID4VAM: A | benchmark | Dataset With Synthetic Images for Visual Attention Modeling |
| SIFT and SURF Performance Evaluation against Various Image Deformations on | benchmark | Dataset |
| Signature Detection, Restoration, and Verification: A Novel Chinese Document Signature Forgery Detection | benchmark | |
| Signavatars: A Large-scale 3d Sign Language Holistic Motion Dataset and | benchmark | |
| Silhouette Body Measurement | benchmark | s |
| Sim2RealVS: A New | benchmark | for Video Stabilization with a Strong Baseline |
| SIMBA: Split Inference: mechanisms, | benchmark | s and Attacks |
| Simple Approach and | benchmark | for 21,000-Category Object Detection, A |
| Simulation Experiments of Different Metaheuristics Algorithms using | benchmark | Functions: A Performance Study |
| Single Frame Atmospheric Turbulence Mitigation: A | benchmark | Study and a New Physics-Inspired Transformer Model |
| Single Image Deraining: A Comprehensive | benchmark | Analysis |
| Single- and cross- database | benchmark | s for gender classification under unconstrained settings |
| Single-Image Super-Resolution: A | benchmark | |
| SIXray: A Large-Scale Security Inspection X-Ray | benchmark | for Prohibited Item Discovery in Overlapping Images |
| SkatingVerse: A large-scale | benchmark | for comprehensive evaluation on human action understanding |
| Skeleton Ground Truth Extraction: Methodology, Annotation Tool and | benchmark | s |
| Skin_hair Dataset: Setting the | benchmark | for Effective Hair Inpainting Methods for Improving the Image Quality of Dermoscopic Images |
| Sliding Window Based Micro-expression Spotting: A | benchmark | |
| SLPDR: A | benchmark | for Ship License Plate Detection and Recognition |
| Small object detection in aerial traffic imagery: A | benchmark | for motorbike-dominated road scenes |
| SmartHome-Bench: A Comprehensive | benchmark | for Video Anomaly Detection in Smart Homes Using Multi-Modal Large Language Models |
| SMTPD: A New | benchmark | for Temporal Prediction of Social Media Popularity |
| SoccerNet-Tracking: Multiple Object Tracking Dataset and | benchmark | in Soccer Videos |
| SoccerNet-v2: A Dataset and | benchmark | s for Holistic Understanding of Broadcast Soccer Videos |
| Social-IQ: A Question Answering | benchmark | for Artificial Social Intelligence |
| SOK-Bench: A Situated Video Reasoning | benchmark | with Aligned Open-World Knowledge |
| Soma Segmentation | benchmark | in Full Adult Fly Brain, A |
| Spatial457: A Diagnostic | benchmark | for 6D Spatial Reasoning of Large Multimodal Models |
| SpatialSense: An Adversarially Crowdsourced | benchmark | for Spatial Relation Recognition |
| Spatio-Temporal Mitosis Detection in Time-Lapse Phase-Contrast Microscopy Image Sequences: A | benchmark | |
| Special issue on visual concept detection in the MIRFLICKR/ImageCLEF | benchmark | |
| Spherical Superpixels: | benchmark | and Evaluation |
| Spring: A High-Resolution High-Detail Dataset and | benchmark | for Scene Flow, Optical Flow and Stereo |
| STAR: A First-Ever Dataset and a Large-Scale | benchmark | for Scene Graph Generation in Large-Size Satellite Imagery |
| Static facial expression analysis in tough conditions: Data, evaluation protocol and | benchmark | |
| Stitched Wide Field of View Light Field Image Quality Assessment: | benchmark | Database and Objective Metric |
| StudioGAN: A Taxonomy and | benchmark | of GANs for Image Synthesis |
| SUES-200: A Multi-Height Multi-Scene Cross-View Image | benchmark | Across Drone and Satellite |
| SUM: A | benchmark | dataset of Semantic Urban Meshes |
| SUN RGB-D: A RGB-D scene understanding | benchmark | suite |
| SUNRGBD: A RGB-D Scene Understanding | benchmark | Suite |
| Super-CLEVR: A Virtual | benchmark | to Diagnose Domain Robustness in Visual Reasoning |
| Superpixel segmentation: A | benchmark | |
| Supervised Raw Video Denoising With a | benchmark | Dataset on Dynamic Scenes |
| Surface Reconstruction From Point Clouds: A Survey and a | benchmark | |
| Surface Reconstruction from SLAM-Based Point Clouds: Results from the Datasets of the 2023 SIFET | benchmark | |
| Survey and | benchmark | of Automatic Surface Reconstruction From Point Clouds, A |
| Survey of State of the Art Large Vision Language Models: Alignment, | benchmark | , Evaluations and Challenges, A |
| Survey: How good are the current advances in image set based face identification?: Experiments on three popular | benchmark | s with a naive approach |
| SUTD-TrafficQA: A Question Answering | benchmark | and an Efficient Network for Video Reasoning over Traffic Events |
| SVEA: A Small-scale | benchmark | for Validating the Usability of Post-hoc Explainable AI Solutions in Image and Signal Recognition |
| SVIRO: Synthetic Vehicle Interior Rear Seat Occupancy Dataset and | benchmark | |
| SyB3R: A Realistic Synthetic | benchmark | for 3D Reconstruction from Images |
| Synergetic Assessment of Quality and Aesthetic: Approach and Comprehensive | benchmark | Dataset |
| SynMVCrowd: A Large Synthetic | benchmark | for Multi-view Crowd Counting and Localization |
| Systematic Evaluation and | benchmark | for Person Re-Identification: Features, Metrics, and Datasets, A |
| T2I-CompBench++: An Enhanced and Comprehensive | benchmark | for Compositional Text-to-Image Generation |
| T2ISafety: | benchmark | for Assessing Fairness, Toxicity, and Privacy in Image Generation |
| T2V-CompBench: A Comprehensive | benchmark | for Compositional Text-to-video Generation |
| TAD16K: An enhanced | benchmark | for autonomous driving |
| TAO: A Large-scale | benchmark | for Tracking Any Object |
| Target Tracking Techniques, Performance Evaluation, Comparison, | benchmark | s, Datasets, Survey |
| Target-aware Dual Adversarial Learning and a Multi-scenario Multi-Modality | benchmark | to Fuse Infrared and Visible for Object Detection |
| TE141K: Artistic Text | benchmark | for Text Effect Transfer |
| Technical Solution Discussion for Key Challenges of Operational Convolutional Neural Network-Based Building-Damage Assessment from Satellite Imagery: Perspective from | benchmark | xBD Dataset |
| Tencent-MVSE: A Large-Scale | benchmark | Dataset for Multi-Modal Video Similarity Evaluation |
| Text-to-Image Vehicle Re-Identification: Multi-Scale Multi-View Cross-Modal Alignment Network and a Unified | benchmark | |
| Textinvision: Text and Prompt Complexity Driven Visual Text Generation | benchmark | |
| Texture segmentation | benchmark | |
| Texture Segmentation | benchmark | |
| TGIF: A New Dataset and | benchmark | on Animated GIF Description |
| Thermal Infrared Video | benchmark | for Visual Analysis, A |
| thermal Object Tracking | benchmark | , A |
| Thinking Image Color Aesthetics Assessment: Models, Datasets and | benchmark | s |
| Thorough | benchmark | and a New Model for Light Field Saliency Detection, A |
| Three-dimensional singular spectrum analysis for precise land cover classification from UAV-borne hyperspectral | benchmark | datasets |
| THRONE: An Object-Based Hallucination | benchmark | for the Free-Form Generations of Large Vision-Language Models |
| TinyPedSeg: A Tiny Pedestrian Segmentation | benchmark | for Top-Down Drone Images |
| TO-LF: A Texture and Occlusion-Oriented | benchmark | Dataset for Light Field Disparity Estimation |
| Toolkit to | benchmark | Point Cloud Quality Metrics with Multi-Track Evaluation Criteria, A |
| Tools for BIM-GIS Integration (IFC Georeferencing and Conversions): Results from the GeoBIM | benchmark | 2019 |
| Toulouse Hyperspectral Data Set: A | benchmark | data set to assess semi-supervised spectral representation learning and pixel-wise classification techniques |
| Toward an objective | benchmark | for video completion |
| Toward Chinese Food Understanding: A Cross-Modal Ingredient-Level | benchmark | |
| Toward Efficient Video Compression Artifact Detection and Removal: A | benchmark | Dataset |
| Toward Explainable 3D Grounded Visual Question Answering: A New | benchmark | and Strong Baseline |
| Toward Multi-Source Sky-Ground Re-Identification: A New | benchmark | and an Innovative Approach |
| Toward RAW Object Detection: A New | benchmark | and A New Model |
| Toward Real-World Multi-View Object Classification: Dataset, | benchmark | , and Analysis |
| Toward Real-World Single Image Super-Resolution: A New | benchmark | and a New Model |
| Toward Realistic Hierarchical Object Detection: Problem, | benchmark | , and Solution |
| Toward Video Anomaly Retrieval From Video Anomaly Detection: New | benchmark | s and Model |
| Toward Weather-Robust 3D Human Body Reconstruction: Millimeter-Wave Radar-Based Dataset, | benchmark | , and Multi-Modal Fusion |
| Towards 3D Colored Mesh Saliency: Database and | benchmark | s |
| Towards Automatic Power Battery Detection: New Challenge, | benchmark | Dataset and Baseline |
| Towards contactless palmprint recognition: A novel device, a new | benchmark | , and a collaborative representation based identification approach |
| Towards Fast and Accurate Real-World Depth Super-Resolution: | benchmark | Dataset and Baseline |
| Towards Generic 3D Tracking in RGBD Videos: | benchmark | and Baseline |
| Towards Large-Scale Small Object Detection: Survey and | benchmark | s |
| Towards lifelong object recognition: A dataset and | benchmark | |
| Towards Long-Horizon Vision-Language Navigation: Platform, | benchmark | and Method |
| Towards More Flexible and Accurate Object Tracking with Natural Language: Algorithms and | benchmark | |
| Towards More Practical Group Activity Detection: A New | benchmark | and Model |
| Towards Natural Language-Based Document Image Retrieval: New Dataset and | benchmark | |
| Towards Natural Language-guided Drones: Geotext-1652 | benchmark | with Spatial Relation Matching |
| Towards pen-holding hand pose recognition: A new | benchmark | and a coarse-to-fine PHHP recognition network |
| Towards Real-World Burst Image Super-Resolution: | benchmark | and Method |
| Towards Real-World HDR Video Reconstruction: A Large-Scale | benchmark | Dataset and A Two-Stage Alignment Network |
| Towards Real-World Prohibited Item Detection: A Large-Scale X-ray | benchmark | |
| Towards Real-world Video Face Restoration: A New | benchmark | |
| Towards Real-world X-ray Security Inspection: A High-Quality | benchmark | And Lateral Inhibition Module For Prohibited Items Detection |
| Towards reliable domain generalization: Insights from the PF2HC | benchmark | and dynamic evaluations |
| Towards Robust Monocular Depth Estimation: A New Baseline and | benchmark | |
| Towards Robust Rain Removal Against Adversarial Attacks: A Comprehensive | benchmark | Analysis and Beyond |
| Towards Scalable 3D Anomaly Detection and Localization: A | benchmark | via 3D Anomaly Synthesis and A Self-Supervised Learning Network |
| Towards Scalable Human-aligned | benchmark | for Text-guided Image Editing |
| Towards Segmenting Consumer Stereo Videos: | benchmark | , Baselines and Ensembles |
| Towards Semantic Segmentation of Urban-Scale 3D Point Clouds: A Dataset, | benchmark | s and Challenges |
| Towards Temporal Event Detection: A Dataset, | benchmark | s and Challenges |
| Towards to real world vehicle privacy protection: A new dataset and | benchmark | |
| Towards Unified Deep Image Deraining: A Survey and a New | benchmark | |
| Tracking Reflected Objects: A | benchmark | |
| Tracking Revisited Using RGBD Camera: Unified | benchmark | and Baselines |
| Tracking Small and Fast Moving Objects: A | benchmark | |
| TrackingNet: A Large-Scale Dataset and | benchmark | for Object Tracking in the Wild |
| Traffic Accident | benchmark | for Causality Recognition |
| Trafficnight: An Aerial Multimodal | benchmark | for Nighttime Vehicle Surveillance |
| TrainFors: A Large | benchmark | Training Dataset for Image Manipulation Detection and Localization |
| Transformers in Small Object Detection: A | benchmark | and Survey of State-of-the-Art |
| Transparent Object Tracking | benchmark | |
| TUM-DLR Multimodal Earth Observation Evaluation | benchmark | , The |
| TUM2TWIN: Introducing the large-scale multimodal urban digital twin | benchmark | dataset |
| Tunevlseg: Prompt Tuning | benchmark | for Vision-language Segmentation Models |
| UA-DETRAC | benchmark | Suite |
| UA-DETRAC: A new | benchmark | and protocol for multi-object detection and tracking |
| UAL-Bench: The First Comprehensive Unusual Activity Localization | benchmark | |
| UAV-Human: A Large | benchmark | for Human Behavior Understanding with Unmanned Aerial Vehicles |
| UAV-Rain1k: A | benchmark | for Raindrop Removal from UAV Aerial Imagery |
| UAVPairs: A | benchmark | for match pair retrieval of large-scale UAV images |
| UBnormal: New | benchmark | for Supervised Open-Set Video Anomaly Detection |
| UG^2: a Video | benchmark | for Assessing the Impact of Image Restoration and Enhancement on Automatic Visual Recognition |
| UIT-OpenViIC: An open-domain | benchmark | for evaluating image captioning in Vietnamese |
| Uli-Ri: A | benchmark | for Person Re-Identification With Quantitative Annotations |
| Ultra-High Resolution Segmentation with Ultra-Rich Context: A Novel | benchmark | |
| Ultra-High-Definition Image Restoration: New | benchmark | s and a Dual Interaction Prior-Driven Solution |
| UMPM | benchmark | : A multi-person dataset with synchronized video and motion capture data for evaluation of articulated human motion and interaction |
| Unbiasing the Estimation of Chlorophyll from Hyperspectral Images: A | benchmark | Dataset, Validation Procedure and Baseline Results |
| Unbiasing through Textual Descriptions: Mitigating Representation Bias in Video | benchmark | s |
| Uncertainty-sensitive Activity Recognition: A Reliability | benchmark | and the CARING Models |
| uncompressed | benchmark | image dataset for colour imaging, An |
| Unconstrained | benchmark | Urdu Handwritten Sentence Database with Automatic Line Segmentation, An |
| Uncovering what, why and How: A Comprehensive | benchmark | for Causation Understanding of Video Anomaly |
| Underwater Image Enhancement | benchmark | Dataset and Beyond, An |
| Underwater Image Enhancement Quality Evaluation: | benchmark | Dataset and Objective Metric |
| Underwater Image Quality Assessment: | benchmark | Database and Objective Method |
| Underwater Image Restoration Through a Prior Guided Hybrid Sense Approach and Extensive | benchmark | Analysis |
| Underwater Photogrammetry: Potentialities and Problems Results of The | benchmark | Session of the 2019 Sifet Congress |
| Unidentified Video Objects: A | benchmark | for Dense, Open-World Segmentation |
| Unified Video Segmentation | benchmark | : Annotation, Metrics and Analysis, A |
| UniMod1K: Towards a More Universal Large-Scale Dataset and | benchmark | for Multi-modal Learning |
| UNIPEN project of on-line data exchange and recognizer | benchmark | s |
| UniRTL: A universal RGBT and low-light | benchmark | for object tracking |
| Universal Protocol to | benchmark | Camera Calibration for Sports, A |
| Unmanned Aerial Vehicle | benchmark | : Object Detection and Tracking, The |
| Unmanned Aerial Vehicle | benchmark | : Object Detection, Tracking and Baseline, The |
| Unmanned Aerial Vehicle Visual Detection and Tracking using Deep Neural Networks: A Performance | benchmark | |
| Unveiling Deep Shadows: A Survey and | benchmark | on Image and Video Shadow Detection, Removal, and Generation in the Deep Learning Era |
| Unveiling the Anomalies in an Ever-Changing World: A | benchmark | for Pixel-Level Anomaly Detection in Continual Learning |
| UOW-Vessel: A | benchmark | Dataset of High-Resolution Optical Satellite Images for Vessel Detection and Segmentation |
| Urban Waterlogging Detection: A Challenging | benchmark | and Large-small Model Co-adapter |
| UrbanSARFloods: Sentinel-1 SLC-Based | benchmark | Dataset for Urban and Open-Area Flood Mapping |
| Urvos: Unified Referring Video Object Segmentation Network with a Large-scale | benchmark | |
| Using the Overlapping Score to Improve Corruption | benchmark | s |
| USOD10K: A New | benchmark | Dataset for Underwater Salient Object Detection |
| USTC-TD: A Test Dataset and | benchmark | for Image and Video Coding in 2020s |
| USVTrack: A | benchmark | for Multi-Object Tracking in Complex Water Surface Scenes |
| Utb180: A High-quality | benchmark | for Underwater Tracking |
| UVEB: A Large-scale | benchmark | and Baseline Towards Real-World Underwater Video Enhancement |
| V-RSIR: A Web-based Tool and | benchmark | Dataset for Remote Sensing Image Retrieval |
| VBench++: Comprehensive and Versatile | benchmark | Suite for Video Generative Models |
| VBench: Comprehensive | benchmark | Suite for Video Generative Models |
| vCLIMB: A Novel Video Class Incremental Learning | benchmark | |
| Vehicle detection in aerial imagery: A small target detection | benchmark | |
| Vehicle detection with sub-class training using R-CNN for the UA-DETRAC | benchmark | |
| Vehicle Lane Merge Visual | benchmark | |
| Vesselness Filters: A Survey with | benchmark | s Applied to Liver Imaging |
| VFHQ: A High-Quality Dataset and | benchmark | for Video Face Super-Resolution |
| VHRShips: An Extensive | benchmark | Dataset for Scalable Deep Learning-Based Ship Detection Applications |
| Video Class Agnostic Segmentation | benchmark | for Autonomous Driving |
| Video Crowd Localization With Multifocus Gaussian Neighborhood Attention and a Large-Scale | benchmark | |
| Video Quality Assessment of User Generated Content: A | benchmark | Study and a New Model |
| Video text detection and recognition: Dataset and | benchmark | |
| Video-Bench: Human-Aligned Video Generation | benchmark | |
| Video-MME: The First-Ever Comprehensive Evaluation | benchmark | of Multi-modal LLMs in Video Analysis |
| View-Based 3-D Model Retrieval: A | benchmark | |
| VIFB: A Visible and Infrared Image Fusion | benchmark | |
| VinaBench: | benchmark | for Faithful and Consistent Visual Narratives |
| VisDA: A Synthetic-to-Real | benchmark | for Visual Domain Adaptation |
| Visible-Thermal Tiny Object Detection: A | benchmark | Dataset and Baselines |
| Visible-Thermal UAV Tracking: A Large-Scale | benchmark | and New Baseline |
| Vision-Based Parking-Slot Detection: A DCNN-Based Approach and a Large-Scale | benchmark | Dataset |
| Visionary vigilance: Optimized YOLOV8 for fallen person detection with large-scale | benchmark | dataset |
| Visual Attention Modeling for Stereoscopic Video: A | benchmark | and Computational Model |
| Visual | benchmark | for Autonomous Driving in Open-Pit Mines, A |
| Visual Interestingness Prediction: A | benchmark | Framework and Literature Review |
| visual object tracking | benchmark | for cell motility in time-lapse imaging, A |
| Visual Perception for Navigation in Human Environments: The JackRabbot Human Body Pose Dataset and | benchmark | |
| Visual Question Answering, Datasets, | benchmark | s, Surveys |
| Visual Robustness | benchmark | for Visual Question Answering (VQA) |
| Visual tracking in camera-switching outdoor sport videos: | benchmark | and baselines for skiing |
| VL-RewardBench: A Challenging | benchmark | for Vision-Language Generative Reward Models |
| VNL-STES: A | benchmark | Dataset and Model for Spatiotemporal Event Spotting in Volleyball Analytics |
| VPCFormer: A transformer-based multi-view finger vein recognition model and a new | benchmark | |
| VSCHH 2023: A | benchmark | for the View Synthesis Challenge of Human Heads |
| WATB: Wild Animal Tracking | benchmark | |
| WaterScenes: A Multi-Task 4D Radar-Camera Fusion Dataset and | benchmark | s for Autonomous Driving on Water Surfaces |
| Weakly Supervised Learning Guided by Activation Mapping Applied to a Novel Citrus Pest | benchmark | |
| WebFace260M: A | benchmark | for Million-Scale Deep Face Recognition |
| WebFace260M: A | benchmark | Unveiling the Power of Million-Scale Deep Face Recognition |
| Webly Supervised Fine-Grained Recognition: | benchmark | Datasets and an Approach |
| WebUAV-3M: A | benchmark | for Unveiling the Power of Million-Scale Deep UAV Tracking |
| WEPDTOF: A Dataset and | benchmark | Algorithms for In-the-Wild People Detection and Tracking from Overhead Fisheye Cameras |
| When Age-Invariant Face Recognition Meets Face Age Synthesis: A Multi-Task Learning Framework and a New | benchmark | |
| When Human Pose Estimation Meets Robustness: Adversarial Algorithms and | benchmark | s |
| When Pedestrian Detection Meets Multi-modal Learning: Generalist Model and | benchmark | Dataset |
| When Sketch Face Recognition Meets Mask Obfuscation: Database and | benchmark | |
| When Visual Grounding Meets Gigapixel-Level Large-Scale Scenes: | benchmark | and Approach |
| WHU-Railway3D: A Diverse Dataset and | benchmark | for Railway Point Cloud Semantic Segmentation |
| WHU-STree: A multi-modal | benchmark | dataset for street tree inventory |
| WIDER FACE: A Face Detection | benchmark | |
| Wifbs: A Web-Based Image Feature | benchmark | System |
| Wild Face Anti-Spoofing Challenge 2023: | benchmark | and Results |
| WildDash: Creating Hazard-Aware | benchmark | s |
| Wildfish++: A Comprehensive Fish | benchmark | for Multimedia Research |
| WIMANS: A | benchmark | Dataset for WIFI-based Multi-User Activity Sensing |
| WSRD: A Novel | benchmark | for High Resolution Image Shadow Removal |
| XFMP: A | benchmark | for Explainable Fine-Grained Abnormal Behavior Recognition on Medical Personal Protective Equipment |
| YACCLAB: Yet Another Connected Components Labeling | benchmark | |
| YouTube-8M: A Large-Scale Video Classification | benchmark | |
| ZAHA: Introducing the Level of Facade Generalization and the Large-Scale Point Cloud Facade Semantic Segmentation | benchmark | Dataset |
1131 for benchmark
| _ | benchmarking | _ |
| 3D Compression | benchmarking | with Mymultimediaworld.com |
| 4Seasons: | benchmarking | Visual SLAM and Long-Term Localization for Autonomous Driving in Challenging Conditions |
| @BENCH: | benchmarking | Vision-Language Models for Human-centered Assistive Technology |
| Advanced | benchmarking | for Image Compositing Evaluation, An |
| Affect in Multimedia: | benchmarking | Violent Scenes Detection |
| AI for dating stars: a | benchmarking | study for gyrochronology |
| AIGV-Assessor: | benchmarking | and Evaluating the Perceptual Quality of Text-to-Video Generation with LMM |
| Algorithm for | benchmarking | an SIMD Pyramid with the Abingdon Cross, An |
| Analysis and | benchmarking | of Extending Blind Face Image Restoration to Videos |
| Annotation and | benchmarking | of a Video Dataset under Degraded Complex Atmospheric Conditions and Its Visibility Enhancement Analysis for Moving Object Detection |
| ANTHROPOS-V: | benchmarking | the Novel Task of Crowd Volume Estimation |
| approach towards | benchmarking | of table structure recognition results, An |
| ARTeFACT: | benchmarking | Segmentation Models on Diverse Analogue Media Damage |
| Assessment and | benchmarking | of Spatially Enabled RDF Stores for the Next Generation of Spatial Data Infrastructure |
| BD Open LULC Map: High-Resolution Land Use Land Cover Mapping and | benchmarking | For Urban Development In Dhaka, Bangladesh |
| BenchLMM: | benchmarking | Cross-Style Visual Capability of Large Multimodal Models |
| benchmarking | 3D Face De-Identification with Preserving Facial Attributes |
| benchmarking | 3D Pose Estimation for Face Recognition |
| benchmarking | 6DOF Outdoor Visual Localization in Changing Conditions |
| benchmarking | a Multimodal and Multiview and Interactive Dataset for Human Action Recognition |
| benchmarking | a Reduced Multivariate Polynomial Pattern Classifier |
| benchmarking | Access Structures for High-Dimensional Multimedia Data |
| benchmarking | Adversarial Robustness on Image Classification |
| benchmarking | Algorithmic Bias in Face Recognition: An Experimental Approach Using Synthetic Faces and Human Evaluation |
| benchmarking | Anchor-Based and Anchor-Free State-of-the-Art Deep Learning Methods for Individual Tree Detection in RGB High-Resolution Images |
| benchmarking | and Analysis of Unsupervised Object Segmentation from Real-World Single Images |
| benchmarking | and Analyzing Generative Data for Visual Recognition |
| benchmarking | and Analyzing Robust Point Cloud Recognition: Bag of Tricks for Defending Adversarial Examples |
| benchmarking | and Error Diagnosis in Multi-instance Pose Estimation |
| benchmarking | and hardware implementation of JPEG-LS |
| benchmarking | and Improving Bird's Eye View Perception Robustness in Autonomous Driving |
| benchmarking | and quality analysis of DEM generated from high and very high resolution optical stereo satellite data |
| benchmarking | and scaling of deep learning models for land cover image classification |
| benchmarking | asymmetric 3D-2D face recognition systems |
| benchmarking | Audio Visual Segmentation for Long-Untrimmed Videos |
| benchmarking | Automatic Bundle Adjustment Results |
| benchmarking | Binarisation Schemes for Deep Face Templates |
| benchmarking | Cameras for Open VSLAM Indoors |
| benchmarking | Campaign for the Multimodal Detection of Violent Scenes in Movies, A |
| benchmarking | classification models for emotion recognition in natural speech: A multi-corporal study |
| benchmarking | Close-range Structure from Motion 3D Reconstruction Software Under Varying Capturing Conditions |
| benchmarking | commercial OCR engines for technical drawings indexing |
| benchmarking | Data Efficiency and Computational Efficiency of Temporal Action Localization Models |
| benchmarking | Dataset for Performance Evaluation of Automatic Surface Reconstruction Algorithms, A |
| benchmarking | Datasets for Breast Cancer Computer-Aided Diagnosis (CADx) |
| benchmarking | Deep Learning for On-Board Space Applications |
| benchmarking | Deep Learning Models for Cloud Detection in Landsat-8 and Sentinel-2 Images |
| benchmarking | deep learning techniques for face recognition |
| benchmarking | deep models on retinal fundus disease diagnosis and a large-scale dataset |
| benchmarking | deep models on salient object detection |
| benchmarking | Denoising Algorithms with Real Photographs |
| benchmarking | Different SfM-MVS Photogrammetric and iOS LiDAR Acquisition Methods for the Digital Preservation of a Short-Lived Excavation: A Case Study from an Area of Sinkhole Related Subsidence |
| benchmarking | discriminative approaches for word spotting in handwritten documents |
| benchmarking | Elevation Plus Land Surface Parameters Finds FathomDEM and Copernicus DEM Win as Best Global DEMs |
| benchmarking | equivariance for Deep Learning based optical flow estimators |
| benchmarking | Facial Image Analysis Technologies |
| benchmarking | Facial Image Analysis Technologies (BeFIT) |
| benchmarking | federated learning for semantic datasets: Federated scene graph generation |
| benchmarking | Framework for SAR Despeckling |
| benchmarking | Geometry-Based Leaf-Filtering Algorithms for Tree Volume Estimation Using Terrestrial LiDAR Scanners |
| benchmarking | Geospatial High-Value Data Openness Using GODI Plus Methodology: A Regional Level Case Study |
| benchmarking | GPU-Based Phase Correlation for Homography-Based Registration of Aerial Imagery |
| benchmarking | graph-based clustering algorithms |
| benchmarking | Head Pose Estimation in-the-Wild |
| benchmarking | HEp-2 Cells Classification Methods |
| benchmarking | HEp-2 specimen cells classification using linear discriminant analysis on higher order spectra features of cell shape |
| benchmarking | High Density Image Matching for Oblique Airborne Imagery |
| benchmarking | Hough Transform Architectures for Real-Time |
| benchmarking | human face similarity using identical twins |
| benchmarking | Image Classifiers for Physical Out-of-Distribution Examples Detection |
| benchmarking | Image Retrieval Diversification Techniques for Social Media |
| benchmarking | Image Retrieval for Visual Localization |
| benchmarking | Image Segmentation Algorithms |
| benchmarking | Implicit Neural Representation and Geometric Rendering in Real-Time RGB-D SLAM |
| benchmarking | Initiative for Multimedia Evaluation: MediaEval 2016, The |
| benchmarking | Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning |
| benchmarking | large-scale Fine-Grained Categorization |
| benchmarking | Laryngeal Neoplasm Segmentation: A Multicenter Dataset and an Effective Method |
| benchmarking | Local Orientation Extraction in Fingerprint Recognition |
| benchmarking | Localization for Augmented Reality in Large Scale Environments |
| benchmarking | Low-Light Image Enhancement and Beyond |
| benchmarking | Low-Shot Robustness to Natural Distribution Shifts |
| benchmarking | Machine Learning Algorithms for Instantaneous Net Surface Shortwave Radiation Retrieval Using Remote Sensing Data |
| benchmarking | Micro-Action Recognition: Dataset, Methods, and Applications |
| benchmarking | Mobile Laser Scanning Systems Using A Permanent Test Field |
| benchmarking | Monocular Depth Estimation Models for VR Content Creation from a User Perspective |
| benchmarking | MSWEP Precipitation Accuracy in Arid Zones Against Traditional and Satellite Measurements |
| benchmarking | Multi-Modal Semantic Segmentation Under Sensor Failures: Missing and Noisy Modality Robustness |
| benchmarking | Multi-target Tracking MOTChallenge |
| benchmarking | Object Detection Robustness against Real-World Corruptions |
| benchmarking | Object Detectors under Real-World Distribution Shifts in Satellite Imagery |
| benchmarking | Object Detectors with Coco: A New Path Forward |
| benchmarking | of algorithms for 3D tissue reconstruction |
| benchmarking | of algorithms for automatic correspondence localisation |
| benchmarking | of Blind Video Deblurring Methods on Long Exposure and Resource Poor Settings |
| benchmarking | of Bootstrap Temporal Stereo using Statistical and Physical Scene Modelling |
| benchmarking | of Convolutional Neural Network Approaches for Vegetation Land Cover Mapping |
| benchmarking | of data fusion algorithms in support of earth observation based Antarctic wildlife monitoring |
| benchmarking | of Deep Architectures for Segmentation of Medical Images |
| benchmarking | of Fingerprint Sensors |
| benchmarking | of High-resolution Land Cover Maps In Africa |
| benchmarking | of Image Registration Methods for Differently Stained Histological Slides |
| benchmarking | of Individual Tree Segmentation Methods in Mediterranean Forest Based on Point Clouds from Unmanned Aerial Vehicle Imagery and Low-Density Airborne Laser Scanning |
| benchmarking | of Natural Scene Image Dataset In Degraded Conditions for Visibility Enhancement |
| benchmarking | of objective quality metrics for HDR image quality assessment |
| benchmarking | of Update Learning Strategies on Digit Classifier Systems |
| benchmarking | of wildland fire colour segmentation algorithms |
| benchmarking | Omni-Vision Representation Through the Lens of Visual Realms |
| benchmarking | Out-of-Distribution Detection in Visual Question Answering |
| benchmarking | Page Segmentation Algorithms |
| benchmarking | parts based face processing in-the-wild for gender recognition and head pose estimation |
| benchmarking | Pedestrian Odometry: The Brown Pedestrian Odometry Dataset (BPOD) |
| benchmarking | Performance of Object Detection Under Image Distortions in an Uncontrolled Environment |
| benchmarking | Platform for Mitotic Cell Classification of ANA IIF HEp-2 Images, A |
| benchmarking | Probabilistic Deep Learning Methods for License Plate Recognition |
| benchmarking | Protocol for Watermarking Methods, A |
| benchmarking | Representation Learning for Natural World Image Collections |
| benchmarking | result diversification in social image retrieval |
| benchmarking | Robots by Inducing Failures in Competition Scenarios |
| benchmarking | Robustness Beyond LP Norm Adversaries |
| benchmarking | Robustness in Neural Radiance Fields |
| benchmarking | Robustness of 3D Object Detection to Common Corruptions in Autonomous Driving |
| benchmarking | Robustness to Text-Guided Corruptions |
| benchmarking | Segmentation Models with Mask-Preserved Attribute Editing |
| benchmarking | Self-Supervised Learning on Diverse Pathology Datasets |
| benchmarking | Single-Image Dehazing and Beyond |
| benchmarking | Single-Image Reflection Removal Algorithms |
| benchmarking | Single-Image Reflection Removal Algorithms |
| benchmarking | Skeleton-based Motion Encoder Models for Clinical Applications: Estimating Parkinson's Disease Severity in Walking Sequences |
| benchmarking | Spurious Bias in Few-shot Image Classifiers |
| benchmarking | Stereo Data (Not the Matching Algorithms) |
| benchmarking | Still-to-Video Face Recognition via Partial and Local Linear Discriminant Analysis on COX-S2V Dataset |
| benchmarking | Texture Classification Algorithms |
| benchmarking | the Applicability of Ontology in Geographic Object-Based Image Analysis |
| benchmarking | the Complementary-View Multi-human Association and Tracking |
| benchmarking | the Retrieval of Biomass in Boreal Forests Using P-Band SAR Backscatter with Multi-Temporal C- and L-Band Observations |
| benchmarking | the Robustness of Cross-view Geo-localization Models |
| benchmarking | the Robustness of LiDAR Semantic Segmentation Models |
| benchmarking | the Robustness of LiDAR-Camera Fusion for 3D Object Detection |
| benchmarking | the Robustness of Semantic Segmentation Models |
| benchmarking | the Robustness of Semantic Segmentation Models with Respect to Common Corruptions |
| benchmarking | the Robustness of Temporal Action Detection Models Against Temporal Corruptions |
| benchmarking | tool for MAV visual pose estimation, A |
| benchmarking | tree instance segmentation of terrestrial laser scanning point clouds |
| benchmarking | Two Algorithms for People Detection from Top-View Depth Cameras |
| benchmarking | Ultra-High-Definition Image Super-resolution |
| benchmarking | Under- and Above-Canopy Laser Scanning Solutions for Deriving Stem Curve and Volume in Easy and Difficult Boreal Forest Conditions |
| benchmarking | Visual Localization for Autonomous Navigation |
| benchmarking | VLMs' Reasoning About Persuasive Atypical Images |
| benchmarking | Zero-Shot Recognition with Vision-Language Models: Challenges on Granularity and Specificity |
| Better, Faster Small Hazard Detection: Instance-Aware Techniques, Metrics and | benchmarking | |
| Beyond Supervised vs. Unsupervised: Representative | benchmarking | and Analysis of Image Representation Learning |
| BlenderGym: | benchmarking | Foundational Model Systems for Graphics Editing |
| Bongard-HOI: | benchmarking | Few-Shot Visual Reasoning for Human-Object Interactions |
| Boundary Detection | benchmarking | : Beyond F-Measures |
| Building Better Models: | benchmarking | Feature Extraction and Matching for Structure from Motion at Construction Sites |
| Classifying Anti-nuclear Antibodies HEp-2 Images: A | benchmarking | Platform |
| Closer Look at | benchmarking | Self-supervised Pre-training with Image Classification, A |
| ComfyBench: | benchmarking | LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems |
| Comparison of watermarking algorithms via a GA-based | benchmarking | tool |
| Completely Automated Multiresolution Edge Snapper: A New Technique for an Accurate Carotid Ultrasound IMT Measurement: Clinical Validation and | benchmarking | on a Multi-Institutional Database |
| Comprehensive | benchmarking | Framework for Sentinel-2 Sharpening: Methods, Dataset, and Evaluation Metrics, A |
| Comprehensive Database for | benchmarking | Imaging Systems, A |
| Comprehensive Multi-Illuminant Dataset for | benchmarking | of the Intrinsic Image Algorithms, A |
| Comprehensive Study on Robustness of Image Classification Models: | benchmarking | and Rethinking, A |
| corpus for | benchmarking | of people detection algorithms, A |
| CoSpace: | benchmarking | Continuous Space Perception Ability for Vision-Language Models |
| COUNTS: | benchmarking | Object Detectors and Multimodal Large Language Models under Distribution Shifts |
| Crop Water Productivity Mapping and | benchmarking | Using Remote Sensing and Google Earth Engine Cloud Computing |
| CrowdFaceDB: Database and | benchmarking | for face verification in crowd |
| CXPMRG-Bench: Pre-training and | benchmarking | for X-ray Medical Report Generation on CheXpert Plus Dataset |
| DaisyRec 2.0: | benchmarking | Recommendation for Rigorous Evaluation |
| Dataset for | benchmarking | Image-Based Localization, A |
| Design of supervision-scalable learning systems: Methodology and performance | benchmarking | |
| DexArt: | benchmarking | Generalizable Dexterous Manipulation with Articulated Objects |
| Digital Elevation Model Intercomparison Experiment Demix, A Community-based Approach at Global DEM | benchmarking | , The |
| Dissecting Dissonance: | benchmarking | Large Multimodal Models Against Self-contradictory Instructions |
| Dora: Sampling and | benchmarking | for 3D Shape Variational Auto-Encoders |
| ECVnet Workshop on | benchmarking | |
| EDFace-Celeb-1 M: | benchmarking | Face Hallucination With a Million-Scale Dataset |
| Efficient | benchmarking | via Bias-Bounded Subset Selection |
| EgoPlan-Bench: | benchmarking | Multimodal Large Language Models for Human-Level Planning |
| Empirical Investigation into | benchmarking | Model Multiplicity for Trustworthy Machine Learning: A Case Study on Image Classification, An |
| ENRICH: Multi-purposE dataset for | benchmarking | In Computer vision and pHotogrammetry |
| EV2Gym: A Flexible V2G Simulator for EV Smart Charging Research and | benchmarking | |
| EvalCrafter: | benchmarking | and Evaluating Large Video Generation Models |
| Evaluating SEE-a | benchmarking | system for document page segmentation |
| EventAid: | benchmarking | Event-Aided Image/Video Enhancement Algorithms With Real-Captured Hybrid Dataset |
| Extended StirTrace | benchmarking | of biometric and forensic qualities of morphed face images |
| Extracting Compact Information from Image | benchmarking | Tools: The SAR Despeckling Case |
| face biometric | benchmarking | review and characterisation, A |
| FaceBench: A Multi-View Multi-Level Facial Attribute VQA Dataset for | benchmarking | Face Perception MLLMs |
| Feature Detection Performance Based | benchmarking | of Motion Deblurring Methods: Applications to Vision for Legged Robots |
| FLHetBench: | benchmarking | Device and State Heterogeneity in Federated Learning |
| FPGA-Based Hardware Accelerator for CNNs Inference on Board Satellites: | benchmarking | with Myriad 2-Based Solution for the CloudScout Case Study, An |
| FPGA-Based On-Board Hyperspectral Imaging Compression: | benchmarking | Performance and Energy Efficiency against GPU Implementations |
| FRAMES-VQA: | benchmarking | Fine-Tuning Robustness across Multi-Modal Shifts in Visual Question Answering |
| framework for developing and | benchmarking | sampling and denoising algorithms for Monte Carlo rendering, A |
| FreeMan: Towards | benchmarking | 3D Human Pose Estimation Under Real-World Conditions |
| From 2D Silhouettes to 3D Object Retrieval: Contributions and | benchmarking | |
| GenderBias-VL: | benchmarking | Gender Bias in Vision Language Models via Counterfactual Probing |
| GeoNet: | benchmarking | Unsupervised Adaptation across Geographies |
| GMOBench: | benchmarking | generic moving objects |
| Good at captioning, bad at counting: | benchmarking | GPT-4V on Earth observation data |
| GPU-Supported Image Compression for Remote Visualization: Realization and | benchmarking | |
| Ground Truth Correspondence Measure for | benchmarking | , A |
| Ground-Truthing and | benchmarking | Document Page Segmentation |
| Guidelines for Underwater Image Enhancement Based on | benchmarking | of Different Methods |
| Handwritten Digit Recognition: | benchmarking | of State-of-the-Art Techniques |
| Haze visibility enhancement: A Survey and quantitative | benchmarking | |
| Holographic Data Coding: | benchmarking | and Extending HEVC With Adapted Transforms |
| Humanrefiner: | benchmarking | Abnormal Human Generation and Refining with Coarse-to-fine Pose-reversible Guidance |
| Illusory VQA: | benchmarking | and Enhancing Multimodal Models on Visual Illusions |
| image database for | benchmarking | of automatic face detection and recognition algorithms, An |
| ImageNet-D: | benchmarking | Neural Network Robustness on Diffusion Synthetic Object |
| ImageNet-E: | benchmarking | Neural Network Robustness via Attribute Editing |
| ImageNet-Patch: A dataset for | benchmarking | machine learning robustness against adversarial patches |
| Improving biometric recognition by means of score ratio, the likelihood ratio for non-probabilistic classifiers. A | benchmarking | study |
| International | benchmarking | of terrestrial laser scanning approaches for forest inventories |
| International | benchmarking | of the Individual Tree Detection Methods for Modeling 3-D Canopy Structure for Silviculture and Forest Ecology Using Airborne Laser Scanning |
| InViG: | benchmarking | Open-Ended Interactive Visual Grounding with 500K Dialogues |
| Is Synthetic Data all We Need? | benchmarking | the Robustness of Models Trained with Synthetic Images |
| K-Sort Arena: Efficient and Reliable | benchmarking | for Generative Models via K-wise Human Preferences |
| LaMAR: | benchmarking | Localization and Mapping for Augmented Reality |
| Land8Fire: A Complete Study on Wildfire Segmentation Through Comprehensive Review, Human-Annotated Multispectral Dataset, and Extensive | benchmarking | |
| large database of graphs and its use for | benchmarking | graph isomorphism algorithms, A |
| Lazy Man's Approach to | benchmarking | : Semisupervised Classifier Evaluation and Recalibration, A |
| LidarGait: | benchmarking | 3D Gait Recognition with Point Clouds |
| Loli-street: | benchmarking | Low-light Image Enhancement and Beyond |
| Manipal-UAV person detection dataset: A step towards | benchmarking | dataset and algorithms for small object detection |
| Measures for | benchmarking | of Automatic Correspondence Algorithms |
| METU dataset: A big dataset for | benchmarking | trademark retrieval |
| MLVU: | benchmarking | Multi-task Long Video Understanding |
| MotionBench: | benchmarking | and Improving Fine-Grained Video Motion Understanding for Vision Language Models |
| Multi-Camera Action Dataset for Cross-Camera Action Recognition | benchmarking | |
| Multi-Source Image Matching Algorithms for UAV Positioning: | benchmarking | , Innovation, and Combined Strategies |
| Multimodal mathematical reasoning embedded in aerial vehicle imagery: | benchmarking | , analysis, and exploration |
| NATS-Bench: | benchmarking | NAS Algorithms for Architecture Topology and Size |
| natural and synthetic corpus for | benchmarking | of hand gesture recognition systems, A |
| NICO++: Towards Better | benchmarking | for Domain Generalization |
| Novel Pre-Processing Approach and | benchmarking | Analysis for Faster, Robust, and Improved Small Object Detection Methods, A |
| Novel Video Dataset for Change Detection | benchmarking | , A |
| Omnia de EgoTempo: | benchmarking | Temporal Understanding of Multi-Modal LLMs in Egocentric Videos |
| OmniDocBench: | benchmarking | Diverse PDF Document Parsing with Comprehensive Annotations |
| On | benchmarking | camera calibration and multi-view stereo for high resolution imagery |
| On | benchmarking | of document analysis systems |
| On | benchmarking | of Invoice Analysis Systems |
| On | benchmarking | Optical Flow |
| Online and offline handwritten Chinese character recognition: | benchmarking | on new databases |
| Open Architecture for End-to-End Document Analysis | benchmarking | , An |
| OpenCIL: | benchmarking | out-of-distribution detection in class incremental learning |
| Optical Water Type Guided | benchmarking | of Machine Learning Generalization for Secchi Disk Depth Retrieval |
| Optimization of H.263 video encoding using a single processor computer: performance tradeoffs and | benchmarking | |
| Optimizing traffic signal control for continuous-flow intersections: | benchmarking | against a state-of-practice model |
| Pairwise-Comparison-Based Rank Learning for | benchmarking | Image Restoration Algorithms |
| Para-Lane: Multi-Lane Dataset Registering Parallel Scans for | benchmarking | Novel View Synthesis |
| Partly First Among Equals: Semantic Part-Based | benchmarking | for State-of-the-Art Object Recognition Systems |
| PathBench: A | benchmarking | Platform for Classical and Learned Path Planning Algorithms |
| Performance | benchmarking | of RVC based multimedia specifications |
| Performance Evaluation and | benchmarking | of Algorithms or Systems for Calibration, Orientation and Surface Reconstruction |
| Performance Evaluation and | benchmarking | of Six Texture-Based Feature Sets for Segmenting Historical Documents |
| Performance Evaluation and | benchmarking | of Six-Page Segmentation Algorithms |
| PhytoSynth: Leveraging Multi-modal Generative Models for Crop Disease Data Generation with Novel | benchmarking | and Prompt Engineering Approach |
| Pink Panther: A Complete Environment for Ground Truthing and | benchmarking | Document Page Segmentation |
| Plenoptic 2.0 Toolbox: | benchmarking | of Depth Estimation Methods for MLA-Based Focused Plenoptic Cameras, The |
| PP4AV: A | benchmarking | Dataset for Privacy-preserving Autonomous Driving |
| Predbench: | benchmarking | Spatio-temporal Prediction Across Diverse Disciplines |
| Putting Nonnegative Matrix Factorization to the Test: A tutorial derivation of pertinent Cramer-Rao bounds and performance | benchmarking | |
| Quality Index for | benchmarking | Image Inpainting Algorithms with Guided Regional Statistics |
| RCP-Bench: | benchmarking | Robustness for Collaborative Perception Under Diverse Corruptions |
| RCV2023 Challenges: | benchmarking | Model Training and Inference for Resource-Constrained Deep Learning |
| Real-IAD: A Real-World Multi-View Dataset for | benchmarking | Versatile Industrial Anomaly Detection |
| Real-world Blur Dataset for Learning and | benchmarking | Deblurring Algorithms |
| Recording and Playback of Camera Shake: | benchmarking | Blind Deconvolution with a Real-World Database |
| Revisiting Oxford and Paris: Large-Scale Image Retrieval | benchmarking | |
| RobustNav: Towards | benchmarking | Robustness in Embodied Navigation |
| RoDLA: | benchmarking | the Robustness of Document Layout Analysis Models |
| R^2-Bench: | benchmarking | the Robustness of Referring Perception Models Under Perturbations |
| Saliency | benchmarking | Made Easy: Separating Models, Maps and Metrics |
| Satellite-based Measurements For | benchmarking | Regional Irrigation Performance In Goulburn-murray Catchment |
| Scaling and | benchmarking | Self-Supervised Visual Representation Learning |
| Securing the Skies: a Comprehensive Survey on Anti-Uav Methods, | benchmarking | , and Future Directions |
| SEED-Bench: | benchmarking | Multimodal Large Language Models |
| Semantic Correspondence: Unified | benchmarking | and a Strong Baseline |
| Set-Based | benchmarking | Method for Address Block Location on Arbitrarily Complex Grey Level Images, A |
| SHOWMe: | benchmarking | Object-agnostic Hand-Object 3D Reconstruction |
| Sim2e: | benchmarking | the Group Equivariant Capability of Correspondence Matching Algorithms |
| Six-CD: | benchmarking | Concept Removals for Text-to-image Diffusion Models |
| Sketchtopia: A Dataset and Foundational Agents for | benchmarking | Asynchronous Multimodal Communication with Iconic Feedback |
| SMPLy | benchmarking | 3D Human Pose Estimation in the Wild |
| Spectral View of Randomized Smoothing Under Common Corruptions: | benchmarking | and Improving Certified Robustness, A |
| SQAD: Automatic Smartphone Camera Quality Assessment and | benchmarking | |
| SUM Parts: | benchmarking | Part-Level Semantic Segmentation of Urban Meshes |
| Sun-Induced Chlorophyll Fluorescence III: | benchmarking | Retrieval Methods and Sensor Characteristics for Proximal Sensing |
| Survey on Efficient Vision Transformers: Algorithms, Techniques, and Performance | benchmarking | , A |
| SVLTA: | benchmarking | Vision-Language Temporal Alignment via Synthetic Video Situation |
| T2VBench: | benchmarking | Temporal Dynamics for Text-to-Video Generation |
| TACO: | benchmarking | Generalizable Bimanual Tool-ACtion-Object Understanding |
| TARGO and TARGO-Net: | benchmarking | Target-Driven Object Grasping Under Occlusions |
| Territorial Competitiveness and Smart City: | benchmarking | Analysis Of Dubai, Abu Dhabi, Riyadh, Cairo, and Rabat |
| Texture feature | benchmarking | and evaluation for historical document image analysis |
| Time-Series FY4A Datasets for Super-Resolution | benchmarking | of Meteorological Satellite Images |
| TIMTQE: | benchmarking | Machine Translation Quality Estimation for Text Images |
| Toward Bridging the Simulated-to-Real Gap: | benchmarking | Super-Resolution on Real Data |
| Towards | benchmarking | and Assessing Visual Naturalness of Physical World Adversarial Attacks |
| Towards | benchmarking | Automated Calibration, Orientation and Surface Reconstruction from Images |
| Towards | benchmarking | of real-world stereo data |
| Towards | benchmarking | Scene Background Initialization |
| Towards Causal | benchmarking | of Bias in Face Analysis Algorithms |
| Towards Cyberbullying Detection: Building, | benchmarking | and Longitudinal Analysis of Aggressiveness and Conflicts/Attacks Datasets From Twitter |
| Towards Efficient | benchmarking | of Foundation Models in Remote Sensing: A Capabilities Encoding Approach |
| Tune It or Don't Use It: | benchmarking | Data-Efficient Image Classification |
| UGC-VQA: | benchmarking | Blind Video Quality Assessment for User Generated Content |
| UNIIR: Training and | benchmarking | Universal Multimodal Information Retrievers |
| Unsupervised and Semi-supervised Bias | benchmarking | in Face Recognition |
| User variance and its impact on video retrieval | benchmarking | |
| VELOCITI: | benchmarking | Video-Language Compositional Reasoning with Strict Entailment |
| VG-SSL: | benchmarking | Self-Supervised Representation Learning Approaches for Visual Geo-Localization |
| Video is Worth 10,000 Words: Training and | benchmarking | with Diverse Captions for Better Long Video Retrieval, A |
| Virtual Environment Tool for | benchmarking | Face Analysis Systems, A |
| VISCO: | benchmarking | Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning |
| Visual appearance based document classification methods: Performance evaluation and | benchmarking | |
| Visual attention quality database for | benchmarking | performance evaluation metrics |
| Which Vegetation Index? | benchmarking | Multispectral Metrics to Hyperspectral Mixture Models in Diverse Cropland |
| WildVideo: | benchmarking | LMMs for Understanding Video-Language Interaction |
| Workshop on | benchmarking | Facial Image Analysis Technologies |
| Workshop on | benchmarking | Multi-target Tracking |
| XLD: A Cross-Lane Dataset for | benchmarking | Novel Driving View Synthesis |
| YOLOBench: | benchmarking | Efficient Object Detectors on Embedded Systems |
329 for benchmarking