| _ | only | _ |
| 140-MHz 94 K Gates HD1080p 30-Frames/s Intra- | only | Profile H.264 Encoder, A |
| 3D-aware virtual try-on using | only | 2D inputs |
| A&B BNN: Add&Bit-Operation- | only | Hardware-Friendly Binary Neural Network |
| Acquisition Of 3d Information For Vanished Structure By Using | only | An Ancient Picture |
| Adapt | only | once: Fast unsupervised person re-identification via relevance-aware guidance |
| Adaptive Beampattern Synthesis for Partially-Clustered Array Through K-Means and Amplitude- | only | MVDR |
| Adaptive Bearing- | only | Formation Tracking Control for Nonholonomic Multiagent Systems |
| Adaptive Formation Control of Networked Robotic Systems With Bearing- | only | Measurements |
| Adaptive Formation Tracking Control of Multiple Vertical Takeoff and Landing UAVs With Bearing- | only | Measurements |
| Affixal approach for Arabic decomposable vocabulary recognition a validation on printed word in | only | one font |
| Algorithm of Adaptive Fading Memory UKF in Bearings- | only | Target Tracking |
| ALPI: Auto-Labeller with Proxy Injection for 3D Object Detection using 2D Labels | only | |
| Amplitude- | only | log Radon transform for geometric invariant shape descriptor |
| Anatomy-Aware MR-Imaging- | only | Radiotherapy |
| Assessment of IRNSS- | only | Data Processing: Availability, Single-Frequency SPP and Short-Baseline RTK |
| Attenuation map estimation with SPECT emission data | only | |
| AttGAN: Facial Attribute Editing by | only | Changing What You Want |
| Auto QA: The Question Is Not | only | What, but Also Where |
| Automatic frontal face annotation and AAM building for arbitrary expressions from a single frontal image | only | |
| Automatic Kappa Angle Estimation For Air Photos Based On Phase | only | Correlation |
| Automatic Recognition of Partial Shoeprints Based on Phase- | only | Correlation |
| Autonomous Manipulation Learning for Similar Deformable Objects via | only | One Demonstration |
| Autonomous robot exploration and cognitive map building in unknown environments using omnidirectional visual information | only | |
| BasicTAD: An astounding RGB- | only | baseline for temporal action detection |
| Bearing- | only | Formation Control With Prespecified Convergence Time |
| Bearings- | only | Target Tracking with an Unbiased Pseudo-Linear Kalman Filter |
| Bias-Compensated Diffusion Pseudolinear Kalman Filter Algorithm for Censored Bearings- | only | Target Tracking |
| Biases in CloudSat Falling Snow Estimates Resulting from Daylight- | only | Operations |
| Boundary-Active- | only | Adaptive Power-Reduction Scheme for Region-Growing Video-Segmentation |
| Brima: Low-Overhead Browser- | only | Image Annotation Tool (Preprint) |
| Bus Passenger Origin-Destination Flow Estimation Using Entry- | only | Smartcard Data: A Self-Supervised Learning Method Without Alighting Data |
| Cam4DOcc: Benchmark for Camera- | only | 4D Occupancy Forecasting in Autonomous Driving Applications |
| Camera- | only | 3D Panoptic Scene Completion for Autonomous Driving Through Differentiable Object Shapes |
| CAMixerSR: | only | Details Need More Attention |
| CERES Energy Balanced and Filled (EBAF) from Afternoon- | only | Satellite Orbits |
| Chasing Shadows: Solving Deepfake Detection Benchmarks Using Irrelevant Features | only | |
| CLIPPO: Image-and-Language Understanding from Pixels | only | |
| Clothes-Changing Person Re-identification with RGB Modality | only | |
| Cnn Depression Severity Level Estimation from Upper Body vs. Face- | only | Images |
| Codestream Domain Scrambling of Moving Objects Based on DCT Sign- | only | Correlation for Motion JPEG Movies |
| Color categories | only | affect post-perceptual processes when same- and different-category colors are equally discriminable |
| Comment on a Method for Computing Points of a Circle Using | only | Integers |
| COMPOSER: Compositional Reasoning of Group Activity in Videos with Keypoint- | only | Modality |
| Comprehensive Study of Decoder- | only | LLMs for Text-to-Image Generation, A |
| Computed tomography image generation from magnetic resonance imaging using Wasserstein metric for MR- | only | radiation therapy |
| Computing similarity transformations from | only | image correspondences |
| Continuous-Time Range- | only | Pose Estimation |
| Convergence comparison of least squares based bearing- | only | SLAM algorithms using different landmark parametrizations |
| Cross-Lingual Universal Dependency Parsing | only | From One Monolingual Treebank |
| D'OH: Decoder- | only | Random Hypernetworks for Implicit Neural Representations |
| Data-Independent Phase- | only | Beamforming of FDA-MIMO Radar for Swarm Interference Suppression |
| Decoder- | only | Image Registration |
| DecoderTracker: Decoder- | only | end-to-end method for multiple-object tracking |
| Deep Learning for Land Cover Classification Using | only | a Few Bands |
| Deep-learning architecture to forecast destinations of bus passengers from entry- | only | smart-card data |
| Designing Incoherent Frames With | only | Matrix Vector Multiplications |
| Development and Initial Results of a Brain PET Insert for Simultaneous 7-Tesla PET/MRI Using an FPGA- | only | Signal Digitization Method |
| Direct Approach for Local Quasi-Geoid Modeling Based on Spherical Radial Basis Functions Using a Noisy Satellite- | only | Global Gravity Field Model, A |
| Direct Methods for Evaluating the Planarity and Rigidity of a Surface Using | only | 2D Views |
| Discrimination Between Native and Non-Native Speech Using Visual Features | only | |
| Distributed Particle Filter for Bearings- | only | Tracking on Spherical Surfaces, A |
| Double-random-phase encryption with photon counting for image authentication using | only | the amplitude of the encrypted image |
| Drawing an Automatic Sketch of Deformable Objects Using | only | a Few Images |
| DSM-to-DTM Reconstruction Using | only | DSM-Derived Inputs with Residual Learning and CSF Priors |
| DTrOCR: Decoder- | only | Transformer for Optical Character Recognition |
| EEG-Guided Adversarial Alignment for EOG- | only | Vigilance Estimation |
| Effective intra- | only | rate control for H.264/AVC |
| Efficient Coding and Mapping Algorithms for Software- | only | Real-Time Video Coding at Low Bit Rates |
| Efficient Compression of Amplitude- | only | Images for the Image Trading System, An |
| Efficient parallel architecture of an intra- | only | scalable multi-layer HEVC encoder |
| Ego- | only | : Egocentric Action Detection without Exocentric Transferring |
| EM algorithm for estimating SPECT emission and transmission parameters from emission data | only | , An |
| Encoder- | only | Image Registration |
| Encrypting | only | AC coefficient signs considered harmful |
| Energy consumption analysis and modelling of a H.264/AVC intra- | only | based encoder dedicated to WVSNs |
| Estimating a Route Travel Time Distribution Function With Segment Correlations Using Segment-Level Travel Time Data | only | : A Moment-Based Method |
| Estimating Plant Nitrogen by Developing an Accurate Correlation between VNIR- | only | Vegetation Indexes and the Normalized Difference Nitrogen Index |
| Estimating Squinted SAR Data: An Efficient Multivariate Minimization Approach Using | only | Essential 3-D Target Information |
| Estimating the Number of Correct Matches Using | only | Spatial Order |
| Euclidean reconstruction of a circular truncated cone | only | from its uncalibrated contours |
| Evaluating Classification Performance with | only | Positive and Unlabeled Samples |
| Experimental Comparison of Optical Binary Phase- | only | Filter and High-Pass Matched Filter Correlation |
| Exploring Open-Vocabulary Semantic Segmentation from CLIP Vision Encoder Distillation | only | |
| Fast Edge- | only | Matching Techniques for Robot Pattern Matching |
| Fast Intrinsic-Extrinsic Calibration for Pose- | only | Structure-from-Motion |
| Few-Shot Learning of Video Action Recognition | only | Based on Video Contents |
| Fine-grained classification of identity document types with | only | one example |
| Finger-Knuckle-Print Verification Based on Band-Limited Phase- | only | Correlation |
| Fixed-Time Slip Control With Extended-State Observer Using | only | Wheel Speed for Anti-Lock Braking Systems of Electric Vehicles |
| For your eyes | only | |
| Fractal dimension, | only | a fraction of the truth? |
| Framework for Scalable Vision- | only | Navigation, A |
| framework of three-dimensional object recognition which needs | only | a few reference images, A |
| Fusion Unbiased Pseudo-Linear Kalman Filter-Based Bearings- | only | Target Tracking |
| Gait Recognition Based on Modified Phase | only | Correlation |
| Gait Recognition Based on Modified Phase- | only | Correlation |
| Gaussian-Sum Cubature Kalman Filter with Improved Robustness for Bearings- | only | Tracking |
| general quantitative cryptanalysis of permutation- | only | multimedia ciphers against plaintext attacks, A |
| Generalized Labeled Multi-Bernoulli Multi-Target Tracking with Doppler- | only | Measurements |
| Geolocation Correction for Geostationary Satellite Observations by a Phase- | only | Correlation Method Using a Visible Channel |
| Globally Optimal Relative Pose and Scale Estimation from | only | Image Correspondences with Known Vertical Direction |
| GMM-SVM Fingerprint Verification Based on Minutiae | only | |
| Gradual Domain Adaptation with Pseudo-Label Denoising for SAR Target Recognition When Using | only | Synthetic Data for Training |
| GraphVid: It | only | Takes a Few Nodes to Understand a Video |
| HeightFormer: Explicit Height Modeling Without Extra Data for Camera- | only | 3D Object Detection in Bird's Eye View |
| High definition video intra- | only | coding based on node-cell macroblock pixel structure and 2-D interleaved DCT |
| High quality facial expression recognition in video streams using shape related information | only | |
| High-Accuracy Rotation Estimation Algorithm Based on 1D Phase- | only | Correlation, A |
| How far can you get with a modern face recognition test set using | only | simple features? |
| How to Make AdaBoost.M1 Work for Weak Base Classifiers by Changing | only | One Line of the Code |
| Hybrid forests for left ventricle segmentation using | only | the first slice label |
| I can't believe there's no images!: Learning Visual Tasks Using | only | Language Supervision |
| I | only | Have Eyes for You: The Impact of Masks On Convolutional-Based Facial Expression Recognition |
| I-ViT: Integer- | only | Quantization for Efficient Vision Transformer Inference |
| Identifying Vegetation in Arid Regions Using Object-Based Image Analysis with RGB- | only | Aerial Imagery |
| Illumination Strategies for Intensity- | only | Imaging |
| Image scale and rotation from the phase- | only | bispectrum |
| Image- | only | Real-Time Incremental UAV Image Mosaic for Multi-Strip Flight |
| Impacts of Species Misidentification on Species Distribution Modeling with Presence- | only | Data |
| Improved Hatch Filter Algorithm towards Sub-Meter Positioning Using | only | Android Raw GNSS Measurements without External Augmentation Corrections, An |
| Improved Medium Baseline RTK Positioning Performance Based on BDS/Galileo/GPS Triple-Frequency- | only | Observations |
| Improved Sea State Bias Estimation for Altimeter Reference Missions With Altimeter- | only | Three-Parameter Models |
| Improving Angle- | only | Orbit Determination Accuracy for Earth-Moon Libration Orbits Using a Neural-Network-Based Approach |
| Improving Forest Height Retrieval by Reducing the Ambiguity of Volume- | only | Coherence Using Multi-Baseline PolInSAR Data |
| influence of communication bandwidth on target tracking with angle | only | measurements from two platforms, The |
| Initialization of Particle Filter and Posterior Cramer-Rao Bound for Bearings- | only | Tracking in Modified Polar Coordinate System |
| Innovative Curvelet- | only | -Based Approach for Automated Change Detection in Multi-Temporal SAR Imagery, An |
| Integer-arithmetic- | only | Certified Robustness for Quantized Neural Networks |
| Intelligent Home 3D: Automatic 3D-House Design From Linguistic Descriptions | only | |
| Intensity- | only | signal-subspace-based imaging |
| Inversion of the sliding Fourier transform using | only | two frequency bins and its application to source separation |
| Iris Biometric Security Challenges and Possible Solutions: For your eyes | only | ? Using the iris as a key |
| Is a detector | only | good for detection? |
| Jailbreak and Guard Aligned Language Models With | only | Few In-Context Demonstrations |
| Keep it Unreal: Bridging the Realism Gap for 2.5D Recognition with Geometry Priors | only | |
| Label- | only | Model Inversion Attacks via Boundary Repulsion |
| LabOR: Labeling | only | if Required for Domain Adaptive Semantic Segmentation |
| Language- | only | Efficient Training of Zero-shot Composed Image Retrieval |
| Large-Scale Content- | only | Video Recommendation |
| largest empty rectangle containing | only | a query object in Spatial Databases, The |
| Laser | only | feature based multi robot SLAM |
| Learning 3D Semantic Segmentation with | only | 2D Image Supervision |
| Learning a Bias Correction for Lidar- | only | Motion Estimation |
| Learning based automatic face annotation for arbitrary poses and expressions from frontal images | only | |
| Learning data association for multi-object tracking using | only | coordinates |
| Learning of Prototypes and Decision Boundaries for a Verification Problem Having | only | Positive Samples |
| Learning to Generate Text-Grounded Mask for Open-World Semantic Segmentation from | only | Image-Text Pairs |
| LED- | only | BRDF measurement device, An |
| Lerojd: Lidar Extended Radar- | only | Object Detection |
| LiDAR- | only | 3D object detection based on spatial context |
| LISO: Lidar- | only | Self-supervised 3d Object Detection |
| Low-Cost Real-Time Remote Sensing and Geolocation of Moving Targets via Monocular Bearing- | only | Micro UAVs |
| Low-Resolution- | only | Microscopy Super-Resolution Models Generalizing to Non-Periodicities at Atomic Scale |
| L_1,L_2)-RIP and Projected Back-Projection Reconstruction for Phase- | only | Measurements |
| Machine Learning-Based Multiple Cloud Vertical Structure Parameter Prediction Algorithm | only | Using OCO-2 Oxygen A-Band Measurements, A |
| Magnitude- | only | Reconstruction of Two-Dimensional Sequences with Finite Regions of Support |
| mChartQA and mChartQABench: A multimodal- | only | solution for complex chart question-answering |
| Medical image registration using Phase- | only | Correlation for distorted dental radiographs |
| MeshGPT: Generating Triangle Meshes with Decoder- | only | Transformers |
| Method for Blur and Affine Invariant Object Recognition Using Phase- | only | Bispectrum, A |
| method for computing points of a circle using | only | integers, A |
| Modeling an effectual multi-section You | only | Look Once for enhancing lung cancer prediction |
| Motion Models that | only | Work Sometimes |
| Motion Recovery from Image Sequences Using | only | First Order Optical Flow Information |
| Motion- | only | video compression |
| Multi-depth phase- | only | hologram optimization using the L-BFGS algorithm with sequential slicing |
| Multifrequency Interferometric Imaging with Intensity- | only | Measurements |
| Multimodal Neurons in Pretrained Text- | only | Transformers |
| multiplierless pruned DCT-like transformation for image and video compression that requires ten additions | only | , A |
| Need | only | One More Point (NOOMP): Perspective Adaptation Crowd Counting in Complex Scenes |
| Neural sentence embedding using | only | in-domain sentences for out-of-domain sentence detection in dialog systems |
| New Approach for Automatic Detection of Buildings in Airborne Laser Scanner Data Using first Echo | only | |
| New camera chip captures | only | what it needs |
| new method for fingerprint matching using phase- | only | auto- and cross-bispectrum, A |
| New Solution to the Relative Orientation Problem Using | only | 3 Points and the Vertical Direction, A |
| new state vector and a map joining algorithm for range- | only | SLAM, A |
| No-MambAAD: Revitalizing Conv- | only | Networks for Unsupervised Anomaly Detection |
| Non-Coherent Direction of Arrival Estimation from Magnitude- | only | Measurements |
| non-rigid registration method for medical volume data using 3D Phase- | only | Correlation, A |
| Not | only | Look, But Also Listen: Learning Multimodal Violence Detection Under Weak Supervision |
| Not | only | size matters: Regularized partial matching of nonrigid shapes |
| Not | only | Text: Exploring Compositionality of Visual Representations in Vision-Language Models |
| Note on Using | only | Position Equations for Robotic Hand/Eye Calibration, A |
| novel fast combine-and-conquer object detector based on | only | one-level feature map, A |
| Novel View Synthesis from | only | a 6-DoF Camera Pose by Two-stage Networks |
| OASIS: | only | Adversarial Supervision for Semantic Image Synthesis |
| Object Recognition and Detection by a Combination of Support Vector Machine and Rotation Invariant Phase | only | Correlation |
| Object Tracking With | only | Background Cues |
| Observability Metrics for Single-Target Tracking With Bearings- | only | Measurements |
| Off-axis complex hologram encoding method for holographic display with amplitude- | only | modulation |
| On-Device Self-Supervised Learning of Low-Latency Monocular Depth from | only | Events |
| only | Image Based For The 3d Metric Survey Of Gothic Structures By Using Frame Cameras And Panoramic Cameras |
| only | Once Attack: Fooling the Tracker With Adversarial Template |
| only | Time Can Tell: Discovering Temporal Data for Temporal Modeling |
| only | -Reference Video Quality Assessment for Video Coding Using Convolutional Neural Network |
| Optical Flow Requires Multiple Strategies (but | only | One Network) |
| optical reconstruction of hologram recorded by OSH using amplitude- | only | SLM and phase-only SLM, An |
| optical reconstruction of hologram recorded by OSH using amplitude- | only | SLM and phase-only SLM, An |
| Optimal ROS-Based Symmetric Phase- | only | Filter for Fingerprint Verification, The |
| Performance evaluation of a geometric correction method for multi-projector display using SIFT and Phase- | only | Correlation |
| Performance Evaluation of Troposphere Estimated from Galileo- | only | Multi-Frequency Observations |
| Phantom Studies of Fused-Data TREIT Using | only | Biopsy-Probe Electrodes |
| Phase algorithm for blocking artifact reduction in reconstructions from analysis- | only | AM-FM models |
| phase | only | transform for unsupervised surface defect detection, The |
| Phase retrieval from intensity- | only | data by relative entropy minimization |
| Phase- | only | Image Based Kernel Estimation for Single Image Blind Deblurring |
| Phase- | only | Optimization Null Control Method for FDA-MIMO Based on ADMM, A |
| Photometric stereo with | only | two images: A theoretical study and numerical resolution |
| PIZZA: A Powerful Image- | only | Zero-Shot Zero-CAD Approach to 6 DoF Tracking |
| Pose- | only | Solution to Visual Reconstruction and Navigation, A |
| Poseidon Technologies: The world's first and | only | computer-aided drowning detection system |
| Posterior Cramer-Rao Lower Bounds for Target Tracking in Sensor Networks With Quantized Range- | only | Measurements |
| PPP and PPP-AR Kinematic Post-Processed Performance of GPS- | only | , Galileo-Only and Multi-GNSS |
| PPP and PPP-AR Kinematic Post-Processed Performance of GPS- | only | , Galileo-Only and Multi-GNSS |
| Pre-Estimating Self-Localization Error of NDT-Based Map-Matching From Map | only | |
| Precise and Automatic 3-D Absolute Geolocation of Targets Using | only | Two Long-Aperture SAR Acquisitions |
| Predicting Ball Ownership in Basketball from a Monocular View Using | only | Player Trajectories |
| Predicting temperate forest stand types using | only | structural profiles from discrete return airborne lidar |
| Presence- | only | Geographical Priors for Fine-Grained Image Classification |
| Progressive Visual Object Detection with Positive Training Examples | only | |
| Prompt-Tuning SAM: From Generalist to Specialist with | only | 2,048 Parameters and 16 Training Images |
| PromptAD: Learning Prompts with | only | Normal Samples for Few-Shot Anomaly Detection |
| QDTrack: Quasi-Dense Similarity Learning for Appearance- | only | Multiple Object Tracking |
| Quantization and Training of Neural Networks for Efficient Integer-Arithmetic- | only | Inference |
| quaternion phase- | only | correlation algorithm for color images, A |
| Query by Strings and Return Ranking Word Regions with | only | One Look |
| RandAR: Decoder- | only | Autoregressive Visual Generation in Random Orders |
| Range- | only | SLAM with Interpolated Range Data |
| RAW Image Reconstruction Using a Self-Contained sRGB-JPEG Image with | only | 64 KB Overhead |
| Read- | only | Prompt Optimization for Vision-Language Few-shot Learning |
| Real-Time Pose-Invariant Face Recognition by Triplet Pose Sparse Matrix from | only | a Single Image |
| Real-Time Single-Workstation Obstacle Avoidance Using | only | Wide-Field Flow Divergence |
| Realizing a Low-Power Head-Mounted Phase- | only | Holographic Display by Light-Weight Compression |
| Recognizing Faces from the Eyes | only | |
| Red-Pill Robots | only | , Please |
| RefineDetLite: A Lightweight One-stage Object Detection Framework for CPU- | only | Devices |
| Reflectance Computation for a Specular | only | V-Cavity |
| ReidTrack: Reid- | only | Multi-target Multi-camera Tracking |
| reliability of estimated confidence intervals for classification error rates when | only | a single sample is available, The |
| Robust Reflection Removal With Flash- | only | Cues in the Wild |
| Robust Reflection Removal with Reflection-free Flash- | only | Cues |
| Robust UAV Visual Teach and Repeat Using | only | Sparse Semantic Object Features |
| Robust Underwater Vehicle Pose Estimation via Convex Optimization Using Range- | only | Remote Sensing Data |
| Rotation | only | |
| Rotation- | only | Bundle Adjustment |
| SAR Tomography at the Limit: Building Height Reconstruction Using | only | 3-5 TanDEM-X Bistatic Interferograms |
| SatViT-Seg: A Transformer- | only | Lightweight Semantic Segmentation Model for Real-Time Land Cover Mapping of High-Resolution Remote Sensing Imagery on Satellites |
| Scanning | only | Once: An End-to-end Framework for Fast Temporal Grounding in Long Videos |
| Second-order event-triggered tracking control with | only | position measurements |
| See the Silence: Improving Visual- | only | Voice Activity Detection by Optical Flow and RGB Fusion |
| Semantic- | only | Visual Odometry based on dense class-level segmentation |
| Sequence and Network Embedding Method for Bus Arrival Time Prediction Using GPS Trajectory Data | only | , A |
| Shape Descriptor Combining Logarithmic-Scale Histogram of Radon Transform and Phase- | only | Correlation Function, A |
| SimLingo: Vision- | only | Closed-Loop Autonomous Driving with Language-Action Alignment |
| Simultaneous in-plane motion estimation and point matching using geometric cues | only | |
| Simultaneous Pose Estimation and Velocity Estimation of an Ego Vehicle and Moving Obstacles Using LiDAR Information | only | |
| Soccer Player Detection with | only | Color Features Selected Using Informed Haar-like Features |
| software | only | scalable video delivery system for multimedia applications over heterogeneous networks, A |
| Software- | only | Videocodec Using Pixelwise Conditional Differential Replenishment and Perceptual Enhancements |
| Sparse Signal Recovery from a Mixture of Linear and Magnitude- | only | Measurements |
| Special ciphertext- | only | attack to double random phase encryption by plaintext shifting with speckle correlation |
| Special Issue Retraction: Innovative approach for multimodal fusion recognition based feature extraction using band-limited phase- | only | correlation and discrete orthonormal Stockwell transform |
| Spectral-daylight recovery by use of | only | a few sensors |
| Spiking Transformer: Introducing Accurate Addition- | only | Spiking Self-Attention for Transformer |
| Splat-SLAM: Globally Optimized RGB- | only | SLAM with 3D Gaussians |
| Stereovision- | only | Based Interactive Mobile Robot for Human-Robot Face-to-Face Interaction |
| Sub-Pixel Stereo Correspondence Technique Based on 1D Phase- | only | Correlation, A |
| Super-Resolution Training Paradigm Based on Low-Resolution Data | only | to Surpass the Technical Limits of STEM and STM Microscopy, A |
| SuS-X: Training-Free Name- | only | Transfer of Vision-Language Models |
| SyDPose: Object Detection and Pose Estimation in Cluttered Real-World Depth Images Trained using | only | Synthetic Data |
| Symmetrical Phase- | only | Matched Filtering of Fourier-Mellin Transforms for Image Registration and Recognition |
| Target Motion Analysis Using Range- | only | Measurements: Algorithms, Performance and Application to ISAR Data |
| Target Motion Analysis Using Single Sensor Bearings- | only | Measurements |
| Target tracking with Bearings- | only | Measurements |
| Test-Time Adaptation for Super-Resolution: You | only | Need to Overfit on a Few More Images |
| Text | only | Analysis, Natural Language |
| text- | only | weakly supervised learning framework for text spotting via text-to-polygon generator, A |
| Three-Dimensional Signal Source Localization with Angle- | only | Measurements in Passive Sensor Networks |
| Time and energy modeling of an INTRA- | only | HEVC encoder |
| Toward Hardware Building Blocks for Software- | only | Real-Time Video Processing: The MOVIE Approach |
| Toward Sustainable Last-Mile Deliveries: A Comparative Study of Energy Consumption and Delivery Time for Drone- | only | and Drone-Aided Public Transport Approaches in Urban Areas |
| Towards an automatic on-line signature verifier using | only | one reference per signer |
| Towards automatic reconstruction of axonal structures in volumetric microscopy images depicting | only | active synapses |
| Towards High-Fidelity Text-Guided 3D Face Generation and Manipulation Using | only | Images |
| Training of classifiers using virtual samples | only | |
| Training Superpixel Network | only | Once |
| Training Vision Transformers with | only | 2040 Images |
| TransGrasp: Grasp Pose Estimation of a Category of Objects by Transferring Grasps from | only | One Labeled Instance |
| Translation | only | |
| Transverse or axial superresolution in a 4Pi-confocal microscope by phase- | only | filters |
| Tree Species Classification from TLS Point Clouds Using Multi-Task Learning and Woody- | only | Point Cloud Generation |
| Tree, Shrub, and Grass Classification Using | only | RGB Images |
| Trinocular stereo image rectification in closed-form | only | using fundamental matrices |
| TSBA: A two-stage poison- | only | backdoor attack on visual object tracking |
| UniEnc-CASSNAT: An Encoder- | only | Non-Autoregressive ASR for Speech SSL Models |
| UniHOPE: A Unified Approach for Hand- | only | and Hand-Object Pose Estimation |
| Universal Software | only | Radar with All Waveforms Simultaneously on a Single Platform |
| Unknown Prompt, the | only | Lacuna: Unveiling CLIP's Potential for Open Domain Generalization |
| Unsupervised Segmentation of Anomalies in Sequential Data, Images and Volumetric Data Using Multiscale Fourier Phase- | only | Analysis |
| Urban Archaeology: How to Communicate a Story of a Site, 3D Virtual Reconstruction but not | only | |
| Using Remote Sensing Techniques to Document and Identify the Largest Underwater Object of the Baltic Sea: Case Study of the | only | German Aircraft Carrier, Graf Zeppelin |
| Utilization of Noise- | only | Samples in Array Processing With Prior Knowledge |
| Variable-rate learned image compression with integer-arithmetic- | only | inference |
| Vector quantization with memory and multi-labeling for isolated video- | only | automatic speech recognition |
| VerbDiff: Text- | only | Diffusion Models with Enhanced Interaction Awareness |
| Video you | only | look once: Overall temporal convolutions for action recognition |
| View invariant gait recognition using | only | one uniform model |
| Vision- | only | Localization |
| Vital information is | only | worth one thumbnail: Towards efficient human pose estimation |
| Watch | only | Once: An End-to-End Video Action Detection Framework |
| We Don't Need No Bounding-Boxes: Training Object Class Detectors Using | only | Human Verification |
| Weak Mesoscale Variability in the Optimum Interpolation Sea Surface Temperature (OISST)-AVHRR- | only | Version 2 Data before 2007 |
| Weaklier Supervised Semantic Segmentation With | only | One Image Level Annotation per Category |
| WECROMCL: Weakly Supervised Cross-modality Contrastive Learning for Transcription- | only | Supervised Text Spotting |
| What If We | only | Use Real Datasets for Scene Text Recognition? Toward Scene Text Recognition With Fewer Labels |
| When It Is Not | only | About Color: The Importance of Hyperspectral Imaging Applied to the Investigation of Paintings |
| Which Phoneme-to-Viseme Maps Best Improve Visual- | only | Computer Lip-Reading? |
| Word Channel Based Multiscale Pedestrian Detection without Image Resizing and Using | only | One Classifier |
| Word2Scene: Efficient remote sensing image scene generation with | only | one word via hybrid intelligence and low-rank representation |
| YOLC: You | only | Look Clusters for Tiny Object Detection in Aerial Images |
| YOLO, You | only | Look Once, Family Object Detection |
| YOLOH: You | only | Look One Hourglass for Real-Time Object Detection |
| YOLOMM: You | only | Look Once for Multi-modal Multi-tasking |
| YOLSO: You | only | Look Small Object |
| You Don't | only | Look Once: Constructing Spatial-Temporal Memory for Integrated 3D Object Detection and Tracking |
| You | only | label once: A self-adaptive clustering-based method for source-free active domain adaptation |
| You | only | Learn One Query: Learning Unified Human Query for Single-stage Multi-person Multi-task Human-centric Perception |
| You | only | Look Intensity Once: Event-Driven Long-Term High-Speed Object Detection |
| You | only | Look Once: Unified, Real-Time Object Detection |
| You | only | Look One Step: Accelerating Backpropagation in Diffusion Sampling With Gradient Shortcuts |
| You | only | Look One-level Feature |
| You | only | Look Yourself: Unsupervised and Untrained Single Image Dehazing Neural Network |
| You | only | Need 80k Parameters to Enhance Image: Learning Periodic Features for Image Enhancement |
| You | only | Need Clear Images: Self-Supervised Single Image Dehazing |
| You | only | Need Less Attention at Each Stage in Vision Transformers |
| You | only | Need One Step: Fast Super-resolution with Stable Diffusion via Scale Distillation |
| You | only | Need The Image: Unsupervised Few-Shot Semantic Segmentation With Co-Guidance Network |
| You | only | Search Once: Single Shot Neural Architecture Search via Direct Sparse Optimization |
| You | only | Segment Once: Towards Real-Time Panoptic Segmentation |
| You | only | Train Once: A Unified Framework for Both Full-Reference and No-Reference Image Quality Assessment |
| You | only | Train Once: Learning General and Distinctive 3D Local Descriptors |
| You- | only | -Look-Once Multiple-Strategy Printed Circuit Board Defect Detection Model |
| Your Large Vision-Language Model | only | Needs A Few Attention Heads For Visual Grounding |
| YOWO: You | only | Walk Once to Jointly Map an Indoor Scene and Register Ceiling-Mounted Cameras |
344 for only