| _ | lingual | _ |
| Analyzing Zero-shot Cross- | lingual | Transfer in Supervised NLP Tasks |
| Automatic Separation of Words in Indian Multi- | lingual | Multi-Script Documents |
| Automatic separation of words in multi- | lingual | multi-script Indian documents |
| Baseline detection of multi- | lingual | unconstrained handwritten text lines |
| BTS: A Bi- | lingual | Benchmark for Text Segmentation in the Wild |
| Camera based mixed- | lingual | card reader for mobile device |
| CiCo: Domain-Aware Sign Language Retrieval via Cross- | lingual | Contrastive Learning |
| COCO-CN for Cross- | lingual | Image Tagging, Captioning, and Retrieval |
| Controllable Multi- | lingual | Multi-Speaker Multi-Style Text-to-Speech Synthesis With Multivariate Information Minimization, A |
| Cross | lingual | handwritten character recognition using long short term memory network with aid of elephant herding optimization algorithm |
| Cross- | lingual | Adaptation for Vision-Language Model via Multimodal Semantic Distillation |
| Cross- | lingual | few-shot sign language recognition |
| Cross- | lingual | font generation via patch-level style contrastive learning and relative position awareness |
| Cross- | lingual | font style transfer with full-domain convolutional attention |
| Cross- | lingual | Summarization method based on cross-lingual Fact-relationship Graph Generation, A |
| Cross- | lingual | Summarization method based on cross-lingual Fact-relationship Graph Generation, A |
| Cross- | lingual | Text Image Recognition via Multi-Hierarchy Cross-Modal Mimic |
| Cross- | lingual | Text Image Recognition via Multi-Task Sequence to Sequence Learning |
| Cross- | lingual | transfer learning: A PARAFAC2 approach |
| Cross- | lingual | Universal Dependency Parsing Only From One Monolingual Treebank |
| Cross- | lingual | Vocal Emotion Recognition in Five Native Languages of Assam Using Eigenvalue Decomposition |
| cViL: Cross- | lingual | Training of Vision-Language Models using Knowledge Distillation |
| Dual-View Curricular Optimal Transport for Cross- | lingual | Cross-Modal Retrieval |
| Embedded Heterogeneous Attention Transformer for Cross- | lingual | Image Captioning |
| Embil: An English-manipuri Bi- | lingual | Benchmark for Scene Text Detection and Language Identification |
| FBN: Federated Bert Network with client-server architecture for cross- | lingual | signature verification |
| Feature Aggregation in Zero-Shot Cross- | lingual | Transfer Using Multilingual BERT |
| Geospatial Information Categories Mapping in a Cross- | lingual | Environment: A Case Study of 'Surface Water' Categories in Chinese and American Topographic Maps |
| Grounding Scene Graphs on Natural Images via Visio- | lingual | Message Passing |
| Harnessing the Power of Multi- | lingual | Datasets for Pre-training: Towards Enhancing Text Spotting Performance |
| HC2L: Hybrid and Cooperative Contrastive Learning for Cross- | lingual | Spoken Language Understanding |
| Image Retrieval With | lingual | And Visual Paraphrasing Via Generative Models |
| Improving Continuous Sign Language Recognition with Cross- | lingual | Signs |
| MAdVerse: A Hierarchical Dataset of Multi- | lingual | Ads from Diverse Sources and Categories |
| Measuring novelty and redundancy with multiple modalities in cross- | lingual | broadcast news |
| Multi- | lingual | and Multi-modal Speech Processing and Applications |
| Multi- | lingual | City Name Recognition for Indian Postal Automation |
| Multi- | lingual | Offline Handwriting Recognition Using Hidden Markov Models: A Script-Independent Approach |
| Multi- | lingual | Phoneme Recognition and Language Identification Using Phonotactic Information |
| Multi- | lingual | Recognition System for Arabic and Latin Handwriting, A |
| Multi- | lingual | scene text detection and language identification |
| Multi- | lingual | text recognition from video frames |
| Multi-Oriented and Multi- | lingual | Scene Text Detection With Direct Regression |
| Multi-Task WaveRNN With an Integrated Architecture for Cross- | lingual | Voice Conversion |
| Multimodal Cross- | lingual | Summarization for Videos: A Revisit in Knowledge Distillation Induced Triple-Stage Training Method |
| On the Robustness of Cross- | lingual | Speaker Recognition using Transformer-based Approaches |
| Phonetically-Anchored Domain Adaptation for Cross- | lingual | Speech Emotion Recognition |
| Reinforced, Incremental and Cross- | lingual | Event Detection From Social Messages |
| Self-Supervised Discovery of Cross- | lingual | Shared Knowledge for Continual Text Recognition |
| semantic content based recommendation system for cross- | lingual | news, A |
| Shape Code Based Word-Image Matching for Retrieval of Indian Multi- | lingual | Documents |
| SimpleElastix: A User-Friendly, Multi- | lingual | Library for Medical Image Registration |
| Thousand Frames in Just a Few Words: | lingual | Description of Videos through Latent Topics and Sparse Object Stitching, A |
| Tools for enabling digital access to multi- | lingual | Indic documents |
| Toward Text-independent Cross- | lingual | Speaker Recognition Using English-Mandarin-Taiwanese Dataset |
| UC2: Universal Cross- | lingual | Cross-modal Vision-and-Language Pre-training |
| Vector Field Decomposition-Based Flow Matching for Zero-Shot Cross- | lingual | Text-to-Speech |
| Wasserstein GAN based on Autoencoder with back-translation for cross- | lingual | embedding mappings |
| Word level identification of Kannada, Hindi and English scripts from a tri- | lingual | document |
59 for lingual
| _ | linguistic | _ |
| Achieving | linguistic | Provenance via Plagiarism Detection |
| Ae Textspotter: Learning Visual and | linguistic | Representation for Ambiguous Text Spotting |
| Aggregating Local and Global Text Features for | linguistic | Steganalysis |
| ALIP: The Automatic | linguistic | Indexing of Pictures System |
| ALiSa: Acrostic | linguistic | Steganography Based on BERT and Gibbs Sampling |
| approach to use | linguistic | and model-based fuzzy expert knowledge for the analysis of MRT images, An |
| Approximate World Models: Incorporating Qualitative and | linguistic | Information into Vision Systems |
| Automatic Classification and | linguistic | Analysis of Extremist Online Material |
| Automatic | linguistic | indexing of pictures by a statistical modeling approach |
| Blind | linguistic | Steganalysis against Translation Based Steganography |
| Boosting Generic Visual- | linguistic | Representation With Dynamic Contexts |
| Can | linguistic | features extracted from geo-referenced tweets help building function classification in remote sensing? |
| Combining Two Synchronisation Methods in a | linguistic | Model to Describe Sign Language |
| Compressing Visual- | linguistic | Model via Knowledge Distillation |
| Computational | linguistic | retrieval framework using negative bootstrapping for retrieving transliteration variants |
| Computational | linguistic | s processing in indigenous language |
| Computational Models for Integrating | linguistic | and Visual Information: A Survey |
| Computer | linguistic | analysis of line drawings |
| Concept of a | linguistic | Variable and Its Application to Approximate Reasoning, I, II and III, The |
| Conceptual description of visual scenes from | linguistic | models |
| Continuous Emotion Recognition using Visual-audio- | linguistic | Information: A Technical Report for ABAW3 |
| Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio- | linguistic | Compositional Understanding |
| CPG-LS: Causal Perception Guided | linguistic | Steganography |
| Decoding Visual Neural Representations by Multimodal Learning of Brain-Visual- | linguistic | Features |
| Dense Video Captioning With Early | linguistic | Information Fusion |
| DeVLBert: Out-of-distribution Visio- | linguistic | Pretraining with Causality |
| DIA: Deriving | linguistic | information from auxiliary languages for remote sensing image captioning |
| Discovering Hidden Visual Concepts Beyond | linguistic | Input in Infant Learning |
| Distilling DETR with Visual- | linguistic | Knowledge for Open-Vocabulary Object Detection |
| Domain-Assisted Few-Shot | linguistic | Steganalysis in Imbalanced Class Scenarios |
| Efficiently Fusing Pretrained Acoustic and | linguistic | Encoders for Low-Resource Speech Recognition |
| Enhancing Machine Translation by Integrating | linguistic | Knowledge in the Word Alignment Module |
| Evaluation strategies for automatic | linguistic | indexing of pictures |
| Exploring Global and Local | linguistic | Representations for Text-to-Image Synthesis |
| Exploring Pairwise Relationships Adaptively From | linguistic | Context in Image Captioning |
| Flexible | linguistic | pattern recognition |
| framework for | linguistic | relevance feedback in content-based image retrieval using fuzzy logic, A |
| fuzzy | linguistic | -based software tool for seismic image interpretation, A |
| General Facial Representation Learning in a Visual- | linguistic | Manner |
| General Steganalysis of Generative | linguistic | Steganography Based on Dynamic Segment-Level Lexical Association Extraction |
| Generalized Robot Vision-Language Model via | linguistic | Foreground-Aware Contrast |
| Generating Multi-Level | linguistic | Spatial Descriptions form Range Sensor Readings Using the Histogram of Forces |
| Gesture Modelling for | linguistic | Purposes |
| Godfather vs. Chaos: Comparing | linguistic | Analysis Based on On-line Knowledge Sources and Bags-of-N-Grams for Movie Review Valence Estimation, The |
| graph-based multi-level | linguistic | representation for document understanding, A |
| Hand Gesture Recognition within a | linguistic | s-Based Framework |
| Handling Out-of-Vocabulary Words and Recognition Errors Based on Word | linguistic | Context for Handwritten Sentence Recognition |
| Heterogeneous Domain Remapping for Universal Detection of Generative | linguistic | Steganography |
| HGA: Hierarchical Feature Extraction With Graph and Attention Mechanism for | linguistic | Steganalysis |
| High-performance | linguistic | Steganalysis, Capacity Estimation and Steganographic Positioning |
| Improving language-supervised object detection with | linguistic | structure analysis |
| Improving Visual Grounding with Visual- | linguistic | Verification and Iterative Reasoning |
| Incorporating | linguistic | Information to Statistical Word-Level Alignment |
| Incorporating | linguistic | Model Adaptation into Whole-Book Recognition |
| Indirect-Effect-Incorporated | linguistic | Z-Number Petri Nets and Its Application to Evaluate Generalized Eco-Driving Behaviors, An |
| Integration of | linguistic | and Geospatial Features Using Global Context Embedding for Automated Text Geocoding, The |
| Intelligent Home 3D: Automatic 3D-House Design From | linguistic | Descriptions Only |
| Intern VL: Scaling up Vision Foundation Models and Aligning for Generic Visual- | linguistic | Tasks |
| Irregular text block recognition via decoupling visual, | linguistic | , and positional information |
| Layout and language: exploring text block discovery in tables using | linguistic | resources |
| Learning Better Visual Dialog Agents with Pretrained Visual- | linguistic | Representation |
| Learning | linguistic | Association Towards Efficient Text-Video Retrieval |
| Learning to Collocate Visual- | linguistic | Neural Modules for Image Captioning |
| LingoSent: A Platform for | linguistic | Aware Sentiment Analysis for Social Media Messages |
| linguistic | Analysis of Laser Speckle Contrast Images Recorded at Rest and During Biological Zero: Comparison With Laser Doppler Flowmetry Data |
| linguistic | approach to classification of bacterial genomes, A |
| linguistic | Context In Vision |
| linguistic | description of relative positions in images |
| linguistic | Dynamic Analysis of Traffic Flow Based on Social Media: A Case Study |
| linguistic | Feature Vector for the Visual Interpretation of Sign Language, A |
| linguistic | fuzzy recogniser of off-line handwritten characters, A |
| linguistic | Generative Steganography With Enhanced Cognitive-Imperceptibility |
| linguistic | Hallucination for Text-Based Video Retrieval |
| linguistic | integration information in the aabatas arabic text analysis system |
| linguistic | Landscape of Arabs in New York City: Application of a Geosemiotics Analysis |
| linguistic | Landscapes on Street-Level Images |
| linguistic | Methods for the Description of a Straight Line on a Grid |
| linguistic | Methods in Picture Processing: A Survey |
| linguistic | Steganalysis by Enhancing and Integrating Local and Global Features |
| linguistic | Steganalysis Merging Semantic and Statistical Features |
| linguistic | Steganalysis via LLMs: Two Modes for Efficient Detection of Strongly Concealed Stego |
| linguistic | Steganalysis via Probabilistic Weighted Contrastive Learning |
| linguistic | Steganalysis via Text Dual Attention Fusing Statistical and Multi-Layer Semantic Features |
| linguistic | Steganalysis With Graph Neural Networks |
| linguistic | Steganography: From Symbolic Space to Semantic Space |
| linguistic | Steganography: Hiding Information in Syntax Space |
| linguistic | Structure Guided Context Modeling for Referring Image Segmentation |
| linguistic | Structures as Weak Supervision for Visual Scene Graph Generation |
| linguistic | summarization of video for fall detection using voxel person and fuzzy logic |
| linguistic | -Aware Patch Slimming Framework for Fine-Grained Cross-Modal Alignment |
| linguistic | s-aware Masked Image Modeling for Self-supervised Scene Text Recognition |
| LiVLR: A Lightweight Visual- | linguistic | Reasoning Framework for Video Question Answering |
| Luminate: | linguistic | Understanding and Multi-Granularity Interaction for Video Object Segmentation |
| Machine vs humans in a cursive script reading experiment without | linguistic | knowledge |
| Mathematical | linguistic | s in Cognitive Medical Image Interpretation Systems |
| Model Diagnosis and Correction via | linguistic | and Implicit Attribute Editing |
| Modelling and recognition of the | linguistic | components in American Sign Language |
| Multi-Classification of | linguistic | Steganography Driven by Large Language Models |
| Multi- | linguistic | Optical Font Recognition Using Stroke Templates |
| Multilevel Post-Processing for Korean Character-Recognition Using Morphological Analysis and | linguistic | Evaluation |
| Multimedia Search Without Visual Analysis: The Value of | linguistic | and Contextual Information |
| New | linguistic | -Perceptual Event Model for Spatio-Temporal Event Detection and Personalized Retrieval of Sports Video, A |
| Novel Computational | linguistic | Measures, Dialogue System and the Development of SOPHIE: Standardized Online Patient for Healthcare Interaction Education |
| novel unsupervised ensemble framework using concept-based | linguistic | methods and machine learning for twitter sentiment analysis, A |
| On Nonparametric and | linguistic | Approaches to Pattern Recognition |
| Open-Set Mixed Domain Adaptation via Visual- | linguistic | Focal Evolving |
| PNG-Stega: Progressive Non-Autoregressive Generative | linguistic | Steganography |
| Prediction-Based Audiovisual Fusion for Classification of Non- | linguistic | Vocalisations |
| Problem Reduction Representation for the | linguistic | Analysis of Waveforms |
| Quantifying Urban | linguistic | Diversity Related to Rainfall and Flood across China with Social Media Data |
| Recognizing Affect from | linguistic | Information in 3D Continuous Space |
| Reconstructing Signing Avatars from Video Using | linguistic | Priors |
| Recovering the | linguistic | components of the manual signs in American Sign Language |
| Referring expression comprehension model with matching detection and | linguistic | feedback |
| RLS-DTS: Reinforcement-Learning | linguistic | Steganalysis in Distribution-Transformed Scenario |
| RSTeller: Scaling up visual language modeling in remote sensing with rich | linguistic | semantics from openly available data and large language models |
| Saying What You're Looking For: | linguistic | s Meets Video Search |
| ScanEnts3D: Exploiting Phrase-to-3D-Object Correspondences for Improved Visio- | linguistic | Models in 3D Scenes |
| Scene Text Recognition Using Structure-Guided Character Detection and | linguistic | Knowledge |
| Secure and Disambiguating Approach for Generative | linguistic | Steganography, A |
| Segmentation of Digital Curves Using | linguistic | Techniques |
| Semantically Relevant Image Retrieval by Combining Image and | linguistic | Analysis |
| SeSy: | linguistic | Steganalysis Framework Integrating Semantic and Syntactic Features |
| Shared | linguistic | Resources for the Meeting Domain |
| Show, Conceive and Tell: Image Captioning with Prospective | linguistic | Information |
| Singing Robots: How Embodiment Affects Emotional Responses to Non- | linguistic | Utterances |
| Small-Scale | linguistic | Steganalysis for Multi-Concealed Scenarios |
| Solving a decision problem with | linguistic | information |
| Some | linguistic | and statistical problems in pattern recognition |
| speech understanding and dialog system with a homogeneous | linguistic | knowledge base, A |
| Spoken document classification with SVMs using | linguistic | unit weighting and probabilistic couplers |
| STVGBert: A Visual- | linguistic | Transformer based Framework for Spatio-temporal Video Grounding |
| SUPP: Understanding Moving Picture Patterns Based on | linguistic | Knowledge |
| system for recognizing Vietnamese document images based on HMM and | linguistic | s, A |
| Tell, Draw, and Repeat: Generating and Modifying Images Based on Continual | linguistic | Instruction |
| Test-Time Entropy Minimization Method for Cross-Domain | linguistic | Steganalysis, A |
| Towards General Visual- | linguistic | Face Forgery Detection |
| Understanding VQA for Negative Answers Through Visual and | linguistic | Inference |
| Unit Selection Using | linguistic | , Prosodic and Spectral Distance for Developing Text-to-Speech System in Hindi |
| Unsupervised Learning of Hand-Printed Characters with | linguistic | Information, An |
| Unsupervised Visual- | linguistic | Reference Resolution in Instructional Videos |
| Using | linguistic | Context for Image Interpretation and Annotation |
| Using | linguistic | Models for Image Retrieval |
| Utilizing overt and latent | linguistic | structure to improve keystroke-based authentication |
| Visual Relationship Detection with Internal and External | linguistic | Knowledge Distillation |
| Visual word proximity and | linguistic | s for semantic video indexing and near-duplicate retrieval |
| Visual- | linguistic | Feature Alignment With Semantic and Kinematic Guidance for Referring Multi-Object Tracking |
| Visual- | linguistic | Methods for Receipt Field Recognition |
| VL-LTR: Learning Class-wise Visual- | linguistic | Representation for Long-Tailed Visual Recognition |
| VL-SAT: Visual- | linguistic | Semantics Assisted Training for 3D Semantic Scene Graph Prediction in Point Cloud |
| VLMAH: Visual- | linguistic | Modeling of Action History for Effective Action Anticipation |
| VSNet: Focusing on the | linguistic | Characteristics of Sign Language |
| Weakly-Supervised 3D Scene Graph Generation via Visual- | linguistic | Assisted Pseudo-Labeling |
| Weakly-Supervised Visual Grounding of Phrases with | linguistic | Structures |
| What You Say Is Not What You Do: Studying Visio- | linguistic | Models for TV Series Summarization |
| What's in a Caption? Dataset-Specific | linguistic | Diversity and Its Effect on Visual Description Models and Metrics |
| Winoground: Probing Vision and Language Models for Visio- | linguistic | Compositionality |
157 for linguistic