MultiCamera11
* *Activity Monitoring by Multi-Camera Surveillance Systems
* Determining operational measures from multi-camera surveillance systems using soft biometrics
* game-theoretic design for collaborative tracking in a video camera network, A
* HSV and RGB color histograms comparing for objects tracking among non overlapping FOVs, using CBTF
* Improved person detection in industrial environments using multiple self-calibrated cameras
* Multi-camera detection association for 3D localisation
* Multiple views based human motion tracking in surveillance videos
* Real time complex event detection for resource-limited multimedia sensor networks
8 for MultiCamera11
MultInfoRetr( Vol No. )
* *International Journal of Multimedia Information Retrieval
MultInfoRetr(1)
* Acquisition of Multimedia Ontology: An Application in Preservation of Cultural Heritage
* Bridging the gap between expert and novice users for video search
* Cost-sensitive learning in social image tagging: Review, New Ideas and Evaluation
* Directional local extrema patterns: a new descriptor for content based image retrieval
* efficient framework for location-based scene matching in image databases, An
* Exploiting contextual information for image re-ranking and rank aggregation
* Fast shape retrieval using a graph theoretic approach
* heterogeneous feature selection with structural sparsity for multimedia annotation and hashing: a survey, The
* Interactive search in image retrieval: a survey
* Large-scale near-duplicate image retrieval by kernel density estimation
* Leveraging visual concepts and query performance prediction for semantic-theme-based video retrieval
* Multimedia semantics-aware query-adaptive hashing with bits reconfigurability
* Multimodal Image Retrieval
* New Grand Challenge for Multimedia Information Retrieval: Bridging the Utility Gap
* Optical music recognition: state-of-the-art and open issues
* Semantics-based selection of everyday concepts in visual lifelogging
* study on video data mining, A
* Video concept detection by audio-visual grouplets
* Á trous gradient structure descriptor for content based image retrieval
19 for MultInfoRetr(1)
MultInfoRetr(2)
* 3D object retrieval using salient views
* Best papers in multimedia information retrieval
* Beyond audio and video retrieval: Topic-oriented multimedia summarization
* Bundle min-Hashing
* Combining usage and content in an online recommendation system for music in the Long Tail
* Content analysis meets viewers: linking concept detection with demographics on YouTube
* Event-related image retrieval: exploring geographical and temporal distribution of user tags
* Exploiting semantics on external resources to gather visual examples for video retrieval
* Genre-specific modeling of visual features for efficient content based video shot classification and retrieval
* geometrical distance measure for determining the similarity of musical harmony, A
* High-level event recognition in unconstrained videos
* Hybrid music information retrieval
* intelligent content-based image retrieval system for clinical decision support in brain tumor diagnosis, An
* Intrinsic spatial pyramid matching for deformable 3D shape retrieval
* Location-aware music recommendation
* Minimal test collections for low-cost evaluation of Audio Music Similarity and Retrieval systems
* Mobile video concept classification
* Multimodal biomedical image retrieval using hierarchical classification and modality fusion
* Searching for images by video
* Tonal representations for music retrieval: from version identification to query-by-humming
* Very large scale nearest neighbor search: Ideas, strategies and challenges
* When music makes a scene
22 for MultInfoRetr(2)
MultInfoRetr(3)
* ACM ICMR 2014 best papers in image retrieval
* Adaptive diversification for tag-based social image retrieval
* Context-assisted face clustering framework with human-in-the-loop
* Editorial of the special issue on cross-media analysis
* Image re-ranking system based on closed frequent patterns
* Improving the quality of K-NN graphs through vector sparsification: application to image databases
* incremental evolutionary learning method for optimizing content-based image indexing algorithms, An
* Indexing heterogeneous features with superimages
* Information extraction from multimedia web documents: An open-source platform and testbed
* Interactive cross and multimodal biomedical image retrieval based on automatic region-of-interest (ROI) identification and classification
* MET: media-embedded target for connecting paper to digital media
* Multimedia information retrieval: best papers and expanding frontiers
* Multivariate time series modeling of geometric features of spatio-temporal volumes for content based video retrieval
* Optimization of information retrieval for cross media contents in a best practice network
* Parallel incremental power mean SVM for the classification of large-scale image datasets
* Person instance graphs for mono-, cross- and multi-modal person recognition in multimedia data: application to speaker identification in TV broadcast
* Self-similarity-based partial near-duplicate video retrieval and alignment
* sparse kernel relevance model for automatic image annotation, A
* Statistical framework for content-based medical image retrieval based on wavelet orthogonal polynomial model with multiresolution structure
* Topic detection in cross-media: A semi-supervised co-clustering approach
* Video Browser Showdown: a live evaluation of interactive video search tools, The
21 for MultInfoRetr(3)
MultInfoRetr(4)
* Aligning plot synopses to videos for story-based retrieval
* aMM: Towards adaptive ranking of multi-modal documents
* Bregman pooling: feature-space local pooling for image classification
* Building effective SVM concept detectors from clickthrough data for large-scale image retrieval
* Detection of social events in streams of social multimedia
* Distributed cross-media multiple binary subspace learning
* Generic multivariate model for color texture classification in RGB color space
* ImageCLEF annotation with explicit context-aware kernel maps
* influence of image descriptors' dimensions' value cardinalities on large-scale similarity search, The
* Large image modality labeling initiative using semi-supervised and optimized clustering
* Learning to detect concepts with Approximate Laplacian Eigenmaps in large-scale and online settings
* Multi-Bin search: improved large-scale content-based image retrieval
* novel framework for CBCD using integrated color and acoustic features, A
* On-the-fly learning for visual search of large-scale image and video datasets
* Optimizing visual dictionaries for effective image retrieval
* Region-based Image Retrieval Using Shape-Adaptive DCT
* Region-based Image Retrieval Using Shape-Adaptive DCT
* Special issue on concept detection with big data
* Special issue on video retrieval
* Studying the impact of sequence clustering on near-duplicate video retrieval: an experimental comparison
* VIDCAR: an unsupervised CBVR framework for identifying similar videos with prominent object motion
* Video classification with Densely extracted HOG/HOF/MBH features: An evaluation of the accuracy/computational efficiency trade-off
* Weakly supervised detection of video events using hidden conditional random fields
23 for MultInfoRetr(4)
MultInfoRetr(5)
* Automatic environmental sound concepts discovery for video retrieval
* Blind late fusion in multimedia event retrieval
* Boosting local texture descriptors with Log-Gabor filters response for improved image retrieval
* Bundling centre for landmark image discovery
* Classification of color texture images based on modified WLD
* Deep shape-aware descriptor for nonrigid 3D object retrieval
* efficient method for video shot boundary detection and keyframe extraction using SIFT-point distribution histogram, An
* Image recommendation based on keyword relevance using absorbing Markov chain and image features
* Improving content-based image retrieval with compact global and local multi-features
* IR_URFS_VF: image recommendation with user relevance feedback session and visual features in vertical image search
* Learning content-social influential features for influence analysis
* Learning initial feature weights for CBIR using query augmentation
* Major events in multimedia information retrieval
* MGraph: multimodal event summarization in social media using topic models and graph-based ranking
* novel approach for shape-based object recognition with curvelet transform, A
* On the coupled use of signal and semantic concepts to bridge the semantic and user intention gaps for visual content retrieval
* On the use of commonsense ontology for multimedia event recounting
* Open and free datasets for multimedia retrieval
* Robust facial expression recognition system based on hidden Markov models
* Special issue on visual information retrieval
* Text-to-video: a semantic search engine for internet videos
* User-adaptive image retrieval via fusing pointwise and pairwise labels
22 for MultInfoRetr(5)
MultInfoRetr(6)
* ACSIR: ANOVA Cosine Similarity Image Recommendation in vertical search
* Computational framework for emotional VAD prediction using regularized Extreme Learning Machine
* DBAHCL: database for Arabic handwritten characters and ligatures
* Editorial for the ICMR 2016 special issue
* Fast discrete curvelet transform-based anisotropic feature extraction for biomedical image indexing and retrieval
* Instance search retrospective with focus on TRECVID
* Investigating country-specific music preferences and music recommendation algorithms with the LFM-1b dataset
* Learning hierarchical video representation for action recognition
* Multi-frame twin-channel descriptor for person re-identification in real-time surveillance videos
* Multicontext-adaptive indexing and search for large-scale video navigation
* Multilingual visual sentiment concept clustering and analysis
* overview of approaches for content-based medical image retrieval, An
* overview of traffic sign detection and classification methods, An
* OVIS: ontology video surveillance indexing and retrieval system
* Query-by-example music information retrieval by score-based genre prediction and similarity measure
* Script identification algorithms: a survey
* Shot boundary detection using perceptual and semantic information
* survey of tag-based information retrieval, A
* survey on camera-captured scene text detection and extraction: towards Gurmukhi script, A
* Survey on handwritten documents word spotting, A
* Toward semantic content-based image retrieval using Dempster-Shafer theory in multi-label classification framework
* Unsupervised group feature selection for media classification
22 for MultInfoRetr(6)
MultInfoRetr(8)
* 3D local circular difference patterns for biomedical image retrieval
* automatic feature extraction and fusion model: Application to electromyogram (EMG) signal classification, An
* Automatic visual pattern mining from categorical image dataset
* Balancing search space partitions by sparse coding for distributed redundant media indexing and retrieval
* Brain disease diagnosis using local binary pattern and steerable pyramid
* Color-independent classification of animation video
* complete person re-identification model using Kernel-PCA-based Gabor-filtered hybrid descriptors, A
* Content-based medical image retrieval of CT images of liver lesions using manifold learning
* Cross-specificity: modelling data semantics for cross-modal matching and retrieval
* Current challenges and visions in music recommender systems research
* Detection and visualization of misleading content on Twitter
* DHFML: deep heterogeneous feature metric learning for matching photograph and cartoon pairs
* Digital watermarking for deep neural networks
* Editorial for the ICMR 2017 special issue
* Editorial for the ICMR 2018 special issue
* efficient content-based medical image indexing and retrieval using local texture feature descriptors, An
* End-to-end cross-modality retrieval with CCA projections and pairwise ranking loss
* Estimating the information gap between textual and visual representations
* faceted approach to reachability analysis of graph modelled collections, A
* Hybrid descriptors and Weighted PCA-EFMNet for Face Verification in the Wild
* Improvement of image description using bidirectional LSTM
* Joint embeddings with multimodal cues for video-text retrieval
* Mining exoticism from visual content with fusion-based deep neural networks
* MSRC: multimodal spatial regression with semantic context for phrase grounding
* Multi-dimensional multi-directional mask maximum edge pattern for bio-medical image retrieval
* Multi-view collective tensor decomposition for cross-modal hashing
* Multimodal analysis of user behavior and browsed content under different image search intents
* Order, context and popularity bias in next-song recommendations
* Pedestrian detection using first- and second-order aggregate channel features
* Probabilistic selection of frames for early action recognition in videos
* review of semantic segmentation using deep neural networks, A
* review on robust video copy detection, A
* Robustness of DR-LDP over PCANet for face analysis
* Semi-supervised domain adaptation for pedestrian detection in video surveillance based on maximum independence assumption
* Spatiotemporal wavelet correlogram for human action recognition
* Survey on brain tumor segmentation and feature extraction of MR images
* survey paper on secret image sharing schemes, A
* Three-dimensional spatio-temporal trajectory descriptor for human action recognition
* Transferred Semantic Scores for Scalable Retrieval of Histopathological Breast Cancer Images
* Using visual features based on MPEG-7 and deep learning for movie recommendation
* Video instance search via spatial fusion of visual words and object proposals
41 for MultInfoRetr(8)
MultInfoRetr(9)
* Characterization and classification of semantic image-text relations
* ContextNet: representation and exploration for painting classification and retrieval in context
* Editorial for the ICMR 2019 special issue
* Effective video hyperlinking by means of enriched feature sets and monomodal query combinations
* Focus-Aspect-Value model for predicting subjective visual attributes, The
* Hierarchical attentive deep neural networks for semantic music annotation through multiple music representations
* Hypergraph learning with collaborative representation for image search reranking
* Image annotation: the effects of content, lexicon and annotation method
* Learning visual features for relational CBIR
* Multi-level context extraction and attention-based contextual inter-modal fusion for multimodal sentiment analysis and emotion classification
* retrieval-based approach for diverse and image-specific adversary selection, A
* Single-image crowd counting: a comparative survey on deep learning-based approaches
* Special issue on deep learning in image and video retrieval
* study on deep learning spatiotemporal models and feature extraction techniques for video understanding, A
* survey of traditional and deep learning-based feature descriptors for high dimensional data in computer vision, A
* survey on instance segmentation: state of the art, A
16 for MultInfoRetr(9)
MultiSP( Vol No. )
* *Multidimensional Systems and Signal Processing
MultiSP(6)
* On the Smoothness Constraint in the Intensity-Based Estimation of the Parallax Field
MultiSP(8)
* Adaptive Morphological Representation of Signals: Polynomial and Wavelet Methods
* Grobner Bases and Multidimensional FIR Multirate Systems
* Low Bit-Rate Design Considerations for Wavelet-Based Image-Coding
* Multidimensional Filter Banks and Wavelets: Research Developments and Applications - Preface
* Multiresolution Vector Quantization for Video Coding
* Multiscale, Statistical Anomaly Detection Analysis and Algorithms for Linearized Inverse Scattering Problems
* New Bit-Rate Control of MPEG with Predictive and Adaptive Perceptual Quantization, A
* On the Scalability of 2-D Discrete Wavelet Transform Algorithms
* On Translation Invariant Subspaces and Critically Sampled Wavelet Transforms
* Reconstruction and Decomposition Algorithms for Biorthogonal Multiwavelets
* Zero-Phase Filter Bank and Wavelet Code R-Matrices: Properties, Triangular Decompositions, and a Fast Algorithm
* Zero-Phase Filter Bank and Wavelet Code R-Matrices: Properties, Triangular Decompositions, and a Fast Algorithm
12 for MultiSP(8)
MultiSP(9)
* Low-Bit-Rate VQ: A Projection Based Approach
* ROI Search Method for Still Images Based on Set Descriptions, An
MultiTemp11
* *International Workshop on the Analysis of Multi-temporal Remote Sensing Images
* Active-learning based cascade classification of multitemporal images for updating land-cover maps
* Analysis of earth observation time series to investigate the relation between rainfall, vegetation dynamic and streamflow in the Uele' basin (Central African Republic)
* Analysis of LULC changes and urban expansion of the resort city of Al Ain using remote sensing and GIS
* Analysis of NOAA/AVHRR multitemporal images, climate conditions and cultivated land of sugarcane fields applied to agricultural monitoring
* Analytical description of pseudo-invariant features (PIFs)
* Assessing the impact of the orbital drift of SPOT-VGT1 by comparing with SPOT-VGT2 data
* Automated backdating of transportation networks with Landsat imagery
* Automatic interpolation of phenological phases in Germany
* Bathymetry from fusion of multi-temporal Landsat and radar altimetery
* Braided river dynamics determined using satellite imagery: Upper Rakaia River, Canterbury, New Zealand
* Change detection in very high resolution imagery based on dynamic time warping: An implementation for Haiti earthquake damage assessment
* Classification of dynamic evolutions from satellitar image time series based on similarity measures
* Clustering analysis applied to NDVI/NOAA multitemporal images to improve the monitoring process of sugarcane crops
* Clustering of satellite image time series under Time Warping
* Coarse to fine patches-based multitemporal analysis of very high resolution satellite images
* Comparison of two remote sensing time series analysis methods for monitoring forest decline
* Deriving plant phenology from remote sensing
* Detection of small changes in airborne hyperspectral imagery: Experimental results over urban areas
* Does evapotranspiration influence the strength of the North American monsoon? Multitemporal satellite analysis of evapotranspiration and its effects
* Dynamic mapping of cropland areas in Sub-Saharan Africa using MODIS time series
* Effect of the learning algorithm on the accuracy of sub-pixel land use classifications with multilayer perceptrons
* Effects of multitemporal scene changes on pansharpening fusion
* Exploring the capacity to grasp multi-annual seasonal variability of winter wheat in Continental Climates with MODIS
* Feature extraction for NDVI AVHRR/NOAA time series classification
* Generation of 250m MODIS LAI time series by temporal regression
* Greenland inland ice melt-off: Analysis of global gravity data from the GRACE satellites
* hyperspectral reflectance data based model inversion methodology to detect reniform nematodes in cotton, A
* Identification of grazed and mown grasslands using a time series of high-spatial-resolution remote sensing images
* impact of inter-annual variability in remote sensing time series on modeling tree species distributions, The
* Investigation of evolutionary feature subset selection in multi-temporal datasets for harmful algal bloom detection
* Land cover change detection thresholds for Landsat data samples
* Land cover classification by using multi-temporal COSMO-SkyMed data
* Low and high spatial resolution time series fusion for improved land cover map production
* method for change detection with multi-temporal satellite images based on Principal Component Analysis, A
* Monitoring a fuzzy object: The case of Lake Naivasha
* Monitoring African surface water dynamic using medium resolution daily data allows anomalies detection in nearly real time
* Monitoring crop growth inter-annual variability from MODIS time series: Performance comparison between crop specific green area index and current global leaf area index products
* Monitoring environmental change in the Andes based on SPOT-VGT and NOAA-AVHRR time series analysis
* Monitoring global vegetation with the Yearly Land Cover Dynamics (YLCD) method
* Monitoring land cover changes in Hulun Buir by using object-oriented method
* Multi-temporal analysis of a mangrove ecosystem in Southeastern Brazil using object-based classification applied to IKONOS II data
* Multi-temporal damage assessment of linear infrastructural objects using Dynamic Bayesian Networks
* Multi-temporal SAR classification according to change detection operators
* multilevel approach to change detection for port surveillance with very high resolution SAR images, A
* Multitemporal classification of natural vegetation cover in Brazilian Cerrado
* Multitemporal data management and exploitation infrastructure
* Multitemporal fusion of Landsat and MERIS images
* NDVI time series and Markov chains to model the change of fuzzy vegetative drought classes
* Phenology of the natural vegetation: A land cover specific approach for a reference dataset in Central Africa
* PhenoSat: A tool for vegetation temporal analysis from satellite image data
* Producing global land cover maps consistent over time to respond the needs of the climate modelling community
* Quantification of LAI interannual anomalies by adjusting climatological patterns
* robust approach for phenological change detection within satellite image time series, A
* robust change detection feature for Cosmo-SkyMed detected SAR images, A
* SAR imagery change detection method for Land Border Monitoring
* Semi-automated generation of a multi-temporal forest depletion layer with the Landcover Change Mapper (LCM)
* Snow cover monitoring in alpine regions with COSMO-SkyMed images by using a multitemporal approach and depolarization ratio
* Spatial and temporal mapping of leaf area index in Alpine pastures and meadows with satellite MODIS imagery
* Spatiotemporal dimensionality and time-space characterization of vegetation phenology from multitemporal MODIS EVI
* Spatiotemporal mining of ENVISAT SAR interferogram time series over the Haiyuan fault in China
* Spectral-Temporal Analysis by Response Surface applied to detect deforestation in the Brazilian Amazon
* Time-series analysis of rainforest clearing in Sabah, Borneo using Landsat imagery
* Tools for multitemporal analysis and classification of multisource satellite imagery
* Unravelling long-term vegetation change patterns in a binational watershed using multitemporal land cover data and historical photography
* Urbanization analysis by mutual information based change detection between SPOT 5 panchromatic images
* Use of multi-annual MODIS Land Surface Temperature data for the characterization of the heat requirements for grapevine varieties
* Using NASA'S Long Term Data Record version 3 for the monitoring of land surface vegetation
* Utilization of spectral measurements and phenological observations to detect grassland-habitats with a RapidEye intra-annual time-series
* Year-to-year variability of NDVI in croplands and grasslands across a regional grasslands-forest ecotone in Central Alberta, Canada
70 for MultiTemp11
MultiTemp15
* *International Workshop on the Analysis of Multi-temporal Remote Sensing Images
* 13 Years of changes in the extent and physiognomy of mangroves after shrimp farming abandonment, Bali
* 3D displacement retrieval on glacial areas by airborne multi-view photogrammetry
* Agricultural monitoring with polarimetric SAR time series
* Alpine algorithms-time series of innovative remote sensing products for Alpine areas: Snow cover leaf area index and soil moisture
* alternative representation of coarse-resolution remote sensing images for time-series processing, An
* Building profile reconstruction using TerraSAR-X data time-series and tomographic techniques
* Change analysis of dual polarimetric Sentinel-1 SAR image time series using stationary wavelet transform and change detection matrix
* Change detection in bi-temporal data by canonical information analysis
* Change detection of coral reef habitats from multi-temporal and multi-source satellite imagery in Bunaken, Indonesia
* Change detection using multiscale segmentation and Kullback-Leibler divergence: Application on road damage extraction
* Characteristics of spatial-temporal sprawl in specific Chinese coastal cities from 1979 to 2013
* Cloud removal in image time series through unmixing
* CloudSim: A fair benchmark for comparison of methods for times series reconstruction from cloud and atmospheric contamination
* Comparison between spatial and temporal estimation of entropy on polarimetric SAR images
* Consistent forest change maps 198-2000 from the AVHRR time series: Case studies for South America and Indonesia
* Coupling of phenological information and synthetically generated time-series for crop types as indicator for vegetation coverage information
* Data assimilation in multiscale complex systems
* Data fusion approach for Urban area identification using multisensor information
* Data stream mining for multitemporal remote sensing data
* Dealing whith occultation when accounting for observation error correlation in a wavelet space
* Deformation estimation on low coherence areas by means of polarimetric differential SAR interferometry
* Determining the effects of ENSO phenomena on Andean areas by applying radiometric indices on long time series
* dynamical model to classify the content of multitemporal images employing distributed computing techniques, A
* Elevation changes and X-band ice and snow penetration inferred from TanDEM-X data of the Mont-Blanc area
* Evaluating the temporal stability of synthetically generated time-series for crop types in Central Germany
* Exploiting satelitte image time series for monitoring ecological quality parameters of french reservoirs
* Exploring the validity of the long term data record V4 database for land surface monitoring
* Extracting characteristics of satellite image time series with decision trees
* Fine co-registration of VHR images for multitemporal Urban area analysis
* Fluctuations of Caucasian glaciers in 20th century
* Global snow cover mapping using a multi-temporal multi-sensor approach
* Ground echoes filtering using the completed local binary pattern and the support vector machine
* Improved crop classification using multitemporal RapidEye data
* Inpainting restoration for inland waters Mexico ecosystems
* keypoint approach for change detection between SAR images based on graph theory, A
* Land cover change dynamics and multi-factor analysis in high mountains basins of Colombian Andes
* Landscape features that prevent or foster urban sprawl
* Mapping the snow line altitude for large glacier samples from multitemporal Landsat imagery
* Modeling high rainfall regions for flash flood nowcasting
* Monitoring forest recovery with change metrics derived from Landsat time series stacks
* Multitemporal classification without new labels: A solution with optimal transport
* Multitemporal data mining: From biomass monitoring to nuclear proliferation detection
* Multivariate statistical modeling for multi-temporal SAR change detection using wavelet transforms
* Normalized difference phytoplankton index (NDPI) and spatio-temporal cloud filtering for multitemporal cyanobacteria pollution analysis on Erie Lake in 2014
* Numerical models to forecast the sugarcane production in regional scale based on time series of NDVI/AVHRR images
* Prediction of NDVI for grassland habitats by fusing RapidEye and Landsat imagery
* Primal sketch of image series with edge preserving filtering application to change detection
* Processing polarimetric SAR time series over urban areas with binary partition trees
* rapid mapping approach to quantify damages caused by the 2003 bam earthquake using high resolution multitemporal optical images, A
* Recent elevation and velocity changes of Astrolabe Glacier, Terre Adelie, Antarctica
* Region-based change detection of PolSAR images using analytic information-theoretic divergence
* Regional glacier mapping from time-series of Landsat type data
* Retrieving daily evapotranspiration from the combination of geostationary and polar-orbit satellite data
* Robust glacier displacements using knowledge-based image matching
* Satellite image time series classification and analysis using an adapted graph labeling
* scalable spatiotemporal inference framework based on statistical shape analysis for natural ecosystem monitoring by remote sensing, A
* Sparse-smooth decomposition models for multi-temporal SAR images
* Spatio-temporal characterization in satellite image time series
* statistical approach for predicting grassland degradation in disturbance-driven landscapes, A
* Superpixel-based change detection in high resolution SAR images using region covariance features
* swap randomization approach for mining motion field time series over the Argentiere glacier, A
* Temporal stability of mangrove multispectral signatures at fine scales: Stability of mangrove multispectral signatures
* Testing satellite rainfall estimates for yield simulation of a rainfed cereal in West Africa
* Time series analysis of multi-frequency SAR backscatter and bistatic coherence in the context of flood mapping
* Towards the large-scale assessment of vegetation biomass production stability
* Tree species discrimination in temperate woodland using high spatial resolution Formosat-2 time series
* Trends in 15-year MODIS NDVI time series for Mexico
68 for MultiTemp15
MultiTemp17
* *International Workshop on the Analysis of Multi-temporal Remote Sensing Images
* Agricultural monitoring using clustering techniques on satellite image time series of low spatial resolution
* Analysis of multitemporal Sentinel-2 images in the framework of the ESA Scientific Exploitation of Operational Missions
* Analysis of Riparian forest buffers dynamics in Colombian basins by Landsat Time Series
* Angular normalisation of PROBA-V 300m NDVI
* ASAP - Anomaly hot Spots of Agricultural Production, a new early warning decision support system developed by the Joint Research Centre
* Assessing hypertemporal SENTINEL-1 COHERENCE maps for LAND COVER monitoring
* Assessment of ALOS PALSAR 25-m mosaic data for land cover mapping
* Assessment of AquaCrop for winter wheat using satellite derived fCover data
* Assessment of time series consistency of terrestrial Essential Climate Variables
* Automatic production of large-scale cloud-free orthomosaics from multitemporal satellite images
* Automatic smoothing of remote sensing data
* Built-up areas mapping at global scale based on adaptive parametric thresholding of Sentinel-1 intensity coherence time series
* Change detection in a series of Sentinel-1 SAR data
* Circular change detection in image time series inspired by two-dimensional phase unwrapping
* Classification of anthropogenic landscapes
* Combined use of SAR and optical time series data for near real-time forest disturbance mapping
* Detecting the spread of invasive species in central Chile with a Sentinel-2 time-series
* Estimate yield at parcel level from S2 time serie in sub-Saharan smallholder farming systems
* Estimating total aboveground, stem and branch biomass using multi-frequency SAR
* European Space agency (ESA) Landsat MSS/TM/ETM+/OLI archive: 42 years of our history
* Evaluating an energy balance setting and random forest-based downscaling for the estimation of daily ET at sub-kilometer spatial resolution
* Filtering mislabeled data for improving time series classification
* Glacier ice loss monitored through the Planet cubesat constellation
* Global climatic drivers of vegetation based on wavelet analysis
* Handling coherence measures of displacement field time series: Application to Greenland ice sheet glaciers
* Harbour pattern of life analysis with time series of medium resolution satellite images
* Humid tropical forest monitoring with multi-temporal L-, C- and X-band SAR data
* Identifying crops in smallholder farms using time series of WorldView-2 images
* Image representation alternatives for the analysis of satellite image time series
* Investigating the control of ocean-atmospheric oscillations over global terrestrial evaporation using a simple supervised learning method
* Joint retrieval of surface reflectance and aerosol properties from PROBA-V observations, part I: Algorithm performance evaluation
* Joint Surface Reflectance and AeRosol properties retrieval in the PV-LAC framework, part II: Validation
* Land cover change detection in Satellite Image Time Series using an active learning method
* Land surface phenology from Copernicus Global Land time series
* Land-cover evolution class analysis in Image Time Series of Landsat and Sentinel-2 based on Latent Dirichlet Allocation
* Lava emplacement mapping with SAR and optical satellite data
* Leveraging Sentinel-1 time-series data for mapping agricultural land cover and land use in the tropics
* Mapping of season length anomalies in Mexico
* Mapping small reservoirs in semi-arid regions using multitemporal SAR: Methods and applications
* Mapping tree species of forests in southwest France using Sentinel-2 image time series
* Monitoring pasture intesification in Brazilian Amazon biome with MODIS time series
* Mountain crop monitoring with multitemporal Sentinel-1 and Sentinel-2 imagery
* Multi temporal data visualization in EO mobile apps
* Multi-temporal and multi-source alpine glacier cover classification
* Multitemporal Sentinel-2 data: remarks and observations
* non-linear data-driven approach to reveal global vegetation sensitivity to climate, A
* novel method for unsupervised multiple Change Detection in hyperspectral images based on binary Spectral Change Vectors, A
* On the use of guided regularized random forests to identify crops in smallholder farm fields
* Optimizing SAR change detection based on log-ratio features
* Potato monitoring in Belgium with WatchITGrow
* Potential of Sentinel-2 and SPOT5 (Take5) time series for the estimation of grasslands biodiversity indices
* Preliminary exploration of introducing spatial correlation information into the probabilistic patch-based similarity measure
* Proba-V cloud detection Round Robin: Validation results and recommendations
* Remote sensing monitoring of land restoration interventions in semi-arid environments using a before-after control-impact statistical design
* Retrospective analysis of long-term landscape evolution based on archive satellite imagery and historical maps
* RGB SAR product exploiting multitemporal: General processing and applications
* Sea Surface Temperature changes analysis, an Essential Climate Variable for Ecosystem Services provisioning
* SITS for estimating sugarcane production
* Spatial relationships between natural resources and land use dynamics in the Amazonian agricultural frontier
* Spatio-temporal evolution of crop fields in Sentinel-2 Satellite Image Time Series
* Spatiotemporal variations of alpine climate, snow cover and phenology
* Support for Multi-temporal and Multi-mission data processing: The ESA Research and Service Support
* Survey of current hyperspectral Earth observation applications from space and synergies with Sentinel-2
* Temporal analysis of SAR imagery for permanent and evolving Earth land cover behavior assessment
* Temporal relationships between daily precipitation and NDVI time series in Mexico
* Unsupervised change detection of remote sensing images using superpixel segmentation and variational Gaussian mixture model
* Urban area change detection based on generalized likelihood ratio test
* use of Landsat time series for identification of forest degradation levels in the eastern Brazilian Amazon (Paragominas), The
* Using Landsat-8 and Sentinel-1 data for Above Ground Biomass assessment in the Tamar valley and Dartmoor
* Variations in mangrove regeneration rates under different management plans: An analysis of Landsat time-series in the Matang Mangrove Forest Reserve, Peninsular Malaysia
71 for MultiTemp17
MultiView07
* *Beyond Multiview Geometry: Robust Estimation and Organization of Shapes from Multiple Cues
* Active Visual Object Reconstruction using D-, E-, and T-Optimal Next Best Views
* Constrained Optimization for Retinal Curvature Estimation Using an Affine Camera
* Joint Priors for Variational Shape and Appearance Modeling
* MRF and Gaussian Curvature Based Shape Representation for Shape Matching, An
* Multiview normal field integration using level set methods
* Opti-Acoustic Stereo Imaging, System Calibration and 3-D Reconstruction
* Robust Click-Point Linking: Matching Visually Dissimilar Local Regions
* Scene-space Feature Detectors
9 for MultiView07
Multiview17
* *Multiview Relationships in 3D Data
* Accurate Depth Map Estimation from Small Motions
* Camera Pose Filtering with Local Regression Geodesics on the Riemannian Manifold of Dual Quaternions
* Combining Exemplar-Based Approach and learning-Based Approach for Light Field Super-Resolution Using a Hybrid Imaging System
* Computer Vision Meets Geometric Modeling: Multi-view Reconstruction of Surface Points and Normals Using Affine Correspondences
* Content-Aware Metric for Stitched Panoramic Image Quality Assessment, A
* Edge SLAM: Edge Points Based Monocular Visual SLAM
* KPPF: Keypoint-Based Point-Pair-Feature for Scalable Automatic Global Registration of Large RGB-D Scans
* Multiview Absolute Pose Using 3D-2D Perspective Line Correspondences and Vertical Direction
* On Tablet 3D Structured Light Reconstruction and Registration
* Probabilistic Surfel Fusion for Dense LiDAR Mapping
* Use-Case Study on Multi-view Hypothesis Fusion for 3D Object Classification, A
12 for Multiview17
MultLearnApp18
* *Multimodal Learning and Applications Workshop
* Boosting LiDAR-Based Semantic Labeling by Cross-modal Training Data Generation
* CentralNet: A Multilayer Approach for Multimodal Fusion
* Generalized Bayesian Canonical Correlation Analysis with Missing Modalities
* Learning from #Barcelona Instagram Data What Locals and Tourists Post About Its Neighbourhoods
* Learning to Learn from Web Data Through Deep Semantic Embeddings
* Structured Listwise Approach to Learning to Rank for Image Tagging, A
* ThermalGAN: Multimodal Color-to-Thermal Image Translation for Person Re-identification in Multispectral Dataset
* Unpaired Thermal to Visible Spectrum Transfer Using Adversarial Training
* Visual-Semantic Alignment Across Domains Using a Semi-Supervised Approach
* Visually Indicated Sound Generation by Perceptually Optimized Classification
* Where and What Am I Eating? Image-Based Food Menu Recognition
12 for MultLearnApp18
MultMed
* *IEEE Transactions on Multimedia
MultMed(1)
* Content-Based Video Indexing Retrieval
* Detection of Moving Cast Shadows for Object Segmentation
MultMed(10)
* Admission Control Scheme Based on Online Measurement for VBR Video Streams Over Wireless Home Networks, An
* Association and Temporal Rule Mining for Post-Filtering of Semantic Concept Detection in Video
* Association Rule-Based Method to Support Medical Image Diagnosis With Efficiency, An
* Audio-Visual Affective Expression Recognition Through Multistream Fused HMM
* Batch Nearest Neighbor Search for Video Retrieval
* Boosting-Based Multimodal Speaker Detection for Distributed Meeting Videos
* Channel Aware Multiuser Scalable Video Streaming Over Lossy Under-Provisioned Channels: Modeling and Analysis
* Color-Based Image Salient Region Segmentation Using Novel Region Merging Strategy
* Comprehensive Survey on Three-Dimensional Mesh Watermarking, A
* Compression of 3-D Point Visual Data Using Vector Quantization and Rate-Distortion Optimization
* Constrained Probabilistic Petri Net Framework for Human Activity Detection in Video, A
* Content-Aware Playout and Packet Scheduling for Video Streaming Over Wireless Links
* Content-Aware Prediction Algorithm With Inter-View Mode Decision for Multiview Video Coding
* Content-Based Image Retrieval Using Multiresolution Color and Texture Features
* Cross-Dimensional Perceptual Quality Assessment for Low Bit-Rate Videos
* Delay-Constrained and R-D Optimized Transrating for High-Definition Video Streaming Over WLANs
* DISCOV: A Framework for Discovering Objects in Video
* Discriminant Graph Structures for Facial Expression Recognition
* Document Image Processing for Paper Side Communications
* Efficient Deblocking With Coefficient Regularization, Shape-Adaptive Filtering, and Quantization Constraint
* Efficient Watermarking Method Based on Significant Difference of Wavelet Coefficient Quantization, An
* Energy-Constrained Distortion Reduction Optimization for Wavelet-Based Coded Image Transmission in Wireless Sensor Networks
* Face Annotation Using Transductive Kernel Fisher Discriminant
* Fast Best-Match Shape Searching in Rotation-Invariant Metric Spaces
* Fast Inter Mode Decision Using Spatial Property of Motion Field
* Fragile Watermarking With Error-Free Restoration Capability
* Graph-Based Multiplayer Detection and Tracking in Broadcast Soccer Videos
* Highly Efficient VLSI Architecture for H.264/AVC CAVLC Decoder, A
* Human Age Estimation With Regression on Discriminative Aging Manifold
* Image Retrieval Over Networks: Active Learning Using Ant Algorithm
* Image Retrieval With Relevance Feedback Based on Graph-Theoretic Region Correspondence Estimation
* Implementing the 2-D Wavelet Transform on SIMD-Enhanced General-Purpose Processors
* Improving Robustness of Quantization-Based Image Watermarking via Adaptive Receiver
* Integrated Mining of Visual Features, Speech Features, and Frequent Patterns for Semantic Video Annotation
* Interactive Transmission of JPEG2000 Images Using Web Proxy Caching
* Intra/Inter Macroblock Mode Decision for Error-Resilient Transcoding
* Joined Spectral Trees for Scalable SPIHT-Based Multispectral Image Compression
* Joint Source-Channel Video Coding Scheme Based on Distributed Source Coding, A
* Linear Rate Control and Optimum Statistical Multiplexing for H.264 Video Broadcast
* Low Complexity Detection of Discrete Cross Differences for Fast H.264/AVC Intra Prediction, A
* Low-Complexity Heterogeneous Video Transcoding Using Data Mining
* Mining Appearance Models Directly From Compressed Video
* Multilevel Asymmetric Scheme for Digital Fingerprinting, A
* Multimodal and Multilevel Ranking Scheme for Large-Scale Video Retrieval, A
* Multimodal Scheme for Program Segmentation and Representation in Broadcast Video Streams, A
* No-Reference PSNR Estimation for Quality Monitoring of Motion JPEG2000 Video Over Lossy Packet Networks
* Novel Framework for Semantic Annotation and Personalized Retrieval of Sports Video, A
* Optimizing Multiple Object Tracking and Best View Video Synthesis
* Paired Subimage Matching Watermarking Method on Ordered Dither Images and Its High-Quality Progressive Coding
* Partitioning of Multiple Fine-Grained Scalable Video Sequences Concurrently Streamed to Heterogeneous Clients
* Predicting Visual Focus of Attention From Intention in Remote Collaborative Tasks
* Real-Time Vision and Speech Driven Avatars for Multimedia Applications
* Recognizing Human Emotional State From Audiovisual Signals
* Recognizing Human Emotional State From Audiovisual Signals*
* Robust Audio-Visual Speech Recognition Based on Late Integration
* Robust Image Corner Detection Based on the Chord-to-Point Distance Accumulation Technique
* Scalable 3-D Terrain Visualization Through Reversible JPEG2000-Based Blind Data Hiding
* Selection of Concept Detectors for Video Search by Ontology-Enriched Semantic Spaces
* Shot Change Detection via Local Keypoint Matching
* Spatiotemporal Motion Analysis for the Detection and Classification of Moving Targets
* Synthesis of Silhouettes and Visual Hull Reconstruction for Articulated Humans
* Using Webcast Text for Semantic Event Detection in Broadcast Sports Video
* Video Annotation Based on Kernel Linear Neighborhood Propagation
* Video Error Concealment Using Spatio-Temporal Boundary Matching and Partial Differential Equation
* Video Semantic Event/Concept Detection Using a Subspace-Based Multimedia Data Mining Framework
* Video Streaming for Mobile Video Surveillance
* Video-Based Human Movement Analysis and Its Application to Surveillance Systems
* Vision-Based Augmented-Reality System For Multiuser Collaborative Environments, A
68 for MultMed(10)
MultMed(11)
* 3-D Face Detection, Landmark Localization, and Registration Using a Point Distribution Model
* Architectures for Fast Transcoding of H.264/AVC to Quality-Scalable SVC Streams
* Attack on Watermarking Method Based on Significant Difference of Wavelet Coefficient Quantization
* Bandwidth Aggregation-Aware Dynamic QoS Negotiation for Real-Time Video Streaming in Next-Generation Wireless Networks
* Blind Robust 3-D Mesh Watermarking Based on Oblate Spheroidal Harmonics
* Capacity Gain of Mixed Multicast/Unicast Transport Schemes in a TV Distribution Network
* Character Identification in Feature-Length Films Using Global Face-Name Matching
* Coherent Phrase Model for Efficient Image Near-Duplicate Retrieval
* Community Streaming With Interactive Visual Overlays: System and Optimization
* Concealment of Whole-Picture Loss in Hierarchical B-Picture Scalable Video Coding
* Content-Aware Distortion-Fair Video Streaming in Congested Networks
* Content-Based Attention Ranking Using Visual and Contextual Attention Model for Baseball Videos
* Context-Aware Person Identification in Personal Photo Collections
* Control-Theoretic Approach to Rate Control for Streaming Videos, A
* Controlling Virtual Cameras Based on a Robust Model-Free Pose Acquisition Technique
* Design of a Scalable Multicast Scheme With an Application-Network Cross-Layer Approach
* Discriminant Subspace Analysis: An Adaptive Approach for Image Classification
* Dynamic Resource Allocation for MGS H.264/AVC Video Transmission Over Link-Adaptive Networks
* Effective Annotation and Search for Video Blogs with Integration of Context and Content Analysis
* Efficient Background Subtraction and Shadow Removal for Monochromatic Video Sequences
* Efficient Mode Selection Prior to the Actual Encoding for H.264/AVC Encoder, An
* Efficient Near-Duplicate Video Shot Detection Method Using Shot-Based Interest Points, An
* Ellipsoidal Harmonics for 3-D Shape Description and Retrieval
* Event Tactic Analysis Based on Broadcast Sports Video
* Expression-Invariant Face Recognition With Constrained Optical Flow Warping
* FaceSeg: Automatic Face Segmentation for Real-Time Video
* Fast Motion Estimation on Graphics Hardware for H.264 Video Encoding
* Fast-Mesh: A Low-Delay High-Bandwidth Mesh for Peer-to-Peer Live Streaming
* Generalized PCRTT Offline Bandwidth Smoothing Based on SVM and Systematic Video Segmentation
* Hierarchical Modeling and Adaptive Clustering for Real-Time Summarization of Rush Videos
* Human Perception of Audio-Visual Synthetic Character Emotion Expression in the Presence of Ambiguous and Conflicting Information
* Image Annotation Within the Context of Personal Photo Collections Using Hierarchical Event and Scene Models
* Image Retargeting Using Mesh Parametrization
* Island Multicast: Combining IP Multicast With Overlay Data Distribution
* Joint Source Coding and Network-Supported Distributed Error Control for Video Streaming in Wireless Multihop Networks
* LayerP2P: Using Layered Video Chunks in P2P Live Streaming
* Lipreading With Local Spatiotemporal Descriptors
* Liveness Enforcing Supervision of Video Streaming Systems Using Nonsequential Petri Nets
* Low-Complexity Cross-Layer Optimization Algorithm for Video Communication Over Wireless Networks, A
* Multicue Bayesian State Estimator for Gaze Prediction in Open Signed Video, A
* Multiuser Rate Allocation Games for Multimedia Communications
* No-Reference Video Quality Monitoring for H.264/AVC Coded Video
* Novel Video Summarization Based on Mining the Story-Structure and Semantic Relations Among Concept Entities, A
* Optimal Channel Adaptation of Scalable Video Over a Multicarrier-Based Multicell Environment
* Optimal Packet Loss Protection of Progressively Compressed 3-D Meshes
* Optimized H.264 Video Encoding and Packetization for Video Transmission Over Pipeline Forwarding Networks
* Picture Collage
* Quality-Driven Cross-Layer Solution for MPEG Video Streaming Over WiMAX Networks, A
* Real-Time Near-Duplicate Elimination for Web Video Search With Content and Context
* Registration Based on Scene Recognition and Natural Features Tracking Techniques for Wide-Area Augmented Reality Systems
* Reliable Application Layer Multicast Over Combined Wired and Wireless Networks
* Rhombic Dodecahedron Map: An Efficient Scheme for Encoding Panoramic Video, The
* Robust Scaling-Based Image Watermarking Using Maximum-Likelihood Decoder With Optimum Strength Factor
* Salient Region Detection by Modeling Distributions of Color and Orientation
* Scalable Video Multicast Using Expanding Window Fountain Codes
* Scale-Invariant Visual Language Modeling for Object Categorization
* Scene Detection in Videos Using Shot Clustering and Sequence Alignment
* Segmentation-Driven Image Fusion Based on Alpha-Stable Modeling of Wavelet Coefficients
* Sketch-Based Spatial Queries for Retrieving Human Locomotion Patterns From Continuously Archived GPS Data
* Smooth Control of Adaptive Media Playout for Video Streaming
* Spatial Correlation Model for Visual Information in Wireless Multimedia Sensor Networks, A
* Sports Video Mining via Multichannel Segmental Hidden Markov Models
* Statistical Scheduling of Offline Comparative Subjective Evaluations for Real-Time Multimedia
* Structural Descriptors for Category Level Object Detection
* Support Vector Machine Approach for Detection and Localization of Transmission Errors Within Standard H.263++ Decoders, A
* Syntactic Matching of Trajectories for Ambient Intelligence Applications
* Tensor-Based Transductive Learning for Multimodality Video Semantic Concept Detection
* Trade-Offs in Bit-Rate Allocation for Wireless Video Streaming
* Unified Traffic Model for MPEG-4 and H.264 Video Traces, A
* Using RTT Variability for Adaptive Cross-Layer Approach to Multimedia Delivery in Heterogeneous Networks
* Using Visual Context and Region Semantics for High-Level Concept Detection
71 for MultMed(11)
MultMed(12)
* 3-D Audio-Visual Corpus of Affective Communication, A
* 3-D Model Search and Retrieval From Range Images Using Salient Features
* Adaptation of Multimedia Presentations for Different Display Sizes in the Presence of Preferences and Temporal Constraints
* Adaptive Computational Model for Salient Object Detection, An
* Affective Audio-Visual Words and Latent Topic Driving Model for Realizing Movie Affective Scene Classification
* Affective Visualization and Retrieval for Music Video
* Authentication of Scalable Video Streams With Low Communication Overhead
* Bayesian Approach to Automated Creation of Tactile Facial Images, A
* Blind Audiovisual Source Separation Based on Sparse Redundant Representations
* Bridging the Semantic Gap Between Image Contents and Tags
* Browsing Video Along Multiple Threads
* Camera Motion-Based Analysis of User Generated Video
* Combining Context, Consistency, and Diversity Cues for Interactive Image Categorization
* Comparison of Perceptually-Based Metrics for Objective Evaluation of Geometry Processing, A
* Constructing Concept Lexica With Small Semantic Gaps
* Controlling the Bit Rate of Multi-Object Videos With Noncooperative Game Theory
* Cross-Media Alignment of Names and Faces
* Digital Cinema Watermarking for Estimating the Position of the Pirate
* Dynamic FEC Algorithms for TFRC Flows
* Efficient and Robust Algorithm for Shape Indexing and Retrieval, An
* Emotion Recognition in Text for 3-D Facial Expression Rendering
* Energy Efficient H.263 Video Transmission in Power Saving Wireless LAN Infrastructure
* Estimating Cohesion in Small Groups Using Audio-Visual Nonverbal Behavior
* Fine-Granularity Transmission Distortion Modeling for Video Packet Scheduling Over Mesh Networks
* Framework of Enhancing Image Steganography With Picture Quality Optimization and Anti-Steganalysis Based on Simulated Annealing Algorithm, A
* Image Annotation by Graph-Based Inference With Integrated Multiple/Single Instance Representations
* Image Classification With Kernelized Spatial-Context
* Image-Based Approach to Video Copy Detection With Spatio-Temporal Post-Filtering, An
* Impact of Network Dynamics on User's Video Quality: Analytical Framework and QoS Provision
* In-Image Accessibility Indication
* Information-Theoretic Analysis of Input Strokes in Visual Object Cutout
* Joint Compressive Video Coding and Analysis
* Lightweight SCTP for Partially Reliable Overlay Video Multicast Service for Mobile Terminals, A
* Low-Complexity Analytical Modeling for Cross-Layer Adaptive Error Protection in Video Over WLAN, A
* Mining Compositional Features From GPS and Visual Cues for Event Recognition in Photo Collections
* Mining Group Nonverbal Conversational Patterns Using Probabilistic Topic Models
* Multi-View Video Summarization
* Multihop Packet Delay Bound Violation Modeling for Resource Allocation in Video Streaming Over Mesh Networks
* Multimedia Quality-Driven Network Resource Management Architecture for Wireless Sensor Networks With Stream Authentication, A
* Multitransform Architecture for H.264/AVC High-Profile Coders, A
* Natural Visible and Infrared Facial Expression Database for Expression Recognition and Emotion Inference, A
* Network Awareness of P2P Live Streaming Applications: A Measurement Study
* On Energy Efficient Encryption for Video Streaming in Wireless Sensor Networks
* On the Annotation of Web Videos by Efficient Near-Duplicate Search
* Practical Online Near-Duplicate Subsequence Detection for Continuous Video Streams
* Predicting Speaker Head Nods and the Effects of Affective Information
* Real-Time Framework for Video Time and Pitch Scale Modification, A
* Real-Time Visual Concept Classification
* Representations of Keypoint-Based Semantic Concept Detection: A Comprehensive Study
* Robust Block-Based Image/Video Registration Approach for Mobile Imaging Devices, A
* Robust Symbolic Dual-View Facial Expression Recognition With Skin Wrinkles: Local Versus Global Approach
* Scalable Intraband and Composite Wavelet-Based Coding of Semiregular Meshes
* Sequence Multi-Labeling: A Unified Video Annotation Scheme With Spatial and Temporal Context
* SPANC: Optimizing Scheduling Delay for Peer-to-Peer Live Streaming
* Special Issue on Multimodal Affective Interaction
* Stochastic Approach to Image Retrieval Using Relevance Feedback and Particle Swarm Optimization, A
* Synchronization of Multiple Camera Videos Using Audio-Visual Features
* System for Real-Time Multimodal Analysis of Nonverbal Affective Social Interaction in User-Centric Media, A
* Towards a Relevant and Diverse Search of Social Images
* TURINstream: A Totally pUsh, Robust, and effIcieNt P2P Video Streaming Architecture
* Video Annotation Through Search and Graph Reinforcement Mining
* Video Précis: Highlighting Diverse Aspects of Videos
* Visualizing Image Collections Using High-Entropy Layout Distributions
63 for MultMed(12)
MultMed(13)
* Adaptive Context-Tree-Based Statistical Filtering for Raster Map Image Denoising
* Adaptive Learning for Target Tracking and True Linking Discovering Across Multiple Non-Overlapping Cameras
* Adaptive Resource Allocation for Layer-Encoded IPTV Multicasting in IEEE 802.16 WiMAX Wireless Networks
* Algorithm and Architecture Design of Perception Engine for Video Coding Applications
* Analysis and Exploitation of Musician Social Networks for Recommendation and Discovery
* Audiovisual Discrimination Between Speech and Laughter: Why and When Visual Information Might Help
* Automated Assembly of Shredded Pieces From Multiple Photos
* Autonomous Framework to Produce and Distribute Personalized Team-Sport Video Summaries: A Basketball Case Study, An
* Balancing Attended and Global Stimuli in Perceived Video Quality Assessment
* Bayesian Visual Reranking
* Collaborative Face Recognition for Improved Face Annotation in Personal Photo Collections Shared on Online Social Networks
* ConnectBoard: Enabling Genuine Eye Contact and Accurate Gaze in Remote Collaboration
* Connotative Space for Supporting Movie Affective Recommendation, A
* Content-Aware Display Adaptation and Interactive Editing for Stereoscopic Images
* Cooperative Layered Video Multicast Using Randomized Distributed Space Time Codes
* Cost-Sensitive Multi-Label Learning for Audio Tag Annotation and Retrieval
* Cross-Layer Optimization for Downlink Wavelet Video Transmission
* Depth Image-Based Rendering With Advanced Texture Synthesis for 3-D Video
* Editing by Viewing: Automatic Home Video Summarization by Viewing Behavior Analysis
* Effective Method for Movable Projector Keystone Correction, An
* Effective Pseudonoise Sequence and Decoding Function for Imperceptibility and Robustness Enhancement in Time-Spread Echo-Based Audio Watermarking
* Effective Semantic Annotation by Image-to-Concept Distribution Model
* Efficient Algorithms for Multi-Sender Data Transmission in Swarm-Based Peer-to-Peer Streaming Systems
* Efficient Feature Detection and Effective Post-Verification for Large Scale Near-Duplicate Image Search
* Empowering Visual Categorization With the GPU
* Enabling Composition-Based Video-Conferencing for the Home
* Energy-Efficient Multicasting of Scalable Video Streams Over WiMAX Networks
* Event-Based Semantic Image Adaptation for User-Centric Mobile Display Devices
* Exploiting Visual-Audio-Textual Characteristics for Automatic TV Commercial Block Detection and Segmentation
* Exploring Distributional Discrepancy for Multidimensional Point Set Retrieval
* Exposing Digital Image Forgeries by Detecting Discrepancies in Motion Blur
* Fast Action Detection via Discriminative Random Forest Voting and Top-K Subvolume Search
* Fast Visual Retrieval Using Accelerated Sequence Matching
* Feature-Based Sparse Representation for Image Similarity Assessment
* Flash Translation Layer for NAND Flash-Based Multimedia Storage Devices, A
* Fuzzy Clustering Algorithm for Virtual Character Animation Representation, A
* Fuzzy Similarity-Based Emotional Classification of Color Images
* Game-Theoretic Strategies and Equilibriums in Multimedia Fingerprinting Social Networks
* Geometric Invariant Audio Watermarking Based on an LCM Feature
* Guided Face Cartoon Synthesis
* High-Quality Visualization for Geographically Distributed 3-D Teleimmersive Applications
* Human Psychology of Common Appraisal: The Reddit Score
* Image Quality Assessment by Separately Evaluating Detail Losses and Additive Impairments
* Image Retagging Using Collaborative Tag Propagation
* Impact of Spectrum Sensing Frequency and Packet-Loading Scheme on Multimedia Transmission Over Cognitive Radio Networks, The
* In-Network Packet Scheduling and Rate Allocation: A Content Delivery Perspective
* Integrating Visual Saliency and Consistency for Re-Ranking Image Search Results
* Interactive 3-D Audio System With Loudspeakers, An
* Interactive Image Segmentation With Multiple Linear Reconstructions in Windows
* Introduction to the ICME2010 Special Issue
* IRS: A Detour Routing System to Improve Quality of Online Games
* Kernel Framework for Content-Based Artist Recommendation System in Music, A
* Layer-Aware Forward Error Correction for Mobile Broadcast of Layered Media
* Layered Internet Video Adaptation (LIVA): Network-Assisted Bandwidth Sharing and Transient Loss Protection for Video Streaming
* Layered Multicast With Inter-Layer Network Coding for Multimedia Streaming
* Learning Visual Contexts for Image Annotation From Flickr Groups
* Less is More: Efficient 3-D Object Retrieval With Query View Selection
* Low Complexity Sign Detection and Text Localization Method for Mobile Applications, A
* Low-Complexity Inverse Transforms of Video Codecs in an Embedded Programmable Platform
* Markup SVG: An Online Content-Aware Image Abstraction and Annotation Tool
* MIMiC: Multimodal Interactive Motion Controller
* Missing Image Data Reconstruction Based on Adaptive Inverse Projection via Sparse Representation
* MobiUP: An Upsampling-Based System Architecture for High-Quality Video Streaming on Mobile Devices
* Moving Region Segmentation From Compressed Video Using Global Motion Estimation and Markov Random Fields
* Multi-Core Platforms for Beamforming and Wave Field Synthesis
* Multi-Gesture Interaction System Using a 3-D Iris Disk Model for Gaze Estimation and an Active Appearance Model for 3-D Hand Pointing, A
* Multi-Resolution Design for Large-Scale and High-Resolution Monitoring
* Nongeometric Distortion Smoothing Approach for Depth Map Preprocessing
* Object Retrieval Using Visual Query Context
* On Complexity Modeling of H.264/AVC Video Decoding and Its Application for Energy Efficient Decoding
* On Distributed Multimedia Scheduling With Constrained Control Channels
* On-the-Fly Erasure Coding for Real-Time Video Applications
* One-Pulse FEC Coding for Robust CELP-Coded Speech Transmission Over Erasure Channels
* Online Buffer Fullness Estimation Aided Adaptive Media Playout for Video Streaming
* Online Video Stream Abstraction and Stylization
* Optimal Bandwidth Assignment for Multiple-Description-Coded Video
* Optimal Layered Video IPTV Multicast Streaming Over Mobile WiMAX Systems
* Optimizing FEC Transmission Strategy for Minimizing Delay in Lossless Sequential Streaming
* Optimizing Multi-Rate Peer-to-Peer Video Conferencing Applications
* Optimizing Visual Search Reranking via Pairwise Learning
* Perceptually Guided Fast Compression of 3-D Motion Capture Data
* Performance Evaluation of IPTV Over Wireless Home Networks
* Practical Image Quality Metric Applied to Image Coding
* Prioritized Distributed Video Delivery With Randomized Network Coding
* Probabilistic Novelty Detection for Acoustic Surveillance Under Real-World Conditions
* Rate and Distortion Modeling of CGS Coded Scalable Video Content
* Reduced-Reference Image Quality Assessment Using Reorganized DCT-Based Image Representation
* Robust Camera Calibration and Player Tracking in Broadcast Basketball Video
* Robust Luby Transform Encoding Pattern-Aware Symbol Packetization Algorithm for Video Streaming Over Wireless Network, A
* Robust Spatial Matching for Object Retrieval and Its Parallel Implementation on GPU
* Routing-Aware Multiple Description Video Coding Over Mobile Ad-Hoc Networks
* Scalable Video Multicast in Hybrid 3G/Ad-Hoc Networks
* Selection of Network Coding Nodes for Minimal Playback Delay in Streaming Overlays
* Semi-Automatic Tagging of Photo Albums via Exemplar Selection and Tag Inference
* Sensitivity Analysis of the Human Visual System for Depth Cues in Stereoscopic 3-D Displays
* Spatial Audio Object Coding With Two-Step Coding Structure for Interactive Audio Service
* Spatial Correlation-Based Image Compression Framework for Wireless Multimedia Sensor Networks, A
* Special Section on Interactive Multimedia
* Spread Spectrum Visual Sensor Network Resource Management Using an End-to-End Cross-Layer Design
* Stratification-Based Keyframe Cliques for Effective and Efficient Video Representation
* Subjective Quality Evaluation via Paired Comparison: Application to Scalable Video Coding
* Superchunk-Based Efficient Search in P2P-VoD System
* Survey of Audio-Based Music Classification and Annotation, A
* Tag Tagging: Towards More Descriptive Keywords of Image Content
* Temporal Color Consistency-Based Video Reproduction for Dichromats
* Text-Video Completion Using Structure Repair and Texture Propagation
* Touch Interface Exploiting Time-Frequency Classification Using Zak Transform for Source Localization on Solids, A
* Training Surrogate Sensors in Musical Gesture Acquisition Systems
* Two-Level Downlink Scheduling for Real-Time Multimedia Services in LTE Networks
* Unequal Error Protection Using Fountain Codes With Applications to Video Communication
* Unifying Low-Level and High-Level Music Similarity Measures
* Unsupervised Alignment of News Video and Text Using Visual Patterns and Textual Concepts
* Utilizing Related Samples to Enhance Interactive Concept-Based Video Search
* Video Inpainting on Digitized Vintage Films via Maintaining Spatiotemporal Continuity
* Virtual Contour Guided Video Object Inpainting Using Posture Mapping and Retrieval
* Web Image and Video Mining Towards Universal and Robust Age Estimator
116 for MultMed(13)
MultMed(14)
* Adaptive Workload Equalization in Multi-Camera Surveillance Systems
* Advanced Hierarchical Motion Estimation Scheme With Lossless Frame Recompression and Early-Level Termination for Beyond High-Definition Video Coding, An
* Advanced IPTV Services Personalization Through Context-Aware Content Recommendation
* Aesthetics-Based Stereoscopic Photo Cropping for Heterogeneous Displays
* Affine Model Based Motion Compensation Prediction for Zoom
* Analysis and Evaluation of Adaptive LDPC AL-FEC Codes for Content Download Services
* Analytical Framework for Improving the Quality of Streaming Over TCP
* Analytical Modeling for Delay-Sensitive Video Over WLAN
* Assessment of Stereoscopic Crosstalk Perception
* Asymmetric Coding of Multi-View Video Plus Depth Based 3-D Video for View Rendering
* Automatic Light Scene Setting Through Image-Based Sparse Light Effect Approximation
* Automatic Role Recognition in Multiparty Conversations: An Approach Based on Turn Organization, Prosody, and Conditional Random Fields
* Bayesian Visual Reranking
* Bottom-Up Saliency Detection Model Based on Human Visual Sensitivity and Amplitude Spectrum
* Bridging the Semantic Gap via Functional Brain Imaging
* Causal Flow
* Content-Based Analysis Improves Audiovisual Archive Retrieval
* Content-Based Image Compression for Arbitrary-Resolution Display Devices
* Cooperation and Coalition in Multimedia Fingerprinting Colluder Social Networks
* Coordinate Live Streaming and Storage Sharing for Social Media Content Distribution
* Correlation-Aware QoS Routing With Differential Coding for Wireless Video Sensor Networks
* Cross-Layer Framework for QoS Support in Wireless Multimedia Sensor Networks
* Delay-Cognizant Interactive Streaming of Multiview Video With Free Viewpoint Synthesis
* Depth Video Coding Using Adaptive Geometry Based Intra Prediction for 3-D Video Systems
* Design and Synthesis for Multimedia Systems Using the Targeted Dataflow Interchange Format
* Difficulty Guided Image Retrieval Using Linear Multiple Feature Embedding
* Discovering Image Semantics in Codebook Derivative Space
* Discriminating Joint Feature Analysis for Multimedia Data Understanding
* Dynamic Sub-GOP Forward Error Correction Code for Real-Time Video Applications
* Effective Codebooks for Human Action Representation and Classification in Unconstrained Videos
* Efficient and Rate-Distortion Optimal Wavelet Packet Basis Selection in JPEG2000
* Efficient Frame Concealment for Depth Image-Based 3-D Video Transmission
* Efficient Genre-Specific Semantic Video Indexing
* Efficient Parallel Framework for H.264/AVC Deblocking Filter on Many-Core Platform
* Efficient Video Coding Using Legacy Algorithmic Approaches
* Energy-Efficient Resource Allocation and Scheduling for Multicast of Scalable Video Over Wireless Networks
* Enhanced 3-D Modeling for Landmark Image Classification
* Enhanced Bag-of-Visual Word Vector Space Model to Represent Visual Content in Athletics Images, An
* Error Weighted Semi-Coupled Hidden Markov Model for Audio-Visual Emotion Recognition
* Event Driven Web Video Summarization by Tag Localization and Key-Shot Identification
* Exploring Contextual Redundancy in Improving Object-Based Video Coding for Video Sensor Networks Surveillance
* Exploring Locality of Reference in P2P VoD Systems
* Fast and Accurate Video Semantic-Indexing System Using Fast MAP Adaptation and GMM Supervectors, A
* Fast Dynamic Range Compression With Local Contrast Preservation Algorithm and Its Application to Real-Time Video Enhancement, A
* Fast Mode Decision for H.264/AVC Based on Rate-Distortion Clustering
* Feature Combination in Kernel Space for Distance Based Image Hashing
* Finding Celebrities in Billions of Web Images
* Frame Rate Optimization Framework for Improving Continuity in Video Streaming, A
* Gammatone Cepstral Coefficients: Biologically Inspired Features for Non-Speech Audio Classification
* Generic Framework for Video Annotation via Semi-Supervised Learning, A
* Global 1-Mbps Peer-Assisted Streaming: Fine-Grain Measurement of a Configurable Platform
* Harvesting Social Images for Bi-Concept Search
* Hidden-Concept Driven Multilabel Image Annotation and Label Ranking
* HodgeRank on Random Graphs for Subjective Video Quality Assessment
* Hybrid Algorithm for Effective Lossless Compression of Video Display Frames, A
* Interactive Video Indexing With Statistical Active Learning
* Introduction to the ICME 2011 Special Issue
* Introduction to the Special Section on Smart, Social, and Converged TV
* Investigating the Effects of Multiple Factors Towards More Accurate 3-D Object Retrieval
* Joint Demosaicing and Subpixel-Based Down-Sampling for Bayer Images: A Fast Frequency-Domain Analysis Approach
* Joint Source-Channel Coding and Optimization for Layered Video Broadcasting to Heterogeneous Devices
* Kernel Cross-Modal Factor Analysis for Information Fusion With Application to Bimodal Emotion Recognition
* Large-Scale Vehicle Detection, Indexing, and Search in Urban Surveillance Videos
* Learn to Personalized Image Search From the Photo Sharing Websites
* Learn2Dance: Learning Statistical Music-to-Dance Mappings for Choreography Synthesis
* Learn2Dance: Learning Statistical Music-to-Dance Mappings for Choreography Synthesis
* Learning Hierarchical Semantic Description Via Mixed-Norm Regularization for Image Understanding
* Learning Semantics From Multimedia Web Resources: An Introduction to the Special Issue
* Long-Term Incremental Web-Supervised Learning of Visual Concepts via Random Savannas
* Low-Complexity Video Quality Assessment Using Temporal Quality Variations
* Low-Decoding-Latency Buffer Compression for Graphics Processing Units
* Low-Delay Peer-To-Peer Media Streaming Based on Network Coding Over Randomized Multicast Trees
* Low-Latency Video Streaming With Congestion Control in Mobile Ad-Hoc Networks
* Managing Digital Rights for P2P Live Broadcast and Recording on the Internet
* Matrix-Based Approach to Unsupervised Human Action Categorization, A
* Model-Based Shot Boundary Detection Technique Using Frame Transition Parameters, A
* Movie2Comics: Towards a Lively Video Content Presentation
* Moving Object Detection and Tracking Using a Spatio-Temporal Graph in H.264/AVC Bitstreams for Video Surveillance
* Multi-Camera Approach to Image-Based Rendering and 3-D/Multiview Display of Ancient Chinese Artifacts, A
* Multimodal Video Indexing and Retrieval Using Directed Information
* Multipitch Estimation of Piano Music by Exemplar-Based Sparse Representation
* Multiple Description of Coded Video for Path Diversity Streaming Adaptation
* Nonrigid Structure-From-Motion From 2-D Images Using Markov Chain Monte Carlo
* Nonverbal Behavior Approach to Identify Emergent Leaders in Small Groups, A
* Normalized Energy Density-Based Forensic Detection of Resampled Images
* Novel Large-Scale Digital Forensics Service Platform for Internet Videos, A
* Novel Multiple Kernel Learning Framework for Heterogeneous Feature Fusion and Variable Selection, A
* Object Co-Segmentation Based on Shortest Path Algorithm and Saliency Model
* Optimizing Selective ARQ for H.264 Live Streaming: A Novel Method for Predicting Loss-Impact in Real Time
* P2P-Based IPTV Services: Design, Deployment, and QoE Measurement
* Parallel Lasso for Large-Scale Video Concept Detection
* Path Modeling and Retrieval in Distributed Video Surveillance Databases
* Photo Stream Alignment and Summarization for Collaborative Photo Collection and Sharing
* Preference-Aware View Recommendation System for Scenic Photos Based on Bag-of-Aesthetics-Preserving Features
* Pricing and Investment for Online TV Content Platforms
* Privacy Enabled Digital Rights Management Without Trusted Third Party Assumption
* Probabilistic Motion Diffusion of Labeling Priors for Coherent Video Segmentation
* Prototype-Based Image Search Reranking
* Purposive Hidden-Object-Game: Embedding Human Computation in Popular Game
* QoE Prediction Model and its Application in Video Quality Adaptation Over UMTS Networks
* Quality-Centric TCP-Friendly Congestion Control for Multimedia Transmission, A
* Quantitative Characterization of Semantic Gaps for Learning Complexity Estimation and Inference Model Selection
* Query Difficulty Prediction for Web Image Search
* Reading Users' Minds From Their Eyes: A Method for Implicit Image Annotation
* Real-Time Head and Hand Tracking Based on 2.5D Data
* Recommender System for Sport Videos Based on User Audiovisual Consumption
* Reducing DRAM Image Data Access Energy Consumption in Video Processing
* Rhythm of Motion Extraction and Rhythm-Based Cross-Media Alignment for Dance Videos
* Robust Face-Name Graph Matching for Movie Character Identification
* Robust Image Coding Based Upon Compressive Sensing
* Robust Watermarking of Compressed and Encrypted JPEG2000 Images
* Robustly Extracting Captions in Videos Based on Stroke-Like Edges and Spatio-Temporal Analysis
* S3-MKL: Scalable Semi-Supervised Multiple Kernel Learning for Real-World Image Applications
* Sampling and Ontologically Pooling Web Images for Visual Concept Learning
* Scalable Comic-Like Video Summaries and Layout Disturbance
* Search and Retrieval of Rich Media Objects Supporting Multiple Multimodal Queries
* Secure and Efficient Authentication Scheme for Access Control in Mobile Pay-TV Systems, A
* Semantic Model Vectors for Complex Video Event Recognition
* Single Image Realism Assessment and Recoloring by Color Compatibility
* Sketch-Based Annotation and Visualization in Video Authoring
* Sliding-Window Designs for Vertex-Based Shape Coding
* Sparse Ensemble Learning for Concept Detection
* Structure Tensor Series-Based Large Scale Near-Duplicate Video Retrieval
* Summarizing Rushes Videos by Motion, Object, and Event Understanding
* Tag-Based Image Retrieval Improved by Augmented Features and Group-Based Refinement
* Tennis Real Play
* Throughput Scaling of Convolution for Error-Tolerant Multimedia Applications
* Towards Cross-Version Harmonic Analysis of Music
* Towards Scalable Summarization of Consumer Videos Via Sparse Dictionary Selection
* Understanding Kin Relationships in a Photo
* Unsupervised Salient Object Segmentation Based on Kernel Density Estimation and Two-Phase Graph Cut
* Unsupervised Semantic Feature Discovery for Image Object Retrieval and Tag Refinement
* User-Aware Image Tag Refinement via Ternary Semantic Analysis
* Video Completion Using Bandlet Transform
* Visual Sentences for Pose Retrieval Over Low-Resolution Cross-Media Dance Collections
* Visually Summarizing Web Pages Through Internal and External Images
* Weakly Supervised Graph Propagation Towards Collective Image Parsing
* Web Image Annotation Via Subspace-Sparsity Collaborated Feature Selection
* Web Video Geolocation by Geotagged Social Resources
* Web-Based Classifiers for Human Action Recognition
* Wireless H.264 Video Quality Enhancement Through Optimal Prioritized Packet Fragmentation
141 for MultMed(14)
MultMed(15)
* Access Point-Based FEC Mechanism for Video Transmission Over Wireless LANs, An
* Active Bucket Categorization for High Recall Video Retrieval
* Adaptive Cloud Downloading Service, An
* Adaptive Mobile Cloud Computing to Enable Rich Mobile Multimedia Applications
* Aesthetic Image Enhancement by Dependence-Aware Object Recomposition
* Affective Labeling in a Content-Based Recommender System for Images
* AMES-Cloud: A Framework of Adaptive Mobile Video Streaming and Efficient Social Video Sharing in the Clouds
* Appearance-Based QR Code Beautifier
* Attribute-Based Access to Scalable Media in Cloud-Assisted Content Sharing Networks
* Automatic Training Image Acquisition and Effective Feature Selection From Community-Contributed Photos for Facial Attribute Detection
* Beyond Text QA: Multimedia Answer Generation by Harvesting Web Information
* Bootstrapping Visual Categorization With Relevant Negatives
* Branch and Data Herding: Reducing Control and Memory Divergence for Error-Tolerant GPU Applications
* Capacity Management of Seed Servers in Peer-to-Peer Streaming Systems With Scalable Video Streams
* Casual Stereoscopic Photo Authoring
* Cloud-Based Image Coding for Mobile Devices: Toward Thousands to One Compression
* CloudMoV: Cloud-Based Mobile Social TV
* Co-Salient Object Detection From Multiple Images
* Collusion-Resistant Conditional Access System for Flexible-Pay-Per-Channel Pay-TV Broadcasting, A
* Compressing 3D Trees With Rendering Efficiency Based on Differential Data
* Compressive Video Streaming: Design and Rate-Energy-Distortion Analysis
* Connectivity, Online Social Capital, and Mood: A Bayesian Nonparametric Analysis
* Consistent Stereo Matching Under Varying Radiometric Conditions
* Content-Based Photo Quality Assessment
* Context-Aware Video Retargeting via Graph Model
* Continuous Birdsong Recognition Using Gaussian Mixture Modeling of Image Shape Features
* Cooperative Delivery Techniques to Support Video-on-Demand Service in IPTV Networks
* Correspondence Matching of Multi-View Video Sequences Using Mutual Information Based Similarity Measure
* Crowdsourcing Multimedia QoE Evaluation: A Trusted Framework
* Cube2Video: Navigate Between Cubic Panoramas in Real-Time
* Design QoS-Aware Multi-Path Provisioning Strategies for Efficient Cloud-Assisted SVC Video Streaming to Heterogeneous Clients
* Differential Coding-Based Scheduling Framework for Wireless Multimedia Sensor Networks, A
* Directive Contrast Based Multimodal Medical Image Fusion in NSCT Domain
* Discovering Video Shot Categories by Unsupervised Stochastic Graph Partition
* Downlink Power Control for Multi-User VBR Video Streaming in Cellular Networks
* Edge-Preserving Texture Suppression Filter Based on Joint Filtering Schemes
* Effective CU Size Decision Method for HEVC Encoders, An
* Effective Multiple Feature Hashing for Large-Scale Near-Duplicate Video Retrieval
* Efficient Fine-Granular Scalable Coding of 3D Mesh Sequences
* Efficient Resource Provisioning and Rate Selection for Stream Mining in a Community Cloud
* Emotional Accompaniment Generation System Based on Harmonic Progression
* Empirical Model of Multiview Video Coding Efficiency for Wireless Multimedia Sensor Networks, An
* Energy and Quality-Aware Multimedia Signal Processing
* Error Tolerant Multimedia Stream Processing: There's Plenty of Room at the Top (of the System Stack)
* Example-Based Color Transfer for Gradient Meshes
* Example-Based Super-Resolution With Soft Information and Decision
* Exploiting Semantic and Visual Context for Effective Video Annotation
* Face Expression Recognition by Cross Modal Data Association
* Fairness Resource Allocation in Blind Wireless Multimedia Communications
* Fast and Efficient Transcoding Based on Low-Complexity Background Modeling and Adaptive Block Classification
* Fast Intra-Coding for H.264/AVC by Using Projection-Based Predicted Block Residuals
* FAST Rate Allocation for JPEG2000 Video Transmission Over Time-Varying Channels
* Feature Processing and Modeling for 6D Motion Gesture Recognition
* Feature Selection for Multimedia Analysis by Sharing Information Among Multiple Tasks
* Fluorescence Tomography Reconstruction With Simultaneous Positron Emission Tomography Priors
* From Logo to Object Segmentation
* Fully Automatic and Frame-Accurate Video Synchronization Using Bitrate Sequences
* Generating Visual Summaries of Geographic Areas Using Community-Contributed Images
* GPS Estimation for Places of Interest From Social Users' Uploaded Photos
* GPS/HPS-and Wi-Fi Fingerprint-Based Location Recognition for Check-In Applications Over Smartphones in Cloud-Based LBSs
* GPU-Accelerated Real-Time Tracking of Full-Body Motion With Multi-Layer Search
* Gram-Based String Paradigm for Efficient Video Subsequence Search, A
* Graph-Based Topic-Focused Retrieval in Distributed Camera Network
* Group Delay Based Methods for Speaker Segregation and its Application in Multimedia Information Retrieval
* Guest Editorial for Special Section on Multimodal Biomedical Imaging: Algorithms and Applications
* Guest Editorial: Special section on cloud-based mobile media: Infrastructure, services, and applications
* Guest Editorial: Special Section on New Software/Hardware Paradigms for Error-Tolerant Multimedia Systems
* Hessian Regularized Support Vector Machines for Mobile Image Annotation on the Cloud
* Image Re-Attentionizing
* Inferring Contexts From Facebook Interactions: A Social Publicity Scenario
* Integrating Non-Repetitive LT Encoders With Modified Distribution to Achieve Unequal Erasure Protection
* Integration of Multivariate Data Streams With Bandpower Signals
* Interaction Design for Mobile Visual Search
* Interactive Multimodal Visual Search on Mobile Device
* Interactive Multiview Video System With Low Complexity 2D Look Around at Decoder
* Interactive Schematic Summaries for Faceted Exploration of Surveillance Video
* Joint Bit Allocation and Rate Control for Coding Multi-View Video Plus Depth Based 3D Video
* Joint Multimodal Group Analysis Framework for Modeling Corticomuscular Activity, A
* Joint Social and Content Recommendation for User-Generated Videos in Online Social Network
* Joint Spatio-Temporal Alignment of Sequences
* JPIP Proxy Server With Prefetching Strategies Based on User-Navigation Model and Semantic Map
* Just Noticeable Difference Estimation for Images With Free-Energy Principle
* Kinect-Like Depth Data Compression
* Latent Mixture of Discriminative Experts
* Learning a Contextual Multi-Thread Model for Movie/TV Scene Segmentation
* Learning Crowdsourced User Preferences for Visual Summarization of Image Collections
* Learning Query-Specific Distance Functions for Large-Scale Web Image Search
* Learning Semantic Signatures for 3D Object Retrieval
* Learning to Distribute Vocabulary Indexing for Scalable Visual Search
* Learning to Photograph: A Compositional Perspective
* Learning to Produce 3D Media From a Captured 2D Video
* Learning to Reassemble Shredded Documents
* Linking Brain Responses to Naturalistic Music Through Analysis of Ongoing EEG and Stimulus Features
* Local Disparity Estimation With Three-Moded Cross Census and Advanced Support Weight
* Localization of Taps on Solid Surfaces for Human-Computer Touch Interfaces
* Low-Complexity Bit-Plane Entropy Coding and Rate Control for 3-D DWT Based Video Coding, A
* Low-Cost Eye Gaze Prediction System for Interactive Networked Video Streaming
* LP-SR: Approaching Optimal Storage and Retrieval for Video-on-Demand
* Markov Decision Process Based Energy-Efficient On-Line Scheduling for Slice-Parallel Video Decoders on Multicore Systems
* Measurement and Modeling of Video Watching Time in a Large-Scale Internet Video-on-Demand System
* Message Passing Matching Dynamics for Overlapping Point Identification
* Mixed Reality Virtual Clothes Try-On System, A
* Mode Decision-Based Algorithm for Complexity Control in H.264/AVC
* Modeling and Analysis of Skype Video Calls: Rate Control and Video Quality
* Modeling Functional Roles Dynamics in Small Group Interactions
* Modeling of Driver Behavior in Real World Scenarios Using Multiple Noninvasive Sensors
* Monitoring of Tumor Response to Au Nanorod-Indocyanine Green Conjugates Mediated Therapy With Fluorescence Imaging and Positron Emission Tomography
* MSIDX: Multi-Sort Indexing for Efficient Content-Based Image Search and Retrieval
* Multi-Feature Fusion via Hierarchical Regression for Multimedia Analysis
* Multimedia Event Detection Using A Classifier-Specific Intermediate Representation
* Multimedia Fusion With Mean-Covariance Analysis
* Multimedia Information Retrieval Based on Late Semantic Fusion Approaches: Experiments on a Wikipedia Image Collection
* Multimodal Analysis for Identification and Segmentation of Moving-Sounding Objects
* Multimodal Approach to Speaker Diarization on TV Talk-Shows, A
* Multimodal Photoacoustic Tomography
* Multimodal Saliency and Fusion for Movie Summarization Based on Aural, Visual, and Textual Attention
* NetClust: A Framework for Scalable and Pareto-Optimal Media Server Placement
* Network and Device Aware QoS Approach for Cloud-Based Mobile Streaming, A
* Network Coding Meets Multimedia: A Review
* New Fast Encoding Algorithm Based on an Efficient Motion Estimation Process for the Scalable Video Coding Standard, A
* Non-Parametric Super-Resolution Using a Bi-Sensor Camera
* On a Highly Efficient RDO-Based Mode Decision Pipeline Design for AVS
* On the Investigation of Cloud-Based Mobile Media Environments with Service-Populating and QoS-Aware Mechanisms
* On-Device Mobile Visual Location Recognition by Integrating Vision and Inertial Sensors
* Online Allocation of Communication and Computation Resources for Real-Time Multimedia Services
* Optimization Framework for QoS-Enabled Adaptive Video Streaming Over OpenFlow Networks, An
* Optimizing Cloud Resources for Delivering IPTV Services Through Virtualization
* Patch-Based Image Warping for Content-Aware Retargeting
* Personal Clothing Retrieval on Photo Collections by Color and Attributes
* Preserving Motion-Tolerant Contextual Visual Saliency for Video Resizing
* Proxy-Based Multi-Stream Scalable Video Adaptation Over Wireless Networks Using Subjective Quality and Rate Models
* QoE-Driven Cache Management for HTTP Adaptive Bit Rate Streaming Over Wireless Networks
* Quantitative Model and Analysis of Information Confusion in Social Networks, A
* Quantitative Study of Music Listening Behavior in a Social and Affective Context
* Query-Adaptive Image Search With Hash Codes
* Query-Document-Dependent Fusion: A Case Study of Multimodal Music Retrieval
* Raptor Codes Based Unequal Protection for Compressed Video According to Packet Priority
* Real-Time, Full 3-D Reconstruction of Moving Foreground Objects From Multiple Consumer Depth Cameras
* Reduced-Reference Image Quality Assessment with Visual Information Fidelity
* Reversible Data Hiding With Optimal Value Transfer
* Review of Recent Advances in Registration Techniques Applied to Minimally Invasive Therapy, A
* Robust and Energy Efficient Multimedia Systems via Likelihood Processing
* Robust and Scalable Visual Category and Action Recognition System Using Kernel Discriminant Analysis With Spectral Regression, A
* Robust Part-Based Hand Gesture Recognition Using Kinect Sensor
* Robust Spatial Consistency Graph Model for Partial Duplicate Image Retrieval
* Robust Technique for Motion-Based Video Sequences Temporal Alignment, A
* Saliency Detection Model Using Low-Level Features Based on Wavelet Transform, A
* Scalable Content-Based Music Retrieval Using Chord Progression Histogram and Tree-Structure LSH
* Scalable Face Image Retrieval Using Attribute-Enhanced Sparse Codewords
* Scalable Precision Analysis Framework, A
* Scalable Resource Allocation for SVC Video Streaming Over Multiuser MIMO-OFDM Networks
* Script-to-Movie: A Computational Framework for Story Movie Composition
* Segmentation and Rectification of Pictures in the Camera-Captured Images of Printed Documents
* Self-Learning Approach to Single Image Super-Resolution, A
* Sensing Trending Topics in Twitter
* Sequential Error Concealment for Video/Images by Sparse Linear Prediction
* Shape Similarity Analysis by Self-Tuning Locally Constrained Mixed-Diffusion
* Simplification Resilient LDPC-Coded Sparse-QIM Watermarking for 3D-Meshes
* Smooth Nonnegative Matrix Factorization for Unsupervised Audiovisual Document Structuring
* Speaking Effect Removal on Emotion Recognition From Facial Expressions Based on Eigenface Conversion
* Spectral Hashing With Semantically Consistent Graph for Image Indexing
* Style Transfer Via Image Component Analysis
* Toward Blind Scheduling in Mobile Media Cloud: Fairness, Simplicity, and Asymptotic Optimality
* Towards Cross-Domain Learning for Social Video Popularity Prediction
* Tracking Human Under Occlusion Based on Adaptive Multiple Kernels With Projected Gradients
* Tracking Large-Scale Video Remix in Real-World Events
* Transcranial Ultrasound and Magnetic Resonance Image Fusion With Virtual Navigator
* Travel Recommendation by Mining People Attributes and Travel Group Types From Community-Contributed Photos
* Two-Level Hierarchical Alignment for Semi-Coupled HMM-Based Audiovisual Emotion Recognition With Temporal Course
* Understanding the Characteristics of Internet Short Video Sharing: A YouTube-Based Measurement Study
* Understanding the External Links of Video Sharing Sites: Measurement and Analysis
* Unsupervised Hierarchical Feature Learning Framework for One-Shot Image Recognition, An
* Video Aesthetic Quality Assessment by Temporal Integration of Photo- and Motion-Based Features
* Video Error Concealment Using a Computation-Efficient Low Saliency Prior
* Video-to-Shot Tag Propagation by Graph Sparse Group Lasso
* VideoPuzzle: Descriptive One-Shot Video Composition
* Visual Speech Synthesis Using a Variable-Order Switching Shared Gaussian Process Dynamical Model
* Visually Favorable Tone-Mapping With High Compression Performance in Bit-Depth Scalable Video Coding
* Web Multimedia Object Classification Using Cross-Domain Correlation Knowledge
* YouTube Lens: Crowdsourced Personality Impressions and Audiovisual Analysis of Vlogs, The
180 for MultMed(15)
MultMed(16)
* 3-D Interfaces to Improve the Performance of Visual Known-Item Search
* Accelerating Index-Based Audio Identification
* Acceptability-Based QoE Models for Mobile Video
* Accurate and Robust Range Image Registration Algorithm for 3D Object Modeling, An
* Adaptive Learning for Celebrity Identification With Video Context
* Adaptive Mechanism for Optimal Content Download in Wireless Networks, An
* Adaptive Thread Scheduling Mechanism With Low-Power Register File for Mobile GPUs, An
* Adaptive Watermarking and Tree Structure Based Image Quality Estimation
* Advanced Moving Object Detection Algorithm for Automatic Traffic Monitoring in Real-World Limited Bandwidth Networks, An
* Analysis and Predictive Modeling of Body Language Behavior in Dyadic Interactions From Multimodal Interlocutor Cues
* Analysis of Buffer Starvation With Application to Objective QoE Optimization of Streaming Services
* Analytical Approach for Voice Capacity Estimation Over WiFi Network Using ITU-T E-Model, An
* Assessment of Learned Score Features for Modeling Expressive Dynamics in Music, An
* Asymmetric Pruning for Learning Cascade Detectors
* Atmospheric Perspective Effect Enhancement of Landscape Photographs Through Depth-Aware Contrast Manipulation
* Audio Properties of Perceived Boundaries in Music
* Augmenting Image Descriptions Using Structured Prediction Output
* Automatic Estimation of Multiple Motion Fields From Video Sequences Using a Region Matching Based Approach
* Automatic Human Mocap Data Classification
* Bag-of-Importance Model With Locality-Constrained Coding Based Feature Learning for Video Summarization, A
* Band Codes for Energy-Efficient Network Coding With Application to P2P Mobile Streaming
* Best Practices for QoE Crowdtesting: QoE Assessment With Crowdsourcing
* BM25 With Exponential IDF for Instance Search
* Broadcasting Oneself: Visual Discovery of Vlogging Styles
* CAVVA: Computational Affective Video-in-Video Advertising
* CBM: Online Strategies on Cost-Aware Buffer Management for Mobile Video Streaming
* Channel Time Allocation PSO for Gigabit Multimedia Wireless Networks
* Classification of Cinematographic Shots Using Lie Algebra and its Application to Complex Event Recognition
* Cloud Mobile Media: Reflections and Outlook
* Coding Structure and Replication Optimization for Interactive Multiview Video Streaming
* Comprehensive Study Over VLAD and Product Quantization in Large-Scale Image Retrieval, A
* Compressing Encrypted Images With Auxiliary Information
* Conceptlets: Selective Semantics for Classifying Video Events
* Concurrent Single-Label Image Classification and Annotation via Efficient Multi-Layer Group Sparse Coding
* Content-Based Prediction of Movie Style, Aesthetics, and Affect: Data Set and Baseline Experiments
* Contextual Object Detection With Spatial Context Prototypes
* Contextual Query Expansion for Image Retrieval
* Corpus Development for Affective Video Indexing
* Correlation-Aware Packet Scheduling in Multi-Camera Networks
* Corruptive Artifacts Suppression for Example-Based Color Transfer
* Creating Experts From the Crowd: Techniques for Finding Workers for Difficult Tasks
* Creating the Sydney York Morphological and Acoustic Recordings of Ears Database
* Cross-Modal Approach for Extracting Semantic Relationships Between Concepts Using Tagged Images, A
* Data-Driven Approach for Facial Expression Retargeting in Video, A
* Depth-Based Multiview Distributed Video Coding
* Depth-Discrepancy-Compensated Inter-Prediction With Adaptive Segment Management for Multiview Depth Video Coding
* Discrete Cosine Transform Locality-Sensitive Hashes for Face Retrieval
* Discriminative Soft Bag-of-Visual Phrase for Mobile Landmark Recognition
* Discriminative Structure Learning for Semantic Concept Detection With Graph Embedding
* Distortion-Fair Cross-Layer Resource Allocation for Scalable Video Transmission in OFDMA Wireless Networks
* Distributed QoS Architectures for Multimedia Streaming Over Software Defined Networks
* Distributed Rate Allocation in Inter-Session Network Coding
* Distributed Scheduling for Low-Delay and Loss-Resilient Media Streaming With Network Coding
* Dynamic Load Balancing for Real-Time Video Encoding on Heterogeneous CPU+GPU Systems
* Dynamic Request Redirection and Elastic Service Scaling in Cloud-Centric Media Networks
* Dynamic Texture Recognition Using Multiscale Binarized Statistical Image Features
* Effective Results Ranking for Mobile Query by Singing/Humming Using a Hybrid Recommendation Mechanism
* Effective Video Retargeting With Jittery Assessment
* Efficient H.264/AVC Video Coding with Adaptive Transforms
* Efficient MRF Energy Propagation for Video Segmentation via Bilateral Filters
* Efficient Multi-View Generation Method From a Single-View Video Based on Affine Geometry Information, An
* Efficient Patch-Wise Non-Uniform Deblurring for a Single Image
* Efficient Viewer-Centric Depth Adjustment Based on Virtual Fronto-Parallel Planar Projection in Stereo 3D Images
* Enabling Geometry-Based 3-D Tele-Immersion With Fast Mesh Compression and Linear Rateless Coding
* Example-Based Human Motion Extrapolation and Motion Repairing Using Contour Manifold
* Example-Based Video Stereolization With Foreground Segmentation and Depth Propagation
* Exploiting Click Constraints and Multi-view Features for Image Re-ranking
* Exploiting Web Images for Semantic Video Indexing Via Robust Sample-Specific Loss
* Extracting Primary Objects by Video Co-Segmentation
* Face Distortion Recovery Based on Online Learning Database for Conversational Video
* Fashion Parsing With Weak Color-Category Labels
* Fast HEVC Inter CU Selection Method Based on Pyramid Motion Divergence, A
* Fast Single Image Super-Resolution via Self-Example Learning and Sparse Representation
* Gaze-Based Relevance Feedback for Realizing Region-Based Image Retrieval
* Generalized Equalization Model for Image Enhancement
* Generative Model for Concurrent Image Retrieval and ROI Segmentation, A
* Glottal and Vocal Tract Characteristics of Voice Impersonators
* Guest Editorial: Special Section on Music Data Mining
* Guest Editorial: Special Section on Socio-Mobile Media Analysis and Retrieval
* H.264 High-Profile Intra-Prediction with Adaptive Selection Between the Parallel and Pipelined Executions of Prediction Modes, An
* Hire me: Computational Inference of Hirability in Employment Interviews Based on Nonverbal Behavior
* Illumination Robust Video Foreground Prediction Based on Color Recovering
* Image Alignment by Piecewise Planar Region Matching
* Image Attribute Adaptation
* Image Relevance Prediction Using Query-Context Bag-of-Object Retrieval Model
* Image Similarity Using Sparse Representation and Compression Distance
* Impact of Random and Burst Packet Losses on H.264 Scalable Video Coding
* In-Network Quality Optimization for Adaptive Video Streaming Services
* Instant Mobile Video Search With Layered Audio-Video Indexing and Progressive Transmission
* Intent-Aware Video Search Result Optimization
* Interference Reduction in Reverberant Speech Separation With Visual Voice Activity Detection
* Interruption Probability of Wireless Video Streaming With Limited Video Lengths
* Investigating Redundant Internet Video Streaming Traffic on iOS Devices: Causes and Solutions
* Iterative Pricing-Based Rate Allocation for Video Streams With Fluctuating Bandwidth Availability
* Joint Sampling Rate and Bit-Depth Optimization in Compressive Video Sampling
* Kernel-Based MMSE Multimedia Signal Reconstruction and Its Application to Spatial Error Concealment
* Layered Wireless Video Relying on Minimum-Distortion Inter-Layer FEC Coding
* Learning Effective Event Models to Recognize a Large Number of Human Actions
* Learning High-Level Feature by Deep Belief Networks for 3-D Model Retrieval and Recognition
* Learning Salient Features for Speech Emotion Recognition Using Convolutional Neural Networks
* Loss-Resilient Coding of Texture and Depth for Free-Viewpoint Video Conferencing
* Low Complexity Adaptive View Synthesis Optimization in HEVC Based 3D Video Coding
* Low Transmission Overhead Framework of Mobile Visual Search Based on Vocabulary Decomposition, A
* Low-Complexity Packet Scheduling Algorithms for Streaming Scalable Media Based on Time Utility Function
* Mining Crowdsourced First Impressions in Online Social Video
* Mobile Landmark Search with 3D Models
* Model-Assisted Cross-Layer Design of an Energy-Efficient Mobile Video Cloud, A
* Motion Vector Recovery for Video Error Concealment by Using Iterative Dynamic-Programming Optimization
* MRF-Based Fast HEVC Inter CU Decision With the Variance of Absolute Differences
* Multi-Array Camera Disparity Enhancement
* Multi-Label Learning With Fused Multimodal Bi-Relational Graph
* Multi-Objective Optimization for Multimodal Visualization
* Multi-Source-Driven Asynchronous Diffusion Model for Video-Sharing in Online Social Networks
* Multimodal Interactive Continuous Scoring of Subjective 3D Video Quality of Experience
* Multipath Video Real-Time Streaming by Field-Based Anycast Routing
* Near-Duplicate Subsequence Matching Between the Continuous Stream and Large Video Dataset
* New Reference Frame Recompression Algorithm and Its VLSI Architecture for UHDTV Video Codec, A
* Noise Robust Face Hallucination via Locality-Constrained Representation
* Non-Blind Structure-Preserving Substitution Watermarking of H.264/CAVLC Inter-Frames
* Non-Rigid Structure-From-Motion With Uniqueness Constraint and Low Rank Matrix Fitting Factorization
* Normalized Correlation-Based Quantization Modulation for Robust Watermarking
* Novel Efficient HEVC Decoding Solution on General-Purpose Processors
* On a Hashing-Based Enhancement of Source Separation Algorithms Over Finite Fields With Network Coding Perspectives
* On Designing Paired Comparison Experiments for Subjective Multimedia Quality Assessment
* On the Quality of Service of Cloud Gaming Systems
* Online HodgeRank on Random Graphs for Crowdsourceable QoE Evaluation
* Optimized Motion Energy Estimation for Group of Pictures in Multi-Level Error Protection of H.264/AVC Video Bitstreams
* ParCast+: Parallel Video Unicast in MIMO-OFDM WLANs
* Parsing the Hand in Depth Images
* Per-Cluster Ensemble Kernel Learning for Multi-Modal Image Clustering With Group-Dependent Feature Selection
* Person Identity Label Propagation in Stereo Videos
* Personalized Geo-Specific Tag Recommendation for Photos on Social Websites
* Physical Metaphor for Streaming Media Retargeting
* PicWords: Render a Picture by Packing Keywords
* Point Cloud Encoding for 3D Building Model Retrieval
* Point of Interest Detection and Visual Distance Estimation for Sensor-Rich Video
* Post-Processing for Blocking Artifact Reduction Based on Inter-Block Correlation
* Predicting Failing Queries in Video Search
* Prior-Free Weighting Scheme for Binary Code Ranking, A
* Prototype-Based Modeling for Facial Expression Analysis
* Quaternionic Signal Processing Techniques for Automatic Evaluation of Dance Performances From MoCap Data
* Random Network Coding for Multimedia Delivery Services in LTE/LTE-Advanced
* Rate-Distortion Optimized Mode Switching for Error-Resilient Multi-View Video Plus Depth Based 3-D Video Coding
* Recursive On-Line 2D PCA and Its Application to Long-Term Background Subtraction
* Reducing Operational Costs in Cloud Social TV: An Opportunity for Cloud Cloning
* Regularity Preserved Superpixels and Supervoxels
* Relevant Window-Based Bitmap Compression in P2P Systems: Framework and Solution
* Representative Discovery of Structure Cues for Weakly-Supervised Image Segmentation
* Resource Allocation for Personalized Video Summarization
* Reversible Data Hiding in Encrypted JPEG Bitstream
* Robust Multi-Speaker Tracking via Dictionary Learning and Identity Modeling
* Robust Semi-Automatic Depth Map Generation in Unconstrained Images and Video Sequences for 2D to Stereoscopic 3D Conversion
* Scalable Mobile Visual Classification by Kernel Preserving Projection Over High-Dimensional Features
* Screen Content Coding Based on HEVC Framework
* Self-Learning Based Image Decomposition With Applications to Single Image Denoising
* Self-Sorting Map: An Efficient Algorithm for Presenting Multimedia Data in Structured Layouts
* Semi-Supervised Multiple Feature Analysis for Action Recognition
* Similarity Assessment Model for Chinese Sign Language Videos
* Simple and Efficient Re-Scrambling Scheme for DTV Programs, A
* Simple Method to Determine if a Music Information Retrieval System is a Horse, A
* Simultaneous-Speaker Voice Activity Detection and Localization Using Mid-Fusion of SVM and HMMs
* Social Image Analysis From a Non-IID Perspective
* Socialized Mobile Photography: Learning to Photograph With Social Context via Mobile Devices
* Solving a Special Type of Jigsaw Puzzles: Banknote Reconstruction From a Large Number of Fragments
* Space-Time Facet Model for Human Activity Classification
* Sparse Multi-Modal Hashing
* Sphere Image for 3-D Model Retrieval
* Sport Type Classification of Mobile Videos
* Standard-Compliant Low-Pass Temporal Filter to Reduce the Perceived Flicker Artifact
* Stationary Probability Model for Microscopic Parallelism in JPEG2000
* Systematic Evaluation of the Bag-of-Frames Representation for Music Information Retrieval, A
* Texture Modeling Using Contourlets and Finite Mixtures of Generalized Gaussian Distributions and Applications
* Topic-Sensitive Influencer Mining in Interest-Based Social Media Networks via Hypergraph Learning
* Touch Saliency: Characteristics and Prediction
* Towards Codebook-Free: Scalable Cascaded Hashing for Mobile Image Search
* Towards Mobile Document Image Retrieval for Digital Library
* Trace Transform Based Method for Color Image Domain Identification
* UMSM: A Traffic Reduction Method on Multi-View Video Streaming for Multiple Users
* Unified Framework of Latent Feature Learning in Social Media, A
* Unsupervised Music Structure Annotation by Time Series Structure Features and Segment Similarity
* Using Audio-Derived Affective Offset to Enhance TV Recommendation
* Using Dynamically Promoted Experts for Music Recommendation
* Variational Bayesian Methods For Multimedia Problems
* Video Activity-Based Traffic Policing: A New Paradigm
* Video Annotation via Image Groups from the Web
* Video Event Detection Using Motion Relativity and Feature Selection
* Video Object Co-Segmentation via Subspace Clustering and Quadratic Pseudo-Boolean Optimization in an MRF Framework
* Visual Protection of HEVC Video by Selective Encryption of CABAC Binstrings
* Weakly Supervised Multi-Graph Learning for Robust Image Reranking
* Weakly Supervised Photo Cropping
190 for MultMed(16)
MultMed(17)
* Accuracy of Subjects in a Quality Experiment: A Theoretical Subject Model, The
* Adaptive Optimal Shape Prior for Easy Interactive Object Segmentation
* Adaptive Prioritized Random Linear Coding and Scheduling for Layered Data Delivery From Multiple Servers
* Adaptive Scalable Video Transmission Strategy in Energy Harvesting Communication System
* Anchor View Allocation for Collaborative Free Viewpoint Video Streaming
* Asymmetric Cyclical Hashing for Large Scale Image Retrieval
* Audio Assisted Robust Visual Tracking With Adaptive Particle Filtering
* Author Topic Model-Based Collaborative Filtering for Personalized POI Recommendations
* Automatic Recognition of Emergent Social Roles in Small Group Interactions
* Automatic Visual Concept Learning for Social Event Understanding
* Auxiliary Metadata Delivery in View Synthesis Using Depth No-Synthesis-Error Model
* Barcode Modulation Method for Data Transmission in Mobile Devices
* Battery Aware Video Delivery Techniques Using Rate Adaptation and Base Station Reconfiguration
* Beyond Multimedia Adaptation: Quality of Experience-Aware Multi-Sensorial Media Delivery
* Biased Discriminant Analysis With Feature Line Embedding for Relevance Feedback-Based Image Retrieval
* Bucket-Filling: An Asymptotically Optimal Video-on-Demand Network With Source Coding
* Characterization of SURF and BRISK Interest Point Distribution for Distributed Feature Extraction in Visual Sensor Networks
* Cloud-Assisted Live Streaming for Crowdsourced Multimedia Content
* Cloud-Based Multimedia Content Protection System
* Compact Image Fingerprint Via Multiple Kernel Hashing
* Competence-Based Song Recommendation: Matching Songs to One's Singing Skill
* Connection Discovery Using Big Data of User-Shared Images in Social Media
* Content-Aware Video2Comics With Manga-Style Layout
* Content-Based Video Quality Prediction for HEVC Encoded Videos Streamed Over Packet Networks
* Context-Adaptive Binary Arithmetic Coding With Fixed-Length Codewords
* Contextual Online Learning for Multimedia Content Aggregation
* Continuous Learning Framework for Activity Recognition Using Deep Hybrid Feature Models, A
* Control-Theoretic Approach to Adaptive Video Streaming in Dense Wireless Networks, A
* Controlling a Robotic Fish Via a Natural User Interface for Informal Science Education
* Covariance-Based Descriptors for Efficient 3D Shape Matching, Retrieval, and Classification
* CPCDN: Content Delivery Powered by Context and User Intelligence
* Cross Indexing With Grouplets
* Cross-Domain Feature Learning in Multimedia
* Cross-Layer Resource Allocation for Video Streaming Over OFDMA Cognitive Radio Networks
* Cross-OSN User Modeling by Homogeneous Behavior Quantification and Local Social Regularization
* Cross-Platform Multi-Modal Topic Modeling for Personalized Inter-Platform Recommendation
* Database Saliency for Fast Image Retrieval
* Deep Head Pose: Gaze-Direction Estimation in Multimodal Video
* Deep Learning and Music Adversaries
* Deep Multimodal Learning for Affective Analysis and Retrieval
* DeepBag: Recognizing Handbag Models
* Demonstration of OpenFlow-Controlled Network Orchestration for Adaptive SVC Video Manycast
* Depth Sensation Enhancement for Multiple Virtual View Rendering
* Describing Multimedia Content Using Attention-Based Encoder-Decoder Networks
* Detection and Classification of Acoustic Scenes and Events
* Disparity Vector Correction for View Synthesis Prediction-Based 3-D Video Transmission
* Distributed Online Hybrid Cloud Management for Profit-Driven Multimedia Cloud Computing
* Dynamic Time Warping for Music Conducting Gestures Evaluation
* Effective Image Retrieval System Using Dot-Diffused Block Truncation Coding Features
* Efficient 3-D Scene Prefetching From Learning User Access Patterns
* Efficient Cascaded Filtering Retrieval Method for Big Audio Data, An
* Efficient Heuristic Methods for Multimodal Fusion and Concept Fusion in Video Concept Detection
* Efficient In-Loop Filtering Across Tile Boundaries for Multi-Core HEVC Hardware Decoders With 4 K/8 K-UHD Video Applications
* Efficient Inter-View Bit Allocation Methods for Stereo Image Coding
* Efficient Mining of Optimal AND/OR Patterns for Visual Recognition
* Efficient QR Code Beautification With High Quality Visual Content
* Enabling Enriched TV Shopping Experience via Computational and Temporal Aware View-Centric Multimedia Abstraction
* Energy-Efficient Coarse-Grained Reconfigurable Processing Unit for Multiple-Standard Video Decoding, An
* Energy-Efficient Coarse-Grained Reconfigurable Processing Unit for Multiple-Standard Video Decoding, An
* Energy-Efficient HTTP Adaptive Video Streaming With Networking Cost Constraint Over Heterogeneous Wireless Networks, An
* Enhancing Video Event Recognition Using Automatically Constructed Semantic-Visual Knowledge Base
* Estimation of Signal Distortion Using Effective Sampling Density for Light Field-Based Free Viewpoint Video
* EventMask: A Game-Based Framework for Event-Saliency Identification in Images
* Exploitation and Exploration Balanced Hierarchical Summary for Landmark Images
* Exploiting the Deep-Link Commentsphere to Support Non-Linear Video Access
* Exploiting Web Images for Semantic Video Indexing Via Robust Sample-Specific Loss
* Face Recognition and Retrieval Using Cross-Age Reference Coding With Cross-Age Celebrity Dataset
* Faithful Disocclusion Filling in Depth Image Based Rendering Using Superpixel-Based Inpainting
* Fashion Parsing With Video Context
* Fast HEVC Inter CU Decision Based on Latent SAD Estimation
* Fast Image Retrieval: Query Pruning and Early Termination
* Fast Object Retrieval Using Direct Spatial Matching
* Fine-Grained Image Search
* Framework for Composition and Enforcement of Privacy-Aware and Context-Driven Authorization Mechanism for Multimedia Big Data, A
* Geolocalized Modeling for Dish Recognition
* Gestalt Rule Feature Points
* Global-Scale Location Prediction for Social Images Using Geo-Visual Ranking
* Guest Editorial Multimedia: The Biggest Big Data
* Guest Editorial: Deep Learning for Multimedia Computing
* Hash-Based Block Matching for Screen Content Coding
* Head Motion Modeling for Human Behavior Analysis in Dyadic Interaction
* Hessian Semi-Supervised Sparse Feature Selection Based on L_2,1/2 -Matrix Norm
* Heterogeneous Feature Selection With Multi-Modal Deep Neural Networks and Sparse Group LASSO
* Hybrid Mobile Visual Search System With Compact Global Signatures, A
* Improving Multimedia Content Delivery via Augmentation With Social Information: The Social Prefetcher Approach
* Intelligent Acoustic Interfaces With Multisensor Acquisition for Immersive Reproduction
* Interactive Multimodal Learning for Venue Recommendation
* Interactive Streaming of Sequences of High Resolution JPEG2000 Images
* Joint Online Transcoding and Delivery Approach for Dynamic Adaptive Streaming, A
* Joint Super Resolution and Denoising From a Single Depth Image
* Joint Time-Domain Resource Partitioning, Rate Allocation, and Video Quality Adaptation in Heterogeneous Cellular Networks
* Knowing Verb From Object: Retagging With Transfer Learning on Verb-Object Concept Images
* Landmark Classification With Hierarchical Multi-Modal Exemplar Feature
* Large-Margin Multi-Modal Deep Learning for RGB-D Object Recognition
* Large-Scale Image Retrieval Based on Compressed Camera Identification
* Learning Compact Hash Codes for Multimodal Representations Using Orthogonal Deep Structure
* Learning Consistent Feature Representation for Cross-Modal Multimedia Retrieval
* Learning Cross Space Mapping via DNN Using Large Scale Click-Through Logs
* Learning Feature Hierarchies: A Layer-Wise Tag-Embedded Approach
* Learning Representative Deep Features for Image Set Analysis
* Learning Spatial and Temporal Extents of Human Actions for Action Detection
* Learning-Based Joint Super-Resolution and Deblocking for a Highly Compressed Image
* Let Your Body Speak: Communicative Cue Extraction on Natural Interaction Using RGBD Data
* Loss Visibility Optimized Real-Time Video Transmission Over MIMO Systems
* Mining Latent Attributes From Click-Through Logs for Image Recognition
* Multi-Resolution Disparity Processing and Fusion for Large High-Resolution Stereo Image
* Multi-Task CNN Model for Attribute Prediction
* Multi-View Video Summarization Using Bipartite Matching Constrained Optimum-Path Forest Clustering
* Multifaceted Approach to Social Multimedia-Based Prediction of Elections, A
* Multimedia Summarization for Social Events in Microblog Stream
* Multimodal Multi-Channel On-Line Speaker Diarization Using Sensor Fusion Through SVM
* Multiple Emotion Tagging for Multimedia Data by Exploiting High-Order Dependencies Among Emotions
* New Technique for Multi-Oriented Scene Text Line Detection and Tracking in Video, A
* Non-Rigid Structure-From-Motion on Degenerate Deformations With Low-Rank Shape Deformation Model
* Novel Efficient HEVC Decoding Solution on General-Purpose Processors
* Novel No-Reference Video Quality Metric for Evaluating Temporal Jerkiness due to Frame Freezing, A
* Novel Traffic Rate Measurement Algorithm for Quality of Experience-Aware Video Admission Control, A
* Object Tracking With Multi-View Support Vector Machines
* On Achieving Short Channel Switching Delay and Playback Lag in IP-Based TV Systems
* On Generating Content-Oriented Geo Features for Sensor-Rich Outdoor Video Search
* On-Road Pedestrian Tracking Across Multiple Driving Recorders
* Optimized Comics-Based Storytelling for Temporal Image Sequences
* Optimized Packet Scheduling in Multiview Video Navigation Systems
* Optimizing HTTP-Based Adaptive Streaming in Vehicular Environment Using Markov Decision Process
* Partial-Duplicate Clustering and Visual Pattern Discovery on Web Scale Image Database
* Pattern-Based Near-Duplicate Video Retrieval and Localization on Web-Scale Videos
* Perceived Synchronization of Mulsemedia Services
* Perceptual Quality Assessment for 3D Triangle Mesh Based on Curvature
* PixNet: A Localized Feature Representation for Classification and Visual Search
* Predicting Eye Fixations on Webpage With an Ensemble of Early Features and High-Level Representations from Deep Network
* Predictive Texture Synthesis-Based Intra Coding Scheme for Advanced Video Coding
* Probabilistic Skimlets Fusion for Summarizing Multiple Consumer Landmark Videos
* Profit Optimization for Wireless Video Broadcasting Systems Based on Polymatroidal Analysis
* Pseudo-Multiple-Exposure-Based Tone Fusion With Local Region Adjustment
* Query Difficulty Estimation for Image Search With Query Reconstruction Error
* Query-Dependent Aesthetic Model With Deep Learning for Photo Quality Assessment
* Rate and Power Allocation for Joint Coding and Transmission in Wireless Video Chat Applications
* Rate Distortion Optimized Inter-View Frame Level Bit Allocation Method for MV-HEVC
* Rating Image Aesthetics Using Deep Learning
* Real-Time Piano Music Transcription Based on Computer Vision
* Recognition of Genuine Smiles
* Reduced Reference Stereoscopic Image Quality Assessment Based on Binocular Perceptual Information
* Relational User Attribute Inference in Social Media
* Retargeting Semantically-Rich Photos
* RGB-D Object Recognition via Incorporating Latent Data Structure and Prior Knowledge
* Robust Face Recognition via Multimodal Deep Face Representation
* Secure and Robust Two-Phase Image Authentication
* Semantic-Based Location Recommendation With Multimodal Venue Semantics
* Semantic-Improved Color Imaging Applications: It Is All About Context
* Simple Countermeasures to Mitigate the Effect of Pollution Attack in Network Coding-Based Peer-to-Peer Live Streaming
* Sketch-Based Image Retrieval Through Hypothesis-Driven Object Boundary Selection With HLR Descriptor
* Smart Streaming for Online Video Services
* Spatio-Temporal Video Segmentation of Static Scenes and Its Applications
* Spatio-Temporally Consistent Color and Structure Optimization for Multiview Video Color Correction
* Structure-Preserving Hybrid Digital-Analog Video Delivery in Wireless Networks
* Structured Visual Feature Learning for Classification via Supervised Probabilistic Tensor Factorization
* Structured-Patch Optimization for Dense Correspondence
* Study of Multimodal Addressee Detection in Human-Human-Computer Interaction, A
* Super Fast Event Recognition in Internet Videos
* Superpixel-Based Hand Gesture Recognition With Kinect Depth Camera
* TCD-TIMIT: An Audio-Visual Corpus of Continuous Speech
* Tennis Ball Tracking Using a Two-Layered Data Association Approach
* Topological Spatial Verification for Instance Search
* Towards Cost-Efficient Video Transcoding in Media Cloud: Insights Learned From User Viewing Patterns
* Towards Effective Image Classification Using Class-Specific Codebooks and Distinctive Local Features
* Towards Practical Self-Embedding for JPEG-Compressed Digital Images
* Transition of Visual Attention Assessment in Stereoscopic Images With Evaluation of Subjective Visual Quality and Discomfort
* Tri-Subject Kinship Verification: Understanding the Core of A Family
* Unconstrained Multimodal Multi-Label Learning
* Understanding Blooming Human Groups in Social Networks
* Uniting Keypoints: Local Visual Information Fusion for Large-Scale Image Search
* Unravelling the Impact of Temporal and Geographical Locality in Content Caching Systems
* Unreeling Xunlei Kankan: Understanding Hybrid CDN-P2P Video-on-Demand Streaming
* Unsupervised Celebrity Face Naming in Web Videos
* Unsupervised Web Topic Detection Using A Ranked Clustering-Like Pattern Across Similarity Cascades
* Uploader Intent for Online Video: Typology, Inference, and Applications
* Using Free Energy Principle For Blind Image Quality Assessment
* Utility-Based H.264/SVC Video Streaming Over Multi-Channel Cognitive Radio Networks
* Utility-Based Optimized Cross-Layer Scheme for Real-Time Video Transmission Over HSDPA
* Video Delivery Performance of a Large-Scale VoD System and the Implications on Content Delivery
* Video Object Segmentation Via Dense Trajectories
* Video Popularity Dynamics and Its Implication for Replication
* Visual Object Tracking by Structure Complexity Coefficients
* Visual Tracking Using Strong Classifier and Structural Local Sparse Descriptors
* Weakly Supervised Deep Metric Learning for Community-Contributed Image Retrieval
* Weighted Component Hashing of Binary Aggregated Descriptors for Fast Visual Search
* Wireless Video Multicast With Cooperative and Incremental Transmission of Parity Packets
* Word-of-Mouth Understanding: Entity-Centric Multimodal Aspect-Opinion Mining in Social Media
* YouTube Video Promotion by Cross-Network Association: @Britney to Advertise Gangnam Style
189 for MultMed(17)
MultMed(18)
* 3D Ear Identification Using Block-Wise Statistics-Based Features and LC-KSVD
* 6-DOF Image Localization From Massive Geo-Tagged Reference Images
* Adaptive Video Streaming With Optimized Bitstream Extraction and PID-Based Quality Control
* All-Zero Block Detection Scheme for Low-Complexity HEVC Encoders, An
* Analytics-Driven Visualization on Digital Directory via Screen-Smart Device Interactions
* Animal Detection From Highly Cluttered Natural Scenes Using Spatiotemporal Object Region Proposals and Patch Verification
* Animating Still Landscape Photographs Through Cloud Motion Creation
* Audio Recapture Detection With Convolutional Neural Networks
* Audiovisual Spatial-Audio Analysis by Means of Sound Localization and Imaging: A Multimedia Healthcare Framework in Abdominal Sound Mapping
* Background Basis Selection-Based Foreground Detection Method, A
* Background Subtraction Using Background Sets With Image- and Color-Space Reduction
* Bandwidth-Efficient Packet Scheduling for Live Streaming With Network Coding
* Bi-level Protected Compressive Sampling
* Binocular Responses for No-Reference 3D Image Quality Assessment
* Blind Image Quality Assessment Using Statistical Structural and Luminance Features
* Blind Quality Assessment of Tone-Mapped Images Via Analysis of Information, Naturalness, and Structure
* Bridging Music and Image via Cross-Modal Ranking Analysis
* CCR: Clustering and Collaborative Representation for Fast Single Image Super-Resolution
* Characterization of Band Codes for Pollution-Resilient Peer-to-Peer Video Streaming
* Classification-Based Record Linkage With Pseudonymized Data for Epidemiological Cancer Registries
* Clothes Co-Parsing Via Joint Image Segmentation and Labeling With Application to Clothing Retrieval
* Clothing Cosegmentation for Shopping Images With Cluttered Background
* Cloud-Based Actor Identification With Batch-Orthogonal Local-Sensitive Hashing and Sparse Representation
* Clustering-Based Content Adaptive Tiles Under On-chip Memory Constraints
* Collaborative Wireless Freeview Video Streaming With Network Coding
* Combined Deblocking Filter and SAO Hardware Architecture for HEVC, A
* Comparison and Evaluation of Sonification Strategies for Guidance Tasks
* Complexity Control Based on a Fast Coding Unit Decision Method in the HEVC Video Coding Standard
* Compressed-Sensed-Domain L1-PCA Video Surveillance
* Computational Model for Object-Based Visual Saliency: Spreading Attention Along Gestalt Cues, A
* ConfidentCare: A Clinical Decision Support System for Personalized Breast Cancer Screening
* Consistent Coding Scheme for Single-Image Super-Resolution Via Independent Dictionaries
* Constellation Design Methodology Based on QoS and User Demand in High-Altitude Platform Broadband Networks, A
* Content-Based Guided Image Filtering, Weighted Semi-Global Optimization, and Efficient Disparity Refinement for Fast and Accurate Disparity Estimation
* Context-Aware Framework for Reducing Bandwidth Usage of Mobile Video Chats, A
* Context-Aware Hypergraph Modeling for Re-identification and Summarization
* Coping With Heterogeneous Video Contributors and Viewers in Crowdsourced Live Streaming: A Cloud-Based Approach
* Core Failure Mitigation in Integer Sum-of-Product Computations on Cloud Computing Systems
* Cross-Modal Correlation Learning by Adaptive Hierarchical Semantic Aggregation
* Cross-Modal Correlation Learning by Adaptive Hierarchical Semantic Aggregation
* Cross-Modal Retrieval via Deep and Bidirectional Representation Learning
* CSPS: An Adaptive Pooling Method for Image Classification
* DAC-Mobi: Data-Assisted Communications of Mobile Images with Cloud Computing Support
* Data Hiding Robust to Mobile Communication Vocoders
* Data-Driven Crowd Understanding: A Baseline for a Large-Scale Crowd Dataset
* Dealing With User Heterogeneity in P2P Multi-Party Video Conferencing: Layered Distribution Versus Partitioned Simulcast
* Decision-Tree-Based Perceptual Video Quality Prediction Model and Its Application in FEC for Wireless Multimedia Communications, A
* Deep Aging Face Verification With Large Gaps
* Deep Learning for Surface Material Classification Using Haptic and Visual Information
* Deep Neural Network-Driven Feature Learning Method for Multi-view Facial Expression Recognition, A
* Deep Relative Attributes
* Delay-Optimized Video Traffic Routing in Software-Defined Interdatacenter Networks
* Democratic Diffusion Aggregation for Image Retrieval
* Depth Map Down-Sampling and Coding Based on Synthesized View Distortion
* Differentially Private Online Learning for Cloud-Based Video Recommendation With Multimedia Big Data in Social Networks
* Discriminative Dictionary Learning With Common Label Alignment for Cross-Modal Retrieval
* Distance-Computation-Free Search Scheme for Binary Code Databases, A
* Do Personality and Culture Influence Perceived Video Quality and Enjoyment?
* DPcode: Privacy-Preserving Frequent Visual Patterns Publication on Cloud
* Effective Active Skeleton Representation for Low Latency Human Action Recognition
* Efficient Bit Rate Transcoding for High Efficiency Video Coding
* Efficient Cache Placement Strategy in Two-Tier Wireless Content Delivery Network
* Efficient Image Sharpness Assessment Based on Content Aware Total Variation
* Efficient Residual DPCM Using an L_1 Robust Linear Prediction in Screen Content Video Coding
* Efficient Summarization From Multiple Georeferenced User-Generated Videos
* Enabling Secure and Fast Indexing for Privacy-Assured Healthcare Monitoring via Compressive Sensing
* Energy-Aware and Bandwidth-Efficient Hybrid Video Streaming Over Mobile Networks
* Energy-Efficient Resource Allocation Optimization for Multimedia Heterogeneous Cloud Radio Access Networks
* Error Mitigation Technique for Erasure Channels Based on a Wavelet Representation of the Speech Excitation Signal, An
* Estimating 3D Gaze Directions Using Unlabeled Eye Images via Synthetic Iris Appearance Fitting
* Estimating Snow Cover From Publicly Available Images
* Exemplar-AMMs: Recognizing Crowd Movements From Pedestrian Trajectories
* Exploiting Perceptual Anchoring for Color Image Enhancement
* Face and Hair Region Labeling Using Semi-Supervised Spectral Clustering-Based Multiple Segmentations
* Factorization Algorithms for Temporal Psychovisual Modulation Display
* Fast Covariant VLAD for Image Search
* Fast Learning-Based Single Image Super-Resolution
* Filtering of Brand-Related Microblogs Using Social-Smooth Multiview Embedding
* Flickr Circles: Aesthetic Tendency Discovery by Multi-View Regularized Topic Modeling
* Folksonomy-Based Visual Ontology Construction and Its Applications
* Frame Interpolation for Cloud-Based Mobile Video Streaming
* Free-Energy Principle Inspired Video Quality Metric and Its Use in Video Coding
* Game Theoretic Resource Allocation in Media Cloud With Mobile Social Users
* GameFlow: Narrative Visualization of NBA Basketball Games
* Geometric Approach to Server Selection for Interactive Video Streaming, A
* Guest Editorial: Cloud-Based Video Processing and Content Sharing
* Guest Editorial: Multimedia-Based Healthcare
* Guest Editorial: Visual Analytics in Multimedia: Opportunities and Research Challenges
* Guided Image Contrast Enhancement Based on Retrieved Images in Cloud
* HEMS: Hierarchical Exemplar-Based Matching-Synthesis for Object-Aware Image Reconstruction
* Hierarchical Visualization of Video Search Results for Topic-Based Browsing
* High-Throughput and Multi-Parallel VLSI Architecture for HEVC Deblocking Filter, A
* High-Throughput Hardware Design of a One-Dimensional SPIHT Algorithm, A
* Higher-Order Image Co-segmentation
* Hirability in the Wild: Analysis of Online Conversational Video Resumes
* Holons Visual Representation for Image Retrieval
* Human Visual System-Based Saliency Detection for High Dynamic Range Content
* Hybrid Zero Block Detection for High Efficiency Video Coding
* Image Classification by Cross-Media Active Learning with Privileged Information
* Image Classification by Selective Regularized Subspace Learning
* Image Co-segmentation via Saliency Co-fusion
* Image Interpolation Based on Non-local Geometric Similarities and Directional Gradients
* Image Retargeting for Preserving Robust Local Feature: Application to Mobile Visual Search
* Image Sharpness Assessment by Sparse Representation
* In-Network View Synthesis for Interactive Multiview Video Systems
* Inter-Prediction Optimizations for Video Coding Using Adaptive Coding Unit Visiting Order
* Interactive Multilabel Image Segmentation via Robust Multilayer Graph Constraints
* Interactive Spiral Tape Video Summarization, An
* Joint Inference of Objects and Scenes With Efficient Learning of Text-Object-Scene Relations
* Kernel Combined Sparse Representation for Disease Recognition
* Keypoint Detection in RGBD Images Based on an Anisotropic Scale Space
* Keypoint Encoding for Improved Feature Extraction From Compressed Video at Low Bitrates
* Knowledge-Based Coding of Objects for Multisource Surveillance Video Data
* lambda-Domain Rate Control Algorithm for HEVC Scalable Extension
* Learning Blind Quality Evaluator for Stereoscopic Images Using Joint Sparse Representation
* Learning Cascaded Deep Auto-Encoder Networks for Face Alignment
* Learning Geographical Hierarchy Features via a Compositional Model
* Learning Personalized Models for Facial Expression Analysis and Gesture Recognition
* Link Adaptation for High-Quality Uncompressed Video Streaming in 60-GHz Wireless Networks
* Locality Sensitive Low-Rank Model for Image Tag Completion, A
* Looking Into Saliency Model via Space-Time Visualization
* Low-Power Video Recording System With Multiple Operation Modes for H.264 and Light-Weight Compression, A
* mDASH: A Markov Decision-Based Rate Adaptation Approach for Dynamic HTTP Streaming
* Mean-Shift and Sparse Sampling-Based SMC-PHD Filtering for Audio Informed Visual Speaker Tracking
* Media Query Processing for the Internet-of-Things: Coupling of Device Energy Consumption and Cloud Infrastructure Billing
* Modeling Dynamics of Online Video Popularity
* Monet: A System for Reliving Your Memories by Theme-Based Photo Storytelling
* MoshViz: A Detail+Overview Approach to Visualize Music Elements
* Multi-Instance Multi-Label Learning Combining Hierarchical Context and its Application to Image Annotation
* Multi-Modal Event Topic Model for Social Event Analysis
* Multi-Perspective Cost-Sensitive Context-Aware Multi-Instance Sparse Coding and Its Application to Sensitive Video Recognition
* Multimedia Pivot Tables for Multimedia Analytics on Image Collections
* Multimodal Personality Recognition in Collaborative Goal-Oriented Tasks
* Multimodal Web Aesthetics Assessment Based on Structural SVM and Multitask Fusion Learning
* Multiple Human Identification and Cosegmentation: A Human-Oriented CRF Approach With Poselets
* Multiple Stage Residual Model for Image Classification and Vector Compression
* Multiple Video Delivery in m-Health Emergency Applications
* Multiplicative Watermark Decoder in Contourlet Domain Using the Normal Inverse Gaussian Distribution
* Multiview and 3D Video Compression Using Neighboring Block Based Disparity Vectors
* Multiview Skeletal Interaction Recognition Using Active Joint Interaction Graph
* Muscular Movement Model-Based Automatic 3D/4D Facial Expression Recognition
* Neyman-Pearson-Based Early Mode Decision for HEVC Encoding
* No-Reference Retargeted Image Quality Assessment Based on Pairwise Rank Learning
* Novel Lip Descriptor for Audio-Visual Keyword Spotting Based on Adaptive Decision Fusion, A
* Novel UEP Fountain Coding Scheme for Scalable Multimedia Transmission, A
* Object Instance Search in Videos via Spatio-Temporal Trajectory Discovery
* On Branded Handbag Recognition
* On Constructing z -Dimensional DIBR-Synthesized Images
* On Data-Driven Delay Estimation for Media Cloud
* On Evaluating Perceptual Quality of Online User-Generated Videos
* On the Optimal Linear Network Coding Design for Information Theoretically Secure Unicast Streaming
* Optimal Incentive Design for Cloud-Enabled Multimedia Crowdsourcing
* Optimality of Greedy Algorithm for Generating Just-Noticeable Difference Surfaces
* Perceiving Graphical and Pictorial Information via Hearing and Touch
* Perceptual Annoyance Models for Videos With Combinations of Spatial and Temporal Artifacts
* Person Reidentification via Ranking Aggregation of Similarity Pulling and Dissimilarity Pushing
* PhenoTree: Interactive Visual Analytics for Hierarchical Phenotyping From Large-Scale Electronic Health Records
* Predicting the Performance in Decision-Making Tasks: From Individual Cues to Group Interaction
* Probabilistic Approach for Predicting the Size of Coding Units in the Quad-Tree Structure of the Quality and Spatial Scalable HEVC
* Pseudo 2D String Matching Technique for High Efficiency Screen Content Coding
* QoE Evaluation of Multimedia Services Based on Audiovisual Quality and User Interest
* Quadtree Degeneration for HEVC
* Quality of Experience Driven Multi-User Video Streaming in Cellular Cognitive Radio Networks With Single Channel Access
* Query-Adaptive Small Object Search Using Object Proposals and Shape-Aware Descriptors
* Rating Prediction Based on Social Sentiment From Textual Reviews
* Region-Aware 3-D Warping for DIBR
* Reliable Methodology to Collect Ground Truth Data of Image Aesthetic Appeal, A
* Resource Allocation With Video Traffic Prediction in Cloud-Based Space Systems
* Resource-Efficient Mobile Multimedia Streaming With Adaptive Network Selection
* Reversible Data Hiding in Encrypted Images by Reversible Image Transformation
* Robust DT CWT-Based DIBR 3D Video Watermarking Using Chrominance Embedding
* Robust Fingertip Detection in a Complex Environment
* Robust Latent Poisson Deconvolution From Multiple Features for Web Topic Detection
* SALIC: Social Active Learning for Image Classification
* Saliency-Guided Quality Assessment of Screen Content Images
* Scalable Video Event Retrieval by Visual State Binary Embedding
* Semantic Discriminative Metric Learning for Image Similarity Measurement
* Semi-Supervised Bi-Dictionary Learning for Image Classification With Smooth Representation-Based Label Propagation
* Sensing Matrix Optimization Based on Equiangular Tight Frames With Consideration of Sparse Representation Error
* Significance Evaluation of Video Data Over Media Cloud Based on Compressed Sensing
* Sketch-Based Image Retrieval by Salient Contour Reinforcement
* Social Diffusion Analysis With Common-Interest Model for Image Annotation
* Social Friend Recommendation Based on Multiple Network Correlation
* Sparse Kernel Reduced-Rank Regression for Bimodal Emotion Recognition From Facial Expression and Speech
* Sparse Pose Regression via Componentwise Clustering Feature Point Representation
* Spin Contour
* SSIM-Based Game Theory Approach for Rate-Distortion Optimized Intra Frame CTU-Level Bit Allocation
* Survey on Visual Analytics of Social Media Data, A
* Tag-Based Image Search by Social Re-ranking
* TagBook: A Semantic Video Representation Without Supervision for Event Detection
* Task-Driven Progressive Part Localization for Fine-Grained Object Recognition
* Tensor Manifold Discriminant Projections for Acceleration-Based Human Activity Recognition
* Tiling in Interactive Panoramic Video: Approaches and Evaluation
* Time-Domain Attribute-Based Access Control for Cloud-Based Video Content Sharing: A Cryptographic Approach
* Toward Cost-Efficient Content Placement in Media Cloud: Modeling and Analysis
* Trend-Aware Video Caching Through Online Learning
* Universal Framework for Salient Object Detection, A
* User-Service Rating Prediction by Exploring Social Users' Rating Behaviors
* View-Level Rate Distortion Model for Multi-View/3D Video, A
* Visual Analytics of Political Networks From Face-Tracking of News Video
* Visual Movie Analytics
* Visual Understanding via Multi-Feature Shared Learning With Global Consistency
* Visual Voice Activity Detection in the Wild
* Visualization-Based Active Learning for Video Annotation
* Visualizing and Analyzing Video Content With Interactive Scalable Maps
* Zero-Shot Person Re-identification via Cross-View Consistency
206 for MultMed(18)
MultMed(19)
* Accelerating Image-Domain-Warping Virtual View Synthesis on GPGPU
* Accurate Depth Extraction Method for Multiple Light-Coding-Based Depth Cameras
* Active Sampling Exploiting Reliable Informativeness for Subjective Image Quality Assessment Based on Pairwise Comparison
* Adaptive Fusion Algorithm for Visible and Infrared Videos Based on Entropy and the Cumulative Distribution of Gray Levels, An
* Adaptive LSTAR Model for Long-Range Variable Bit Rate Video Traffic Prediction
* Adaptive Video Streaming With Network Coding Enabled Named Data Networking
* Analog Coded SoftCast: A Network Slice Design for Multimedia Broadcast/Multicast
* Asymmetric Binary Coding for Image Search
* Attentive Contexts for Object Detection
* Audio Identification by Sampling Sub-fingerprints and Counting Matches
* Automated Online Exam Proctoring
* Automatic Mesh Animation Preview With User Voting-Based Refinement
* Automatic Synchronization of Multi-user Photo Galleries
* Background-Driven Salient Object Detection
* Bayesian Hierarchical Regression Models for QoE Estimation and Prediction in Audiovisual Communications
* Being a Supercook: Joint Food Attributes and Multimodal Content Modeling for Recipe Retrieval and Exploration
* Blind Image Quality Assessment Based on Rank-Order Regularized Regression
* Blind Stereo Quality Assessment Based on Learned Features From Binocular Combined Images
* Cartoon and Texture Decomposition-Based Color Transfer for Fabric Images
* Collective First-Person Vision for Automatic Gaze Analysis in Multiparty Conversations
* Color Enhancement With Adaptive Illumination Estimation for Low-Backlighted Displays
* Color-Guided Depth Recovery via Joint Local Structural and Nonlocal Low-Rank Regularization
* Compact Hash Codes for Efficient Visual Descriptors Retrieval in Large Scale Databases
* Compete or Collaborate: Architectures for Collaborative DASH Video Over Future Networks
* Comprehensive Feature-Based Robust Video Fingerprinting Using Tensor Model
* Compressed Sensing for Efficient Encoding of Dense 3D Meshes Using Model-Based Bayesian Learning
* Context-Associative Hierarchical Memory Model for Human Activity Recognition and Prediction
* Continuous Probability Distribution Prediction of Image Emotions via Multitask Shared Sparse Regression
* Convolutional Neural Network-Based Chinese Text Detection Algorithm via Text Structure Modeling, A
* Cost-Effective Low-Delay Design for Multiparty Cloud Video Conferencing
* Cross-Layer Resource Allocation for Scalable Video Over OFDMA Wireless Networks: Tradeoff Between Quality Fairness and Efficiency
* Cross-Modal Hashing via Rank-Order Preserving
* Cross-Modal Retrieval Using Multiordered Discriminative Structured Subspace Learning
* Crowdsourcing Thousands of Specialized Labels: A Bayesian Active Training Approach
* CrowdTranscoding: Online Video Transcoding With Massive Viewers
* Dancelets Mining for Video Recommendation Based on Dance Styles
* DCAR: A Discriminative and Compact Audio Representation for Audio Processing
* Deep Coupled Metric Learning for Cross-Modal Matching
* Deep Multimetric Learning for Shape-Based 3D Model Retrieval
* Deep Video Hashing
* Depth-Preserving Stereo Image Retargeting Based on Pixel Fusion
* Detecting Dominant Vanishing Points in Natural Scenes with Application to Composition-Sensitive Image Retrieval
* Detecting Low-Quality Workers in QoE Crowdtesting: A Worker Behavior-Based Approach
* Dictionary Learning-Based 3D Morphable Shape Model, A
* Discrete Multimodal Hashing With Canonical Views for Robust Mobile Landmark Search
* Discriminative Multi-instance Multitask Learning for 3D Action Recognition
* Distributed Compressive Sensing for Cloud-Based Wireless Image Transmission
* Distributed Content Based Video Identification in Peer-to-Peer Networks: Requirements and Solutions
* Diversified Visual Attention Networks for Fine-Grained Object Classification
* Dynamic Adaptive Video Streaming: Towards a Systematic Comparison of ICN and TCP/IP
* Dynamic Manga: Animating Still Manga via Camera Movement
* Dynamic Topic Model and Matrix Factorization-Based Travel Recommendation Method Exploiting Ubiquitous Data, A
* Edge Caching for Layered Video Contents in Mobile Social Networks
* Efficient Unsupervised Temporal Segmentation of Motion Data
* Estimating Heart Rate and Rhythm via 3D Motion Tracking in Depth Video
* Exploiting Web Images for Dataset Construction: A Domain Robust Approach
* Exploring Viewer Gazing Patterns for Touch-Based Mobile Gamecasting
* Fast Algorithm and VLSI Architecture of Rate Distortion Optimization in H.265-HEVC
* Fast and Adaptive 3D Reconstruction With Extensively High Completeness
* Fast Image Dehazing Method Based on Linear Transformation
* Fast, Compact, and Discriminative: Evaluation of Binary Descriptors for Mobile Applications
* Focus-Plus-Context Techniques for Picoprojection-Based Interaction
* FreeScup: A Novel Platform for Assisting Sculpture Pose Design
* Frequency-Selective Mesh-to-Grid Resampling for Image Communication
* Fusion of Magnetic and Visual Sensors for Indoor Localization: Infrastructure-Free and More Effective
* Generalized Residual Vector Quantization and Aggregating Tree for Large Scale Search
* GHEVC: An Efficient HEVC Decoder for Graphics Processing Units
* GIFT: Towards Scalable 3D Shape Retrieval
* Graph PCA Hashing for Similarity Search
* Guest Editorial: Large-Scale Multimedia Data Retrieval, Classification, and Understanding
* Guest Editorial: Video Over Future Networks
* Hashing With Pairwise Correlation Learning and Reconstruction
* Hierarchical Bayesian Theme Models for Multipose Facial Expression Recognition
* Hierarchical MK Splines: Algorithm and Applications to Data Fitting
* Hierarchical Spatio-Temporal Model for Human Activity Recognition, A
* HNIP: Compact Deep Invariant Representations for Video Matching, Localization, and Retrieval
* Human Facial Age Estimation by Cost-Sensitive Label Ranking and Trace Norm Regularization
* Image Location Inference by Multisaliency Enhancement
* Image-Based Appraisal of Real Estate Properties
* Imbalance Compensation Framework for Background Subtraction, An
* Implicit Analysis of Perceptual Multimedia Experience Based on Physiological Response: A Review
* Improved Depth-Assisted Error Concealment Algorithm for 3D Video Transmission
* Inferring Emotional Tags From Social Images With User Demographics
* Instrument Learning and Sparse NMD for Automatic Polyphonic Music Transcription
* Integration of Diverse Data Sources for Spatial PM2.5 Data Interpolation
* Interactive Screen Video Streaming-Based Pervasive Mobile Workstyle
* Inverse Sparse Group Lasso Model for Robust Object Tracking
* Joint Admission Control and Routing Via Approximate Dynamic Programming for Streaming Video Over Software-Defined Networking
* Joint Compression of Near-Duplicate Videos
* Joint Deep Boltzmann Machine (jDBM) Model for Person Identification Using Mobile Phone Data, A
* Joint Image-Text News Topic Detection and Tracking by Multimodal Topic And-Or Graph
* Known-Artist Live Song Identification Using Audio Hashprints
* Large-Scale Tracking for Images With Few Textures
* Learning Efficient Binary Codes From High-Level Feature Representations for Multilabel Image Retrieval
* Learning Sparse Representation for No-Reference Quality Assessment of Multiply Distorted Stereoscopic Images
* Learning to Predict High-Quality Edge Maps for Room Layout Estimation
* Live Broadcast With Community Interactions: Bottlenecks and Optimizations
* Local Pattern Collocations Using Regional Co-occurrence Factorization
* Many Shades of Negativity, The
* Matryoshka Peek: Toward Learning Fine-Grained, Robust, Discriminative Features for Product Search
* Maximum a Posterior and Perceptually Motivated Reconstruction Algorithm: A Generic Framework
* Media Quality Assessment by Perceptual Gaze-Shift Patterns Discovery
* Methodology for Designing and Evaluating Cloud Scheduling Strategies in Distributed Videoconferencing Systems, A
* Mining Fashion Outfit Composition Using an End-to-End Deep Learning Approach on Set Data
* Mirror Mirror on the Wall... An Unobtrusive Intelligent Multisensory Mirror for Well-Being Status Self-Assessment and Visualization
* Mobile Live Video Streaming Optimization via Crowdsourcing Brokerage
* Modeling Restaurant Context for Food Recognition
* Motion Classification-Based Fast Motion Estimation for High-Efficiency Video Coding
* Motion-Homogeneous-Based Fast Transcoding Method From H.264: AVC to HEVC
* Multi-View Surveillance Video Summarization via Joint Embedding and Sparse Optimization
* Multimedia Classification Using Bipolar Relation Graphs
* Multimodal 2D+3D Facial Expression Recognition With Deep Fusion Convolutional Neural Network
* Multimodal Video-to-Near-Scene Annotation
* Multipath Cooperative Communications Networks for Augmented and Virtual Reality Transmission
* MuVi: Multiview Video Aware Transmission Over MIMO Wireless Systems
* Neighborhood Matching for Image Retrieval
* No-Reference and Robust Image Sharpness Evaluation Based on Multiscale Spatial and Spectral Features
* Nonlinear Discrete Hashing
* Nonlinear Sparse Hashing
* Nonparametric Sparse Matrix Decomposition for Cross-View Dimensionality Reduction
* Novel Data Hiding Algorithm for High Dynamic Range Images, A
* Novel Transient Wrinkle Detection Algorithm and Its Application for Expression Synthesis, A
* Novel Visual and Statistical Image Features for Microblogs News Verification
* Nuclear Norm-Based 2DLPP for Image Classification
* Object Localization Based on Proposal Fusion
* Object-Based Visual Saliency via Laplacian Regularized Kernel Regression
* Occlusion-Aware Real-Time Object Tracking
* On Market-Driven Hybrid-P2P Video Streaming
* Online MoCap Data Coding With Bit Allocation, Rate Control, and Motion-Adaptive Post-Processing
* Online Variable Coding Length Product Quantization for Fast Nearest Neighbor Search in Mobile Retrieval
* Optimal Representations for Adaptive Streaming in Interactive Multiview Video Systems
* Optimized Adaptive Streaming of Multi-video Stream Bundles
* Overlapping Community Detection for Multimedia Social Networks
* Parametric Planning Model for Video Quality Evaluation of IPTV Services Combining Channel and Video Characteristics
* Parametric Quality-Estimation Model for Adaptive-Bitrate-Streaming Services
* PBC: Polygon-Based Classifier for Fine-Grained Categorization
* Perceptual Pruning: A Context-Aware Transcoder for Immersive Video Conferencing Systems
* Personalized Egocentric Video Summarization of Cultural Tour on User Preferences Input
* Personalized Social Image Recommendation Method Based on User-Image-Tag Model
* Photo Aesthetics Analysis via DCNN Feature Encoding
* Photo Filter Recommendation by Category-Aware Aesthetic Learning
* Picking Neural Activations for Fine-Grained Recognition
* Pipeline-Based Ray-Tracing Runtime System for HSA-Compliant Frameworks, A
* PLTD: Patch-Based Low-Rank Tensor Decomposition for Hyperspectral Images
* Predicting Image Memorability Through Adaptive Transfer Learning From External Sources
* Predicting Popularity of Online Videos Using Support Vector Regression
* Privacy Preserving Cloth Try-On Using Mobile Augmented Reality
* Probabilistic Approach to People-Centric Photo Selection and Sequencing, A
* Progressive Pseudo-analog Transmission for Mobile Video Streaming
* QoS Provisionings for Device-to-Device Content Delivery in Cellular Networks
* Real-Time Correlation Filter Tracking by Efficient Dense Belief Propagation With Structure Preserving
* Recognizing and Presenting the Storytelling Video Structure With Deep Multimodal Networks
* Reducing Latency for Multimedia Broadcast Services Over Mobile Networks
* Redundancy Allocation Based on the Weighted Mismatch-Rate Slope for Multiple Description Video Coding
* Reliable Video Streaming With Strict Playout Deadline in Multihop Wireless Networks
* Resource Provisioning and Profit Maximization for Transcoding in Clouds: A Two-Timescale Approach
* Retrieval Compensated Group Structured Sparsity for Image Super-Resolution
* Retrieval From and Understanding of Large-Scale Multi-modal Medical Datasets: A Review
* Robust Generalized Low-Rank Decomposition of Multimatrices for Image Recovery
* Saliency Detection by Fully Learning a Continuous Conditional Random Field
* Saliency Detection for 3D Surface Geometry Using Semi-regular Meshes
* Saliency Prior Context Model for Real-Time Object Tracking, A
* Salient Object Segmentation via Effective Integration of Saliency and Objectness
* Scalable Image Retrieval by Sparse Product Quantization
* SDNHAS: An SDN-Enabled Architecture to Optimize QoE in HTTP Adaptive Streaming
* Segment-Based Storage and Transcoding Trade-off Strategy for Multi-version VoD Systems in the Cloud, A
* Semisupervised Online Multikernel Similarity Learning for Image Retrieval
* Sequential Deep Trajectory Descriptor for Action Recognition With Three-Stream CNN
* Signal Dependent Transform Based on SVD for HEVC Intracoding
* Single Image Super-Resolution via Adaptive Transform-Based Nonlocal Self-Similarity Modeling and Learning-Based Gradient Regularization
* Single Image Super-Resolution via Locally Regularized Anchored Neighborhood Regression and Nonlocal Means
* Skin Segmentation Algorithm Based on Stacked Autoencoders, A
* Sleep Apnea Detection via Depth Video and Audio Feature Learning
* Social Attribute Aware Incentive Mechanism for Device-to-Device Video Distribution
* Social Force Model-Based MCMC-OCSVM Particle PHD Filter for Multiple Human Tracking
* Social-Aware Rate Based Content Sharing Mode Selection for D2D Content Sharing Scenarios
* Social-Aware Video Recommendation for Online Social Groups
* Socially Aware Energy-Efficient Mobile Edge Collaboration for Video Distribution
* Sound-Event Classification Using Robust Texture Features for Robot Hearing
* Sparse Multigraph Embedding for Multimodal Feature Representation
* Sparse Recovery-Based Error Concealment
* Sparse Representation Model Using the Complete Marginal Fisher Analysis Framework and Its Applications to Visual Recognition, A
* SRLSP: A Face Image Super-Resolution Algorithm Using Smooth Regression With Local Structure Prior
* Statistically Indifferent Quality Variation: An Approach for Reducing Multimedia Distribution Cost for Adaptive Video Streaming Services
* Stochastic Multiview Hashing for Large-Scale Near-Duplicate Video Retrieval
* Structural Variation Classification Model for Image Quality Assessment, A
* Structure-Preserving Image Super-Resolution via Contextualized Multitask Learning
* Sufficient Image Appearance Transfer Combining Color and Texture
* Supervised Local Descriptor Learning for Human Action Recognition
* Texture Plus Depth Video Coding Using Camera Global Motion Information
* Toward Encrypted Cloud Media Center With Secure Deduplication
* Toward Physiology-Aware DASH: Bandwidth-Compliant Prioritized Clinical Multimedia Communication in Ambulances
* Toward QoE-Assured 4K Video-on-Demand Delivery Through Mobile Edge Virtualization With Adaptive Prefetching
* Tradeoffs Between Cost and Performance for CDN Provisioning Based on Coordinate Transformation
* Trip Outfits Advisor: Location-Oriented Clothing Recommendation
* Two-Stage Friend Recommendation Based on Network Alignment and Series Expansion of Probabilistic Topic Model
* Two-View 3D Reconstruction for Food Volume Estimation
* Unimodal Stopping Model-Based Early SKIP Mode Decision for High-Efficiency Video Coding
* Utility-Driven Adaptive Preprocessing for Screen Content Video Compression
* Video Captioning With Attention-Based LSTM and Semantic Consistency
* Video eCommerce: Toward Large Scale Online Video Advertising
* Video Encoder Architecture for Low-Delay Live-Streaming Events
* Video Object Segmentation via Global Consistency Aware Query Strategy
* VideoWhisper: Toward Discriminative Unsupervised Video Feature Learning With Attention-Based Recurrent Neural Networks
* Vision-Based Fingertip Tracking Utilizing Curvature Points Clustering and Hash Model Representation
* Visual Importance and Distortion Guided Deep Image Quality Assessment Framework
* Visual Tracking via Nonnegative Multiple Coding
* Visualizing Video Sounds With Sound Word Animation to Enrich User Experience
* Voronoi-Based Compact Image Descriptors: Efficient Region-of-Interest Retrieval With VLAD and Deep-Learning-Based Descriptors
* VRFP: On-the-Fly Video Retrieval Using Web Images and Fast Fisher Vector Products
* Wavelet-Based L_infty Semi-regular Mesh Coding
* Weakly Supervised Learning of Deformable Part-Based Models for Object Detection via Region Proposals
* Who Are Your Real Friends: Analyzing and Distinguishing Between Offline and Online Friendships From Social Multimedia Data
* Words Matter: Scene Text for Image Classification and Retrieval
214 for MultMed(19)
MultMed(20)
* 3DQoE-Oriented and Energy-Efficient 2D plus Depth Based 3D Video Streaming Over Centrally Controlled Networks
* Accessible Melanoma Detection Using Smartphones and Mobile Image Analysis
* Active Learning for Crowdsourced QoE Modeling
* AENet: Learning Deep Audio Features for Video Analysis
* Aesthetics-Driven Stereoscopic 3-D Image Recomposition With Depth Adaptation
* Analysis of Structural Characteristics for Quality Assessment of Multiply Distorted Images
* Anomaly Detection Based on Stacked Sparse Coding With Intraframe Classification Strategy
* Arbitrary-Oriented Scene Text Detection via Rotation Proposals
* Audio-Visual System for Object-Based Audio: From Recording to Listening, An
* Automatic Image Cropping for Visual Aesthetic Enhancement Using Deep Neural Networks and Cascaded Regression
* Background Modeling and Referencing for Moving Cameras-Captured Surveillance Video Coding in HEVC
* Bag of Surrogate Parts Feature for Visual Recognition
* Behavioral Analysis of Kinetic Telepresence for Small Symmetric Group-to-Group Meetings
* Bilevel Feature Learning for Video Saliency Detection
* Blackthorn: Large-Scale Interactive Multimodal Learning
* Blind Image Quality Assessment via Vector Regression and Object Oriented Pooling
* Blind Quality Assessment Based on Pseudo-Reference Image
* Blind Quality Index for Multiply Distorted Images Using Biorder Structure Degradation and Nonlocal Statistics
* Building Emotional Machines: Recognizing Image Emotions Through Deep Neural Networks
* Bundled Object Context for Referring Expressions
* BVI-HD: A Video Quality Database for HEVC Compressed and Texture Synthesized Content
* CCL: Cross-modal Correlation Learning With Multigrained Fusion by Hierarchical Network
* Check Out This Place: Inferring Ambiance From Airbnb Photos
* Closed-Form Optimization on Saliency-Guided Image Compression for HEVC-MSP
* CNN-Based Joint Clustering and Representation Learning with Feature Drift Compensation for Large-Scale Image Data
* Coherent Deep-Net Fusion To Classify Shots In Concert Videos
* Collaborative Scheduling-Based Parallel Solution for HEVC Encoding on Multicore Platforms, A
* Collective Density Clustering for Coherent Motion Detection
* Content-Adaptive Joint Image Compression and Encryption Scheme, A
* Content-Attention Representation by Factorized Action-Scene Network for Action Recognition
* Content-Aware Delivery of Scalable Video in Network Coding Enabled Named Data Networks
* Controllable Multicast for Adaptive Scalable Video Streaming in Software-Defined Networks
* Convolutional Neural Network for Intermediate View Enhancement in Multiview Streaming
* Cooperative Bargaining Game-Based Multiuser Bandwidth Allocation for Dynamic Adaptive Streaming Over HTTP
* Cost-Constrained Video Quality Satisfaction Study on Mobile Devices, A
* Cost-Distortion Optimization and Resource Control in Pseudo-Analog Visual Communications
* Cross-Domain Collaborative Learning via Discriminative Nonparametric Bayesian Model
* Cross-Media Similarity Evaluation for Web Image Retrieval in the Wild
* Cross-Space Distortion Directed Color Image Compression
* CTU-Level Complexity Control for High Efficiency Video Coding
* CUNet: A Compact Unsupervised Network For Image Classification
* DASH Adaptation Algorithm Based on Adaptive Forgetting Factor Estimation
* Data Analysis in Multimedia Quality Assessment: Revisiting the Statistical Tests
* Data Driven 2-D-to-3-D Video Conversion for Soccer
* Data-Driven Lightweight Interest Point Selection for Large-Scale Visual Search
* Deep Age Estimation: From Classification to Ranking
* Deep Salient Object Detection With Dense Connections and Distraction Diagnosis
* Deep Spatiotemporal Perspective for Understanding Crowd Behavior, A
* Deep Temporal Multimodal Fusion for Medical Procedure Monitoring Using Wearable Sensors
* Deep-Structured Event Modeling for User-Generated Photos
* Depth Assisted Adaptive Workload Balancing for Parallel View Synthesis
* Depth Pooling Based Large-Scale 3-D Action Recognition with Convolutional Neural Networks
* Depth-Adaptive Deep Neural Network for Semantic Segmentation
* Detecting and Removing Visual Distractors for Video Aesthetic Enhancement
* Detecting Socially Significant Music Events Using Temporally Noisy Labels
* Detecting Topic Authoritative Social Media Users: A Multilayer Network Approach
* Discovering Triangles in Portraits for Supporting Photographic Creation
* Discovery of Repeated Melodic Phrases in Folk Singing Recordings
* Discriminative Part Selection for Human Action Recognition
* Disseminating Multilayer Multimedia Content Over Challenged Networks
* Distributed Consolidation of Highly Incomplete Dynamic Point Clouds Based on Rank Minimization
* Dual-Graph Regularized Discriminative Multitask Tracker
* Dynamic Resource Allocation by Batch Optimization for Value-Added Video Services Over SDN
* Dynamic Texture Recognition Using Volume Local Binary Count Patterns With an Application to 2D Face Spoofing Detection
* Edge Computing Framework for Cooperative Video Processing in Multimedia IoT Systems
* Editorial IEEE Transactions on Multimedia Special Section on Video Analytics: Challenges, Algorithms, and Applications
* Editorial Introduction to the Special Issue on Multimedia Big Data for Extreme Events
* Efficient and Robust Image Coding and Transmission Based on Scrambled Block Compressive Sensing
* Efficient Architecture of In-Loop Filters for Multicore Scalable HEVC Hardware Decoders, An
* Efficient Audio Rendering Using Angular Region-Wise Source Enhancement for 360° Video
* EgoGesture: A New Dataset and Benchmark for Egocentric Hand Gesture Recognition
* Energy-Aware Mobile Edge Computing and Routing for Low-Latency Visual Data Processing
* Event-Based Perceptual Quality Assessment for HTTP-Based Video Streaming With Playback Interruption
* Expanding-Window BATS Code for Scalable Video Multicasting Over Erasure Networks
* Explicit Shape Regression With Characteristic Number for Facial Landmark Localization
* Exploiting Pseudo-Quadtree Structure for Accelerating HEVC Spatial Resolution Downscaling Transcoder
* Exploiting Video Quality Information With Lightweight Network Coordination for HTTP-Based Adaptive Video Streaming
* Exploiting Web Images for Video Highlight Detection With Triplet Deep Ranking
* Extracting Key Segments of Videos for Event Detection by Learning From Web Sources
* F-DES: Fast and Deep Event Summarization
* Fast Forgery Detection Algorithm Based on Exponential-Fourier Moments for Video Region Duplication, A
* Fast Uyghur Text Detector for Complex Background Images, A
* Fast-PADMA: Rapidly Adapting Facial Affect Model From Similar Individuals
* Feature Descriptor Based on Local Normalized Difference for Real-World Texture Classification, A
* Field-of-Experts Filters Guided Tensor Completion
* Foveation-Based Wireless Soft Image Delivery
* Free-Viewpoint Television System for Horizontal Virtual Navigation, A
* Full-Reference Objective Quality Assessment of Tone-Mapped Images
* Fully Convolutional Network for Multiscale Temporal Action Proposals
* Fusing Geometric Features for Skeleton-Based Action Recognition Using Multilayer LSTM Networks
* General Knowledge Embedded Image Representation Learning
* Generalized Semi-supervised and Structured Subspace Learning for Cross-Modal Retrieval
* Geo-Distinctive Visual Element Matching for Location Estimation of Images
* Geodesic Path-Based Diffusion Acceleration for Image Denoising
* GLA: Global-Local Attention for Image Description
* Grab, Pay, and Eat: Semantic Food Detection for Smart Restaurants
* Group-Sensitive Triplet Embedding for Vehicle Reidentification
* H.264 and H.265 Video Bandwidth Prediction
* HEVC Selective Encryption Using RC6 Block Cipher Technique
* Hierarchical Parsing Net: Semantic Scene Parsing From Global Scene to Objects
* High-Quality Soft Video Delivery With GMRF-Based Overhead Reduction
* Highly Accurate Image Reconstruction for Multimodal Noise Suppression Using Semisupervised Learning on Big Data
* Hole Filling With Multiple Reference Views in DIBR View Synthesis
* Holographic Data Coding: Benchmarking and Extending HEVC With Adapted Transforms
* Hybrid Digital-Analog Video Delivery With Shannon-Kotel'nikov Mapping
* Hybrid Intraprediction Based on Local and Nonlocal Correlations
* IF-MCA: Importance Factor-Based Multiple Correspondence Analysis for Multimedia Data Analytics
* Image Style Classification Based on Learnt Deep Correlation Features
* Impact Localization on Rigid Surfaces Using Hermitian Angle Distribution for Human-Computer Interface Applications
* Improved Image-Based Localization Using SFM and Modified Coordinate System Transfer
* Improving Existing Collaborative Filtering Recommendations via Serendipity-Based Algorithm
* Improving Multipath Video Transmission With Raptor Codes in Heterogeneous Wireless Networks
* Improving Video Saliency Detection via Localized Estimation and Spatiotemporal Refinement
* Information Bottleneck Approach to Optimize the Dictionary of Visual Data, An
* Intelligent Detail Enhancement for Exposure Fusion
* Interactive Image Segmentation Using Semi-transparent Wearable Glasses
* Interpreting Video Recommendation Mechanisms by Mining View Count Traces
* Iterative Feedback Control-Based Salient Object Segmentation
* Iterative Framework of Cascaded Deblocking and Superresolution for Compressed Images, An
* Joint Coding-Transmission Optimization for a Video Surveillance System With Multiple Cameras
* Joint Dynamic Rate Control and Transmission Scheduling for Scalable Video Multirate Multicast Over Wireless Networks
* Joint Intra and Multiple Description Coding for Packet Loss Resilient Video Transmission
* Joint Latent Dirichlet Allocation for Social Tags
* Joint Optimization of Radio and Virtual Machine Resources With Uncertain User Demands in Mobile Cloud Computing
* Joint Sponsor Scheduling in Cellular and Edge Caching Networks for Mobile Video Delivery
* JPEG Image Encryption With Improved Format Compatibility and File Size Preservation
* Key-Frame-Based Background Sprite Generation for Hole Filling in Depth Image-Based Rendering
* Knowledge-Guided Deep Fractal Neural Networks for Human Pose Estimation
* Label Distribution-Based Facial Attractiveness Computation by Deep Residual Learning
* Large Margin Learning in Set-to-Set Similarity Comparison for Person Reidentification
* Learning Deep Spatio-Temporal Dependence for Semantic Video Segmentation
* Learning From Cross-Domain Media Streams for Event-of-Interest Discovery
* Learning From Hierarchical Spatiotemporal Descriptors for Micro-Expression Recognition
* Leveraging Structural Context Models and Ranking Score Fusion for Human Interaction Prediction
* Light Field Coding With Field-of-View Scalability and Exemplar-Based Interlayer Prediction
* Local Wavelet Acoustic Pattern: A Novel Time-Frequency Descriptor for Birdsong Recognition
* Lossless Compression of Color Filter Array Mosaic Images With Visualization via JPEG 2000
* Low-Rank Linear Embedding for Image Recognition
* Maya Codical Glyph Segmentation: A Crowdsourcing Approach
* Measuring Crowd Collectiveness by Macroscopic and Microscopic Motion Consistencies
* MixedEmotions: An Open-Source Toolbox for Multimodal Emotion Analysis
* Mobile Instant Video Clip Sharing With Screen Scrolling: Measurement and Enhancement
* Modeling Multimodal Clues in a Hybrid Deep Learning Framework for Video Classification
* Multigranular Event Recognition of Personal Photo Albums
* Multilabel Image Classification With Regional Latent Semantic Dependencies
* Multimodal Framework for Analyzing the Affect of a Group of People
* Multimodal Recurrent Neural Networks With Information Transfer Layers for Indoor Scene Labeling
* Multiple Speaker Tracking in Spatial Audio via PHD Filtering and Depth-Audio Fusion
* Multiscale Deep Alternative Neural Network for Large-Scale Video Classification
* Multisensor Image Fusion and Enhancement in Spectral Total Variation Domain
* Multistage Object Detection With Group Recursive Learning
* Multistage Pooling for Blind Quality Prediction of Asymmetric Multiply-Distorted Stereoscopic Images
* Multiview Label Sharing for Visual Representations and Classifications
* Multiview Multimodal System for Monitoring Patient Sleep, A
* Multiview Video Transmission Over Underwater Acoustic Path
* Music Popularity: Metrics, Characteristics, and Audio-Based Prediction
* Naturalness Preserved Nonuniform Illumination Estimation for Image Enhancement Based on Retinex
* New Model-Based Method for Multi-View Human Body Tracking and Its Application to View Transfer in Image-Based Rendering, A
* No-Reference Image Quality Assessment Using Orthogonal Color Planes Patterns
* No-Reference Image Sharpness Assessment Based on Maximum Gradient and Variability of Gradients
* No-Reference View Synthesis Quality Prediction for 3-D Videos Based on Color-Depth Interactions
* Noncoverage Field Model for Improving the Rendering Quality of Virtual Views, A
* Nonnegative OPLS for Supervised Design of Filter Banks: Application to Image and Audio Feature Extraction
* Novel Digital Watermarking Based on General Non-Negative Matrix Factorization, A
* Novel No-Reference Metric for Estimating the Impact of Frame Freezing Artifacts on Perceptual Quality of Streamed Videos, A
* Object Detection and Tracking Under Occlusion for Object-Level RGB-D Video Segmentation
* On Influential Trends in Interactive Video Retrieval: Video Browser Showdown 2015-2017
* On the Minimization of Glass-to-Glass and Glass-to-Algorithm Delay in Video Communication
* Online Modeling of Esthetic Communities Using Deep Perception Graph Analytics
* Online Multimodal Multiexpert Learning for Social Event Tracking
* Optimal Transmission Estimation via Fog Density Perception for Efficient Single Image Defogging
* Optimal Transmission Topology Construction and Secure Linear Network Coding Design for Virtual-Source Multicast With Integral Link Rates
* Optimized Data Representation for Interactive Multiview Navigation
* Optimizing Multistage Discriminative Dictionaries for Blind Image Quality Assessment
* Optimizing Quality of Experience for Adaptive Bitrate Streaming via Viewer Interest Inference
* Parallax-Tolerant Image Stitching Based on Robust Elastic Warping
* Pedestrian Detection via Body Part Semantic and Contextual Information With DNN
* Perceptual Quality Maximization for Video Calls With Packet Losses by Optimizing FEC, Frame Rate, and Quantization
* Personalized Classifier for Food Image Recognition
* Photo Stylistic Brush: Robust Style Transfer via Superpixel-Based Bipartite Graph
* PQTable: Nonexhaustive Fast Search for Product-Quantized Codes Using Hash Tables
* Predicting Microblog Sentiments via Weakly Supervised Multimodal Deep Learning
* Predicting Visual Features From Text for Image and Video Caption Retrieval
* Prediction of the Leadership Style of an Emergent Leader Using Audio and Visual Nonverbal Features
* PROVID: Progressive and Multimodal Vehicle Reidentification for Large-Scale Urban Surveillance
* QoE-Driven Mobile Edge Caching Placement for Adaptive Video Streaming
* Quality Assessment of DIBR-Synthesized Images by Measuring Local Geometric Distortions and Global Sharpness
* Quality of Experience in a Stereoscopic Multiview Environment
* Quality-Guided Fusion-Based Co-Saliency Estimation for Image Co-Segmentation and Colocalization
* Quasi-Homography Warps in Image Stitching
* Query Adaptive Multiview Object Instance Search and Localization Using Sketches
* Query-Adaptive Image Retrieval by Deep-Weighted Hashing
* Query-Free Clothing Retrieval via Implicit Relevance Feedback
* Ranking-Preserving Low-Rank Factorization for Image Annotation With Missing Labels
* Real-Time Long-Term Tracking With Prediction-Detection-Correction
* Real-Time, Curvature-Sensitive Surface Simplification Using Depth Images
* Recognition of Emotions in User-Generated Videos With Kernelized Features
* Recurrent Spatial Pyramid CNN for Optical Flow Estimation
* Reduced-Reference Image Quality Assessment in Free-Energy Principle and Sparse Representation
* Region-Based Multiple Description Coding for Multiview Video Plus Depth Video
* Regularized Semi-non-negative Matrix Factorization for Hashing
* Reliable and Reversible Image Privacy Protection Based on False Colors, A
* Removing Haze Particles From Single Image via Exponential Inference With Support Vector Data Description
* RETRIEVAL: An Online Performance Evaluation Tool for Information Retrieval Methods
* Reversible Data Hiding in Encrypted Three-Dimensional Mesh Models
* Robust 3-D Human Detection in Complex Environments With a Depth Camera
* Robust 3D Action Recognition Through Sampling Local Appearances and Global Distributions
* Robust Coverless Image Steganography Based on DCT and LDA Topic Classification
* Robust Detection of Extreme Events Using Twitter: Worldwide Earthquake Monitoring
* Robust Multiview Synthesis for Wide-Baseline Camera Arrays
* Robust Sparse and Dense Nonrigid Structure From Motion
* Robust Tracking and Redetection: Collaboratively Modeling the Target and Its Context
* Robust Visual Tracking via Smooth Manifold Kernel Sparse Learning
* Saliency Detection in Face Videos: A Data-Driven Approach
* Scale-Aware Edge-Preserving Image Filtering via Iterative Global Optimization
* Scale-Aware Fast R-CNN for Pedestrian Detection
* Scale-Aware Fast R-CNN for Pedestrian Detection
* Scene Text Detection Using Superpixel-Based Stroke Feature Transform and Deep Learning Based Region Classification
* SeaShips: A Large-Scale Precisely Annotated Dataset for Ship Detection
* Seeds-Based Part Segmentation by Seeds Propagation and Region Convexity Decomposition
* Semi-Supervised Image Classification With Self-Paced Cross-Task Networks
* Server Allocation Problem for Session-Based Multiplayer Cloud Gaming, The
* Single Image Dehazing Using Ranking Convolutional Neural Network
* Snowflake Removal for Videos via Global and Local Low-Rank Decomposition
* SNR-Constrained Heuristics for Optimizing the Scaling Parameter of Robust Audio Watermarking
* Social-Aware Movie Recommendation via Multimodal Network Learning
* Spatio-Temporal Disocclusion Filling Using Novel Sprite Cells
* Spatio-Temporal Saliency Networks for Dynamic Saliency Prediction
* Spatiotemporal Saliency Estimation by Spectral Foreground Detection
* Speech Emotion Recognition Using Deep Convolutional Neural Network and Discriminant Temporal Pyramid Matching
* Spherical Superpixel Segmentation
* SPIHT Algorithm With Adaptive Selection of Compression Ratio Depending on DWT Coefficients
* Spring-Electric Graph Model for Socialized Group Photography, A
* Statistical Study of View Preferences for Online Videos With Cross-Platform Information
* Step Count and Pulse Rate Detection Based on the Contactless Image Measurement Method
* Structure-Guided Image Inpainting Using Homography Transformation
* Summarization of User-Generated Sports Video by Using Deep Action Recognition Features
* Super Resolution by Comprehensively Exploiting Dependencies of Wavelet Coefficients
* Superpixel-Based Single Nighttime Image Haze Removal
* Supervised Distributed Hashing for Large-Scale Multimedia Retrieval
* SVD-Based Adaptive QIM Watermarking on Stereo Audio Signals
* Text2Video: An End-to-end Learning Framework for Expressing Text With Videos
* Thin-Feature-Aware Transport-Velocity Formulation for SPH-Based Liquid Animation
* Three-Dimensional Attention-Based Deep Ranking Model for Video Highlight Detection
* Toward Intelligent Product Retrieval for TV-to-Online (T2O) Application: A Transfer Metric Learning Approach
* Toward Rendering-Latency Reduction for Composable Web Services via Priority-Based Object Caching
* Towards Individual QoE for Multiparty Videoconferencing
* Traffic-Optimized Data Placement for Social Media
* Twitter100k: A Real-World Dataset for Weakly Supervised Cross-Media Retrieval
* Two-Stream 3-D convNet Fusion for Action Recognition in Videos With Arbitrary Size and Length
* Ultrasonic Communication Using Consumer Hardware
* Understanding Dynamic Cross-OSN Associations for Cold-Start Recommendation
* Unequal Error Protection for Scalable Video Storage in the Cloud
* Universal String Matching Approach to Screen Content Coding, A
* Unsupervised Discovery of Character Dictionaries in Animation Movies
* Unsupervised Salient Object Detection via Inferring from Imperfect Saliency Models
* Variational Fusion of Time-of-Flight and Stereo Data for Depth Estimation Using Edge-Selective Joint Filtering
* Visual Sentiment Prediction Based on Automatic Discovery of Affective Regions
* Vocabulary for Growth: Topic Modeling of Content Popularity Evolution, A
* Worst Case Driven Display Frame Compression for Energy-Efficient Ultra-HD Display Processing
* You Are What You Eat: Exploring Rich Recipe Information for Cross-Region Food Analysis
261 for MultMed(20)
MultMed(21)
* 3-D Reconstruction of Human Body Shape From a Single Commodity Depth Camera
* AccAnn: A New Subjective Assessment Methodology for Measuring Acceptability and Annoyance of Quality of Experience
* Adaptive Convolution for Object Detection
* Adaptive Cyclopean Image-Based Stereoscopic Image-Quality Assessment Using Ensemble Learning
* Adaptive Hypergraph Embedded Semi-Supervised Multi-Label Image Annotation
* Adaptive Label Propagation for Facial Appearance Transfer
* Adaptive RD Optimal Sparse Coding With Quantization for Image Compression
* Adaptive Semi-Supervised Feature Selection for Cross-Modal Retrieval
* Adaptive Triangular Partition Algorithm for Digital Images, An
* Adjusted Non-Local Regression and Directional Smoothness for Image Restoration
* Adversarially Approximated Autoencoder for Image Generation and Manipulation
* Attend and Imagine: Multi-Label Image Classification With Visual Attention and Recurrent Neural Networks
* Attention-Based Multiview Re-Observation Fusion Network for Skeletal Action Recognition
* Attentive Spatial-Temporal Summary Networks for Feature Learning in Irregular Gait Recognition
* Auto-Embedding Generative Adversarial Networks For High Resolution Image Synthesis
* Automatic Curation of Sports Highlights Using Multimodal Excitement Features
* Automatic Depression Analysis Using Dynamic Facial Appearance Descriptor and Dirichlet Process Fisher Encoding
* Automatic Video Object Segmentation Based on Visual and Motion Saliency
* Auxiliary Classifier Generative Adversarial Network With Soft Labels in Imbalanced Acoustic Event Detection
* Bayesian DeNet: Monocular Depth Prediction and Frame-Wise Fusion With Synchronized Uncertainty
* Benchmark of DIBR Synthesized View Quality Assessment Metrics on a New Database for Immersive Media Applications, A
* Bidirectional Convolutional Recurrent Sparse Network (BCRSN): An Efficient Model for Music Emotion Recognition
* Blind Quality Assessment of 3-D Synthesized Views Based on Hybrid Feature Classes
* Blind Quality Assessment of Camera Images Based on Low-Level and High-Level Statistical Features
* BLTRCNN-Based 3-D Articulatory Movement Prediction: Learning Articulatory Synchronicity From Both Text and Audio Inputs
* Boosting Positive and Unlabeled Learning for Anomaly Detection With Multi-Features
* BranchGAN: Unsupervised Mutual Image-to-Image Transfer With A Single Encoder and Dual Decoders
* Cache Less for More: Exploiting Cooperative Video Caching and Delivery in D2D Communications
* Can Categories and Attributes Be Learned in a Multi-Task Way?
* Channel-Dependent Statistical Watermark Detector for Color Images, A
* Co-Recognition of Multiple Fingertips for Tabletop Human-Projector Interaction
* COCO-CN for Cross-Lingual Image Tagging, Captioning, and Retrieval
* Codebook-Free Compact Descriptor for Scalable Visual Search
* COMIC: Toward A Compact Image Captioning Model With Attention
* Computational Model for Stereoscopic Visual Saliency Prediction, A
* Connecting Subspace Learning and Extreme Learning Machine in Speech Emotion Recognition
* Content Popularity Prediction Towards Location-Aware Mobile Edge Caching
* Content-Based Adaptive SHVC Mode Decision Algorithm
* Context-Aware Three-Dimensional Mean-Shift With Occlusion Handling for Robust Object Tracking in RGB-D Videos
* Continuous Gesture Segmentation and Recognition Using 3DCNN and Convolutional LSTM
* Creating a Multitrack Classical Music Performance Dataset for Multimodal Music Analysis: Challenges, Insights, and Applications
* Cross-Modality Bridging and Knowledge Transferring for Image Understanding
* Cross-Modality Microblog Sentiment Prediction via Bi-Layer Multimodal Hypergraph Learning
* Decoupled Spatial Neural Attention for Weakly Supervised Semantic Segmentation
* Deep Alignment Network Based Multi-Person Tracking With Occlusion and Motion Reasoning
* Deep Binary Reconstruction for Cross-Modal Hashing
* Deep Feature Aggregation and Image Re-Ranking With Heat Diffusion for Image Retrieval
* Deep Hierarchical Encoder-Decoder Network for Image Captioning
* Deep Learning for Single Image Super-Resolution: A Brief Review
* Deep Memory Network for Cross-Modal Retrieval
* Deep Multi-Modality Adversarial Networks for Unsupervised Domain Adaptation
* Deep Neural Framework for Continuous Sign Language Recognition by Iterative Training, A
* Deep Objective Quality Assessment Driven Single Image Super-Resolution
* Deep Progressive Hashing for Image Retrieval
* Deep Universal Generative Adversarial Compression Artifact Removal
* Detail Preserved Single Image Dehazing Algorithm Based on Airlight Refinement
* Differential Privacy Oriented Distributed Online Learning for Mobile Social Video Prefetching
* Differentially-Private and Trustworthy Online Social Multimedia Big Data Retrieval in Edge Computing
* Discovering Latent Discriminative Patterns for Multi-Mode Event Representation
* Distortion Design for Secure Adaptive 3-D Mesh Steganography
* Distributed and Efficient Object Detection via Interactions Among Devices, Edge, and Cloud
* Distribution-Oriented Aesthetics Assessment With Semantic-Aware Hybrid Network
* Double-Bit Quantization and Index Hashing for Nearest Neighbor Search
* DRFN: Deep Recurrent Fusion Network for Single-Image Super-Resolution with Large Factors
* Dual Pursuit for Subspace Learning
* Dynamic Cross-Layer Signaling Exchange for Real-Time and On-Demand Multimedia Streams
* Dynamic Difficulty Awareness Training for Continuous Emotion Prediction
* Dynamic Texture Classification Using Unsupervised 3D Filter Learning and Local Binary Encoding
* Effective 3-D Shape Retrieval by Integrating Traditional Descriptors and Pointwise Convolution
* Effective Image Retrieval via Multilinear Multi-Index Fusion
* Efficient Estimation of View Synthesis Distortion for Depth Coding Optimization
* Emotion-Aware Multimedia Systems Security
* Enabling Trusted and Privacy-Preserving Healthcare Services in Social Media Health Networks
* End-to-End Automatic Image Annotation Based on Deep CNN and Multi-Label Data Augmentation
* Energy-Efficient Multipath TCP for Quality-Guaranteed Video Over Heterogeneous Wireless Networks
* Enhancing Image Watermarking With Adaptive Embedding Parameter and PSNR Guarantee
* Enhancing the Robustness of Neural Collaborative Filtering Systems Under Malicious Attacks
* EVM-CNN: Real-Time Contactless Heart Rate Estimation From Facial Video
* Exploiting Mid-Level Semantics for Large-Scale Complex Video Classification
* Exploiting Recurrent Neural Networks and Leap Motion Controller for the Recognition of Sign Language and Semaphoric Hand Gestures
* Exploiting Web Images for Weakly Supervised Object Detection
* Exploring Users' Internal Influence from Reviews for Social Recommendation
* Extracting Multiple Visual Senses for Web Learning
* Face Alignment With Expression- and Pose-Based Adaptive Initialization
* Face Hallucination via Coarse-to-Fine Recursive Kernel Regression Structure
* Facial Expression Recognition Using Hierarchical Features With Deep Comprehensive Multipatches Aggregation Convolutional Neural Networks
* Facial Landmark Machines: A Backbone-Branches Architecture With Progressive Representation Learning
* Fast H.264 to HEVC Transcoding: A Deep Learning Method
* Fast Similarity Matrix Profile for Music Analysis and Exploration
* Feature Affinity-Based Pseudo Labeling for Semi-Supervised Person Re-Identification
* Fine Granularity Object-Level Representation for Event Detection and Recounting, A
* Fine-Grained Land Use Classification at the City Scale Using Ground-Level Images
* First-Person Action Recognition With Temporal Pooling and Hilbert-Huang Transform
* FIVR: Fine-Grained Incident Video Retrieval
* Format-Compliant Selective Secret 3-D Object Sharing Scheme
* FreeCast: Graceful Free-Viewpoint Video Delivery
* FuseGAN: Learning to Fuse Multi-Focus Image via Conditional Generative Adversarial Network
* Gated Peripheral-Foveal Convolutional Neural Network for Unified Image Aesthetic Prediction, A
* Generating Video Descriptions With Latent Topic Guidance
* Geometry and Topology Preserving Hashing for SIFT Feature
* GLAD: Global-Local-Alignment Descriptor for Scalable Person Re-Identification
* GPU-Based Hierarchical Motion Estimation for High Efficiency Video Coding
* Gradient Prior-Aided CNN Denoiser With Separable Convolution-Based Optimization of Feature Dimension
* Graph-Based Static 3D Point Clouds Geometry Coding
* Guest Editorial Trustworthiness in Social Multimedia Analytics and Delivery
* Heterogeneous Hashing Network for Face Retrieval Across Image and Video Domains
* Hierarchical Approach for Associating Body-Worn Sensors to Video Regions in Crowded Mingling Scenarios, A
* Hierarchical Concept Score Postprocessing and Concept-Wise Normalization in CNN-Based Video Event Recognition
* Hierarchy-Dependent Cross-Platform Multi-View Feature Learning for Venue Category Prediction
* High-Efficiency Compressed Sensing-Based Terminal-to-Cloud Video Transmission System, A
* High-Quality Image Captioning With Fine-Grained and Semantic-Guided Visual Attention
* HSCS: Hierarchical Sparsity Based Co-saliency Detection for RGBD Images
* Hybrid Deep-Learning-Based Anomaly Detection Scheme for Suspicious Flow Detection in SDN: A Social Multimedia Perspective
* Image Completion Using Low Tensor Tree Rank and Total Variation Minimization
* Image Decolorization Combining Local Features and Exposure Features
* Improved Low-Bitrate HEVC Video Coding Using Deep Learning Based Super-Resolution and Adaptive Block Patching
* Improved Performance Measures for Video Quality Assessment Algorithms Using Training and Validation Sets
* Improving Object Retrieval Quality by Integration of Similarity Propagation and Query Expansion
* Improving Saliency Detection Based on Modeling Photographer's Intention
* Incremental Re-Identification by Cross-Direction and Cross-Ranking Adaption
* Inferring Emotions From Large-Scale Internet Voice Data
* Instance Segmentation Enabled Hybrid Data Association and Discriminative Hashing for Online Multi-Object Tracking
* Integrating Image and Textual Information in Human-Robot Interactions for Children With Autism Spectrum Disorder
* Iterative Image Dehazing Method With Polarization, An
* Iterative Manifold Embedding Layer Learned by Incomplete Data for Large-Scale Image Retrieval
* Joint CRF and Locality-Consistent Dictionary Learning for Semantic Segmentation
* Joint Deep and Depth for Object-Level Segmentation and Stereo Tracking in Crowds
* Joint Texture/Depth Power Allocation for 3-D Video SoftCast
* Know More Say Less: Image Captioning Based on Scene Graphs
* Labeled Multiple Canonical Correlation Analysis for Information Fusion, The
* Learning a Joint Affinity Graph for Multiview Subspace Clustering
* Learning Attentional Recurrent Neural Network for Visual Tracking
* Learning Composite Latent Structures for 3D Human Action Representation and Recognition
* Learning Deep Conditional Neural Network for Image Segmentation
* Learning Descriptors With Cube Loss for View-Based 3-D Object Retrieval
* Learning Linear Regression via Single-Convolutional Layer for Visual Object Tracking
* Learning Multi-View Representation With LSTM for 3-D Shape Recognition and Retrieval
* Learning Semantic Text Features for Web Text-Aided Image Classification
* Learning-Based Tone Mapping Operator for Efficient Image Matching
* Locally Joint Sparse Marginal Embedding for Feature Extraction
* Long Activity Video Understanding Using Functional Object-Oriented Network
* Low-Cost Four-Dimensional Experience Theater Using Home Appliances
* Makeup Removal via Bidirectional Tunable De-Makeup Network
* MC-SSM: Nonparametric Semantic Image Segmentation With the ICM Algorithm
* MEC-Assisted Panoramic VR Video Streaming Over Millimeter Wave Mobile Networks
* Modified Just Noticeable Depth Difference Model Built in Perceived Depth Space, A
* Motion-Based Rate Adaptation in WebRTC Videoconferencing Using Scalable Video Coding
* Multi-Channel Decomposition in Tandem With Free-Energy Principle for Reduced-Reference Image Quality Assessment
* Multi-Correlation Filters With Triangle-Structure Constraints for Object Tracking
* Multi-Grained Parallel Solution for HEVC Encoding on Heterogeneous Platforms, A
* Multi-Kernel Coupled Projections for Domain Adaptive Dictionary Learning
* Multi-Level Cooperative Fusion of GM-PHD Filters for Online Multiple Human Tracking
* Multi-Modal and Multi-Domain Embedding Learning for Fashion Retrieval and Analysis
* Multi-Person Pose Estimation Using Bounding Box Constraint and LSTM
* Multi-Scale Interpretation Model for Convolutional Neural Networks: Building Trust Based on Hierarchical Interpretation
* Multi-Speaker Tracking From an Audio-Visual Sensing Device
* Multilevel Model for Video Object Segmentation Based on Supervision Optimization
* Multimodal Learning for Human Action Recognition Via Bimodal/Multimodal Hybrid Centroid Canonical Correlation Analysis
* Multitask Learning for Cross-Domain Image Captioning
* Naturalness-Aware Deep No-Reference Image Quality Assessment
* Neural Task Planning With AND-OR Graph Representations
* New Hole-Filling Method Using Extrapolated Spatio-Temporal Background Information for a Synthesized Free-View
* New Modality: Emoji Challenges in Prediction, Anticipation, and Retrieval
* New Rate-Complexity-Distortion Model for Fast Motion Estimation Algorithm in HEVC, A
* No-Reference Quality Assessment for Screen Content Images Based on Hybrid Region Features Fusion
* No-Reference Quality Evaluator of Transparently Encrypted Images
* Non-Local Texture Optimization With Wasserstein Regularization Under Convolutional Neural Network
* Novel Projective-Consistent Plane Based Image Stitching Method, A
* Novel Segmentation Based Depth Map Up-Sampling, A
* Novel Sign Language Recognition Framework Using Hierarchical Grassmann Covariance Matrix, A
* Optimizing Stored Video Delivery for Wireless Networks: The Value of Knowing the Future
* Pairwise-Comparison-Based Rank Learning for Benchmarking Image Restoration Algorithms
* Personalized Recommendation of Social Images by Constructing a User Interest Tree With Deep Features and Tag Trees
* Polar Transformation on Image Features for Orientation-Invariant Representations
* Pre-Attention and Spatial Dependency Driven No-Reference Image Quality Assessment
* Predicting Stereoscopic Image Quality via Stacked Auto-Encoders Based on Stereopsis Formation
* Predicting the Top-N Popular Videos via a Cross-Domain Hybrid Model
* Probabilistic Reasoning for Unique Role Recognition Based on the Fusion of Semantic-Interaction and Spatio-Temporal Features
* Probabilistic Semantic Retrieval for Surveillance Videos With Activity Graphs
* Progressive Spatial Recurrent Neural Network for Intra Prediction
* Quality Assessment for Video With Degradation Along Salient Trajectories
* Quality Evaluation of Image Dehazing Methods Using Synthetic Hazy Images
* Quality-Aware Unpaired Image-to-Image Translation
* QuatNet: Quaternion-Based Head Pose Estimation With Multiregression Loss
* Real-Time Dense Monocular SLAM With Online Adapted Depth Prediction Network
* Real-Time Head Pose Estimation and Face Modeling From a Depth Image
* Real-Time Visual-Inertial SLAM Based on Adaptive Keyframe Selection for Mobile AR Applications
* Reduced-Complexity Intra Block Copy (IntraBC) Mode With Early CU Splitting and Pruning for HEVC Screen Content Coding
* Refinet: A Deep Segmentation Assisted Refinement Network for Salient Object Detection
* Region-Based Context Enhanced Network for Robust Multiple Face Alignment
* Regression-Based Three-Dimensional Pose Estimation for Texture-Less Objects
* Robust Object Tracking Using Manifold Regularized Convolutional Neural Networks
* RotateView: A Video Composition System for Interactive Product Display
* S-MDP: Streaming With Markov Decision Processes
* Saliency Detection via Multi-Scale Global Cues
* Saliency Integration: An Arbitrator Model
* Salient Object Detection Using Cascaded Convolutional Neural Networks and Adversarial Learning
* Salient Object Detection via Fuzzy Theory and Object-Level Enhancement
* Scalable Access Control For Privacy-Aware Media Sharing
* Self-Learning Super-Resolution Using Convolutional Principal Component Analysis and Random Matching
* Separable and Reversible Data Hiding in Encrypted Images Using Parametric Binary Tree Labeling
* Sequential Data Analysis Approach to Detect Emergent Leaders in Small Groups, A
* Shape Basis Interpretation for Monocular Deformable 3-D Reconstruction
* Shape-Optimizing and Illumination-Smoothing Image Stitching
* Show and Tell in the Loop: Cross-Modal Circular Correlation Learning
* Single Image Haze Removal via Region Detection Network
* SkeletonNet: A Hybrid Network With a Skeleton-Embedding Process for Multi-View Image Representation Learning
* SketchHelper: Real-Time Stroke Guidance for Freehand Sketch Retrieval
* Socially Aware Trust Framework for Multimedia Delivery in D2D Cooperative Communication
* Software-Defined Multimedia Streaming System Aided By Variable-Length Interval In-Network Caching
* Sparse Coding Guided Spatiotemporal Feature Learning for Abnormal Event Detection in Large Videos
* Spatiotemporal Symmetric Convolutional Neural Network for Video Bit-Depth Enhancement
* SSIM-Based Global Optimization for CTU-Level Rate Control in HEVC
* SSPA-LBS: Scalable and Social-Friendly Privacy-Aware Location-Based Services
* Statistical Model-Based Detector via Texture Weight Map: Application in Re-Sampling Authentication
* Stochastic Analysis of DASH-Based Video Service in High-Speed Railway Networks
* Structure-Constrained Motion Sequence Generation
* Study of High Frame Rate Video Formats, A
* Stylized Aesthetic QR Code
* Superpixel Segmentation Based on Square-Wise Asymmetric Partition and Structural Approximation
* Supervised Robust Discrete Multimodal Hashing for Cross-Media Retrieval
* SwapGAN: A Multistage Generative Approach for Person-to-Person Fashion Style Transfer
* SynBF: A New Bilateral Filter for Postremoval of Noise From Synthesis Views in 3-D Video
* Synthesis of Realistic Facial Expressions Using Expression Map
* Temporal Action Localization in Untrimmed Videos Using Action Pattern Trees
* Texture Relative Superpixel Generation With Adaptive Parameters
* TPCKT: Two-Level Progressive Cross-Media Knowledge Transfer
* Track, Attend, and Parse (TAP): An End-to-End Framework for Online Handwritten Mathematical Expression Recognition
* Trust Assessment in Vehicular Social Network Based on Three-Valued Subjective Logic
* Trust-Based Privacy-Preserving Photo Sharing in Online Social Networks
* Trust-Based Video Management Framework for Social Multimedia Networks
* Two-Stage Clustering Based 3D Visual Saliency Model for Dynamic Scenarios, A
* Unified Spatio-Temporal Attention Networks for Action Recognition in Videos
* Universal Optical Flow Based Real-Time Low-Latency Omnidirectional Stereo Video System, A
* Unsupervised and Semi-Supervised Image Classification With Weak Semantic Consistency
* Unsupervised Learning of Human Pose Distance Metric via Sparsity Locality Preserving Projections
* Unsupervised Universal Attribute Modeling for Action Recognition
* Very Low Bitrate Semantic Compression of Airplane Cockpit Screen Content
* Video Big Data Retrieval Over Media Cloud: A Context-Aware Online Learning Approach
* Video Saliency Detection via Graph Clustering With Motion Energy and Spatiotemporal Objectness
* Weakly Semantic Guided Action Recognition
* Weakly Supervised Dual Learning for Facial Action Unit Recognition
* Weakly-Supervised Visual Instrument-Playing Action Detection in Videos
* Where-and-When to Look: Deep Siamese Attention Networks for Video-Based Person Re-Identification
* Which Has Better Visual Quality: The Clear Blue Sky or a Blurry Animal?
* XMAS: An Efficient Mobile Adaptive Streaming Scheme Based on Traffic Shaping
* YogaNet: 3-D Yoga Asana Recognition Using Joint Angular Displacement Maps With ConvNets
247 for MultMed(21)
MultMed(22)
* 2-D Skeleton-Based Action Recognition via Two-Branch Stacked LSTM-RNNs
* 2D Pose-Based Real-Time Human Action Recognition With Occlusion-Handling
* 3D Room Layout Estimation From a Single RGB Image
* Accurate and Robust Video Saliency Detection via Self-Paced Diffusion
* Acoustic Scene Clustering Using Joint Optimization of Deep Embedding Learning and Clustering Iteration
* Adaptation-Oriented Feature Projection for One-Shot Action Recognition
* Adaptive Image Sampling Using Deep Learning and Its Application on X-Ray Fluorescence Image Reconstruction
* Adaptive Single Image Dehazing Using Joint Local-Global Illumination Adjustment
* Adversarial Attribute-Text Embedding for Person Search With Natural Language Query
* Affective Video Content Analysis With Adaptive Fusion Recurrent Network
* Asymmetric Joint GANs for Normalizing Face Illumination From a Single Image
* ATMFN: Adaptive-Threshold-Based Multi-Model Fusion Network for Compressed Face Hallucination
* Attentive Sequence to Sequence Translator for Localizing Video Clips by Natural Language, An
* Audio-Visual Particle Flow SMC-PHD Filtering for Multi-Speaker Tracking
* Automated Colorization of a Grayscale Image With Seed Points Propagation
* Bidirectional Attention-Recognition Model for Fine-Grained Object Classification
* Blind Night-Time Image Quality Assessment: Subjective and Objective Approaches
* Blind Watermarking for 3-D Printed Objects by Locally Modifying Layer Thickness
* CGR-GAN: CG Facial Image Regeneration for Antiforensics Based on Generative Adversarial Network
* Character-Oriented Video Summarization With Visual and Textual Cues
* CI-GNN: Building a Category-Instance Graph for Zero-Shot Video Classification
* CKD: Cross-Task Knowledge Distillation for Text-to-Image Synthesis
* Co-Prediction-Based Compression Scheme for Correlated Images, A
* Coarse-to-Fine Localization of Temporal Action Proposals
* Collaborative Content Placement Among Wireless Edge Caching Stations With Time-to-Live Cache
* Compact Hash Code Learning With Binary Deep Neural Network
* Concentrated Local Part Discovery With Fine-Grained Part Representation for Person Re-Identification
* Content-Based Light Field Image Compression Method With Gaussian Process Regression
* Contextualized CNN for Scene-Aware Depth Estimation From Single RGB Image
* Convolutional Networks With Channel and STIPs Attention Model for Action Recognition in Videos
* Cuboid CNN Model with an Attention Mechanism for Skeleton-Based Action Recognition, A
* Cycle-IR: Deep Cyclic Image Retargeting
* Deep Co-Saliency Detection via Stacked Autoencoder-Enabled Fusion and Self-Trained CNNs
* Deep Dual-Channel Neural Network for Image-Based Smoke Detection
* Deep Fusion Feature Representation Learning With Hard Mining Center-Triplet Loss for Person Re-Identification
* Deep Gesture Video Generation With Learning on Regions of Interest
* Deep Manifold-to-Manifold Transforming Network for Skeleton-Based Action Recognition
* Deep Metric Learning With Density Adaptivity
* Deep Multi-Kernel Convolutional LSTM Networks and an Attention-Based Mechanism for Videos
* Deep Multi-Scale Context Aware Feature Aggregation for Curved Scene Text Detection
* Deep Multimodality Learning for UAV Video Aesthetic Quality Assessment
* Deep Position-Sensitive Tracking
* Deep Reference Generation With Multi-Domain Hierarchical Constraints for Inter Prediction
* Deep Reinforcement Learning for Image Hashing
* Deep Top-k Ranking for Image-Sentence Matching
* Deep0Tag: Deep Multiple Instance Learning for Zero-Shot Image Tagging
* DeepFacade: A Deep Learning Approach to Facade Parsing With Symmetric Loss
* DeepQoE: A Multimodal Learning Framework for Video Quality of Experience (QoE) Prediction
* Design of Compressed Sensing System With Probability-Based Prior Information
* Detecting Social Signals in User-Shared Images for Connection Discovery Using Deep Learning
* Dilated Inception Network for Visual Saliency Prediction, A
* Disentangled Spectrum Variations Networks for NIR-VIS Face Recognition
* Distance-Driven Alliance for a P2P Live Video System, A
* Distinct Feature Extraction for Video-Based Gait Phase Classification
* Dual Convolutional LSTM Network for Referring Image Segmentation
* Dynamic Objectives Learning for Facial Expression Recognition
* Dynamic Spectrum Access for Multimedia Transmission Over Multi-User, Multi-Channel Cognitive Radio Networks
* EEG-Based Study on Perception of Video Distortion Under Various Content Motion Conditions, An
* Efficient and Secure Image Communication System Based on Compressed Sensing for IoT Monitoring Applications
* Efficient Mobile Video Streaming via Context-Aware RaptorQ-Based Unequal Error Protection
* Efficient NVoD Scheme Using Implicit Error Correction and Subchannels for Wireless Networks, An
* Efficient Supervised Discrete Multi-View Hashing for Large-Scale Multimedia Search
* Energy Compaction-Based Image Compression Using Convolutional AutoEncoder
* Enhanced Intra Prediction for Video Coding by Using Multiple Neural Networks
* Enhancing the Quality of Image Tagging Using a Visio-Textual Knowledge Base
* Ensemble Tracking Based on Diverse Collaborative Framework With Multi-Cue Dynamic Fusion
* Equalized Margin Loss for Face Recognition, An
* Exploiting Vulnerabilities of Deep Neural Networks for Privacy Protection
* Exploring Discriminative Representations for Image Emotion Recognition With CNNs
* Exploring Global and Local Linguistic Representations for Text-to-Image Synthesis
* Fast Depth and Inter Mode Prediction for Quality Scalable High Efficiency Video Coding
* Fast FoV-Switching DASH System Based on Tiling Mechanism for Practical Omnidirectional Video Services, A
* Fast User-Guided Single Image Reflection Removal via Edge-Aware Cascaded Networks
* Feature Matching With Intra-Group Sparse Model
* Feature-Flow Interpretation of Deep Convolutional Neural Networks
* FFTMI: Features Fusion for Natural Tone-Mapped Images Quality Evaluation
* Fine-Grained Classification of Internet Video Traffic From QoS Perspective Using Fractal Spectrum
* Flexible Deep CNN Framework for Image Restoration, A
* Flexibly Connectable Light Field System For Free View Exploration
* Flickr Image Community Analytics by Deep Noise-Refined Matrix Factorization
* Food Recommendation: Framework, Existing Solutions, and Challenges
* Frame Augmented Alternating Attention Network for Video Question Answering
* Fuzzy Least Squares Support Vector Machine With Adaptive Membership for Object Tracking
* GAIM: Graph Attention Interaction Model for Collective Activity Recognition
* Generative Adversarial Network-Based Intra Prediction for Video Coding
* Generative Model Driven Representation Learning in a Hybrid Framework for Environmental Audio Scene and Sound Event Recognition
* GENPass: A Multi-Source Deep Learning Model for Password Guessing
* Gestures In-The-Wild: Detecting Conversational Hand Gestures in Crowded Scenes Using a Multimodal Fusion of Bags of Video Trajectories and Body Worn Acceleration
* GLNet: Global Local Network for Weakly Supervised Action Localization
* Guide to Match: Multi-Layer Feature Matching With a Hybrid Gaussian Mixture Model
* Hierarchical Attention Network for Visually-Aware Food Recommendation
* Hierarchical Coding of Convolutional Features for Scene Recognition
* Hierarchical Context Features Embedding for Object Detection
* Hierarchical Prototype Learning for Zero-Shot Recognition
* How Do We Experience Crossmodal Correspondent Mulsemedia Content?
* Illumination-Adaptive Person Re-Identification
* Image Compression Based on Compressive Sensing: End-to-End Comparison With JPEG
* Image Retargetability
* Image Vectorization With Real-Time Thin-Plate Spline
* Importance of Context When Recommending TV Content: Dataset and Algorithms, The
* Improved Deep Hashing With Soft Pairwise Similarity for Multi-Label Image Retrieval
* Improved Reversible Data Hiding in Encrypted Images Using Parametric Binary Tree Labeling, An
* Incentive Mechanism for Cooperative Scalable Video Coding (SVC) Multicast Based on Contract Theory
* Interact as You Intend: Intention-Driven Human-Object Interaction Detection
* Interpretable Fast Multi-Scale Deep Decoder for the Standard HEVC Bitstreams, The
* Intra Coding Strategy for Video Error Resiliency: Behavioral Analysis
* Iterative Deep Neural Network Quantization With Lipschitz Constraint
* iWave: CNN-Based Wavelet-Like Transform for Image Compression
* Joint Deep Learning of Facial Expression Synthesis and Recognition
* Joint Learning in the Spatio-Temporal and Frequency Domains for Skeleton-Based Action Recognition
* Jointly Learning Kernel Representation Tensor and Affinity Matrix for Multi-View Clustering
* Jointly Sparse Locality Regression for Image Feature Extraction
* Kernel-Based Mixture Mapping for Image and Text Association
* Kernelized Fuzzy Modal Variation for Local Change Detection From Video Scenes
* Knowledge-Augmented Multimodal Deep Regression Bayesian Networks for Emotion Video Tagging
* Knowledge-Based Topic Model for Multi-Modal Social Event Analysis
* Latency-Aware Adaptive Video Summarization for Mobile Edge Clouds
* Learning Discriminative and Generative Shape Embeddings for Three-Dimensional Shape Retrieval
* Learning How to Smile: Expression Video Generation With Conditional Adversarial Recurrent Nets
* Learning Local Quality-Aware Structures of Salient Regions for Stereoscopic Images via Deep Neural Networks
* Learning Non-Locally Regularized Compressed Sensing Network With Half-Quadratic Splitting
* Learning Normal Patterns via Adversarial Attention-Based Autoencoder for Abnormal Event Detection in Videos
* Learning Reliable Visual Saliency For Model Explanations
* Learning Scene Attribute for Scene Recognition
* Learning the Traditional Art of Chinese Calligraphy via Three-Dimensional Reconstruction and Assessment
* Learning-Based User Clustering and Link Allocation for Content Recommendation Based on D2D Multicast Communications
* Leveraging Virtual and Real Person for Unsupervised Person Re-Identification
* Light Field Super-Resolution Using Edge-Preserved Graph-Based Regularization
* Locally Confined Modality Fusion Network With a Global Perspective for Multimodal Human Affective Computing
* Loopy Residual Hashing: Filling the Quantization Gap for Image Retrieval
* Low-Light Image Enhancement With Semi-Decoupled Decomposition
* Low-Rank Regularized Multi-Representation Learning for Fashion Compatibility Prediction
* Massive-Scale Genre Communities Learning Using a Noise-Tolerant Deep Architecture
* MLC STT-MRAM-Aware Memory Subsystem for Smart Image Applications
* Mobile Streaming of Live 360-Degree Videos
* Moving Cast Shadows Segmentation Using Illumination Invariant Feature
* MRFN: Multi-Receptive-Field Network for Fast and Accurate Single Image Super-Resolution
* MSTGAR: Multioperator-Based Stereoscopic Thumbnail Generation With Arbitrary Resolution
* Multi-Attribute Blind Quality Evaluator for Tone-Mapped Images, A
* Multi-Direction Dictionary Learning Based Depth Map Super-Resolution With Autoregressive Modeling
* Multi-Focus Image Fusion by Hessian Matrix Based Decomposition
* Multi-Level Correlation Adversarial Hashing for Cross-Modal Retrieval
* Multi-Level Policy and Reward-Based Deep Reinforcement Learning Framework for Image Captioning
* Multi-Party WebRTC Services Using Delay and Bandwidth Aware SDN-Assisted IP Multicasting of Scalable Video Over 5G Networks
* Multi-Pathway Generative Adversarial Hashing for Unsupervised Cross-Modal Retrieval
* Multi-Scale Based Context-Aware Net for Action Detection
* Multi-Task Learning for Acoustic Event Detection Using Event and Frame Position Information
* Multi-Task Neural Approach for Emotion Attribution, Classification, and Summarization, A
* Multi-View Saliency Guided Deep Neural Network for 3-D Object Retrieval and Classification
* Multimedia Intelligence: When Multimedia Meets Artificial Intelligence
* Multiscale Superpixel-Based Hyperspectral Image Classification Using Recurrent Neural Networks With Stacked Autoencoders
* Neighborhood Pyramid Preserving Hashing
* Neighbourhood Structure Preserving Cross-Modal Embedding for Video Hyperlinking
* New Method and Benchmark for Detecting Co-Saliency Within a Single Image, A
* No-Reference Quality Evaluation of Stereoscopic Video Based on Spatio-Temporal Texture
* Novel Convolutional Neural Network for Image Steganalysis With Shared Normalization, A
* Online Fast Adaptive Low-Rank Similarity Learning for Cross-Modal Retrieval
* Online Robust Principal Component Analysis With Change Point Detection
* Optimizing Fixation Prediction Using Recurrent Neural Networks for 360° Video Streaming in Head-Mounted Virtual Reality
* Oriented Spatial Transformer Network for Pedestrian Detection Using Fish-Eye Camera
* Part-Aware Fine-Grained Object Categorization Using Weakly Supervised Part Detection Network
* Partition-Aware Adaptive Switching Neural Networks for Post-Processing in HEVC
* Patch-Based Image Hallucination for Super Resolution With Detail Reconstruction From Similar Sample Images
* Pay Attention to the Activations: A Modular Attention Mechanism for Fine-Grained Image Recognition
* PDR-Net: Perception-Inspired Single Image Dehazing Network With Refinement
* PixelRL: Fully Convolutional Network With Reinforcement Learning for Image Processing
* PointHop: An Explainable Machine Learning Method for Point Cloud Classification
* Prediction of Saliency Map for Head and Eye Movements in 360 Degree Images, The
* Pruning 3D Filters For Accelerating 3D ConvNets
* PTB-TIR: A Thermal Infrared Pedestrian Tracking Benchmark
* QoE Analysis of Dense Multiview Video With Head-Mounted Devices
* Radiance-Reflectance Combined Optimization and Structure-Guided L_0-Norm for Single Image Dehazing
* Rate Constrained Multiple-QP Optimization for HEVC
* Rate-Distortion Optimal Joint Texture and Depth Map Coding for 3-D Video Streaming
* Realistic Facial Expression Reconstruction for VR HMD Users
* Reasoning on the Relation: Enhancing Visual Representation for Visual Question Answering and Cross-Modal Retrieval
* Reassembling Shredded Document Stripes Using Word-Path Metric and Greedy Composition Optimal Matching Solver
* Recall What You See Continually Using GridLSTM in Image Captioning
* Reduced Reference Stereoscopic Image Quality Assessment Using Sparse Representation and Natural Scene Statistics
* Referring Image Segmentation by Generative Adversarial Learning
* Refined TV-L1 Optical Flow Estimation Using Joint Filtering
* Relation Attention for Temporal Action Localization
* Representing Modifiable and Reusable Musical Content on the Web With Constrained Multi-Hierarchical Structures
* Reversible Data Hiding in Encrypted Images Based on Multi-MSB Prediction and Huffman Coding
* RGB-T Image Saliency Detection via Collaborative Graph Learning
* Rich Features Embedding for Cross-Modal Retrieval: A Simple Baseline
* Robust QoE-Driven DASH Over OFDMA Networks
* Robust Visual Tracking via Constrained Multi-Kernel Correlation Filters
* Role of the Input in Natural Language Video Description, The
* Saliency Detection via a Multiple Self-Weighted Graph-Based Manifold Ranking
* Salient Object Detection via Multiple Instance Joint Re-Learning
* Screen Content Compression Based on Enhanced Soft Context Formation
* SDN-Based Caching Decision Policy for Video Caching in Information-Centric Networking, An
* Semantic Segmentation Guided Pixel Fusion for Image Retargeting
* Semi-Supervised Cross-Modal Retrieval With Label Prediction
* Sensor-Augmented Neural Adaptive Bitrate Video Streaming on UAVs
* Sentiment Recognition for Short Annotated GIFs Using Visual-Textual Fusion
* Show, Tell, and Polish: Ruminant Decoding for Image Captioning
* Similarity-Aware and Variational Deep Adversarial Learning for Robust Facial Age Estimation
* Single-Image Super-Resolution Method Based on Progressive-Iterative Approximation, A
* Sketch-Based Shape Retrieval via Best View Selection and a Cross-Domain Similarity Measure
* Snapshot High Dynamic Range Imaging via Sparse Representations and Feature Learning
* Spatio-Temporal Attention Networks for Action Recognition and Detection
* Spatio-Temporal VLAD Encoding of Visual Events Using Temporal Ordering of the Mid-Level Deep Semantics
* Spatiotemporal Recurrent Convolutional Networks for Recognizing Spontaneous Micro-Expressions
* Spatiotemporal Recurrent Convolutional Networks for Recognizing Spontaneous Micro-Expressions
* STAT: Spatial-Temporal Attention Mechanism for Video Captioning
* STAT: Spatial-Temporal Attention Mechanism for Video Captioning
* Statistical Learning Based Congestion Control for Real-Time Video Communication
* Steered Mixture-of-Experts for Light Field Images and Video: Representation and Coding
* Steganographic Security Analysis From Side Channel Steganalysis and Its Complementary Attacks
* Stereoscopic Image Stitching via Disparity-Constrained Warping and Blending
* STNReID: Deep Convolutional Networks With Pairwise Spatial Transformer Networks for Partial Person Re-Identification
* Strong Baseline and Batch Normalization Neck for Deep Person Re-Identification, A
* Study on 2D Feature-Based Hash Learning
* Style-Controlled Synthesis of Clothing Segments for Fashion Image Manipulation
* Tamper-Proofing Video With Hierarchical Attention Autoencoder Hashing on Blockchain
* Tile-Based Joint Caching and Delivery of 360° Videos in Heterogeneous Networks
* Toward Making Unsupervised Graph Hashing Discriminative
* Towards Efficient Front-End Visual Sensing for Digital Retina: A Model-Centric Paradigm
* Towards Improving Robustness of Deep Neural Networks to Adversarial Perturbations
* Training Objective Image and Video Quality Estimators Using Multiple Databases
* Two-Stage Triplet Network Training Framework for Image Retrieval, A
* Ultra-Low Complexity and High Efficiency Approach for Lossless Alpha Channel Coding, An
* Uni-and-Bi-Directional Video Prediction via Learning Object-Centric Transformation
* Unified Deep Metric Representation for Mesh Saliency Detection and Non-Rigid Shape Matching, A
* Unmanned Aircraft System Aided Adaptive Video Streaming: A Joint Optimization Approach
* Unsupervised Real-Time Framework of Human Pose Tracking From Range Image Sequences, An
* Unsupervised Variational Video Hashing With 1D-CNN-LSTM Networks
* Unsupervised Video Summarization With Cycle-Consistent Adversarial LSTM Networks
* Using Blockchain for Improved Video Integrity Verification
* Using Cell Phone Pictures of Sheet Music To Retrieve MIDI Passages
* Vabis: Video Adaptation Bitrate System for Time-Critical Live Streaming
* Variational Single Image Dehazing for Enhanced Visualization
* Vibrotactile Quality Assessment: Hybrid Metric Design Based on SNR and SSIM
* Video Anomaly Detection and Localization Based on an Adaptive Intra-Frame Classification Network
* Video Storytelling: Textual Summaries for Events
* VINet: A Visually Interpretable Image Diagnosis Network
* Visual Font Pairing
* Visual Relationship Embedding Network for Image Paragraph Generation
* Visual-Texual Emotion Analysis With Deep Coupled Video and Danmu Neural Networks
* WeGAN: Deep Image Hashing With Weighted Generative Adversarial Networks
* Weighted and Class-Specific Maximum Mean Discrepancy for Unsupervised Domain Adaptation
* What Image Features Boost Housing Market Predictions?
* WiderPerson: A Diverse Dataset for Dense Pedestrian Detection in the Wild
* WSCNet: Weakly Supervised Coupled Networks for Visual Sentiment Classification and Detection
246 for MultMed(22)
MultMed(23)
* 3-D Human Behavior Understanding Using Generalized TS-LSTM Networks
* 3D Face Reconstruction From A Single Image Assisted by 2D Face Images in the Wild
* 3D Pose Estimation Based on Reinforce Learning for 2D Image-Based 3D Model Retrieval
* 3D Skeletal Gesture Recognition via Discriminative Coding on Time-Warping Invariant Riemannian Trajectories
* 460 GOPS/W Improved Mnemonic Descent Method-Based Hardwired Accelerator for Face Alignment, A
* Accurate and Efficient Image Super-Resolution via Global-Local Adjusting Dense Network
* Accurate, Robust Visual Odometry and Detail-Preserving Reconstruction System, An
* Acoustic Room Modelling Using 360 Stereo Cameras
* Adaptive and Robust Partition Learning for Person Retrieval With Policy Gradient
* Adaptive Arbitrary Multiresolution Decomposition for Multiscale Geometric Analysis, An
* Adaptive Deep Metric Learning for Affective Image Retrieval and Classification
* Adaptive Graph Completion Based Incomplete Multi-View Clustering
* Adaptive Multi-Feature Reliability Re-Determinative Correlation Filter for Visual Tracking
* Adaptive Partial Multi-View Hashing for Efficient Social Image Retrieval
* Adaptively Clustering-Driven Learning for Visual Relationship Detection
* Adversarial 3D Convolutional Auto-Encoder for Abnormal Event Detection in Videos
* Adversarial Disentanglement Spectrum Variations and Cross-Modality Attention Networks for NIR-VIS Face Recognition
* Adversarial Learning for Personalized Tag Recommendation
* Adversarial Multimodal Network for Movie Story Question Answering
* Adversarial Network With Multiple Classifiers for Open Set Domain Adaptation
* AFNet: Temporal Locality-Aware Network With Dual Structure for Accurate and Fast Action Detection
* Aggregating Global and Local Visual Representation for Vehicle Re-IDentification
* Anisotropic Graph Convolutional Network for Semi-Supervised Learning
* Approximation Algorithm to Maximize User Capacity for an Auto-Scaling VoD System, An
* APSE: Attention-Aware Polarity-Sensitive Embedding for Emotion-Based Image Retrieval
* Arbitrarily-Oriented Text Detection in Low Light Natural Scene Images
* Artificial Intelligence-Based System to Assess Nutrient Intake for Hospitalised Patients, An
* Attention-Based Deep Reinforcement Learning for Virtual Cinematography of 360° Videos
* Attention-Based Unsupervised Adversarial Model for Movie Review Spam Detection, An
* AttentionFGAN: Infrared and Visible Image Fusion Using Attention-Based Generative Adversarial Networks
* Attentive Composite Residual Network for Robust Rain Removal from Single Images
* Attentive Cross-Modal Fusion Network for RGB-D Saliency Detection
* Attribute-Aware Pedestrian Detection in a Crowd
* Attribute-Guided Feature Learning for Few-Shot Image Recognition
* Augmented Adversarial Training for Cross-Modal Retrieval
* AutoFoley: Artificial Synthesis of Synchronized Sound Tracks for Silent Videos With Deep Learning
* Automated and Robust Image Watermarking Scheme Based on Deep Neural Networks, An
* Benchmarking Image Retrieval Diversification Techniques for Social Media
* Beyond Vision: A Multimodal Recurrent Attention Convolutional Neural Network for Unified Image Aesthetic Prediction Tasks
* Blind 3D-Printing Watermarking Using Moment Alignment and Surface Norm Distribution
* Blind Image Clustering for Camera Source Identification via Row-Sparsity Optimization
* Blind Image Denoising via Dynamic Dual Learning
* Blind Quality Assessment for Tone-Mapped Images by Analysis of Gradient and Chromatic Statistics
* Blind Quality Assessment of Screen Content Images Via Macro-Micro Modeling of Tensor Domain Dictionary
* Boosting Temporal Binary Coding for Large-Scale Video Search
* Bottom-Up and Top-Down Integration Framework for Online Object Tracking, A
* Brain-Media Deep Framework Towards Seeing Imaginations Inside Brains, A
* BR^2 Net: Defocus Blur Detection Via a Bidirectional Channel Attention Residual Refining Network
* Building High-Fidelity Human Body Models From User-Generated Data
* BVI-SynTex: A Synthetic Video Texture Dataset for Video Compression and Quality Assessment
* C-GCN: Correlation Based Graph Convolutional Network for Audio-Video Emotion Recognition
* CAA-Net: Conditional Atrous CNNs With Attention for Explainable Device-Robust Acoustic Scene Classification
* CaptionNet: A Tailor-made Recurrent Neural Network for Generating Image Descriptions
* Capturing Relevant Context for Visual Tracking
* CAT: Corner Aided Tracking With Deep Regression Network
* Character Detection in Animated Movies Using Multi-Style Adaptation and Visual Attention
* cmSalGAN: RGB-D Salient Object Detection With Cross-View Generative Adversarial Networks
* Co-Saliency Detection via a General Optimization Model and Adaptive Graph Learning
* Coarse-to-Fine CNN for Image Super-Resolution
* Coarse-to-Fine Facial Landmark Detection Method Based on Self-attention Mechanism, A
* CoLEAP: Cooperative Learning-Based Edge Scheme With Caching and Prefetching for DASH Video Delivery
* Collaborative Image Relevance Learning for Visual Re-Ranking
* Collaborative Social-Aware and QoE-Driven Video Caching and Adaptation in Edge Network
* COMO: Efficient Deep Neural Networks Expansion With COnvolutional MaxOut
* Comparative Perceptual Assessment of Visual Signals Using Free Energy Features
* Constant Size Point Cloud Clustering: A Compact, Non-Overlapping Solution
* Context-Dependent Propagating-Based Video Recommendation in Multimodal Heterogeneous Information Networks
* Controlling P2P-CDN Live Streaming Services at SDN-Enabled Multi-Access Edge Datacenters
* Data-Driven Bandwidth Prediction Models and Automated Model Selection for Low Latency
* DCR: A Unified Framework for Holistic/Partial Person ReID
* Deep Battery Saver: End-to-End Learning for Power Constrained Contrast Enhancement
* Deep Collaborative Discrete Hashing With Semantic-Invariant Structure Construction
* Deep Hashing with Weighted Spatial Importance
* Deep Image Coding Scheme With Generative Network to Learn From Correlated Images, A
* Deep Loss Driven Multi-Scale Hashing Based on Pyramid Connected Network
* Deep Multi-Patch Matching Network for Visible Thermal Person Re-Identification
* Deep Multi-View Subspace Clustering With Unified and Discriminative Learning
* Deep Reinforcement Polishing Network for Video Captioning
* Deep Semantic Parsing of Freehand Sketches With Homogeneous Transformation, Soft-Weighted Loss, and Staged Learning
* Deep Single Image Deraining via Modeling Haze-Like Effect
* Deep Texture Exemplar Extraction Based on Trimmed T-CNN
* Deep Unsupervised Binary Descriptor Learning Through Locality Consistency and Self Distinctiveness
* Deep Unsupervised Self-Evolutionary Hashing for Image Retrieval
* DeepDance: Music-to-Dance Motion Choreography With Adversarial Learning
* DENet: A Universal Network for Counting Crowd With Varying Densities and Scales
* Dense Video Captioning Using Graph-Based Sentence Summarization
* Density-Aware Multi-Task Learning for Crowd Counting
* Discriminative Region Mining for Object Detection
* Disentangling, Embedding and Ranking Label Cues for Multi-Label Image Recognition
* DLGAN: Depth-Preserving Latent Generative Adversarial Network for 3D Reconstruction
* Domain Adaptation for Food Intake Classification With Teacher/Student Learning
* Domain-Oriented Semantic Embedding for Zero-Shot Learning
* Dress With Style: Learning Style From Joint Deep Embedding of Clothing Styles and Body Shapes
* Driver Yawning Detection Based on Subtle Facial Action Recognition
* DSLR: Deep Stacked Laplacian Restorer for Low-Light Image Enhancement
* Dynamic Motion Estimation and Evolution Video Prediction Network
* Dynamic Point Cloud Inpainting via Spatial-Temporal Graph Learning
* Edge-Cloud Collaboration Enabled Video Service Enhancement: A Hybrid Human-Artificial Intelligence Scheme
* Efficient Design and Control for Network-Assisted Device-to-Device Content Delivery Network
* Efficient Projected Frame Padding for Video-Based Point Cloud Compression
* Emotion Attention-Aware Collaborative Deep Reinforcement Learning for Image Cropping
* Emotion Knowledge Driven Video Highlight Detection
* Enabling Artistic Control Over Pattern Density and Stroke Strength
* End-to-End Audiovisual Speech Recognition System With Multitask Learning
* Enhancing Underexposed Photos Using Perceptually Bidirectional Similarity
* Environmental Sound Classification Using Local Binary Pattern and Audio Features Collaboration
* Estimation of Quality Scores From Subjective Tests-Beyond Subjects' MOS
* Exploiting Local Degradation Characteristics and Global Statistical Properties for Blind Quality Assessment of Tone-Mapped HDR Images
* Explore Video Clip Order With Self-Supervised and Curriculum Learning for Video Applications
* Exploring the Representativity of Art Paintings
* Expression-Aware Face Reconstruction via a Dual-Stream Network
* Factorized Tensor Dictionary Learning for Visual Tensor Data Completion
* Fast Multi-Type Tree Partitioning for Versatile Video Coding Using a Lightweight Neural Network
* Fast Nearest Subspace Search via Random Angular Hashing
* Fast Non-Local Adaptive In-Loop Filter Optimization on GPU
* Fine Granularity Access in Interactive Compression of 360-Degree Images Based on Rate-adaptive Channel Codes
* Fine-Grained Image Captioning With Global-Local Discriminative Objective
* Fine-Grained Visual Categorization by Localizing Object Parts With Single Image
* Frame-Wise Detection of Double HEVC Compression by Learning Deep Spatio-Temporal Representations in Compression Domain
* Frequency-Dependent Depth Map Enhancement via Iterative Depth-Guided Affine Transformation and Intensity-Guided Refinement
* From Edge to Keypoint: An End-to-End Framework For Indoor Layout Estimation
* GAC-GAN: A General Method for Appearance-Controllable Human Video Motion Transfer
* Global Manifold Learning for Interactive Image Segmentation
* Graph Embedding Multi-Kernel Metric Learning for Image Set Classification With Grassmannian Manifold-Valued Features
* Graph Regularized Encoder-Decoder Networks for Image Representation Learning
* Group Re-Identification With Group Context Graph Neural Networks
* Guest Editorial Special Section on Hybrid Human-Artificial Intelligence for Multimedia Computing
* Handling Outliers by Robust M-Estimation in Blind Image Deblurring
* HAPGN: Hierarchical Attentive Pooling Graph Network for Point Cloud Segmentation
* Hard Pixel Mining for Depth Privileged Semantic Segmentation
* Hardness-Aware Dictionary Learning: Boosting Dictionary for Recognition
* Heterogeneous Community Question Answering via Social-Aware Multi-Modal Co-Attention Convolutional Matching
* Hierarchical Group-Level Emotion Recognition
* Hierarchical Reasoning Network for Pedestrian Attribute Recognition
* Hierarchical Soft Quantization for Skeleton-Based Human Action Recognition
* Hierarchical Visual Feature-Based Approach For Image Sonification, A
* High Capacity Reversible Data Hiding in Encrypted Image Based on Intra-Block Lossless Compression
* High-Fidelity Reversible Image Watermarking Based on Effective Prediction Error-Pairs Modification
* Human Memory Update Strategy: A Multi-Layer Template Update Mechanism for Remote Visual Monitoring
* Hybrid Approach for Detecting Prerequisite Relations in Multi-Modal Food Recipes, A
* Hybrid Refinement-Correction Heatmaps for Human Pose Estimation
* Hybrid-Attention Enhanced Two-Stream Fusion Network for Video Venue Prediction
* Image Denoising Using Superpixel-Based PCA
* Image Quality Assessment Using Kernel Sparse Coding
* Image-Only Real-Time Incremental UAV Image Mosaic for Multi-Strip Flight
* Image-Text Multimodal Emotion Classification via Multi-View Attentional Network
* Improving Driver Gaze Prediction With Reinforced Attention
* Improving Generative Modelling in VAEs Using Multimodal Prior
* Improving Student Learning Satisfaction by Using an Innovative DASH-Based Multiple Sensorial Media Delivery Solution
* Instance-Level Heterogeneous Domain Adaptation for Limited-Labeled Sketch-to-Photo Retrieval
* Integrating Part of Speech Guidance for Image Captioning
* Intelligibility Enhancement Via Normal-to-Lombard Speech Conversion With Long Short-Term Memory Network and Bayesian Gaussian Mixture Model
* Interactive Video Retrieval in the Age of Deep Learning: Detailed Evaluation of VBS 2019
* Interclass-Relativity-Adaptive Metric Learning for Cross-Modal Matching and Beyond
* Intermittent Contextual Learning for Keyfilter-Aware UAV Object Tracking Using Deep Convolutional Feature
* IPTV Channel Zapping Recommendation With Attention Mechanism
* Iterative Knowledge Distillation for Automatic Check-Out
* Joint Cross-Modal and Unimodal Features for RGB-D Salient Object Detection
* Joint Input and Output Space Learning for Multi-Label Image Classification
* Joint Intermediate Domain Generation and Distribution Alignment for 2D Image-Based 3D Objects Retrieval
* Kernelized Multiview Subspace Analysis By Self-Weighted Learning
* KTransGAN: Variational Inference-Based Knowledge Transfer for Unsupervised Conditional Generative Learning
* Large Factor Image Super-Resolution With Cascaded Convolutional Neural Networks
* LAST: Location-Appearance-Semantic-Temporal Clustering Based POI Summarization
* Latent Representation Learning Model for Multi-Band Images Fusion via Low-Rank and Sparse Embedding
* LCSegNet: An Efficient Semantic Segmentation Network for Large-Scale Complex Chinese Character Recognition
* LD-MAN: Layout-Driven Multimodal Attention Network for Online News Sentiment Recognition
* Learned Multi-Resolution Variable-Rate Image Compression With Octave-Based Residual Blocks
* Learned Resolution Scaling Powered Gaming-as-a-Service at Scale
* Learning Adaptive Neighborhood Graph on Grassmann Manifolds for Video/Image-Set Subspace Clustering
* Learning and Fusing Multiple User Interest Representations for Micro-Video and Movie Recommendations
* Learning Coarse-to-Fine Graph Neural Networks for Video-Text Retrieval
* Learning Compact Multifeature Codes for Palmprint Recognition From a Single Training Image per Palm
* Learning Crisp Boundaries Using Deep Refinement Network and Adaptive Weighting Loss
* Learning Deep Multi-Level Similarity for Thermal Infrared Object Tracking
* Learning Diverse Fashion Collocation by Neural Graph Filtering
* Learning Dual-Pooling Graph Neural Networks for Few-Shot Video Classification
* Learning Face Image Super-Resolution Through Facial Semantic Attribute Transformation and Self-Attentive Structure Enhancement
* Learning Feature Representation and Partial Correlation for Multimodal Multi-Label Data
* Learning Fundamental Visual Concepts Based on Evolved Multi-Edge Concept Graph
* Learning Localized Representations of Point Clouds With Graph-Convolutional Generative Adversarial Networks
* Learning Low-Rank Sparse Representations With Robust Relationship Inference for Image Memorability Prediction
* Learning Representations for High-Dynamic-Range Image Color Transfer in a Self-Supervised Way
* Learning Spatial-Temporal Representations Over Walking Tracklet for Long-Term Person Re-Identification in the Wild
* Learning Specific and General Realm Feature Representations for Image Fusion
* Learning the Relation Between Interested Objects and Aesthetic Region for Image Cropping
* Learning to Generate Multi-Exposure Stacks With Cycle Consistency for High Dynamic Range Imaging
* Learning to Hash With Dimension Analysis Based Quantizer for Image Retrieval
* Learning to Segment Video Object With Accurate Boundaries
* Learning to Visualize Music Through Shot Sequence for Automatic Concert Video Mashup
* Less is (Just as Good as) More: An Investigation of Odor Intensity and Hedonic Valence in Mulsemedia QoE using Heart Rate and Eye Tracking
* Light Field Image Coding Using VVC Standard and View Synthesis Based on Dual Discriminator GAN
* Low-Cost Anti-Copying 2D Barcode by Exploiting Channel Noise Characteristics
* Low-Rank Pairwise Alignment Bilinear Network For Few-Shot Fine-Grained Image Classification
* Luminance-Aware Pyramid Network for Low-Light Image Enhancement
* M-GCN: Multi-Branch Graph Convolution Network for 2D Image-based on 3D Model Retrieval
* MaD-DLS: Mean and Deviation of Deep and Local Similarity for Image Quality Assessment
* Manifold Transfer Learning via Discriminant Regression Analysis
* Mask Cross-Modal Hashing Networks
* Matrix Factorization Based Framework for Fusion of Physical and Social Sensors, A
* Mesh Convolution: A Novel Feature Extraction Method for 3D Nonrigid Object Classification
* Metadata Connector: Exploiting Hashtag and Tag for Cross-OSN Event Search
* Model-Based Joint Bit Allocation Between Geometry and Color for Video-Based 3D Point Cloud Compression
* Modeling Fashion Influence From Photos
* Modeling QoE for Buffered Video Streaming in Interference-Limited Cellular Networks
* Monocular 3D Facial Expression Features for Continuous Affect Recognition
* Motion Blur Removal With Quality Assessment Guidance
* Motion Compensated Virtual View Synthesis Using Novel Particle Cell
* Multi-Channel Deep Networks for Block-Based Image Compressive Sensing
* Multi-Encoder Towards Effective Anomaly Detection in Videos
* Multi-FoV Viewport-Based Visual Saliency Model Using Adaptive Weighting Losses for 360° Images, A
* Multi-Stream Graph Convolutional Networks-Hidden Conditional Random Field Model for Skeleton-Based Action Recognition, A
* Multimodal Disentangled Domain Adaption for Social Media Event Rumor Detection
* Mutually Attentive Co-Training Framework for Semi-Supervised Recognition, A
* MVANet: Multi-Task Guided Multi-View Attention Network for Chinese Food Recognition
* NDN-MMRA: Multi-Stage Multicast Rate Adaptation in Named Data Networking WLAN
* Neural Style Palette: A Multimodal and Interactive Style Transfer From a Single Style Image
* New Approach for Character Recognition of Multi-Style Vehicle License Plates, A
* New Image Compression Algorithm Based on Non-Uniform Partition and U-System, A
* New Multihypothesis-Based Compressed Video Sensing Reconstruction System, A
* Novel Depth and Color Feature Fusion Framework for 6D Object Pose Estimation, A
* Novel Image Representation Method Under a Non-Standard Positional Numeral System, A
* Novel Perspective to Zero-Shot Learning: Towards an Alignment of Manifold Structures via Semantic Feature Expansion, A
* Object Cosegmentation in Noisy Videos With Multilevel Hypergraph
* Object-Aware Multimodal Named Entity Recognition in Social Media Posts With Adversarial Learning
* On Reliable Multi-View Affinity Learning for Subspace Clustering
* One-Shot Texture Retrieval Using Global Grouping Metric
* Online Hashing With Bit Selection for Image Retrieval
* OPMP: An Omnidirectional Pyramid Mask Proposal Network for Arbitrary-Shape Scene Text Detection
* Optimal Wireless Streaming of Multi-Quality 360 VR Video By Exploiting Natural, Relative Smoothness-Enabled, and Transcoding-Enabled Multicast Opportunities
* Optimizing Video QoE for Mobile eMBMS Users in Cellular Networks
* Panoramic Video Quality Assessment Based on Non-Local Spherical CNN
* Parameter Sharing Exploration and Hetero-Center Triplet Loss for Visible-Thermal Person Re-Identification
* Parametric Shape Estimation of Human Body Under Wide Clothing
* Part-aware Progressive Unsupervised Domain Adaptation for Person Re-Identification
* Patch Based Video Summarization With Block Sparse Representation
* Perceptual Image Hashing With Texture and Invariant Vector Distance for Copy Detection
* Person Re-Identification in Aerial Imagery
* Person Retrieval in Surveillance Videos Via Deep Attribute Mining and Reasoning
* PFAN++: Bi-Directional Image-Text Retrieval With Position Focused Attention Network
* Physiology-Based QoE Comparison of Interactive Augmented Reality, Virtual Reality and Tablet-Based Applications, A
* Pixel-Level Non-local Image Smoothing With Objective Evaluation
* Point Cloud Rendering After Coding: Impacts on Subjective and Objective Quality
* Pose-Guided Tracking-by-Detection: Robust Multi-Person Pose Tracking
* Pose-Normalized and Appearance-Preserved Street-to-Shop Clothing Image Generation and Feature Learning
* Predicting the Perceptual Quality of Point Cloud: A 3D-to-2D Projection-Based Exploration
* Predicting User Quitting Ratio in Adaptive Bitrate Video Streaming
* Predictive Adaptive Streaming to Enable Mobile 360-Degree and VR Experiences
* Privacy-Preserving In-Home Fall Detection Using Visual Shielding Sensing and Private Information-Embedding
* Progressive Bilateral-Context Driven Model for Post-Processing Person Re-Identification
* Progressive Learning of Low-Precision Networks for Image Classification
* Progressive Unsupervised Person Re-Identification by Tracklet Association With Spatio-Temporal Regularization
* Prominent Local Representation for Dynamic Textures Based on High-Order Gaussian-Gradients
* PVC-SLP: Perceptual Vibrotactile-Signal Compression Based-on Sparse Linear Prediction
* QoE-driven HAS Live Video Channel Placement in the Media Cloud
* QoE-Driven UAV-Enabled Pseudo-Analog Wireless Video Broadcast: A Joint Optimization of Power and Trajectory
* QoS-Aware Multicast for Scalable Video Streaming in Software-Defined Networks
* Quality Evaluation for Image Retargeting With Instance Semantics
* Quality Index for View Synthesis by Measuring Instance Degradation and Global Appearance
* Query Reconstruction Network for Referring Expression Image Segmentation
* R-Net: A Relationship Network for Efficient and Accurate Scene Text Detection
* Rank-Consistency Deep Hashing for Scalable Multi-Label Image Search
* Rate Control Method Based on Deep Reinforcement Learning for Dynamic Video Sequences in HEVC
* Re-Synchronization Using the Hand Preceding Model for Multi-Modal Fusion in Automatic Continuous Cued Speech Recognition
* Re-Visiting Discriminator for Blind Free-Viewpoint Image Quality Assessment
* Real-world Cross-modal Retrieval via Sequential Learning
* RealVAD: A Real-World Dataset and A Method for Voice Activity Detection by Body Motion Analysis
* Recurrent Generative Adversarial Network for Face Completion
* Recursive Reversible Data Hiding in Encrypted Images Method With a Very High Payload, A
* Redundancy and Optimization of tANS Entropy Encoders
* Referring Expression Comprehension: A Survey of Methods and Datasets
* Resource-Efficient Parallel Connected Component Labeling Algorithm and Its Hardware Implementation, A
* Risk Optimization for Revenue-Driven Wireless Video Broadcasting Systems: A Copula-Based Framework
* Robust and Efficient RGB-D SLAM in Dynamic Environments
* Robust CAPTCHAs Towards Malicious OCR
* Robust Coding of Encrypted Images via 2D Compressed Sensing
* SAL: Selection and Attention Losses for Weakly Supervised Semantic Segmentation
* Saliency Detection Using Deep Features and Affinity-Based Robust Background Subtraction
* Salient Object Detection by Fusing Local and Global Contexts
* Salient Object Detection in Stereoscopic 3D Images Using a Deep Convolutional Residual Autoencoder
* SEA: Sentence Encoder Assembly for Video Retrieval by Textual Queries
* Self-Adaptive Neural Module Transformer for Visual Question Answering
* Self-Supervised Deep TripleNet for Video Object Segmentation
* Semantic Context Encoding for Accurate 3D Point Cloud Segmentation
* Semantic Example Guided Image-to-Image Translation
* Semantic-Driven Interpretable Deep Multi-Modal Hashing for Large-Scale Multimedia Retrieval
* Semi-Reference Sonar Image Quality Assessment Based on Task and Visual Perception
* Separable Reversible Data Hiding for Encrypted Three-Dimensional Models Based on Spatial Subdivision and Space Encoding
* Serial Image Copy-Move Forgery Localization Scheme With Source/Target Distinguishment, A
* Siamese Tracking Network With Informative Enhanced Loss
* Single Shot Video Object Detector
* Snowball: Iterative Model Evolution and Confident Sample Discovery for Semi-Supervised Learning on Very Small Labeled Datasets
* Soft Video Multicasting Using Adaptive Compressed Sensing
* Solving Jigsaw Puzzles via Nonconvex Quadratic Programming With the Projected Power Method
* SPA-GAN: Spatial Attention GAN for Image-to-Image Translation
* Sparkle: User-Aware Viewport Prediction in 360-Degree Video Streaming
* SparseFusion: Dynamic Human Avatar Modeling From Sparse RGBD Images
* Spatial Pyramid Attention for Deep Convolutional Neural Networks
* Spatial-Temporal Cascade Autoencoder for Video Anomaly Detection in Crowded Scenes
* Speaker Clustering by Co-Optimizing Deep Representation Learning and Cluster Estimation
* Spectrum Characteristics Preserved Visible and Near-Infrared Image Fusion Algorithm
* Speech Personality Recognition Based on Annotation Classification Using Log-Likelihood Distance and Extraction of Essential Audio Features
* SRD: A Tree Structure Based Decoder for Online Handwritten Mathematical Expression Recognition
* Stacked U-Shape Network With Channel-Wise Attention for Salient Object Detection
* Staged Sketch-to-Image Synthesis via Semi-Supervised Generative Adversarial Networks
* STGL: Spatial-Temporal Graph Representation and Learning for Visual Tracking
* Story-driven Video Editing
* String Prediction for 4:2:0 Format Screen Content Coding and Its Implementation in AVS3
* StyleGuide: Zero-Shot Sketch-Based Image Retrieval Using Style-Guided Image Generation
* Subjective and Objective Quality Assessment for Stereoscopic Image Retargeting
* Supervised Pixel-Wise GAN for Face Super-Resolution
* Tag Propagation and Cost-Sensitive Learning for Music Auto-Tagging
* TBEFN: A Two-Branch Exposure-Fusion Network for Low-Light Image Enhancement
* TCLiVi: Transmission Control in Live Video Streaming Based on Deep Reinforcement Learning
* Temporal Action Localization Using Long Short-Term Dependency
* Temporal Constraint Background-Aware Correlation Filter With Saliency Map
* Temporal Textual Localization in Video via Adversarial Bi-Directional Interaction Networks
* Toward Multi-Modal Conditioned Fashion Image Translation
* Towards Coding for Human and Machine Vision: Scalable Face Image Coding
* Transferable Knowledge-Based Multi-Granularity Fusion Network for Weakly Supervised Temporal Action Detection
* Transformer Encoder With Multi-Modal Multi-Head Attention for Continuous Affect Recognition
* TTL-IQA: Transitive Transfer Learning Based No-Reference Image Quality Assessment
* Understanding More About Human and Machine Attention in Deep Neural Networks
* Universal Chosen-Ciphertext Attack for a Family of Image Encryption Schemes
* Universal Cross-Domain 3D Model Retrieval
* Universal-to-Specific Framework for Complex Action Recognition
* Unsupervised Adversarial Instance-Level Image Retrieval
* Unsupervised Moving Object Detection in Complex Scenes Using Adversarial Regularizations
* Unsupervised Multi-View Clustering by Squeezing Hybrid Knowledge From Cross View and Each View
* Unsupervised Video Summarization via Relation-Aware Assignment Learning
* User Identity Linkage Across Social Media via Attentive Time-Aware User Modeling
* Utilizing Two-Phase Processing With FBLS for Single Image Deraining
* V-Eye: A Vision-Based Navigation System for the Visually Impaired
* Vector-Based Feature Representations for Speech Signals: From Supervector to Latent Vector
* VehicleNet: Learning Robust Visual Representation for Vehicle Re-Identification
* Viewpoint Recommendation Based on Object-Oriented 3D Scene Reconstruction
* Viewport-Dependent Saliency Prediction in 360° Video
* Virtual Try-on Network With Attribute Transformation and Local Rendering
* Visual Question Answering With Dense Inter- and Intra-Modality Interactions
* Weakly Supervised Emotion Intensity Prediction for Recognition of Emotions in Images
* Weighted Adaptive Image Super-Resolution Scheme Based on Local Fractal Feature and Image Roughness
* Wide Color Gamut Image Content Characterization: Method, Evaluation, and Applications
* Wildfish++: A Comprehensive Fish Benchmark for Multimedia Research
* YuvConv: Multi-Scale Non-Uniform Convolution Structure Based on YUV Color Model
344 for MultMed(23)
MultMed(24)
* 3D Mesh-Based Lifting-and-Projection Network for Human Pose Transfer, A
* 3DBodyNet: Fast Reconstruction of 3D Animatable Human Body Shape From a Single Commodity Depth Camera
* Accurate Scene Text Detection Via Scale-Aware Data Augmentation and Shape Similarity Constraint
* Action Coherence Network for Weakly-Supervised Temporal Action Localization
* Active Gradual Domain Adaptation: Dataset and Approach
* AdaCrowd: Unlabeled Scene Adaptation for Crowd Counting
* Adaptive Label-Aware Graph Convolutional Networks for Cross-Modal Retrieval
* Adaptive Semantic-Spatio-Temporal Graph Convolutional Network for Lip Reading
* Adversarial-Metric Learning for Audio-Visual Cross-Modal Matching
* Affinity Fusion Graph-Based Framework for Natural Image Segmentation
* AgeGAN++: Face Aging and Rejuvenation With Dual Conditional GANs
* Aggregation-Based Graph Convolutional Hashing for Unsupervised Cross-Modal Retrieval
* Align R-CNN: A Pairwise Head Network for Visual Relationship Detection
* Alleviating Domain Shift via Discriminative Learning for Generalized Zero-Shot Learning
* Alleviating Modality Bias Training for Infrared-Visible Person Re-Identification
* Amorphous Region Context Modeling for Scene Recognition
* AniGAN: Style-Guided Generative Adversarial Networks for Unsupervised Anime Face Generation
* Annealing Genetic GAN for Imbalanced Web Data Learning
* Annular-Graph Attention Model for Personalized Sequential Recommendation
* Anti-Forensics for Face Swapping Videos via Adversarial Training
* APMC: Adjacent Pixels Based Measurement Coding System for Compressively Sensed Images
* Apparel-Invariant Feature Learning for Person Re-Identification
* Approximate k-NN Graph Construction: A Generic Online Approach
* Associated Spatio-Temporal Capsule Network for Gait Recognition
* Asymmetric Correlation Quantization Hashing for Cross-Modal Retrieval
* Attribute Restoration Framework for Anomaly Detection
* Attribute-Aware Feature Encoding for Object Recognition and Segmentation
* Attribute-Induced Bias Eliminating for Transductive Zero-Shot Learning
* Audio Matters in Video Super-Resolution by Implicit Semantic Guidance
* Audio-Visual Tracking of Concurrent Speakers
* Automatic Tagging by Leveraging Visual and Annotated Features in Social Media
* AVN: An Adversarial Variation Network Model for Handwritten Signature Verification
* A^3-FKG: Attentive Attribute-Aware Fashion Knowledge Graph for Outfit Preference Prediction
* Bal-R2CNN: High Quality Recurrent Object Detection With Balance Optimization
* Beyond Triplet Loss: Meta Prototypical N-Tuple Loss for Person Re-identification
* Beyond Triplet Loss: Person Re-Identification With Fine-Grained Difference-Aware Pairwise Loss
* Bilateral Weighted Regression Ranking Model With Spatial-Temporal Correlation Filter for Visual Tracking
* Blind Color Separation Model for Faithful Palette-Based Image Recoloring, A
* Blind Stereoscopic Image Quality Evaluator Based on Binocular Semantic and Quality Channels
* Boundary Information Progressive Guidance Network for Salient Object Detection
* Boundary-Aware Arbitrary-Shaped Scene Text Detector With Learnable Embedding Network
* Bridging the Gap Between Semantic Segmentation and Instance Segmentation
* Broad-to-Narrow Registration and Identification of 3D Objects in Partially Scanned and Cluttered Point Clouds
* Building and Using Personal Knowledge Graph to Improve Suicidal Ideation Detection on Social Media
* BVI-DVC: A Training Database for Deep Video Compression
* CariMe: Unpaired Caricature Generation With Multiple Exaggerations
* Catching the Moment With LoL^+ in Twitch-Like Low-Latency Live Streaming Platforms
* CCAFNet: Crossflow and Cross-Scale Adaptive Fusion Network for Detecting Salient Objects in RGB-D Images
* CDFKD-MFS: Collaborative Data-Free Knowledge Distillation via Multi-Level Feature Sharing
* Co-LDL: A Co-Training-Based Label Distribution Learning Method for Tackling Label Noise
* COLA-Net: Collaborative Attention Network for Image Restoration
* Collaborative Learning With a Multi-Branch Framework for Feature Enhancement
* Combining Retargeting Quality and Depth Perception Measures for Quality Evaluation of Retargeted Stereopairs
* Commonality Modeling Framework for Enhanced Video Coding Leveraging on the Cuboidal Partitioning Based Representation of Frames, A
* Comprehensive Study on Deep Learning-Based Methods for Sign Language Recognition, A
* Conditional Sentence Generation and Cross-Modal Reranking for Sign Language Translation
* Confidence-Based 6D Object Pose Estimation
* Consensus Feature Network for Scene Parsing
* Consensus Graph Learning for Multi-View Clustering
* Consolidated Dataset and Metrics for High-Dynamic-Range Image Quality
* Constrained Tensor Representation Learning for Multi-View Semi-Supervised Subspace Clustering
* Contrastive Attention for Video Anomaly Detection
* Controllable Facial Caricaturization With Localized Deformation and Personalized Semantic Attentions
* Convolutional Neural Network-Based Occupancy Map Accuracy Improvement for Video-Based Point Cloud Compression
* Correlation Graph Convolutional Network for Pedestrian Attribute Recognition
* CRF-Based Framework for Tracklet Inactivation in Online Multi-Object Tracking, A
* CroMM-VSR: Cross-Modal Memory Augmented Visual Speech Recognition
* Cross Parallax Attention Network for Stereo Image Super-Resolution
* Cross View Capture for Stereo Image Super-Resolution
* Cross-Modal Dynamic Networks for Video Moment Retrieval With Text Query
* Cross-Modal Food Retrieval: Learning a Joint Embedding of Food Images and Recipes With Semantic Consistency and Attention Mechanism
* Cross-Modal Semantic Matching Generative Adversarial Networks for Text-to-Image Synthesis
* Cross-Modality Fusion and Progressive Integration Network for Saliency Prediction on Stereoscopic 3D Images
* CrossNet: Detecting Objects as Crosses
* Crowd Counting Via Perspective-Guided Fractional-Dilation Convolution
* Cryptanalysis of Reversible Data Hiding in Encrypted Images by Block Permutation and Co-Modulation
* DBDnet: A Deep Boosting Strategy for Image Denoising
* Decoupled Representation Learning for Character Glyph Synthesis
* Deep Arbitrary HDRI: Inverse Tone Mapping With Controllable Exposure Changes
* Deep Auto-Encoders With Sequential Learning for Multimodal Dimensional Emotion Recognition
* Deep Co-Image-Label Hashing for Multi-Label Image Retrieval
* Deep Continual Learning for Emerging Emotion Recognition
* Deep Domain Adaptation Based Multi-Spectral Salient Object Detection
* Deep Enhanced Weakly-Supervised Hashing With Iterative Tag Refinement
* Deep Generative Model for Image Inpainting With Local Binary Pattern Learning and Spatial Attention
* Deep Light Field Super-Resolution Using Frequency Domain Analysis and Semantic Prior
* Deep Metric Learning With Manifold Class Variability Analysis
* Deep Modality Assistance Co-Training Network for Semi-Supervised Multi-Label Semantic Decoding
* Deep RGB-D Saliency Detection Without Depth
* Deep Shape-Aware Person Re-Identification for Overcoming Moderate Clothing Changes
* Deep View Synthesis via Self-Consistent Generative Network
* Deep-IRTarget: An Automatic Target Detector in Infrared Imagery Using Dual-Domain Feature Extraction and Allocation
* Deep-PCAC: An End-to-End Deep Lossy Compression Framework for Point Cloud Attributes
* Deeper Look at Image Salient Object Detection: Bi-Stream Network With a Small Training Dataset
* Deformable Template Network (DTN) for Object Detection
* Design and Analysis of MEC- and Proactive Caching-Based 360° Mobile VR Video Streaming
* Detecting 3D Points of Interest Using Projective Neural Networks
* Differentiable Spatial Regression: A Novel Method for 3D Hand Pose Estimation
* Discover Micro-Influencers for Brands via Better Understanding
* Discriminative Invariant Alignment for Unsupervised Domain Adaptation
* Discriminative Siamese Complementary Tracker With Flexible Update
* Discriminative Vectorial Framework for Multi-Modal Feature Representation, A
* Disentangled Feature Networks for Facial Portrait and Caricature Generation
* Disentangled Representation Learning for Cross-Modal Biometric Matching
* Disentangling Semantic-to-Visual Confusion for Zero-Shot Learning
* Distribution-Preserving-Based Automatic Data Augmentation for Deep Image Steganalysis
* Drift-Proof Tracking With Deep Reinforcement Learning
* Dual Attention on Pyramid Feature Maps for Image Captioning
* DualVGR: A Dual-Visual Graph Reasoning Unit for Video Question Answering
* Dynamic Emotion Modeling With Learnable Graphs and Graph Inception Network
* Dynamic Training Data Dropout for Robust Deep Face Recognition
* E-Commerce Storytelling Recommendation Using Attentional Domain-Transfer Network and Adversarial Pre-Training
* Efficient and Accurate Multi-Scale Topological Network for Single Image Dehazing
* EFRNet: Efficient Feature Reconstructing Network for Real-Time Scene Parsing
* Emotion Expression With Fact Transfer for Video Description
* Employing Bilinear Fusion and Saliency Prior Information for RGB-D Salient Object Detection
* End-to-End Rain Removal Network Based on Progressive Residual Detail Supplement
* Enhanced Feature Alignment for Unsupervised Domain Adaptation of Semantic Segmentation
* Enhancing Mixture-of-Experts by Leveraging Attention for Fine-Grained Recognition
* Enhancing Neural Machine Translation With Dual-Side Multimodal Awareness
* Ensemble Learning With Manifold-Based Data Splitting for Noisy Label Correction
* Entity-Oriented Multi-Modal Alignment and Fusion Network for Fake News Detection
* Explicit Cross-Modal Representation Learning for Visual Commonsense Reasoning
* Exploiting Informative Video Segments for Temporal Action Localization
* Exploiting Web Images for Fine-Grained Visual Recognition by Eliminating Open-Set Noise and Utilizing Hard Examples
* Exploiting Web Images for Fine-Grained Visual Recognition via Dynamic Loss Correction and Global Sample Selection
* Exploring Pairwise Relationships Adaptively From Linguistic Context in Image Captioning
* Exploring Zero-Shot Emotion Recognition in Speech Using Semantic-Embedding Prototypes
* Extended Feature Pyramid Network for Small Object Detection
* Facial Chirality: From Visual Self-Reflection to Robust Facial Feature Learning
* Families in Wild Multimedia: A Multimodal Database for Recognizing Kinship
* Fast Adaptive Meta-Learning for Few-Shot Image Generation
* Fast Intra Mode Decision Algorithm for Versatile Video Coding
* Fast Video Saliency Detection via Maximally Stable Region Motion and Object Repeatability
* FCNN-Based Super-Resolution Mmwave Radar Framework for Contactless Musical Instrument Interface, An
* Feature Estimations Based Correlation Distillation for Incremental Image Retrieval
* Fine-Grained Attention and Feature-Sharing Generative Adversarial Networks for Single Image Super-Resolution
* Fine-Grained Categorization From RGB-D Images
* Focus Your Attention: A Focal Attention for Multimodal Learning
* ForestDet: Large-Vocabulary Long-Tailed Object Detection and Instance Segmentation
* Frame-Wise Cross-Modal Matching for Video Moment Retrieval
* FVV Live: A Real-Time Free-Viewpoint Video System With Consumer Electronics Hardware
* Gait Recognition Based on Local Graphical Skeleton Descriptor With Pairwise Similarity Network
* Game Theory Based Dynamic Adaptive Video Streaming for Multi-Client Over NDN
* Gated SwitchGAN for Multi-Domain Facial Image Translation
* Generalized Large Margin kNN for Partial Label Learning
* Generalized Zero-Shot Learning Via Multi-Modal Aggregated Posterior Aligning Neural Network
* Generate and Purify: Efficient Person Data Generation for Re-Identification
* Geometric Back-Projection Network for Point Cloud Classification
* Geometry-Constrained Scale Estimation for Monocular Visual Odometry
* GeoPose: Dense Reconstruction Guided 6D Object Pose Estimation With Geometric Consistency
* Global-Local Label Correlation for Partial Multi-Label Learning
* GPS2Vec: Pre-Trained Semantic Embeddings for Worldwide GPS Coordinates
* Graph-Based Multimodal Sequential Embedding for Sign Language Translation
* Graph-Based Visual-Semantic Entanglement Network for Zero-Shot Image Recognition
* Haptic Signal Reconstruction for Cross-Modal Communications
* Harmonious Textual Layout Generation Over Natural Images via Deep Aesthetics Learning
* Heterogeneous Contrastive Learning: Encoding Spatial Information for Compact Visual Representations
* Heterogeneous Hierarchical Feature Aggregation Network for Personalized Micro-Video Recommendation
* Hierarchical User Intent Graph Network for Multimedia Recommendation
* High Capacity Reversible Data Hiding in Encrypted Image Based on Adaptive MSB Prediction
* High-Performance CNN-Applied HEVC Steganography Based on Diamond-Coded PU Partition Modes, A
* HoloCast+: Hybrid Digital-Analog Transmission for Graceful Point Cloud Delivery With Graph Fourier Transform
* Holographic Feature Learning of Egocentric-Exocentric Videos for Multi-Domain Action Recognition
* Horizontal-to-Vertical Video Conversion
* Human Action Recognition by Discriminative Feature Pooling and Video Segment Attention Model
* I-GCN: Incremental Graph Convolution Network for Conversation Emotion Detection
* Identity-Aware Facial Expression Recognition Via Deep Metric Learning Based on Synthesized Images
* IDHashGAN: Deep Hashing With Generative Adversarial Nets for Incomplete Data Retrieval
* Image Co-Saliency Detection and Instance Co-Segmentation Using Attention Graph Clustering Based Graph Convolutional Network
* Image Difference Captioning With Instance-Level Fine-Grained Feature Representation
* Image-to-Image Translation: Methods and Applications
* Improving Robustness of DASH Against Unpredictable Network Variations
* Informative Feature Disentanglement for Unsupervised Domain Adaptation
* Infrared Action Detection in the Dark via Cross-Stream Attention Mechanism
* Infrared and Visible Image Fusion Based on Deep Decomposition Network and Saliency Analysis
* Instance GNN: A Learning Framework for Joint Symbol Segmentation and Recognition in Online Handwritten Diagrams
* Inter-Domain Adaptation Label for Data Augmentation in Vehicle Re-Identification
* Interaction Relational Network for Mutual Action Recognition
* Intra-Domain Consistency Enhancement for Unsupervised Person Re-Identification
* Iterative Network for Image Super-Resolution
* Joint Contrast Enhancement and Exposure Fusion for Real-World Image Dehazing
* Joint Contrast Enhancement and Noise Reduction of Low Light Images Via JND Transform
* Joint Distribution Alignment via Adversarial Learning for Domain Adaptive Object Detection
* Jointly Learning the Attributes and Composition of Shots for Boundary Detection in Videos
* Knowledge-Driven Generative Adversarial Network for Text-to-Image Synthesis
* LAG-Net: Multi-Granularity Network for Person Re-Identification via Local Attention System
* LAGA-Net: Local-and-Global Attention Network for Skeleton Based Action Recognition
* Learning From Web Recipe-Image Pairs for Food Recognition: Problem, Baselines and Performance
* Learning Representation on Optimized High-Order Manifold for Visual Classification
* Learning Scale-Consistent Attention Part Network for Fine-Grained Image Recognition
* Learning Temporal-Correlated and Channel- Decorrelated Siamese Networks for Visual Tracking
* Learning the Global Descriptor for 3-D Object Recognition Based on Multiple Views Decomposition
* Learning to Recognize Human Actions From Noisy Skeleton Data Via Noise Adaptation
* Learning to Simulate Complex Scenes for Street Scene Segmentation
* Learning-Based Quality Assessment for Image Super-Resolution
* Learning-Based Scalable Image Compression With Latent-Feature Reuse and Prediction
* LensCast: Robust Wireless Video Transmission Over MmWave MIMO With Lens Antenna Array
* Leveraging Multiple Relations for Fashion Trend Forecasting Based on Social Media
* List-Wise Rank Learning for Stereoscopic Image Retargeting Quality Assessment
* Low Bitrate Light Field Compression With Geometry and Content Consistency
* Low Quality and Recognition of Image Content
* Low-Latency Network-Adaptive Error Control for Interactive Streaming
* Low-Light Image Restoration With Short- and Long-Exposure Raw Pairs
* LR-GCN: Latent Relation-Aware Graph Convolutional Network for Conversational Emotion Recognition
* LR-SVM+: Learning Using Privileged Information with Noisy Labels
* MFDNet: Collaborative Poses Perception and Matrix Fisher Distribution for Head Pose Estimation
* MFFENet: Multiscale Feature Fusion and Enhancement Network For RGB-Thermal Urban Road Scene Parsing
* MIEGAN: Mobile Image Enhancement via a Multi-Module Cascade Neural Network
* MIG-Net: Multi-Scale Network Alternatively Guided by Intensity and Gradient Features for Depth Map Super-Resolution
* Mixed Dish Recognition With Contextual Relation and Domain Alignment
* Modality Disentangled Discriminator for Text-to-Image Synthesis
* Model May Fit You: User-Generalized Cross-Modal Retrieval, The
* Modeling Instant User Intent and Content-Level Transition for Sequential Fashion Recommendation
* Modeling Sequential Listening Behaviors With Attentive Temporal Point Process for Next and Next New Music Recommendation
* Motion Estimation and Coding Structure for Inter-Prediction of LiDAR Point Cloud Geometry
* Multi-Classes and Motion Properties for Concurrent Visual SLAM in Dynamic Environments
* Multi-Density Sketch-to-Image Translation Network
* Multi-Focus Image Fusion Based on Multi-Scale Gradients and Image Matting
* Multi-Level Temporal Dilated Dense Prediction for Action Recognition
* Multi-Localized Sensitive Autoencoder-Attention-LSTM For Skeleton-based Action Recognition
* Multi-Modal Context Propagation for Person Re-Identification With Wireless Positioning
* Multi-Modal Meta Multi-Task Learning for Social Media Rumor Detection
* Multi-Modal Variational Graph Auto-Encoder for Recommendation Systems
* Multi-Scale Grid Network for Image Deblurring With High-Frequency Guidance
* Multi-Scale Sparse Graph Convolutional Network For the Assessment of Parkinsonian Gait
* Multi-Task Center-of-Pressure Metrics Estimation With Graph Convolutional Network
* Multiframe-to-Multiframe Network for Video Denoising
* Multilevel Anomaly Detection Through Variational Autoencoders and Bayesian Models for Self-Aware Embodied Agents
* Multimodal Cross-Layer Bilinear Pooling for RGBT Tracking
* Multimodal Learning for Temporally Coherent Talking Face Generation With Articulator Synergy
* Multimodal Marketing Intent Analysis for Effective Targeted Advertising
* Novel Rank Learning Based No-Reference Image Quality Assessment Method, A
* Objective Quality Assessment of Lenslet Light Field Image Based on Focus Stack
* One-Shot Image-to-Image Translation via Part-Global Learning With a Multi-Adversarial Framework
* Online Residual Quantization Via Streaming Data Correlation Preserving
* Orthogonal Low-Rank Projection Learning for Robust Image Feature Extraction
* Pasadena: Perceptually Aware and Stealthy Adversarial Denoise Attack
* Personalized Image Recoloring for Color Vision Deficiency Compensation
* PH-GCN: Person Retrieval With Part-Based Hierarchical Graph Convolutional Network
* PiSLTRc: Position-Informed Sign Language Transformer With Content-Aware Convolution
* PR-RL: Portrait Relighting Via Deep Reinforcement Learning
* Prior-Guided Multi-View 3D Head Reconstruction
* Prior-Induced Information Alignment for Image Matting
* Probability-Based Framework to Fuse Temporal Consistency and Semantic Information for Background Segmentation
* Progress and Opportunities in Modelling Just-Noticeable Difference (JND) for Multimedia
* Projective Multiple Kernel Subspace Clustering
* Push & Pull: Transferable Adversarial Examples With Attentive Attack
* Quality Assessment for Omnidirectional Video: A Spatio-Temporal Distortion Modeling Approach
* Quality Evaluation of Holographic Images Coded With Standard Codecs
* Quaternion-Based Dictionary Learning and Saturation-Value Total Variation Regularization for Color Image Restoration
* Raw Image Deblurring
* Real-Time and Accurate UAV Pedestrian Detection for Social Distancing Monitoring in COVID-19 Pandemic
* Real-Time Semi-Supervised Deep Tone Mapping Network, A
* Recurrent Exposure Generation for Low-Light Face Detection
* Region-Based Dehazing via Dual-Supervised Triple-Convolutional Network
* Regularized Two Granularity Loss Function for Weakly Supervised Video Moment Retrieval
* Reinforcement Learning for Logic Recipe Generation: Bridging Gaps From Images to Plans
* Reinforcement-Learning-Based Energy-Efficient Framework for Multi-Task Video Analytics Pipeline, A
* Relation-Aware Compositional Zero-Shot Learning for Attribute-Object Pair Recognition
* RGB-D DSO: Direct Sparse Odometry With RGB-D Cameras for Indoor Scenes
* Rhythm-Aware Sequence-to-Sequence Learning for Labanotation Generation With Gesture-Sensitive Graph Convolutional Encoding
* Robust Audio Patch Attacks Using Physical Sample Simulation and Adversarial Patch Noise Generation
* Robust Character Labeling in Movie Videos: Data Resources and Self-Supervised Feature Adaptation
* Robust Label Rectifying With Consistent Contrastive-Learning for Domain Adaptive Person Re-Identification
* Robust Learning From Noisy Web Images Via Data Purification for Fine-Grained Recognition
* Robust Visual Object Tracking Via Adaptive Attribute-Aware Discriminative Correlation Filters
* SAM: Modeling Scene, Object and Action With Semantics Attention Modules for Video Recognition
* Sampling and Re-Weighting: Towards Diverse Frame Aware Unsupervised Video Person Re-Identification
* Scene Recognition Mechanism for Service Robot Adapting Various Families: A CNN-Based Approach Using Multi-Type Cameras
* SEcure Similar Image Matching (SESIM): An Improved Privacy Preserving Image Retrieval Protocol over Encrypted Cloud Database
* Seek Common Ground While Reserving Differences: A Model-Agnostic Module for Noisy Domain Adaptation
* Self-Attention-Based Multiscale Feature Learning Optical Flow With Occlusion Feature Map Prediction
* Self-Guided Adaptation: Progressive Representation Alignment for Domain Adaptive Object Detection
* Self-Paced Enhanced Low-Rank Tensor Kernelized Multi-View Subspace Clustering
* Self-Supervised Face Image Manipulation by Conditioning GAN on Face Decomposition
* Self-Supervised Graph Convolutional Network for Multi-View Clustering
* Semantic Regularized Class-Conditional GANs for Semi-Supervised Fine-Grained Image Synthesis
* Semantically Meaningful Class Prototype Learning for One-Shot Image Segmentation
* Semi-Supervised Temporal Action Proposal Generation via Exploiting 2-D Proposal Map
* Show, Price and Negotiate: A Negotiator With Online Value Look-Ahead
* SiamCorners: Siamese Corner Networks for Visual Tracking
* Signal-Dependent Noise Estimation for a Real-Camera Model via Weight and Shape Constraints
* Single-Image Specular Highlight Removal via Real-World Dataset Construction
* SmsNet: A New Deep Convolutional Neural Network Model for Adversarial Example Detection
* Social Condition-Enhanced Network for Recognizing Power Distance Using Expressive Prosody and Intrinsic Brain Connectivity, A
* Soft Warping Based Unsupervised Domain Adaptation for Stereo Matching
* Spatial-Temporal Action Localization With Hierarchical Self-Attention
* Spatial-Temporal Graphs for Cross-Modal Text2Video Retrieval
* Spatial-Temporal Multi-Cue Network for Sign Language Recognition and Translation
* Spatio-Temporal Pain Estimation Network With Measuring Pseudo Heart Rate Gain
* Spatiotemporal Dilated Convolution With Uncertain Matching for Video-Based Crowd Estimation
* Spatiotemporal Saliency Representation Learning for Video Action Recognition
* Speaker-Independent Speech Animation Using Perceptual Loss Functions and Synthetic Data
* Speech Driven Talking Face Generation From a Single Image and an Emotion Condition
* SPG-VTON: Semantic Prediction Guidance for Multi-Pose Virtual Try-on
* SPGNet: Serial and Parallel Group Network
* SRDRL: A Blind Super-Resolution Framework With Degradation Reconstruction Loss
* Structure-Guided Arbitrary Style Transfer for Artistic Image and Video
* Structured Attention Network for Referring Image Segmentation
* Style Normalization and Restitution for Domain Generalization and Adaptation
* Subjective Assessment Experiments That Recruit Few Observers With Repetitions (FOWR)
* Subjective Evaluation of Visual Quality and Simulator Sickness of Short 360° Videos: ITU-T Rec. P.919
* Suppressing Biased Samples for Robust VQA
* TANet: Target Attention Network for Video Bit-Depth Enhancement
* TaoHighlight: Commodity-Aware Multi-Modal Video Highlight Detection in E-Commerce
* Targeted Attack of Deep Hashing Via Prototype-Supervised Adversarial Networks
* TC-Net: Detecting Noisy Labels Via Transform Consistency
* Tear the Image Into Strips for Style Transfer
* Temporal Cross-Layer Correlation Mining for Action Recognition
* Temporal Self-Ensembling Teacher for Semi-Supervised Object Detection
* Tensor Product and Tensor-Singular Value Decomposition Based Multi-Exposure Fusion of Images
* TERA: Screen-to-Camera Image Code With Transparency, Efficiency, Robustness and Adaptability
* Texture Preserving Photo Style Transfer Network
* ToF and Stereo Data Fusion Using Dynamic Search Range Stereo Matching
* Total Generate: Cycle in Cycle Generative Adversarial Networks for Generating Human Faces, Hands, Bodies, and Natural Scenes
* Total Variation With Joint Norms For Infrared and Visible Image Fusion, A
* Towards Analysis-Friendly Face Representation With Scalable Feature and Texture Compression
* Towards Fast and Robust Real Image Denoising With Attentive Neural Network and PID Controller
* Towards Multi-Domain Face Synthesis Via Domain-Invariant Representations and Multi-Level Feature Parts
* Tripartite Graph Regularized Latent Low-Rank Representation for Fashion Compatibility Prediction
* TWGAN: Twin Discriminator Generative Adversarial Networks
* Two Exposure Fusion Using Prior-Aware Generative Adversarial Network
* Uncertainty-Aware Unsupervised Domain Adaptation in Object Detection
* Underwater Image Enhancement With Lightweight Cascaded Network
* Underwater Image Quality Assessment: Subjective and Objective Methods
* Unimodal Representation Learning and Recurrent Decomposition Fusion Structure for Utterance-Level Multimodal Embedding Learning, A
* Unpaired Image Captioning With semantic-Constrained Self-Learning
* Unreliable-to-Reliable Instance Translation for Semi-Supervised Pedestrian Detection
* Unsupervised Image and Text Fusion for Travel Information Enhancement
* Unsupervised Image-to-Image Translation via Pre-Trained StyleGAN2 Network
* Unsupervised Monocular Depth Estimation Using Attention and Multi-Warp Reconstruction
* V-SVR+: Support Vector Regression With Variational Privileged Information
* Video Frame Interpolation via Generalized Deformable Convolution
* Video Quality Assessment With Serial Dependence Modeling
* Video-Based Point Cloud Compression Artifact Removal
* View-Invariant Human Action Recognition Via View Transformation Network (VTN)
* Viewport-Aware Deep Reinforcement Learning Approach for 360° Video Caching
* Visual Perception Based Algorithm for Fast Depth Intra Coding of 3D-HEVC
* Voxel Structure-Based Mesh Reconstruction From a 3D Point Cloud
* WAFP-Net: Weighted Attention Fusion Based Progressive Residual Learning for Depth Map Super-Resolution
* Weakly Supervised Temporal Adjacent Network for Language Grounding
* Weakly-Supervised Facial Expression Recognition in the Wild With Noisy Data
* Zero-Shot Learning Based on Quality-Verifying Adversarial Network
* Zero-Shot Single-Microphone Sound Classification and Localization in a Building Via the Synthesis of Unseen Features
* Zero-Shot Video Event Detection With High-Order Semantic Concept Discovery and Matching
* Zwei: A Self-Play Reinforcement Learning Framework for Video Transmission Services
347 for MultMed(24)
MultMed(25)
* 3D Face Reconstruction and Gaze Tracking in the HMD for Virtual Interaction
* 3D Holoscopic Image Compression Based on Gaussian Mixture Model
* 3D Human Pose and Shape Reconstruction From Videos via Confidence-Aware Temporal Feature Aggregation
* 3D-Gradient Guided Rate Control Model for Screen Content Video Coding
* 3D3M: 3D Modulated Morphable Model for Monocular Face Reconstruction
* A2SPPNet: Attentive Atrous Spatial Pyramid Pooling Network for Salient Object Detection
* Abstractive Summarization for Video: A Revisit in Multistage Fusion Network With Forget Gate
* Accurate Head Pose Estimation Using Image Rectification and a Lightweight Convolutional Neural Network
* ACE-MEF: Adaptive Clarity Evaluation-Guided Network With Illumination Correction for Multi-Exposure Image Fusion
* Adaptive Bias-Aware Feature Generation for Generalized Zero-Shot Learning
* Adaptive Coding and Ordered-Index Extended Scrambling Based RDH in Encrypted Images
* Adaptive Group-Wise Consistency Network for Co-Saliency Detection
* Adaptive Ladder Loss for Learning Coherent Visual-Semantic Embedding
* Adaptive Marginalized Semantic Hashing for Unpaired Cross-Modal Retrieval
* Adaptive Multi-Hypergraph Convolutional Networks for 3D Object Classification
* Adaptive Mutual Supervision for Weakly-Supervised Temporal Action Localization
* Adaptive Recurrent Forward Network for Dense Point Cloud Completion
* AdaZoom: Towards Scale-Aware Large Scene Object Detection
* Adjustable Model Compression Using Multiple Genetic Algorithm
* ADNet: Rethinking the Shrunk Polygon-Based Approach in Scene Text Detection
* Adversarial and Isotropic Gradient Augmentation for Image Retrieval With Text Feedback
* Adversarial Learning Guided Task Relatedness Refinement for Multi-Task Deep Learning
* Adversarial Meta-Training Framework for Cross-Domain Few-Shot Learning, An
* Adversarial Mixup Ratio Confusion for Unsupervised Domain Adaptation
* Aesthetic Photo Collage With Deep Reinforcement Learning
* AlignVE: Visual Entailment Recognition Based on Alignment Relations
* ALIKE: Accurate and Lightweight Keypoint Detection and Descriptor Extraction
* ALR-GAN: Adaptive Layout Refinement for Text-to-Image Synthesis
* AMANet: Adaptive Multi-Path Aggregation for Learning Human 2D-3D Correspondences
* AMS-Net: Adaptive Multi-Scale Network for Image Compressive Sensing
* AnANet: Association and Alignment Network for Modeling Implicit Relevance in Cross-Modal Correlation Classification
* Anet: A Deep Neural Network for Automatic 3D Anthropometric Measurement Extraction
* Angel's Girl for Blind Painters: An Efficient Painting Navigation System Validated by Multimodal Evaluation Approach
* Anti-UAV: A Large-Scale Benchmark for Vision-Based UAV Tracking
* AnyoneNet: Synchronized Speech and Talking Head Generation for Arbitrary Persons
* APANet: Adaptive Prototypes Alignment Network for Few-Shot Semantic Segmentation
* Apprenticeship Learning Approach for Adaptive Video Streaming Based on Chunk Quality and User Preference, An
* Arbitrary-Shape Scene Text Detection via Visual-Relational Rectification and Contour Approximation
* Asymmetric Cross-Scale Alignment for Text-Based Person Search
* Asymmetric Modality Translation for Face Presentation Attack Detection
* Asymmetric Training in RealnessGAN
* Asynchronous Generative Adversarial Network for Asymmetric Unpaired Image-to-Image Translation
* ATF: An Alternating Training Framework for Weakly Supervised Face Alignment
* Attention Map Guided Transformer Pruning for Occluded Person Re-Identification on Edge Device
* Attention-Driven Appearance-Motion Fusion Network for Action Recognition
* Attribute-Guided Multiple Instance Hashing Network for Cross-Modal Zero-Shot Hashing
* Attribute-Modulated Generative Meta Learning for Zero-Shot Learning
* Audio Retrieval With Natural Language Queries: A Benchmark Study
* Audio-Driven Talking Face Video Generation With Dynamic Convolution Kernels
* Audio-Visual Event Localization by Learning Spatial and Semantic Co-Attention
* Audiovisual Dependency Attention for Violence Detection in Videos
* Augmented Multi-Scale Spatiotemporal Inconsistency Magnifier for Generalized DeepFake Detection
* Auto-Weighted Layer Representation Based View Synthesis Distortion Estimation for 3-D Video Coding
* AutoMA: Towards Automatic Model Augmentation for Transferable Adversarial Attacks
* Automated Segmentation of Prohibited Items in X-Ray Baggage Images Using Dense De-Overlap Attention Snake
* Automatic Shadow Generation via Exposure Fusion
* Average Gradient-Based Adversarial Attack
* Background Scene Recovery From an Image Looking Through Colored Glass
* Bayesian Filtering Framework for Continuous Affect Recognition From Facial Images, A
* Beyond Word Embeddings: Heterogeneous Prior Knowledge Driven Multi-Label Image Classification
* BGTracker: Cross-Task Bidirectional Guidance Strategy for Multiple Object Tracking
* Bi-Criteria Approximation for a Multi-Origin Multi-Channel Auto-Scaling Live Streaming Cloud
* Bi-RSTU: Bidirectional Recurrent Upsampling Network for Space-Time Video Super-Resolution
* Bias-Correction Feature Learner for Semi-Supervised Instance Segmentation
* Bidirectional Maximum Entropy Training With Word Co-Occurrence for Video Captioning
* Bidirectional Translation Between UHD-HDR and HD-SDR Videos
* Bilateral Fast Low-Rank Representation With Equivalent Transformation for Subspace Clustering
* Bilaterally Slimmable Transformer for Elastic and Efficient Visual Question Answering
* BL-JUNIPER: A CNN-Assisted Framework for Perceptual Video Coding Leveraging Block-Level JND
* Blind Dehazed Image Quality Assessment: A Deep CNN-Based Approach
* Blind Image Quality Assessment via Cross-View Consistency
* Blind Image Restoration Based on Cycle-Consistent Network
* Blind JPEG Compression Artifacts Removal by Integrating Channel Regulation With Exit Strategy
* Block Division Convolutional Network With Implicit Deep Features Augmentation for Micro-Expression Recognition
* Boosting Generic Visual-Linguistic Representation With Dynamic Contexts
* Boosting Robust Learning Via Leveraging Reusable Samples in Noisy Web Data
* Boundary-Aware Network for Shadow Removal, A
* Bullet-Time Video Synthesis Based on Virtual Dynamic Target Axis
* Caching in Dynamic Environments: A Near-Optimal Online Learning Approach
* Calibration-Free Cross-Camera Target Association Using Interaction Spatiotemporal Consistency
* Camera Invariant Feature Learning for Unsupervised Person Re-Identification
* Can Machines Generate Personalized Music? A Hybrid Favorite-Aware Method for User Preference Music Transfer
* Caption-Aided Product Detection via Collaborative Pseudo-Label Harmonization
* Cascade Transformer Decoder Based Occluded Pedestrian Detection With Dynamic Deformable Convolution and Gaussian Projection Channel Attention Mechanism
* Category-Aware Multimodal Attention Network for Fashion Compatibility Modeling
* Causal Interventional Training for Image Recognition
* Cellular Binary Neural Network for Accurate Image Classification and Semantic Segmentation
* CenterTube: Tracking Multiple 3D Objects With 4D Tubelets in Dynamic Point Clouds
* CFPNet: A Denoising Network for Complex Frequency Band Signal Processing
* Character-Aware Sampling and Rectification for Scene Text Recognition
* ChestXRayBERT: A Pretrained Language Model for Chest Radiology Report Summarization
* ChildPredictor: A Child Face Prediction Framework With Disentangled Learning
* Clicking Matters: Towards Interactive Human Parsing
* CNDesc: Cross Normalization for Local Descriptors Learning
* CNN-Based Framework for Enhancing 360° VR Experiences With Multisensorial Effects, A
* Co-Saliency Detection Guided by Group Weakly Supervised Learning
* Coarse-to-Fine Feedback Guidance Based Stereo Image Quality Assessment Considering Dominant Eye Fusion
* Coarse-to-Fine Framework for Automatic Video Unscreen, A
* Coherent Image Animation Using Spatial-Temporal Correspondence
* Collaborative Multilingual Continuous Sign Language Recognition: A Unified Framework
* Combining Deep Convolutional Neural Networks with Stochastic Ensemble Weight Optimization for Facial Expression Recognition in the Wild
* Composition-Guided Neural Network for Image Cropping Aesthetic Assessment
* Compound Projection Learning for Bridging Seen and Unseen Objects
* Compressed Geometric Arrays for Point Cloud Processing
* Compression of Plenoptic Point Cloud Attributes Using 6-D Point Clouds and 6-D Transforms
* Confidence-Aware Active Feedback for Interactive Instance Search
* Confidence-Aware Fusion Using Dempster-Shafer Theory for Multispectral Pedestrian Detection
* Consistency Preservation and Feature Entropy Regularization for GAN Based Face Editing
* Consistent Discrepancy Learning for Intra-Camera Supervised Person Re-Identification
* Consistent Multiple Graph Embedding for Multi-View Clustering
* Consistent Video Inpainting Using Axial Attention-Based Style Transformer
* Constructing Immunized Stego-Image for Secure Steganography via Artificial Immune System
* Context-Aware 3D Point Cloud Semantic Segmentation With Plane Guidance
* Context-Patch Representation Learning With Adaptive Neighbor Embedding for Robust Face Image Super-Resolution
* Contextual Attention Network for Emotional Video Captioning
* Continual Attentive Fusion for Incremental Learning in Semantic Segmentation
* Contour-Aware Equipotential Learning for Semantic Segmentation
* Contrastive 3D Human Skeleton Action Representation Learning via CrossMoCo With Spatiotemporal Occlusion Mask Data Augmentation
* Contrastive JS: A Novel Scheme for Enhancing the Accuracy and Robustness of Deep Models
* Contrastive Multi-Level Graph Neural Networks for Session-Based Recommendation
* Correspondence Attention Transformer: A Context-Sensitive Network for Two-View Correspondence Learning
* COutfitGAN: Learning to Synthesize Compatible Outfits Supervised by Silhouette Masks and Fashion Styles
* CPG3D: Cross-Modal Priors Guided 3D Object Reconstruction
* CR-LDSO: Direct Sparse LiDAR-Assisted Visual Odometry With Cloud Reusing
* Cross Modal Video Representations for Weakly Supervised Active Speaker Localization
* Cross-Class Bias Rectification for Point Cloud Few-Shot Segmentation
* Cross-Domain Contrastive Learning for Unsupervised Domain Adaptation
* Cross-Domain Image-Object Retrieval Based on Weighted Optimal Transport
* Cross-Domain Recommendation Via User-Clustering and Multidimensional Information Fusion
* Cross-Image Region Mining With Region Prototypical Network for Weakly Supervised Segmentation
* Cross-Lingual Text Image Recognition via Multi-Hierarchy Cross-Modal Mimic
* Cross-Mix Monitoring for Medical Image Segmentation With Limited Supervision
* Cross-Modal Data Augmentation for Tasks of Different Modalities
* Cross-Modal Enhancement Network for Multimodal Sentiment Analysis
* Cross-Modal Variational Auto-Encoder for Content-Based Micro-Video Background Music Recommendation
* Cross-Modality Transformer With Modality Mining for Visible-Infrared Person Re-Identification
* Cross-Referencing Self-Training Network for Sound Event Detection in Audio Mixtures
* Cross-View Panorama Image Synthesis
* Crowd Counting via Unsupervised Cross-Domain Feature Adaptation
* CVCNet: Learning Cost Volume Compression for Efficient Stereo Matching
* Cycle Consistency Based Pseudo Label and Fine Alignment for Unsupervised Domain Adaptation
* Cycle-Free Weakly Referring Expression Grounding With Self-Paced Learning
* C^2 DFNet: Criss-Cross Dynamic Filter Network for RGB-D Salient Object Detection
* D-LIOM: Tightly-Coupled Direct LiDAR-Inertial Odometry and Mapping
* DASI: Learning Domain Adaptive Shape Impression for 3D Object Reconstruction
* DBiased-P: Dual-Biased Predicate Predictor for Unbiased Scene Graph Generation
* DDistill-SR: Reparameterized Dynamic Distillation Network for Lightweight Image Super-Resolution
* De-END: Decoder-Driven Watermarking Network
* Dear-Net: Learning Diversities for Skeleton-Based Early Action Recognition
* Decoder-Side Cross Resolution Synthesis for Video Compression Enhancement
* Decomposable Causal View of Compositional Zero-Shot Learning, A
* Decompose to Adapt: Cross-Domain Object Detection Via Feature Disentanglement
* Decoupled Multi-Task Network for Shadow Removal, A
* Deep Cross-Attention Network for Crowdfunding Success Prediction
* Deep Cross-Modal Hashing Based on Semantic Consistent Ranking
* Deep Graph Convolutional Quantization Networks for Image Retrieval
* Deep Label Prior: Pre-Training-Free Salient Object Detection Network Based on Label Learning
* Deep Margin-Sensitive Representation Learning for Cross-Domain Facial Expression Recognition
* Deep Object Co-Segmentation and Co-Saliency Detection via High-Order Spatial-Semantic Network Modulation
* Deep Online Video Stabilization Using IMU Sensors
* Deep Reinforcement Clustering
* Deep Residual Weight-Sharing Attention Network With Low-Rank Attention for Visual Question Answering
* Deep Robust Low Rank Correlation With Unifying Clustering Structure for Cross Domain Adaptation
* Deep SR-HDR: Joint Learning of Super-Resolution and High Dynamic Range Imaging for Dynamic Scenes
* Deep Texton-Coherence Network for Camouflaged Object Detection
* Delving Deep Into One-Shot Skeleton-Based Action Recognition With Diverse Occlusions
* Dense Modality Interaction Network for Audio-Visual Event Localization
* Dense Video Captioning With Early Linguistic Information Fusion
* Depth-Aware and Semantic Guided Relational Attention Network for Visual Question Answering
* Depth-Aware Multi-Person 3D Pose Estimation With Multi-Scale Waterfall Representations
* Depth-Distilled Multi-Focus Image Fusion
* Depth-Induced Gap-Reducing Network for RGB-D Salient Object Detection: An Interaction, Guidance and Refinement Approach
* Design of a 5G Multimedia Broadcast Application Function Supporting Adaptive Error Recovery
* DETA: A Point-Based Tracker With Deformable Transformer and Task-Aligned Learning
* Develop Then Rival: A Human Vision-Inspired Framework for Superimposed Image Decomposition
* Device-Edge-Cloud Collaborative Acceleration Method Towards Occluded Face Recognition in High-Traffic Areas
* Differential Weight Quantization for Multi-Model Compression
* DilateFormer: Multi-Scale Dilated Transformer for Visual Recognition
* Discriminator-Quality Evaluation GAN
* Disentangled Multimodal Representation Learning for Recommendation
* Distortion Map-Guided Feature Rectification for Efficient Video Semantic Segmentation
* Diversity-Boosted Generalization-Specialization Balancing for Zero-Shot Learning
* DMEF: Multi-Exposure Image Fusion Based on a Novel Deep Decomposition Method
* DOC: Text Recognition via Dual Adaptation and Clustering
* Does Thermal Really Always Matter for RGB-T Salient Object Detection?
* Domain Adaptive Transformer Tracking Under Occlusions
* Domain Generalization Via Encoding and Resampling in a Unified Latent Space
* Domain-Class Correlation Decomposition for Generalizable Person Re-Identification
* DREAMT: Diversity Enlarged Mutual Teaching for Unsupervised Domain Adaptive Person Re-Identification
* Dual Cross-Attention for Video Object Segmentation via Uncertainty Refinement
* Dual Fusion-Propagation Graph Neural Network for Multi-View Clustering
* Dual Relation Network for Scene Text Recognition
* Dual Structural Knowledge Interaction for Domain Adaptation
* Dual Transformer for Point Cloud Analysis
* Dual-Gradients Localization Framework With Skip-Layer Connections for Weakly Supervised Object Localization
* Dual-Level Adaptive and Discriminative Knowledge Transfer for Cross-Domain Recognition
* Dual-Tuning: Joint Prototype Transfer and Structure Regularization for Compatible Feature Learning
* DualGNN: Dual Graph Neural Network for Multimedia Recommendation
* Dynamic Contrastive Distillation for Image-Text Retrieval
* Dynamic Residual Filtering With Laplacian Pyramid for Instance Segmentation
* Dynamic Residual Self-Attention Network for Lightweight Single Image Super-Resolution, A
* D^3K: Dynastic Data-Free Knowledge Distillation
* EAPT: Efficient Attention Pyramid Transformer for Image Processing
* Edge-Assisted Massive Video Delivery Over Cell-Free Massive MIMO
* Effective End-to-End Vision Language Pretraining With Semantic Visual Loss
* Efficient and Differentiable Low-Rank Matrix Completion With Back Propagation
* Efficient Geometry Surface Coding in V-PCC
* Efficient Light Field Angular Super-Resolution With Sub-Aperture Feature Learning and Macro-Pixel Upsampling
* Efficient Spatio-Temporal Contrastive Learning for Skeleton-Based 3-D Action Recognition
* Efficient VVC Intra Prediction Based on Deep Feature Fusion and Probability Estimation
* EgoFish3D: Egocentric 3D Pose Estimation From a Fisheye Camera via Self-Supervised Learning
* Enabling Trimap-Free Image Matting With a Frequency-Guided Saliency-Aware Network via Joint Learning
* Encoded Feature Enhancement in Watermarking Network for Distortion in Real Scenes
* End-to-End Blind Video Quality Assessment Based on Visual and Memory Attention Modeling
* Energy-Based Temporal Summarized Attentive Network for Zero-Shot Action Recognition
* Enhancing Style-Guided Image-to-Image Translation via Self-Supervised Metric Learning
* Equal Value String and Copy Above String Based String Prediction for SCC in AVS3
* Estimating Human Weight From a Single Image
* Estimating the Secret Key of Spread Spectrum Watermarking Based on Equivalent Keys
* Explainable and Generalizable Blind Image Quality Assessment via Semantic Attribute Reasoning
* Exploiting Low-Rank Latent Gaussian Graphical Model Estimation for Visual Sentiment Distributions
* Exploiting Multi-View Part-Wise Correlation via an Efficient Transformer for Vehicle Re-Identification
* Exploiting Temporal Contexts With Strided Transformer for 3D Human Pose Estimation
* Exploring Action Centers for Temporal Action Localization
* Exploring Kernel-Based Texture Transfer for Pose-Guided Person Image Generation
* Exploring Optical-Flow-Guided Motion and Detection-Based Appearance for Temporal Sentence Grounding
* Exposing Deepfake Face Forgeries With Guided Residuals
* Extrinsic Self-Calibration of the Surround-View System: A Weakly Supervised Approach
* F-TPE: Flexible Thumbnail-Preserving Encryption Based on Multi-Pixel Sum-Preserving Encryption
* Face De-Occlusion With Deep Cascade Guidance Learning
* Facial Expression Guided Diagnosis of Parkinson's Disease via High-Quality Data Augmentation
* Fast and Robust Online Handwritten Chinese Character Recognition With Deep Spatial and Contextual Information Fusion Network
* Fast Human Pose Estimation in Compressed Videos
* Fast Monocular Depth Estimation via Side Prediction Aggregation with Continuous Spatial Refinement
* FBSNet: A Fast Bilateral Symmetrical Network for Real-Time Semantic Segmentation
* FedLive: A Federated Transmission Framework for Panoramic Livecast With Reinforced Variational Inference
* Few-Shot Learning for Fine-Grained Emotion Recognition Using Physiological Signals
* Few-Shot Segmentation for Prohibited Items Inspection With Patch-Based Self-Supervised Learning and Prototype Reverse Validation
* Few-Shot Segmentation With Optimal Transport Matching and Message Flow
* Few-Shot Speaker Identification Using Lightweight Prototypical Network With Feature Grouping and Interaction
* FFFN: Frame-By-Frame Feedback Fusion Network for Video Super-Resolution
* FGDNet: Fine-Grained Detection Network Towards Face Anti-Spoofing
* FI-WSOD: Foreground Information Guided Weakly Supervised Object Detection
* Fidelity-driven Optimization Reconstruction and Details Preserving Guided Fusion for Multi-Modality Medical Image
* Fine-Grained Face Editing via Personalized Spatial-Aware Affine Modulation
* Fine-Grained Image Classification by Class and Image-Specific Decomposition With Multiple Views
* Fine-Grained Visual Classification via Internal Ensemble Learning Transformer
* Focal Inverse Distance Transform Maps for Crowd Localization
* Focal Stack Image Compression Based on Basis-Quadtree Representation
* Focus and Align: Learning Tube Tokens for Video-Language Pre-Training
* FoleyGAN: Visually Guided Generative Adversarial Network-Based Synchronous Sound Generation in Silent Videos
* Forgetting to Remember: A Scalable Incremental Learning Framework for Cross-Task Blind Image Quality Assessment
* Format Compliant Encryption Method for 3D Objects Allowing Hierarchical Decryption, A
* FP-AGL: Filter Pruning With Adaptive Gradient Learning for Accelerating Deep Convolutional Neural Networks
* Frame-Level Rate Control for Geometry-Based LiDAR Point Cloud Compression
* Free^3 Net: Gliding Free, Orientation Free, and Anchor Free Network for Oriented Object Detection
* Frequency-Domain Deep Guided Image Denoising
* From Collective Attribute Association of Groups to Precise Attribute Association of Individuals
* From External to Internal: Structuring Image for Text-to-Image Attributes Manipulation
* From Front to Rear: 3D Semantic Scene Completion Through Planar Convolution and Attention-Based Network
* FT-TDR: Frequency-Guided Transformer and Top-Down Refinement Network for Blind Face Inpainting
* Full-Scene Defocus Blur Detection With DeFBD+ via Multi-Level Distillation Learning
* FusePose: IMU-Vision Sensor Fusion in Kinematic Space for Parametric Human Pose Estimation
* GA-Net: A Guidance Aware Network for Skeleton-Based Early Activity Recognition
* General Framework to Reversible Data Hiding for JPEG Images With Multiple Two-Dimensional Histograms
* Generalized Score Distribution: A Two-Parameter Discrete Distribution Accurately Describing Responses From Quality of Experience Subjective Experiments
* Generalized Zero-Shot Quantization of Deep Convolutional Neural Networks Via Learned Weights Statistics, A
* Generating Music With Emotions
* Geo-Localization via Ground-to-Satellite Cross-View Image Retrieval
* GeometryMotion-Transformer: An End-to-End Framework for 3D Action Recognition
* Global Memory and Local Continuity for Video Object Detection
* Global Representation Guided Adaptive Fusion Network for Stable Video Crowd Counting
* Global Temporal Difference Network for Action Recognition
* GLRT-Based Multi-Pixel Target Detector in Hyperspectral Imagery, A
* Graph Complemented Latent Representation for Few-Shot Image Classification
* Graph Contrastive Partial Multi-View Clustering
* Graph Convolutional Network With Unknown Class Number
* Graph Neural Networks With Triple Attention for Few-Shot Learning
* GraphIQA: Learning Distortion Graph Representations for Blind Image Quality Assessment
* Grouping by Center: Predicting Centripetal Offsets for the Bottom-up Human Pose Estimation
* GSMFlow: Generation Shifts Mitigating Flow for Generalized Zero-Shot Learning
* Guided by Meta-Set: A Data-Driven Method for Fine-Grained Visual Recognition
* Hash Bit Selection With Reinforcement Learning for Image Retrieval
* Heterogeneous Feature Alignment and Fusion in Cross-Modal Augmented Space for Composed Image Retrieval
* Heterogeneous Graph Contrastive Learning Network for Personalized Micro-Video Recommendation
* HGAN: Hierarchical Graph Alignment Network for Image-Text Retrieval
* HHF: Hashing-Guided Hinge Function for Deep Hashing Retrieval
* Hierarchical Model Compression via Shape-Edge Representation of Feature Maps: An Enlightenment From the Primate Visual System
* Hierarchical Semantic Structure Preserving Hashing for Cross-Modal Retrieval
* High Efficiency Vibrotactile Codec Based on Gate Recurrent Network
* Hole Inpainting Algorithm for Half-Organized Point Cloud Obtained by Structured-Light Section System
* HoloSync: Frame Synchronisation for Multi-Source Holographic Teleportation Applications
* HRNeXt: High-Resolution Context Network for Crowd Pose Estimation
* Human Body-Aware Feature Extractor Using Attachable Feature Corrector for Human Pose Estimation
* Human Parsing With Part-Aware Relation Modeling
* Human Pose and Shape Estimation from Single Polarization Images
* Hybrid Contrastive Learning for Unsupervised Person Re-Identification
* Hybrid Motion Representation Learning for Prediction From Raw Sensor Data
* I3N: Intra- and Inter-Representation Interaction Network for Change Captioning
* IFSepR: A General Framework for Image Fusion Based on Separate Representation Learning
* Illumination Guided Attentive Wavelet Network for Low-Light Image Enhancement
* Image Captioning With Novel Topics Guidance and Retrieval-Based Topics Re-Weighting
* Image Compressed Sensing Using Non-Local Neural Network
* Image Hazing and Dehazing: From the Viewpoint of Two-Way Image Translation With a Weakly Supervised Framework
* Image Manipulation Localization Using Multi-Scale Feature Fusion and Adaptive Edge Supervision
* Image Operation Chain Detection with Machine Translation Framework
* Image Stitching With Manifold Optimization
* Impact of Black Edge Artifact on QoE of the FOV-Based Cloud VR Services, The
* Importance-Aware Information Bottleneck Learning Paradigm for Lip Reading
* Improving Color Constancy Using Chromaticity-Line Prior
* Improving Disentangled Representation Learning for Gait Recognition Using Group Supervision
* Improving Person Re-Identification With Multi-Cue Similarity Embedding and Propagation
* Incorporating Linear Regression Problems Into an Adaptive Framework With Feasible Optimizations
* InDecGAN: Learning to Generate Complex Images From Captions via Independent Object-Level Decomposition and Enhancement
* Indoor Camera Pose Estimation From Room Layouts and Image Outer Corners
* Information Maximizing Adaptation Network With Label Distribution Priors for Unsupervised Domain Adaptation
* Infrared and Visible Image Fusion via Interactive Compensatory Attention Adversarial Learning
* Instance-Aware Deep Graph Learning for Multi-Label Classification
* Instance-Specific Feature Propagation for Referring Segmentation
* Intelligent Typography: Artistic Text Style Transfer for Complex Texture and Structure
* Intelligent Virtual Standard Patient for Medical Students Training Based on Oral Knowledge Graph, An
* Intelligent Vision-Based Nutritional Assessment Method for Handheld Food Items, An
* Inter-Intra Modal Representation Augmentation With DCT-Transformer Adversarial Network for Image-Text Matching
* Interaction Transformer for Human Reaction Generation
* Interaction-Matrix Based Personalized Image Aesthetics Assessment
* Interpretable Graph Convolutional Network for Multi-View Semi-Supervised Learning
* Interpretable Multi-Modal Stacking-Based Ensemble Learning Method for Real Estate Appraisal
* InterREC: An Interpretable Method for Referring Expression Comprehension
* Intra- and Inter-Class Induced Discriminative Deep Dictionary Learning for Visual Recognition
* Intra-Class Adaptive Augmentation With Neighbor Correction for Deep Metric Learning
* Intra-Inter View Interaction Network for Light Field Image Super-Resolution
* Intrinsic and Complete Structure Learning Based Incomplete Multiview Clustering
* ISF-GAN: An Implicit Style Function for High-Resolution Image-to-Image Translation
* Isotropic Self-Supervised Learning for Driver Drowsiness Detection With Attention-Based Multimodal Fusion
* JDSR-GAN: Constructing an Efficient Joint Learning Network for Masked Face Super-Resolution
* Joint Embedding of Deep Visual and Semantic Features for Medical Image Report Generation
* Joint Wavelet Sub-Bands Guided Network for Single Image Super-Resolution
* Joint-Bone Fusion Graph Convolutional Network for Semi-Supervised Skeleton Action Recognition
* JPEG Image Encryption With Adaptive DC Coefficient Prediction and RS Pair Permutation
* Knowing What it is: Semantic-Enhanced Dual Attention Transformer
* Knowledge Distillation Hashing for Occluded Face Retrieval
* Knowledge-Guided Blind Image Quality Assessment With Few Training Samples
* LA-HDR: Light Adaptive HDR Reconstruction Framework for Single LDR Image Considering Varied Light Conditions
* Label-Affinity Self-Adaptive Central Similarity Hashing for Image Retrieval
* Language-Based Image Manipulation Built on Language-Guided Ranking
* Language-Guided Face Animation by Recurrent StyleGAN-Based Generator
* Language-Guided Multi-Granularity Context Aggregation for Temporal Sentence Grounding
* Late Fusion Multiple Kernel Clustering With Local Kernel Alignment Maximization
* Latent Domain Generation for Unsupervised Domain Adaptation Object Counting
* Latent Feature Pyramid Network for Object Detection
* Latent Heterogeneous Graph Network for Incomplete Multi-View Learning
* LCCStyle: Arbitrary Style Transfer With Low Computational Complexity
* Learning a Compact Spatial-Angular Representation for Light Field
* Learning Adaptive Patch Generators for Mask-Robust Image Inpainting
* Learning by Imagination: A Joint Framework for Text-Based Image Manipulation and Change Captioning
* Learning Cross-Channel Representations for Semantic Segmentation
* Learning Detail-Structure Alternative Optimization for Blind Super-Resolution
* Learning Disentangled Representation Implicitly Via Transformer for Occluded Person Re-Identification
* Learning Dual Low-Rank Representation for Multi-Label Micro-Video Classification
* Learning Dual-Level Deep Representation for Thermal Infrared Tracking
* Learning Dual-Routing Capsule Graph Neural Network for Few-Shot Video Classification
* Learning Efficient GANs for Image Translation via Differentiable Masks and Co-Attention Distillation
* Learning Fashion Compatibility With Context Conditioning Embedding
* Learning Generalized Knowledge From a Single Domain on Urban-Scene Segmentation
* Learning Localization-Aware Target Confidence for Siamese Visual Tracking
* Learning MLatent Representations for Generalized Zero-Shot Learning
* Learning Personalized Image Aesthetics From Subjective and Objective Attributes
* Learning Relation Models to Detect Important People in Still Images
* Learning Relative Feature Displacement for Few-Shot Open-Set Recognition
* Learning Scene-Aware Spatio-Temporal GNNs for Few-Shot Early Action Prediction
* Learning Sparse and Discriminative Multimodal Feature Codes for Finger Recognition
* Learning Stage-Wise GANs for Whistle Extraction in Time-Frequency Spectrograms
* Learning to Learn With Variational Inference for Cross-Domain Image Classification
* Learning to Minimize the Remainder in Supervised Learning
* LgNet: A Local-Global Network for Action Recognition and Beyond
* Lifelong Age Transformation With a Deep Generative Prior
* Light Field Compression With Graph Learning and Dictionary-Guided Sparse Coding
* LIQA: Lifelong Blind Image Quality Assessment
* Live 360 Degree Video Delivery Based on User Collaboration in a Streaming Flock
* LiveSR: Enabling Universal HD Live Video Streaming With Crowdsourced Online Learning
* LiVLR: A Lightweight Visual-Linguistic Reasoning Framework for Video Question Answering
* Local Correspondence-Aware Hybrid CNN-GCN Model for Single-Image Human Body Reconstruction, A
* Localized Sparse Incomplete Multi-View Clustering
* Look&listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech Enhancement
* Looking and Hearing Into Details: Dual-Enhanced Siamese Adversarial Network for Audio-Visual Matching
* Low-Light Image Enhancement Using the Cell Vibration Model
* Low-Light Image Enhancement via Self-Reinforced Retinex Projection Model
* Low-Light Stereo Image Enhancement
* LTReID: Factorizable Feature Generation With Independent Components for Long-Tailed Person Re-Identification
* L_1-Regularized Reconstruction Model for Edge-Preserving Filtering
* M2P2: Multimodal Persuasion Prediction Using Adaptive Fusion
* Machine Learning Solution for Video Delivery to Mitigate Co-Tier Interference in 5G HetNets, A
* Manifold Regularized Joint Transfer for Open Set Domain Adaptation
* Many Hands Make Light Work: Transferring Knowledge from Auxiliary Tasks for Video-Text Retrieval
* MAVEN: A Memory Augmented Recurrent Approach for Multimodal Fusion
* MFGNet: Dynamic Modality-Aware Filter Generation for RGB-T Tracking
* Micro-Influencer Recommendation by Multi-Perspective Account Representation Learning
* Micro-Video Popularity Prediction Via Multimodal Variational Information Bottleneck
* MIGN: Multiscale Image Generation Network for Remote Sensing Image Semantic Segmentation
* Mixer-Based Semantic Spread for Few-Shot Learning
* MLNet: A Multi-Domain Lightweight Network for Multi-Focus Image Fusion
* MLP-JCG: Multi-Layer Perceptron With Joint-Coordinate Gating for Efficient 3D Human Pose Estimation
* Modality-Oriented Graph Learning Toward Outfit Compatibility Modeling
* Modeling Both Intra- and Inter-Modality Uncertainty for Multimodal Fake News Detection
* MorphText: Deep Morphology Regularized Accurate Arbitrary-Shape Scene Text Detection
* MotionVideoGAN: A Novel Video Generator Based on the Motion Space Learned From Image Pairs
* Moving Towards Centers: Re-Ranking With Attention and Memory for Re-Identification
* MPPM: A Mobile-Efficient Part Model for Object re-ID
* MSAFF-Net: Multiscale Attention Feature Fusion Networks for Single Image Dehazing and Beyond
* Multi-Agent Trajectory Prediction With Spatio-Temporal Sequence Fusion
* Multi-Branch Distance-Sensitive Self-Attention Network for Image Captioning
* Multi-Channel HEVC Steganography by Minimizing IPM Steganographic Distortions
* Multi-Channel Weight-Sharing Autoencoder Based on Cascade Multi-Head Attention for Multimodal Emotion Recognition
* Multi-Clue Reconstruction of Sharing Chains for Social Media Images
* Multi-Dimensional Attention With Similarity Constraint for Weakly-Supervised Temporal Action Localization
* Multi-Label Speech Emotion Recognition via Inter-Class Difference Loss Under Response Residual Network
* Multi-Level Second-Order Few-Shot Learning
* Multi-Modal Cross-Domain Alignment Network for Video Moment Retrieval
* Multi-Modal Transformer With Global-Local Alignment for Composed Query Image Retrieval
* Multi-Modality Sensing and Data Fusion for Multi-Vehicle Detection
* Multi-Panda Tracking
* Multi-Range View Aggregation Network With Vision Transformer Feature Fusion for 3D Object Retrieval
* Multi-Scale Fine-Grained Alignments for Image and Sentence Matching
* Multi-Source Multi-Label Learning for User Profiling in Online Games
* Multi-Sourced Knowledge Integration for Robust Self-Supervised Facial Landmark Tracking
* Multi-Stage Automatic Evaluation System for Sight-Singing, A
* Multi-Stage Spatio-Temporal Aggregation Transformer for Video Person Re-Identification
* Multi-Stage Visual Tracking With Siamese Anchor-Free Proposal Network
* Multi-Stream Dense View Reconstruction Network for Light Field Image Compression
* Multimodal Affective Computing With Dense Fusion Transformer for Inter- and Intra-Modality Interactions
* Multimodal Core Tensor Factorization and its Applications to Low-Rank Tensor Completion
* Multimodal Emotion Classification With Multi-Level Semantic Reasoning Network
* Multimodal Information Bottleneck: Learning Minimal Sufficient Unimodal and Multimodal Representations
* Multimodal Pre-Training Based on Graph Attention Network for Document Understanding
* Multimodal Sentiment Analysis With Image-Text Interaction Network
* Multimodal Topic Modeling by Exploring Characteristics of Short Text Social Media
* Multimodal-Based and Aesthetic-Guided Narrative Video Summarization
* Multiple Description Coding for Best-Effort Delivery of Light Field Video Using GNN-Based Compression
* Multiple Instance Detection Networks With Adaptive Instance Refinement
* Multiple Relational Learning Network for Joint Referring Expression Comprehension and Segmentation
* Multisample-Based Contrastive Loss for Top-K Recommendation
* Multiscale Emotion Representation Learning for Affective Image Recognition
* Natural Image Stitching With Layered Warping Constraint
* Need Only One More Point (NOOMP): Perspective Adaptation Crowd Counting in Complex Scenes
* Neighborhood Contrastive Transformer for Change Captioning
* Neuromorphic Similarity Measurement of Tactile Stimuli in Human-Machine Interface
* No-Reference Bitstream-Layer Model for Perceptual Quality Assessment of V-PCC Encoded Point Clouds
* No-Reference Light Field Image Quality Assessment Using Four-Dimensional Sparse Transform
* Noise-Sensitive Adversarial Learning for Weakly Supervised Salient Object Detection
* Non-Aligned Multi-View Multi-Label Classification via Learning View-Specific Labels
* Novel Action Saliency and Context-Aware Network for Weakly-Supervised Temporal Action Localization, A
* Novel Encryption-Then-Lossy-Compression Scheme of Color Images Using Customized Residual Dense Spatial Network, A
* Novel Human Image Sequence Synthesis Method by Pose-Shape-Content Inference, A
* Novel Mix-Normalization Method for Generalizable Multi-Source Person Re-Identification, A
* Novel Video Stabilization Model With Motion Morphological Component Priors, A
* Object Detection Made Simpler by Eliminating Heuristic NMS
* Occluded Visible-Infrared Person Re-Identification
* OccluMix: Towards De-Occlusion Virtual Try-on by Semantically-Guided Mixup
* OMGH: Online Manifold-Guided Hashing for Flexible Cross-Modal Retrieval
* Omnidirectional Image Super-Resolution via Latitude Adaptive Network
* Optical Flow Computation for Video Under the Dynamic Illumination
* Optimal Partition Assignment for Universal Object Detection
* Optimal Transport-Based Patch Matching for Image Style Transfer
* Optimal Volumetric Video Streaming With Hybrid Saliency Based Tiling
* OSANet: Object Semantic Attention Network for Visual Sentiment Analysis
* OTFace: Hard Samples Guided Optimal Transport Loss for Deep Face Representation
* ParamCrop: Parametric Cubic Cropping for Video Contrastive Learning
* Partial Multi-Modal Hashing via Neighbor-Aware Completion Learning
* Path-Analysis-Based Reinforcement Learning Algorithm for Imitation Filming
* Perception-and-Regulation Network for Salient Object Detection
* Perception-Aware Cross-Modal Signal Reconstruction: From Audio-Haptic to Visual
* Perceptual Quality Assessment of Cartoon Images
* Person Search by a Bi-Directional Task-Consistent Learning Model
* Personalized Fashion Recommendation With Discrete Content-Based Tensor Factorization
* PGDENet: Progressive Guided Fusion and Depth Enhancement Network for RGB-D Indoor Scene Parsing
* PhotoHelper: Portrait Photographing Guidance Via Deep Feature Retrieval and Fusion
* Plenoptic Point Cloud Compression Using Multiview Extension of High Efficiency Video Coding
* Pluralistic Face Inpainting With Transformation of Attribute Information
* Point Cloud Soft Multicast for Untethered XR Users
* Point-Supervised Video Temporal Grounding
* Positional Attention Guided Transformer-Like Architecture for Visual Question Answering
* PRAM: Penalized Resource Allocation Method for Video Services
* Predicting Visual Attention in Graphic Design Documents
* Privacy-Preserving Image Acquisition for Neural Vision Systems
* Probabilistic Contrastive Framework for Semi-Supervised Learning, A
* Progressive Context-Aware Graph Feature Learning for Target Re-Identification
* Progressive Local Filter Pruning for Image Retrieval Acceleration
* Progressive Motion Boosting for Video Frame Interpolation
* Proposal-Free One-Stage Framework for Referring Expression Comprehension and Generation via Dense Cross-Attention, A
* Prototype Learning for Automatic Check-Out
* Prototype-Based Intent Perception
* Prototypical Contrast and Reverse Prediction: Unsupervised Skeleton Based Action Recognition
* Purifying Low-Light Images via Near-Infrared Enlightened Image
* Pyramid Feature Aggregation for Hierarchical Quality Prediction of Stitched Panoramic Images
* P^2-GAN: Efficient Stroke Style Transfer Using Single Style Image
* QoE-Driven Adaptive Streaming for Point Clouds
* QoE-Oriented Mobile Virtual Reality Game in Distributed Edge Networks
* Quality-Aware Network for Human Parsing
* Quality-Aware Part Models for Occluded Person Re-Identification
* Quantification of Occlusion Handling Capability of a 3D Human Pose Estimation Framework
* Quantitative Comparison of Point Cloud Compression Algorithms With PCC Arena
* Quaternion Relation Embedding for Scene Graph Generation
* Quitting Ratio-Based Bitrate Ladder Selection Mechanism for Adaptive Bitrate Video Streaming
* Radio-Assisted Human Detection
* RAM360: Robust Adaptive Multi-Layer 360° Video Streaming With Lyapunov Optimization
* Ranked Similarity Weighting and Top-nk Sampling in Deep Metric Learning
* Rate-Distortion Optimized Geometry Compression for Spinning LiDAR Point Cloud
* RAV: Learning-Based Adaptive Streaming to Coordinate the Audio and Video Bitrate Selections
* Real-Time 3D Single Object Tracking With Transformer
* Real-World Image Super-Resolution by Exclusionary Dual-Learning
* RealVR: Efficient, Economical, and Quality-of-Experience-Driven VR Video System Based on MPEG OMAF
* Recaptured Screen Image Demoiréing in Raw Domain
* Recognition of Emotions in User-Generated Videos through Frame-Level Adaptation and Emotion Intensity Learning
* Recognition of Handwritten Chinese Text by Segmentation: A Segment-Annotation-Free Approach
* Recognition-Oriented Image Compressive Sensing With Deep Learning
* Reconstruction-Based Visual-Acoustic-Semantic Embedding Method for Speech-Image Retrieval, A
* Recursive Multi-Relational Graph Convolutional Network for Automatic Photo Selection
* Refined Knowledge Transfer for Language-Based Person Search
* Refining Noisy Labels With Label Reliability Perception for Person Re-Identification
* Reflection Removal With NIR and RGB Image Feature Fusion
* Region Separable Stereo Matching
* Region-Aware Arbitrary-Shaped Text Detection With Progressive Fusion
* Regression-Selective Feature-Adaptive Tracker for Visual Object Tracking
* Reinforcement Shrink-Mask for Text Detection
* RelationTrack: Relation-Aware Multiple Object Tracking With Decoupled Representation
* Residual Learning Approach to Deblur and Generate High Frame Rate Video With an Event Camera, A
* Residual Quantization for Low Bit-Width Neural Networks
* RESTEP Into the Future: Relational Spatio-Temporal Learning for Multi-Person Action Forecasting
* Rethinking and Improving Few-Shot Segmentation From a Contour-Aware Perspective
* Retinex-Based Variational Framework for Low-Light Image Enhancement and Denoising
* Reversible Data Hiding for JPEG Images With Adaptive Multiple Two-Dimensional Histogram and Mapping Generation
* Reversible Data Hiding in Encrypted Images Based on Time-Varying Huffman Coding Table
* RFGAN: RF-Based Human Synthesis
* RFMask: A Simple Baseline for Human Silhouette Segmentation With Radio Signals
* RGBT Salient Object Detection: A Large-Scale Dataset and Benchmark
* RIVIE: Robust Inherent Video Information Embedding
* Robust Coverless Image Steganography Based on Neglected Coverless Image Dataset Construction
* Robust Frequency-Domain-Based Graph Adaptive Network for Parkinson's Disease Detection From Gait Data, A
* Robust Image Hashing With Isomap and Saliency Map for Copy Detection
* Robust Local Texture Descriptor in the Parametric Space of the Weibull Distribution, A
* Robust Multi-Drone Multi-Target Tracking to Resolve Target Occlusion: A Benchmark
* Robust Multimodal Sentiment Analysis via Tag Encoding of Uncertain Missing Modalities
* Robust Shape-Aware Rib Fracture Detection and Segmentation Framework With Contrastive Learning, A
* Robust Video-Text Retrieval Via Noisy Pair Calibration
* RQNet: Residual Quaternion CNN for Performance Enhancement in Low Complexity and Device Robust Acoustic Scene Classification
* RSNet: Relation Separation Network for Few-Shot Similar Class Recognition
* RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering
* RV-TMO: Large-Scale Dataset for Subjective Quality Assessment of Tone Mapped Images
* RZSR: Reference-Based Zero-Shot Super-Resolution With Depth Guided Self-Exemplars
* Saliency Guided Inter- and Intra-Class Relation Constraints for Weakly Supervised Semantic Segmentation
* SCAN++: Enhanced Semantic Conditioned Adaptation for Domain Adaptive Object Detection
* Scene Graph Refinement Network for Visual Question Answering
* Scene-Text Oriented Referring Expression Comprehension
* SCTANet: A Spatial Attention-Guided CNN-Transformer Aggregation Network for Deep Face Image Super-Resolution
* Seeing Through Darkness: Visual Localization at Night via Weakly Supervised Learning of Domain Invariant Features
* Self-Consistent Contrastive Attributed Graph Clustering With Pseudo-Label Prompt
* Self-Ensembling GAN for Cross-Domain Semantic Segmentation
* Self-Sufficient Feature Enhancing Networks for Video Salient Object Detection
* Self-Supervised Correlation Learning for Cross-Modal Retrieval
* Self-Supervised Fine-Grained Cycle-Separation Network (FSCN) for Visual-Audio Separation
* Self-Supervised Learning for Heterogeneous Audiovisual Scene Analysis
* Self-Supervised Learning for Multimedia Recommendation
* Self-Supervised Learning for Semi-Supervised Temporal Language Grounding
* Self-Supervised Learning With Data-Efficient Supervised Fine-Tuning for Crowd Counting
* Self-Supervised Masking for Unsupervised Anomaly Detection and Localization
* Self-Supervised Monocular Depth Estimation with Frequency-Based Recurrent Refinement
* Self-Supervised Point Cloud Representation Learning via Separating Mixed Shapes
* Self-Supervised Scene-Debiasing for Video Representation Learning via Background Patching
* Self-Weighted Anchor Graph Learning for Multi-View Clustering
* Semantic Point Cloud Upsampling
* Semantic Relevance Learning for Video-Query Based Video Moment Retrieval
* Semantic-Aware Noise Driven Portrait Synthesis and Manipulation
* Semantic-Aware Transmission With Adaptive Control Scheme for Volumetric Video Service, A
* Semantic-Aware Triplet Loss for Image Classification
* Semantic-Supervised Infrared and Visible Image Fusion Via a Dual-Discriminator Generative Adversarial Network
* Semantics-Preserving Sketch Embedding for Face Generation
* Semi-Fragile Reversible Watermarking for Authenticating 3D Models Based on Virtual Polygon Projection and Double Modulation Strategy, A
* Semi-Supervised Authentically Distorted Image Quality Assessment with Consistency-Preserving Dual-Branch Convolutional Neural Network
* Semi-Supervised Contrastive Learning With Similarity Co-Calibration
* Semi-Supervised Knowledge Distillation for Cross-Modal Hashing
* sfHxL3: Optimized Delivery Architecture for HTTP Low-Latency Live Streaming
* Show, Tell and Rephrase: Diverse Video Captioning via Two-Stage Progressive Training
* Siamese Alignment Network for Weakly Supervised Video Moment Retrieval
* Siamese Graph Learning for Semi-Supervised Age Estimation
* Simple but Effective Method for Balancing Detection and Re-Identification in Multi-Object Tracking, A
* Simultaneously Training and Compressing Vision-and-Language Pre-Training Model
* Single Image Deraining With Continuous Rain Density Estimation
* Single Person Dense Pose Estimation via Geometric Equivariance Consistency
* Skeleton-Based Action Recognition Through Contrasting Two-Stream Spatial-Temporal Networks
* Skeleton-Based Action Recognition with Select-Assemble-Normalize Graph Convolutional Networks
* Skeleton-Based Mutually Assisted Interacted Object Localization and Human Action Recognition
* Skill-Based Hierarchical Reinforcement Learning for Target Visual Navigation
* SMNet: Synchronous Multi-Scale Low Light Enhancement Network With Local and Global Concern
* Source Identification of 3D Printer Based on Layered Texture Encoders
* Sparse Representation Classifier Guided Grassmann Reconstruction Metric Learning With Applications to Image Set Analysis
* Spatial-Channel Enhanced Transformer for Visible-Infrared Person Re-Identification
* Spatial-Temporal Exclusive Capsule Network for Open Set Action Recognition
* Spatio-Temporal Detail Information Retrieval for Compressed Video Quality Enhancement
* Spatio-Temporal Self-Attention Network for Video Saliency Prediction
* Split and Connect: A Universal Tracklet Booster for Multi-Object Tracking
* SPN2D-GAN: Semantic Prior Based Night-to-Day Image-to-Image Translation
* SRRNet: A Semantic Representation Refinement Network for Image Segmentation
* SRRV: A Novel Document Object Detector Based on Spatial-Related Relation and Vision
* STAM: A SpatioTemporal Attention Based Memory for Video Prediction
* Starting Point Selection and Multiple-Standard Matching for Video Object Segmentation With Language Annotation
* State Graph Reasoning for Multimodal Conversational Recommendation
* Steformer: Efficient Stereo Image Super-Resolution With Transformer
* STNet: Scale Tree Network With Multi-Level Auxiliator for Crowd Counting
* StrokeNet: Stroke Assisted and Hierarchical Graph Reasoning Networks
* Strong and Robust Skeleton-Based Gait Recognition Method with Gait Periodicity Priors, A
* StrongSORT: Make DeepSORT Great Again
* Structure-Aware Graph Convolution Network for Point Cloud Parsing
* Structure-Enriched Topology Learning For Cross-Domain Multi-Person Pose Estimation
* Subjective Functionality and Comfort Prediction for Apartment Floor Plans and Its Application to Intuitive Online Property Search
* Super-Resolution Flexible Video Coding Solution for Improving Live Streaming Quality, A
* Superframe-Based Temporal Proposals for Weakly Supervised Temporal Action Detection
* Survey on Video Action Recognition in Sports: Datasets, Methods and Applications, A
* S^2-Net:Semantic and Saliency Attention Network for Person Re-Identification
* S^3 Net: Self-Supervised Self-Ensembling Network for Semi-Supervised RGB-D Salient Object Detection
* T-Net: Deep Stacked Scale-Iteration Network for Image Dehazing
* Tackling Micro-Expression Data Shortage via Dataset Alignment and Active Learning
* Task-Driven Video Compression for Humans and Machines: Framework Design and Optimization
* Teacher-Student Learning: Efficient Hierarchical Message Aggregation Hashing for Cross-Modal Retrieval
* Temporal Action Segmentation With High-Level Complex Activity Labels
* Temporal Attention-Pyramid Pooling for Temporal Action Detection
* Temporal Context Mining for Learned Video Compression
* Temporal Speciation Network for Few-Shot Object Detection
* Text Growing on Leaf
* Text-Guided Generation and Refinement Model for Image Captioning, A
* TextDCT: Arbitrary-Shaped Text Detection via Discrete Cosine Transform Mask
* TextFace: Text-to-Style Mapping Based Face Generation and Manipulation
* Textual Context-Aware Dense Captioning With Diverse Words
* Textural and Directional Information Based Offset In-Loop Filtering in AVS3
* Theme Transformer: Symbolic Music Generation With Theme-Conditioned Transformer
* They are Not Completely Useless: Towards Recycling Transferable Unlabeled Data for Class-Mismatched Semi-Supervised Learning
* Timely and Accurate Bitrate Switching in HTTP Adaptive Streaming With Date-Driven I-Frame Prediction
* Toward Intelligent Design: An AI-Based Fashion Designer Using Generative Adversarial Networks Aided by Sketch and Rendering Generators
* Towards Adaptive Consensus Graph: Multi-View Clustering via Graph Collaboration
* Towards Comprehensive Monocular Depth Estimation: Multiple Heads are Better Than One
* Towards Handling Sudden Changes in Feature Maps During Depth Estimation
* Towards Real-Time Video Caching at Edge Servers: A Cost-Aware Deep Q-Learning Solution
* Towards Task-Generic Image Compression: A Study of Semantics-Oriented Metrics
* Towards Unbiased Multi-Label Zero-Shot Learning With Pyramid and Semantic Attention
* Tracking With Mutual Attention Network
* Transferable Adversarial Belief Attack With Salient Region Perturbation Restriction, A
* Transferable Self-Supervised Instance Learning for Sleep Recognition
* Transferring Image-CLIP to Video-Text Retrieval via Temporal Relations
* Transformer Based Conditional GAN for Multimodal Image Fusion
* Transformer-Based Efficient Salient Instance Segmentation Networks With Orientative Query
* Tree-Structure Analysis Network on Handwritten Chinese Character Error Correction, A
* Trustable Co-Label Learning From Multiple Noisy Annotators
* TSFNet: Triple-Steam Image Captioning
* TSINIT: A Two-Stage Inpainting Network for Incomplete Text
* Tube-Embedded Transformer for Pixel Prediction
* Two-Level Rectification Attention Network for Scene Text Recognition, A
* Two-Stream Prototype Learning Network for Few-Shot Face Recognition Under Occlusions
* Uncertainty Modeling for Robust Domain Adaptation Under Noisy Environments
* Uncertainty-Aware Clustering for Unsupervised Domain Adaptive Object Re-Identification
* Uncertainty-Guided Semi-Supervised Few-Shot Class-Incremental Learning With Knowledge Distillation
* Underwater Adaptive Video Transmissions Using MIMO-Based Software-Defined Acoustic Modems
* Underwater Image Quality Assessment Metric, An
* Unified Adaptive Relevance Distinguishable Attention Network for Image-Text Matching
* Unified Low-Rank Tensor Learning and Spectral Embedding for Multi-View Subspace Clustering
* Unified Multi-Weather Visibility Restoration
* Unpaired Image Captioning by Image-Level Weakly-Supervised Visual Concept Recognition
* Unsupervised Cross-Media Graph Convolutional Network for 2D Image-Based 3D Model Retrieval
* Unsupervised Domain Adaptation for Person Re-Identification Via Individual-Preserving and Environmental-Switching Cyclic Generation
* Unsupervised Embedding Learning with Mutual-Information Graph Convolutional Networks
* Unsupervised Learning-Based Framework for Deepfake Video Detection
* Unsupervised Multi-Subclass Saliency Classification for Salient Object Detection
* Unsupervised Single-Image Reflection Removal
* Unsupervised Underexposed Image Enhancement via Self-Illuminated and Perceptual Guidance
* User Navigation Modeling, Rate-Distortion Analysis, and End-to-End Optimization for Viewport-Driven 360° Video Streaming
* User-Generated Video Quality Assessment: A Subjective and Objective Study
* User-Guided Personalized Image Aesthetic Assessment Based on Deep Reinforcement Learning
* USID-Net: Unsupervised Single Image Dehazing Network via Disentangled Representations
* VCGAN: Video Colorization With Hybrid Generative Adversarial Network
* Very Lightweight Photo Retouching Network With Conditional Sequential Modulation
* Video Denoising for Scenes With Challenging Motion: A Comprehensive Analysis and a New Framework
* Video Instance Segmentation by Instance Flow Assembly
* Video-to-Music Recommendation Using Temporal Alignment of Segments
* View-Aware Salient Object Detection for 360° Omnidirectional Image
* Viewpoint Alignment and Discriminative Parts Enhancement in 3D Space for Vehicle ReID
* Virtual Try-On With Garment Self-Occlusion Conditions
* Visibility and Distortion Measurement for No-Reference Dehazed Image Quality Assessment via Complex Contourlet Transform
* Visible-Infrared Person Re-Identification via Cross-Modality Interaction Transformer
* Vision-Language Matching for Text-to-Image Synthesis via Generative Adversarial Networks
* Visual Interaction Perceptual Network for Blind Image Quality Assessment
* Visual-Semantic Aligned Bidirectional Network for Zero-Shot Learning
* VPFNet: Improving 3D Object Detection With Virtual Point Based LiDAR and Stereo Data Fusion
* VTON-SCFA: A Virtual Try-On Network Based on the Semantic Constraints and Flow Alignment
* Vulnerability of Feature Extractors in 2D Image-Based 3D Object Retrieval
* W-Net: Structure and Texture Interaction for Image Inpainting
* Wavelet Transform-Assisted Adaptive Generative Modeling for Colorization
* Weakly Supervised Audio-Visual Violence Detection
* Weakly Supervised Distribution Discrepancy Minimization Learning With State Information for Person Re-Identification
* Weakly Supervised Few-Shot Segmentation via Meta-Learning
* Weakly Supervised Few-Shot Semantic Segmentation via Pseudo Mask Enhancement and Meta Learning
* Weakly Supervised Instance Segmentation by Exploring Entire Object Regions
* Weakly Supervised Object Detection With Class Prototypical Network
* Weakly Supervised Regional and Temporal Learning for Facial Action Unit Recognition
* Weakly Supervised Semantic Segmentation via Progressive Patch Learning
* Weakly-Supervised 3D Human Pose Estimation With Cross-View U-Shaped Graph Convolutional Network
* Weakly-Supervised Video Object Grounding via Learning Uni-Modal Associations
* What Happens in Crowd Scenes: A New Dataset About Crowd Scenes for Image Captioning
* YDTR: Infrared and Visible Image Fusion via Y-Shape Dynamic Transformer
* Zero-Shot Predicate Prediction for Scene Graph Parsing
707 for MultMed(25)
MultMed(26)
* 3D Object Segmentation Using Cross-Window Point Transformer With Latent Semantic Boundary Guidance
* 3D Scene Graph Generation From Point Clouds
* 3DTA: No-Reference 3D Point Cloud Quality Assessment With Twin Attention
* AAMT: Adversarial Attack-Driven Mutual Teaching for Source-Free Domain-Adaptive Person Reidentification
* Abnormal Ratios Guided Multi-Phase Self-Training for Weakly-Supervised Video Anomaly Detection
* Achieving QoE Fairness in Bitrate Allocation of 360° Video Streaming
* Achieving the Optimum Rate for Cross-Modal Source Coding
* Action-Semantic Consistent Knowledge for Weakly-Supervised Action Localization
* Activating More Information in Arbitrary-Scale Image Super-Resolution
* Adaptive Activation Network for Weakly Supervised Semantic Segmentation
* Adaptive Dual Selective Transformer for Temporal Action Localization, An
* Adaptive Global-Local Representation Learning and Selection for Cross-Domain Facial Expression Recognition
* Adaptive HEVC Video Steganography With High Performance Based on Attention-Net and PU Partition Modes
* Adaptive Sample Assignment Network for Tiny Object Detection, An
* Adaptive Stage-Aware Assessment Skill Transfer for Skill Determination
* Adaptive Structure and Texture Similarity Metric for Image Quality Assessment and Optimization
* Adaptive Teaching for Cross-Domain Crowd Counting
* Adaptive Weight Generator for Multi-Task Image Recognition by Task Grouping Prompt
* ADMNet: Attention-Guided Densely Multi-Scale Network for Lightweight Salient Object Detection
* Adversarial Obstacle Generation Against LiDAR-Based 3D Object Detection
* AdvST: Generating Unrestricted Adversarial Images via Style Transfer
* AKVSR: Audio Knowledge Empowered Visual Speech Recognition by Compressing Audio Knowledge of a Pretrained Model
* Align and Retrieve: Composition and Decomposition Learning in Image Retrieval With Text Feedback
* Alignment-Free RGBT Salient Object Detection: Semantics-Guided Asymmetric Correlation Network and a Unified Benchmark
* Alleviating Over-Fitting in Hashing-Based Fine-Grained Image Retrieval: From Causal Feature Learning to Binary-Injected Hash Learning
* AMatFormer: Efficient Feature Matching via Anchor Matching Transformer
* Anchor Graph-Based Feature Selection for One-Step Multi-View Clustering
* AnimeDiff: Customized Image Generation of Anime Characters Using Diffusion Model
* Anti-Compression Contrastive Facial Forgery Detection
* APCAFlow: All-Pairs Cost Volume Aggregation for Optical Flow Estimation
* Arbitrary Shape Text Detection via Boundary Transformer
* ARES: On Adversarial Robustness Enhancement for Image Steganographic Cost Learning
* Art Image Inpainting With Style-Guided Dual-Branch Inpainting Network
* Attacking Defocus Detection With Blur-Aware Transformation for Defocus Deblurring
* Attentive Snippet Prompting for Video Retrieval
* ATZSL: Defensive Zero-Shot Recognition in the Presence of Adversaries
* Audio-Driven Talking Video Frame Restoration
* Audio-Visual Contrastive and Consistency Learning for Semi-Supervised Action Recognition
* Auto-Points: Automatic Learning for Point Cloud Analysis with Neural Architecture Search
* Autoencoder-Based Collaborative Attention GAN for Multi-Modal Image Synthesis
* Automatic Generation of Interactive Nonlinear Video for Online Apparel Shopping Navigation
* Automatic Hypergraph Generation for Enhancing Recommendation With Sparse Optimization
* Automatic Identification of Human Subgroups in Time-Dependent Pedestrian Flow Networks
* Automatic Point Cloud Registration for 3D Virtual-to-Real Registration Using Macro and Micro Structures
* A^2Pt: Anti-Associative Prompt Tuning for Open Set Visual Recognition
* Balanced Classification: A Unified Framework for Long-Tailed Object Detection
* BASICS: Broad Quality Assessment of Static Point Clouds in a Compression Scenario
* BAVS: Bootstrapping Audio-Visual Segmentation by Integrating Foundation Knowledge
* Beauty of Repetition: An Algorithmic Composition Model With Motif-Level Repetition Generator and Outline-to-Music Generator in Symbolic Music Generation, The
* Benchmark Dataset and Pair-Wise Ranking Method for Quality Evaluation of Night-Time Image Enhancement
* Benchmark for Controllable Text-Image-to-Video Generation, A
* Beyond Instance Discrimination: Relation-Aware Contrastive Self-Supervised Learning
* BI-CAM: Generating Explanations for Deep Neural Networks Using Bipolar Information
* Bidirectional Knowledge Reconfiguration for Lightweight Point Cloud Analysis
* Bilateral Interaction for Local-Global Collaborative Perception in Low-Light Image Enhancement
* Bilateral Knowledge Interaction Network for Referring Image Segmentation
* Binary Similarity Few-Shot Object Detection With Modeling of Hard Negative Samples
* Binocular Image Dehazing via a Plain Network Without Disparity Estimation
* Bio-Inspired Multi-Scale Contourlet Attention Networks
* Bipartite Graph-Based Projected Clustering With Local Region Guidance for Hyperspectral Imagery
* Bit-Plane Based Reversible Data Hiding in Encrypted Images Using Multi-Level Blocking With Quad-Tree
* Blind Image Quality Assessment Based on Perceptual Comparison
* Blind Image Quality Assessment via Transformer Predicted Error Map and Perceptual Quality Token
* Blind Image Quality Index With Cross-Domain Interaction and Cross-Scale Integration
* Blind Quality Enhancement for Compressed Video
* Blind Quality Evaluator of Light Field Images by Group-Based Representations and Multiple Plane-Oriented Perceptual Characteristics
* Boosting Adversarial Training with Hardness-Guided Attack Strategy
* Boosting Adversarial Transferability With Learnable Patch-Wise Masks
* Boosting Entity-Aware Image Captioning With Multi-Modal Knowledge Graph
* Boundary-Guided Lightweight Semantic Segmentation With Multi-Scale Semantic Context
* Bounding Box Vectorization for Oriented Object Detection With Tanimoto Coefficient Regression
* BSTG-Trans: A Bayesian Spatial-Temporal Graph Transformer for Long-Term Pose Forecasting
* Building Multimodal Knowledge Bases With Multimodal Computational Sequences and Generative Adversarial Networks
* C2ANet: Cross-Scale and Cross-Modality Aggregation Network for Scene Depth Super-Resolution
* Calligraphy Font Generation via Explicitly Modeling Location-Aware Glyph Component Deformations
* Camera Topology Graph Guided Vehicle Re-Identification
* CAMF: An Interpretable Infrared and Visible Image Fusion Network Based on Class Activation Mapping
* CARE: Cloudified Android With Optimized Rendering Platform
* Category-Adaptive Label Discovery and Noise Rejection for Multi-Label Recognition With Partial Positive Labels
* Category-Aware Curriculum Learning for Data-Free Knowledge Distillation, A
* CATNet: A Cascaded and Aggregated Transformer Network for RGB-D Salient Object Detection
* CatTrack: Single-Stage Category-Level 6D Object Pose Tracking via Convolution and Vision Transformer
* CAU: A Causality Attention Unit for Spatial-Temporal Sequence Forecast
* CBARF: Cascaded Bundle-Adjusting Neural Radiance Fields From Imperfect Camera Poses
* CBDMoE: Consistent-but-Diverse Mixture of Experts for Domain Generalization
* CCANet: A Collaborative Cross-Modal Attention Network for RGB-D Crowd Counting
* CDCM: ChatGPT-Aided Diversity-Aware Causal Model for Interactive Recommendation
* CDINet: Content Distortion Interaction Network for Blind Image Quality Assessment
* CDKM: Common and Distinct Knowledge Mining Network With Content Interaction for Dense Captioning
* Centralized Error Distribution-Preserving Adaptive Steganography for HEVC
* CFENet: Boosting Few-Shot Semantic Segmentation With Complementary Feature-Enhanced Network
* CGLF-Net: Image Emotion Recognition Network by Combining Global Self-Attention Features and Local Multiscale Features
* Class Enhancement Losses With Pseudo Labels for Open-Vocabulary Semantic Segmentation
* Class-Aware Dual-Supervised Aggregation Network for Video Object Detection
* Class-Wise Contrastive Prototype Learning for Semi-Supervised Classification Under Intersectional Class Mismatch
* CLIP-VG: Self-Paced Curriculum Adapting of CLIP for Visual Grounding
* CLIPREC: Graph-Based Domain Adaptive Network for Zero-Shot Referring Expression Comprehension
* Clothes-Changing Person Re-Identification via Universal Framework With Association and Forgetting Learning
* Cluster-Instance Normalization: A Statistical Relation-Aware Normalization for Generalizable Person Re-Identification
* CM-MaskSD: Cross-Modality Masked Self-Distillation for Referring Image Segmentation
* CMAT: Integrating Convolution Mixer and Self-Attention for Visual Tracking
* CMCF-Net: An End-to-End Context Multiscale Cross-Fusion Network for Robust Copy-Move Forgery Detection
* CMNet: Component-Aware Matching Network for Few-Shot Point Cloud Classification
* CMVDE: Consistent Multi-View Video Depth Estimation via Geometric-Temporal Coupling Approach
* Coarse-to-Fine Cross-View Interaction Based Accurate Stereo Image Super-Resolution Network
* Coarse-to-Fine Depth Super-Resolution With Adaptive RGB-D Feature Attention
* Coarse-to-Fine Image Aesthetics Assessment With Dynamic Attribute Selection
* Coarse-to-Fine Nutrition Prediction
* CodedBGT: Code Bank-Guided Transformer for Low-Light Image Enhancement
* CoLive: Edge-Assisted Clustered Learning Framework for Viewport Prediction in 360° Live Streaming
* Collaborative Multi-Agent Video Fast-Forwarding
* Collaborative Viseme Subword and End-to-End Modeling for Word-Level Lip Reading
* Color Enhanced Cross Correlation Net for Image Sentiment Analysis
* Comment-Context Dual Collaborative Masked Transformer Network for Fake News Detection
* Commonsense Knowledge Prompting for Few-Shot Action Recognition in Videos
* Commonsense-Guided Semantic and Relational Consistencies for Image-Text Retrieval
* Completed Part Transformer for Person Re-Identification
* Compressive Sensing Based Image Codec With Partial Pre-Calculation
* Conditional Consistency Regularization for Semi-Supervised Multi-Label Image Classification
* Conditional Video Diffusion Network for Fine-Grained Temporal Sentence Grounding
* ConGMC: Consistency-Guided Multimodal Clustering via Mutual Information Maximin
* Consistent GT-Proposal Assignment for Challenging Pedestrian Detection
* Constrained Bipartite Graph Learning for Imbalanced Multi-Modal Retrieval
* Content-Adaptive Rate-Distortion Modeling for Frame-Level Rate Control in Versatile Video Coding
* Context Matters: Distilling Knowledge Graph for Enhanced Object Detection
* Context-Aware Interaction Network for RGB-T Semantic Segmentation
* Context-Guided Black-Box Attack for Visual Tracking
* Contextualized Relation Predictive Model for Self-Supervised Group Activity Representation Learning
* Continual All-in-One Adverse Weather Removal With Knowledge Replay on a Unified Network Structure
* Continuous Emotion-Based Image-to-Music Generation
* Contrastive Multi-View Learning for 3D Shape Clustering
* Controllable Video Generation With Text-Based Instructions
* Cooperative Bargaining Game Based Adaptive Video Multicast Over Mobile Edge Networks
* Cooperative Separation of Modality Shared-Specific Features for Visible-Infrared Person Re-Identification
* Correlation-Guided Distribution and Geometry Alignments for Heterogeneous Domain Adaptation
* Counterfactual Visual Dialog: Robust Commonsense Knowledge Learning From Unbiased Training
* Covariant Peak Constraint for Accurate Keypoint Detection and Keypoint-Specific Descriptor Learning
* CP-Net: Contour-Perturbed Reconstruction Network for Self-Supervised Point Cloud Learning
* CRADA: Cross Domain Object Detection With Cyclic Reconstruction and Decoupling Adaptation
* CRNet: Context-guided Reasoning Network for Detecting Hard Objects
* CroMIC-QA: The Cross-Modal Information Complementation Based Question Answering
* Cross Modal Compression With Variable Rate Prompt
* Cross Modality Bias in Visual Question Answering: A Causal View With Possible Worlds VQA
* Cross-Aware Early Fusion With Stage-Divided Vision and Language Transformer Encoders for Referring Image Segmentation
* Cross-Domain Detection Transformer Based on Spatial-Aware and Semantic-Aware Token Alignment
* Cross-Domain Low-Dose CT Image Denoising With Semantic Preservation and Noise Alignment
* Cross-Domain Sample Relationship Learning for Facial Expression Recognition
* Cross-Domain Scene Unsupervised Learning Segmentation With Dynamic Subdomains
* Cross-Modal Adaptive Dual Association for Text-to-Image Person Retrieval
* Cross-Modal Quantization for Co-Speech Gesture Generation
* Cross-Modality Knowledge Calibration Network for Video Corpus Moment Retrieval
* Cross-Modality Proposal-Guided Feature Mining for Unregistered RGB-Thermal Pedestrian Detection
* Cross-Modality Vessel Re-Identification With Deep Alignment Decomposition Network
* Cross-Receptive Focused Inference Network for Lightweight Image Super-Resolution
* Cross-Task Multimodal Reinforcement for Long Tail Next POI Recommendation
* Crossmodal Translation Based Meta Weight Adaption for Robust Image-Text Sentiment Analysis
* Crowd Descriptors and Interpretable Gathering Understanding
* CrowdCaption++: Collective-Guided Crowd Scenes Captioning
* CS-IntroVAE: Cauchy-Schwarz Divergence-Based Introspective Variational Autoencoder
* CTE-Net: Contextual Texture Enhancement Network for Image Super-Resolution
* Cycle-Retinex: Unpaired Low-Light Image Enhancement via Retinex-Inline CycleGAN
* DACOD360: Deadline-Aware Content Delivery for 360-Degree Video Streaming Over MEC Networks
* DanceComposer: Dance-to-Music Generation Using a Progressive Conditional Music Generator
* Dataset and Benchmark for 3D Scene Plausibility Assessment, A
* DCMSTRD: End-to-end Dense Captioning via Multi-Scale Transformer Decoding
* DCRP: Class-Aware Feature Diffusion Constraint and Reliable Pseudo-Labeling for Imbalanced Semi-Supervised Learning
* DDOD: Dive Deeper into the Disentanglement of Object Detector
* Deconfounding Causal Inference for Zero-Shot Action Recognition
* Decoupling and Integration Network for Camouflaged Object Detection
* Deep Conditional HDRI: Inverse Tone Mapping via Dual Encoder-Decoder Conditioning Method
* Deep Counterfactual Representation Learning for Visual Recognition Against Weather Corruptions
* Deep Cross-Modal Retrieval Between Spatial Image and Acoustic Speech
* Deep Hashing Network With Hybrid Attention and Adaptive Weighting for Image Retrieval
* Deep Neighborhood Structure-Preserving Hashing for Large-Scale Image Retrieval
* Deep Neighborhood-Preserving Hashing With Quadratic Spherical Mutual Information for Cross-Modal Retrieval
* Deep Progressive Asymmetric Quantization Based on Causal Intervention for Fine-Grained Image Retrieval
* Deep Rank-N Decomposition Network for Image Fusion
* Deep Ranking Distribution Preserving Hashing for Robust Multi-Label Cross-Modal Retrieval
* Deep Unfolding Network for Image Compressed Sensing by Content-Adaptive Gradient Updating and Deformation-Invariant Non-Local Modeling
* Deep Unrestricted Document Image Rectification
* Deepfake Detection Fighting Against Noisy Label Attack
* Deeply Hybrid Contrastive Learning Based on Semantic Pseudo-Label for Salient Object Detection in Optical Remote Sensing Images
* DeepSpoof: Deep Reinforcement Learning-Based Spoofing Attack in Cross-Technology Multimedia Communication
* Degradation-Aware Dynamic Fourier-Based Network for Spectral Compressive Imaging
* Degradation-Aware Self-Attention Based Transformer for Blind Image Super-Resolution
* Delving Into Important Samples of Semi-Supervised Old Photo Restoration: A New Dataset and Method
* Depth-Guided Deep Video Inpainting
* DFR-Net: Density Feature Refinement Network for Image Dehazing Utilizing Haze Density Difference
* DGFNet: Depth-Guided Cross-Modality Fusion Network for RGB-D Salient Object Detection
* Difference-Aware Distillation for Semantic Segmentation
* DiffFashion: Reference-Based Fashion Design With Structure-Aware Transfer by Diffusion Models
* DIMGNet: A Transformer-Based Network for Pedestrian Reidentification With Multi-Granularity Information Mutual Gain
* Discrepancy and Structure-Based Contrast for Test-Time Adaptive Retrieval
* Discriminative Identity-Feature Exploring and Differential Aware Learning for Unsupervised Person Re-Identification
* DiscrimLoss: A Universal Loss for Hard Samples and Incorrect Samples Discrimination
* Disentangled Graph Variational Auto-Encoder for Multimodal Recommendation With Interpretability
* Disentangled Representation Learning for Controllable Person Image Generation
* Disguised Heterogeneous Face Generation With Iterative-Adversarial Style Unification
* Disjoint Masking With Joint Distillation for Efficient Masked Image Modeling
* Distortion-Aware Self-Supervised Indoor 360° Depth Estimation via Hybrid Projection Fusion and Structural Regularities
* DMAP: Decoupling-Driven Multi-Level Attribute Parsing for Interpretable Outfit Collocation
* DMF-GAN: Deep Multimodal Fusion Generative Adversarial Networks for Text-to-Image Synthesis
* DMH-CL: Dynamic Model Hardness Based Curriculum Learning for Complex Pose Estimation
* Domain Adaptive LiDAR Point Cloud Segmentation With 3D Spatial Consistency
* Domain Complementary Adaptation by Leveraging Diversity and Discriminability From Multiple Sources
* Domain Prompt Tuning via Meta Relabeling for Unsupervised Adversarial Adaptation
* Domain-Adaptive Energy-Based Models for Generalizable Face Anti-Spoofing
* Domain-Aware Graph Network for Bridging Multi-Source Domain Adaptation
* Domain-Consistent and Uncertainty-Aware Network for Generalizable Gaze Estimation
* Domain-Oriented Knowledge Transfer for Cross-Domain Recommendation
* Dominant SIngle-Modal SUpplementary Fusion (SIMSUF) for Multimodal Sentiment Analysis
* Double-Domain Adaptation Semantics for Retrieval-Based Long-Term Visual Localization
* Downstream-Pretext Domain Knowledge Traceback for Active Learning
* DPHANet: Discriminative Parallel and Hierarchical Attention Network for Natural Language Video Localization
* DR-FER: Discriminative and Robust Representation Learning for Facial Expression Recognition
* DRAN: Detailed Region-Adaptive Normalization for Conditional Image Synthesis
* Drawlody: Sketch-Based Melody Creation With Enhanced Usability and Interpretability
* DropQueries: A Simple Way to Discover Comprehensive Segment Representations
* DSIS-DPR: Structured Instance Segmentation and Diffusion Prior Refinement for Dental Anatomy Learning
* DSS-Net: Dynamic Self-Supervised Network for Video Anomaly Detection
* Dual Knowledge Distillation on Multiview Pseudo Labels for Unsupervised Person Re-Identification
* Dual Masked Modeling for Weakly-Supervised Temporal Boundary Discovery
* Dual Modality Prompt Tuning for Vision-Language Pre-Trained Model
* Dual Noise Elimination and Dynamic Label Correlation Guided Partial Multi-Label Learning
* Dual Reinforcement Learning Framework for Weakly Supervised Phrase Grounding, A
* Dual Self-Paced Hashing for Image Retrieval
* Dual-Domain Aligned Deep Hierarchical Matrix Factorization Method for Micro-Video Multi-Label Classification
* Dual-Guided Frequency Prototype Network for Few-Shot Semantic Segmentation
* Dual-Perspective Fusion Network for Aspect-Based Multimodal Sentiment Analysis
* Dual-Stage Uncertainty Modeling for Unsupervised Cross-Domain 3D Model Retrieval
* Dual-Stream Contrastive Learning for Compositional Zero-Shot Recognition
* Dynamic Confidence Sampling and Label Semantic Guidance Learning for Domain Adaptive Retrieval
* Dynamic Pathway for Query-Aware Feature Learning in Language-Driven Action Localization
* Dynamic Template Updating Using Spatial-Temporal Information in Siamese Trackers
* Dynamic View Aggregation for Multi-View 3D Shape Recognition
* Dynamically Shifting Multimodal Representations via Hybrid-Modal Attention for Multimodal Sentiment Analysis
* E-MLB: Multilevel Benchmark for Event-Based Camera Denoising
* Each Performs Its Functions: Task Decomposition and Feature Assignment for Audio-Visual Segmentation
* EdgeVision: Towards Collaborative Video Analytics on Distributed Edges for Performance Maximization
* EDMC: Efficient Multi-View Clustering via Cluster and Instance Space Learning
* Effective and Robust Adversarial Training Against Data and Label Corruptions
* Efficient Anchor Graph Factorization for Multi-View Clustering
* Efficient Attribute-Preserving Framework for Face Swapping, An
* Efficient Hybrid Feature Interaction Network for Stereo Image Super-Resolution
* Efficient Latent Style Guided Transformer-CNN Framework for Face Super-Resolution, An
* Efficient Unsupervised Video Hashing With Contextual Modeling and Structural Controlling
* EHPE: Skeleton Cues-Based Gaussian Coordinate Encoding for Efficient Human Pose Estimation
* EISNet: A Multi-Modal Fusion Network for Semantic Segmentation With Events and Images
* Embedded Heterogeneous Attention Transformer for Cross-Lingual Image Captioning
* EMERSK: Explainable Multimodal Emotion Recognition With Situational Knowledge
* EmoMusicTV: Emotion-Conditioned Symbolic Music Generation With Hierarchical Transformer VAE
* EmotionGesture: Audio-Driven Diverse Emotional Co-Speech 3D Gesture Generation
* End-to-End Distortion Modeling for Error-Resilient Screen Content Video Coding
* End-to-End Instance-Level Human Parsing by Segmenting Persons
* End-to-End Video Scene Graph Generation With Temporal Propagation Transformer
* Enhance Composed Image Retrieval via Multi-Level Collaborative Localization and Semantic Activeness Perception
* Enhanced Context Mining and Filtering for Learned Video Compression
* Enhanced Temporal Consistency for Global Patch Allocation in Video-Based Point Cloud Compression
* Enhancing Unsupervised Semantic Segmentation Through Context-Aware Clustering
* EPM-Net: Efficient Feature Extraction, Point-Pair Feature Matching for Robust 6-D Pose Estimation
* ESC-Net: Alleviating Triple Sparsity on 3D LiDAR Point Clouds for Extreme Sparse Scene Completion
* Estimating the Semantics via Sector Embedding for Image-Text Retrieval
* Event-Aware Retrospective Learning for Knowledge-Based Image Captioning
* Event-Based Low-Illumination Image Enhancement
* Explainability of Speech Recognition Transformers via Gradient-Based Attention Visualization
* Exploit the Best of Both End-to-End and Map-Based Methods for Multi-Focus Image Fusion
* Exploiting Multi-Scale Parallel Self-Attention and Local Variation via Dual-Branch Transformer-CNN Structure for Face Super-Resolution
* Exploiting Spatial and Angular Correlations With Deep Efficient Transformers for Light Field Image Super-Resolution
* Exploiting Substitution Box for Cryptanalyzing Image Encryption Schemes With DNA Coding and Nonlinear Dynamics
* Exploiting Temporal Correlations for 3D Human Pose Estimation
* Exploring Accurate Invariants on Polar Harmonic Fourier Moments in Polar Coordinates for Robust Image Watermarking
* Exploring Rich Semantics for Open-Set Action Recognition
* Exploring Spatial Frequency Information for Enhanced Video Prediction Quality
* Exploring the Applicability of Spectral Recovery in Semantic Segmentation of RGB Images
* Extensible Max-Min Collaborative Retention for Online Mini-Batch Learning Hash Retrieval
* FaceRefiner: High-Fidelity Facial Texture Refinement With Differentiable Rendering-Based Style Transfer
* FARP-Net: Local-Global Feature Aggregation and Relation-Aware Proposals for 3D Object Detection
* Fast and Effective: Progressive Hierarchical Fusion Classification for Remote Sensing Images
* Fast Fourier Inception Networks for Occluded Video Prediction
* Feature Completion Transformer for Occluded Person Re-Identification
* Feature Distribution Representation Learning Based on Knowledge Transfer for Long-Tailed Classification
* Feature First: Advancing Image-Text Retrieval Through Improved Visual Features
* Feature Reconstruction With Disruption for Unsupervised Video Anomaly Detection
* Feature Weakening, Contextualization, and Discrimination for Weakly Supervised Temporal Action Localization
* Federated Adversarial Domain Hallucination for Privacy-Preserving Domain Generalization
* FedSH: Towards Privacy-Preserving Text-Based Person Re-Identification
* Few-Shot Class-Incremental Audio Classification Using Dynamically Expanded Classifier With Self-Attention Modified Prototypes
* Few-Shot Contrastive Transfer Learning With Pretrained Model for Masked Face Verification
* Few-Shot Fine-Grained Image Classification via Multi-Frequency Neighborhood and Double-Cross Modulation
* Few-Shot Generative Model Adaptation via Style-Guided Prompt
* Find Gold in Sand: Fine-Grained Similarity Mining for Domain-Adaptive Crowd Counting
* Fine-Grained Representation Alignment for Zero-Shot Domain Adaptation
* Fine-Tuning for Few-Shot Image Classification by Multimodal Prototype Regularization
* Flexible Alignment Super-Resolution Network for Multi-Contrast Magnetic Resonance Imaging
* Flow Guidance Deformable Compensation Network for Video Frame Interpolation
* FMSA-SC: A Fine-Grained Multimodal Sentiment Analysis Dataset Based on Stock Comment Videos
* Focus Relationship Perception for Unsupervised Multi-Focus Image Fusion
* Focusing on Subtle Differences: A Feature Disentanglement Model for Series Photo Selection
* Frame-Padded Multiscale Transformer for Monocular 3D Human Pose Estimation
* FreqAlign: Excavating Perception-Oriented Transferability for Blind Image Quality Assessment From a Frequency Perspective
* Frequency-Aware Multi-Modal Fine-Tuning for Few-Shot Open-Set Remote Sensing Scene Classification
* Frequency-Based Matcher for Long-Tailed Semantic Segmentation
* From Appearance to Inherence: A Hyperspectral Image Dataset and Benchmark of Material Classification for Surveillance
* From Observation to Concept: A Flexible Multi-View Paradigm for Medical Report Generation
* Fusion-Embedding Siamese Network for Light Field Salient Object Detection
* Gait Recognition With Drones: A Benchmark
* Gait Recognition With Multi-Level Skeleton-Guided Refinement
* GaitParsing: Human Semantic Parsing for Gait Recognition
* Gap-Closing Matters: Perceptual Quality Evaluation and Optimization of Low-Light Image Enhancement
* Gated Multi-Scale Transformer for Temporal Action Localization
* GCSANet: Arbitrary Style Transfer With Global Context Self-Attentional Network
* GCVC: Graph Convolution Vector Distribution Calibration for Fish Group Activity Recognition
* General Deformable RoI Pooling and Semi-Decoupled Head for Object Detection
* Generative Essential Graph Convolutional Network for Multi-View Semi-Supervised Classification
* GIN: Generative INvariant Shape Prior for Amodal Instance Segmentation
* GLFF: Global and Local Feature Fusion for AI-Synthesized Image Detection
* Global and Local Spatio-Temporal Encoder for 3D Human Pose Estimation
* Global-Shared Text Representation Based Multi-Stage Fusion Transformer Network for Multi-Modal Dense Video Captioning
* Glow in the Dark: Low-Light Image Enhancement With External Memory
* Going the Extra Mile in Face Image Quality Assessment: A Novel Database and Model
* GPT-Based Knowledge Guiding Network for Commonsense Video Captioning
* Gradient-Semantic Compensation for Incremental Semantic Segmentation
* Graph Convolution Based Efficient Re-Ranking for Visual Retrieval
* Graph-Based Discriminator Architecture for Multi-Attribute Facial Image Editing, A
* Graph-Based Multimodal Topic Modeling With Word Relations and Object Relations
* Graph-Based Spatio-Temporal Semantic Reasoning Model for Anti-Occlusion Infrared Aerial Target Recognition
* GraphCFC: A Directed Graph Based Cross-Modal Feature Complementation Approach for Multimodal Conversational Emotion Recognition
* Group Multi-View Transformer for 3D Shape Analysis With Spatial Encoding
* Guest Editorial Introduction to the Issue on Pre-Trained Models for Multi-Modality Understanding
* Guided Image-to-Image Translation by Discriminator-Generator Communication
* Hardness-Aware Scene Synthesis for Semi-Supervised 3D Object Detection
* HCL: Hierarchical Consistency Learning for Webly Supervised Fine-Grained Recognition
* HCM: Online Action Detection With Hard Video Clip Mining
* Heterogeneous Graph Transformer for Multiple Tiny Object Tracking in RGB-T Videos
* Hiding Multiple Images into a Single Image Using Up-Sampling
* Hierarchical Camera-Aware Contrast Extension for Unsupervised Person Re-Identification
* Hierarchical Consensus Hashing for Cross-Modal Retrieval
* Hierarchical Dynamic Masks for Visual Explanation of Neural Networks
* Hierarchical Equalization Loss for Long-Tailed Instance Segmentation
* Hierarchical Forgery Classifier on Multi-Modality Face Forgery Clues
* Hierarchical Independent Coding Scheme for Varifocal Multiview Images Based on Angular-Focal Joint Prediction
* Hierarchical Local-Global Transformer for Temporal Sentence Grounding
* Hierarchical Locality-Aware Deep Dictionary Learning for Classification
* High Fidelity Face-Swapping With Style ConvTransformer and Latent Space Selection
* HINT: High-Quality INpainting Transformer With Mask-Aware Encoding and Enhanced Attention
* HitFusion: Infrared and Visible Image Fusion for High-Level Vision Tasks Using Transformer
* HODN: Disentangling Human-Object Feature for HOI Detection
* How to Cache Important Contents for Multi-Modal Service in Dynamic Networks: A DRL-Based Caching Scheme
* How to Improve Immersive Experience?
* Human Activity Discovery With Automatic Multi-Objective Particle Swarm Optimization Clustering With Gaussian Mutation and Game Theory
* Human Gait Recognition Based on Frontal-View Sequences Using Gait Dynamics and Deep Learning
* Human-Centric Behavior Description in Videos: New Benchmark and Model
* Hybrid Graph Reasoning With Dynamic Interaction for Visual Dialog
* IBFusion: An Infrared and Visible Image Fusion Method Based on Infrared Target Mask and Bimodal Feature Extraction Strategy
* IcoCap: Improving Video Captioning by Compounding Images
* IGReg: Image-Geometry-Assisted Point Cloud Registration via Selective Correlation Fusion
* Illumination Controllable Dehazing Network based on Unsupervised Retinex Embedding
* Illumination Distillation Framework for Nighttime Person Re-Identification and a New Benchmark
* Image Aesthetics Assessment Based on Hypernetwork of Emotion Fusion
* Image Dehazing Assessment: A Real-World Dataset and a Haze Density-Aware Criteria
* Image-Based Structured Vehicle Behavior Analysis Inspired by Interactive Cognition
* Imbalanced Open Set Domain Adaptation via Moving-Threshold Estimation and Gradual Alignment
* Implicit Compositional Generative Network for Length-Variable Co-Speech Gesture Synthesis
* Implicit-Explicit Motion Learning for Video Camouflaged Object Detection
* Improving Adaptive Real-Time Video Communication via Cross-Layer Optimization
* Improving Cross-Modal Constraints: Text Attribute Person Search With Graph Attention Networks
* Improving Deepfake Detection Generalization by Invariant Risk Minimization
* Improving Fine-Grained Image Classification With Multimodal Information
* Improving Handwritten Mathematical Expression Recognition via Similar Symbol Distinguishing
* Improving Multi-Person Pose Tracking With a Confidence Network
* Improving Pre-Trained Model-Based Speech Emotion Recognition From a Low-Level Speech Feature Perspective
* Improving the Conditional Fine-Grained Image Generation With Part Perception
* Incomplete Multi-View Clustering via Correntropy and Complement Consensus Learning
* Inexactly Matched Referring Expression Comprehension With Rationale
* InfoUCL: Learning Informative Representations for Unsupervised Continual Learning
* Integrating Language Guidance Into Image-Text Matching for Correcting False Negatives
* Integration of Global and Local Knowledge for Foreground Enhancing in Weakly Supervised Temporal Action Localization
* Inter- and Intra-Domain Potential User Preferences for Cross-Domain Recommendation
* Inter-Modal Masked Autoencoder for Self-Supervised Learning on Point Clouds
* Invisible Intruders: Label-Consistent Backdoor Attack Using Re-Parameterized Noise Trigger
* IRVR: A General Image Restoration Framework for Visual Recognition
* iSCMIS:Spatial-Channel Attention Based Deep Invertible Network for Multi-Image Steganography
* Iterative Adversarial Attack on Image-Guided Story Ending Generation
* Joint Correcting and Refinement for Balanced Low-Light Image Enhancement
* Joint Identity-Aware Mixstyle and Graph-Enhanced Prototype for Clothes-Changing Person Re-Identification
* Joint Intra & Inter-Grained Reasoning: A New Look Into Semantic Consistency of Image-Text Retrieval
* Joint Rate-Distortion Optimization for Video Coding and Learning-Based In-Loop Filtering
* Joint-Limb Compound Triangulation With Co-Fixing for Stereoscopic Human Pose Estimation
* Joints-Centered Spatial-Temporal Features Fused Skeleton Convolution Network for Action Recognition
* Keyword-Aware Relative Spatio-Temporal Graph Networks for Video Question Answering
* KNLConv: Kernel-Space Non-Local Convolution for Hyperspectral Image Super-Resolution
* Knowledge Enhanced Vision and Language Model for Multi-Modal Fake News Detection
* Knowledge-Based Hierarchical Causal Inference Network for Video Action Recognition, A
* Knowledge-Enhanced Causal Reinforcement Learning Model for Interactive Recommendation
* Label-Guided Dynamic Spatial-Temporal Fusion for Video-Based Facial Expression Recognition
* Labelled Non-Zero Diffusion Particle Flow SMC-PHD Filtering for Multi-Speaker Tracking
* Language-Guided Dual-Modal Local Correspondence for Single Object Tracking
* LARNet: Towards Lightweight, Accurate and Real-Time Salient Object Detection
* Learnable Tensor Graph Fusion Framework for Natural Image Segmentation
* Learned Focused Plenoptic Image Compression With Microimage Preprocessing and Global Attention
* Learned Video Compression via Heterogeneous Deformable Compensation Network
* Learning 3D Face Reconstruction From the Cycle-Consistency of Dynamic Faces
* Learning 3D Shape Latent for Point Cloud Completion
* Learning a Novel Ensemble Tracker for Robust Visual Tracking
* Learning Deep Representations for Photo Retouching
* Learning Feature Semantic Matching for Spatio-Temporal Video Grounding
* Learning From Noisy Correspondence With Tri-Partition for Cross-Modal Matching
* Learning Label Semantics for Weakly Supervised Group Activity Recognition
* Learning Monocular Regression of 3D People in Crowds via Scene-Aware Blending and De-Occlusion
* Learning Multi-Expert Distribution Calibration for Long-Tailed Video Classification
* Learning Multi-Layer Attention Aggregation Siamese Network for Robust RGBT Tracking
* Learning Music-Dance Representations Through Explicit-Implicit Rhythm Synchronization
* Learning Mutually Exclusive Part Representations for Fine-Grained Image Classification
* Learning Representations by Contrastive Spatio-Temporal Clustering for Skeleton-Based Action Recognition
* Learning Robust Point Representation for 3D Non-Rigid Shape Retrieval
* Learning Semantic Polymorphic Mapping for Text-Based Person Retrieval
* Learning Semantics-Guided Representations for Scoring Figure Skating
* Learning Shape-Biased Representations for Infrared Small Target Detection
* Learning Structured Relation Embeddings for Fine-Grained Fashion Attribute Recognition
* Learning Temporal Dynamics in Videos With Image Transformer
* Learning to Disentangle the Colors, Textures, and Shapes of Fashion Items: A Unified Framework
* Learning to Evaluate the Artness of AI-Generated Images
* Learning to Hallucinate Face in the Dark
* Learning to Hallucinate Face in the Dark
* Learning to Predict Object-Wise Just Recognizable Distortion for Image and Video Compression
* Learning to Supervise Knowledge Retrieval Over a Tree Structure for Visual Question Answering
* Learning With Imbalanced Noisy Data by Preventing Bias in Sample Selection
* Learning-Based Auction for Matching Demand and Supply of Holographic Digital Twin Over Immersive Communications
* Leveraging the Video-Level Semantic Consistency of Event for Audio-Visual Event Localization
* LHNetV2: A Balanced Low-Cost Hybrid Network for Single Image Dehazing
* LIF-Seg: LiDAR and Camera Image Fusion for 3D LiDAR Semantic Segmentation
* Lightweight Adaptive Feature De-Drifting for Compressed Image Classification
* Lightweight Model Pre-Training via Language Guided Knowledge Distillation
* Lightweight Multiperson Pose Estimation With Staggered Alignment Self-Distillation
* Lightweight Text-Driven Image Editing With Disentangled Content and Attributes
* Lightweight Video-Based Respiration Rate Detection Algorithm: An Application Case on Intensive Care
* Lightweight Voice Spoofing Detection Using Improved One-Class Learning and Knowledge Distillation
* Limb-Aware Virtual Try-On Network With Progressive Clothing Warping
* Linker: Learning Long Short-term Associations for Robust Visual Tracking
* Live 360° Video Streaming to Heterogeneous Clients in 5G Networks
* Local Patch AutoAugment With Multi-Agent Collaboration
* Localized Linear Temporal Dynamics for Self-Supervised Skeleton Action Recognition
* Locate Before Answering: Answer Guided Question Localization for Video Question Answering
* Logit Variated Product Quantization Based on Parts Interaction and Metric Learning With Knowledge Distillation for Fine-Grained Image Retrieval
* LOIS: Looking Out of Instance Semantics for Visual Question Answering
* Long Dialogue Emotion Detection Based on Commonsense Knowledge Graph Guidance
* Low-Light Enhancement Method Based on a Retinex Model for Structure Preservation
* Low-Light Image Enhancement With SAM-Based Structure Priors and Guidance
* Low-Rank Correlation Learning for Unsupervised Domain Adaptation
* Low-Rate Feature Compression for Collaborative Intelligence: Reducing Redundancy in Spatial and Statistical Levels
* M2FNet: Mask-Guided Multi-Level Fusion for RGB-T Pedestrian Detection
* M3ANet: Multi-Modal and Multi-Attention Fusion Network for Ship License Plate Recognition
* MAC: Masked Contrastive Pre-Training for Efficient Video-Text Retrieval
* Manifold-Based Incomplete Multi-View Clustering via Bi-Consistency Guidance
* MAR: Masked Autoencoders for Efficient Action Recognition
* MCDAN: A Multi-Scale Context-Enhanced Dynamic Attention Network for Diffusion Prediction
* MCS-GAN: A Different Understanding for Generalization of Deep Forgery Detection
* Memory-Based Augmentation Network for Video Captioning
* Meta Noise Adaption Framework for Multimodal Sentiment Analysis With Feature Noise
* MFFNet: Multi-Modal Feature Fusion Network for V-D-T Salient Object Detection
* MFNet: Real-Time Motion Focus Network for Video Frame Interpolation
* MHRN: A Multimodal Hierarchical Reasoning Network for Topic Detection
* Mining Semantic Information With Dual Relation Graph Network for Multi-Label Image Classification
* MMGInpainting: Multi-Modality Guided Image Inpainting Based on Diffusion Models
* MobiRFPose: Portable RF-Based 3D Human Pose Camera
* Model-Guided Generative Adversarial Networks for Unsupervised Fine-Grained Image Generation
* Modeling Inner- and Cross-Task Contrastive Relations for Continual Image Classification
* Modeling Multiple Aesthetic Views for Series Photo Selection
* Modeling Subject Scoring Behaviors in Subjective Experiments Based on a Discrete Quality Scale
* MorphNeRF: Text-Guided 3D-Aware Editing via Morphing Generative Neural Radiance Fields
* MosaicMVS: Mosaic-Based Omnidirectional Multi-View Stereo for Indoor Scenes
* Motion Deblur by Learning Residual From Events
* Motion Distillation Framework for Video Frame Interpolation, A
* MPCT: Multiscale Point Cloud Transformer With a Residual Network
* MsgFusion: Medical Semantic Guided Two-Branch Network for Multimodal Brain Image Fusion
* MSViT: Training Multiscale Vision Transformers for Image Retrieval
* MuJo-SF: Multimodal Joint Slot Filling for Attribute Value Prediction of E-Commerce Commodities
* Multi-Domain Adaptation for Motion Deblurring
* Multi-Facet Weighted Asymmetric Multi-Modal Hashing Based on Latent Semantic Distribution
* Multi-Granularity Matching Transformer for Text-Based Person Search
* Multi-Label Continual Learning Using Augmented Graph Convolutional Network
* Multi-Layer Decoupling Attention Network for Weakly Supervised Object Localization
* Multi-Level Label Correction by Distilling Proximate Patterns for Semi-Supervised Semantic Segmentation
* Multi-Level Objective Alignment Transformer for Fine-Grained Oral Panoramic X-Ray Report Generation
* Multi-Level Pixel-Wise Correspondence Learning for 6DoF Face Pose Estimation
* Multi-Level Transitional Contrast Learning for Personalized Image Aesthetics Assessment
* Multi-Modal Structure-Embedding Graph Transformer for Visual Commonsense Reasoning
* Multi-Scale Contourlet Knowledge Guide Learning Segmentation
* Multi-Scale Spatiotemporal Feature Fusion Network for Video Saliency Prediction
* Multi-Semantics Aggregation Network Based on the Dynamic-Attention Mechanism for 3D Human Motion Prediction
* Multi-Sentence Complementarily Generation for Text-to-Image Synthesis
* Multi-Source and Multi-Target Domain Adaptation Based on Dynamic Generator with Attention
* Multi-Source Style Transfer via Style Disentanglement Network
* Multi-Space Point Geometry Compression With Progressive Relation-Aware Transformer
* Multi-Stage Spatio-Temporal Fusion Network for Fast and Accurate Video Bit-Depth Enhancement
* Multi-Task Paired Masking With Alignment Modeling for Medical Vision-Language Pre-Training
* Multi-Vehicle Multi-Camera Tracking With Graph-Based Tracklet Features
* Multi-View MERA Subspace Clustering
* Multimodal Boosting: Addressing Noisy Modalities and Identifying Modality Contribution
* Multimodal Progressive Modulation Network for Micro-Video Multi-Label Classification
* Multimodal Reaction: Information Modulation for Cross-Modal Representation Learning
* Multimodality Self-distillation for Fast Inference of Vision and Language Pretrained Models
* Multiscale Cross-Modal Homogeneity Enhancement and Confidence-Aware Fusion for Multispectral Pedestrian Detection
* Music-Driven Choreography Based on Music Feature Clusters and Dynamic Programming
* Muti-Modal Emotion Recognition via Hierarchical Knowledge Distillation
* Mutual Distillation Learning for Person Re-Identification
* Mutual Dual-Task Generator With Adaptive Attention Fusion for Image Inpainting
* Mutual Filter Teaching for Open-Set Semi-Supervised Learning
* Mutually Textual and Visual Refinement Network for Image-Text Matching, A
* Narrowing Domain Gaps With Bridging Samples for Generalized Face Forgery Detection
* Natural and Adversarial Bokeh Rendering via Circle-of-Confusion Predictive Network
* NDELS: A Novel Approach for Nighttime Dehazing, Low-Light Enhancement, and Light Suppression
* Near-Lossless Compression of Point Cloud Attribute Using Quantization Parameter Cascading and Rate-Distortion Optimization
* Negative Label and Noise Information Guided Disambiguation for Partial Multi-Label Learning
* Negative-Driven Training Pipeline for Siamese Visual Tracking
* Negative-Sensitive Framework With Semantic Enhancement for Composed Image Retrieval
* Neighborhood-Aware Mutual Information Maximization for Source-Free Domain Adaptation
* Neural Logic Vision Language Explainer
* New Data Augmentation Method Based on Mixup and Dempster-Shafer Theory, A
* NiteDR: Nighttime Image De-Raining With Cross-View Sensor Cooperative Learning for Dynamic Driving Scenes
* Noise Imitation Based Adversarial Training for Robust Multimodal Sentiment Analysis
* Noise-Tolerant Learning for Audio-Visual Action Recognition
* Non-Maximum Suppression Guided Label Assignment for Object Detection in Crowd Scenes
* Non-Orthogonal Multiple Access Enhanced Scalable 360-Degree Video Multicast
* Non-Subsampled Contourlet Transform and Ground-Truth Score Generation Based Quality Assessment for DIBR-Synthesized Views
* OARNet: Object-Attribute-Relation Network for Predicting Soccer Events
* Object-Preserving Siamese Network for Single-Object Tracking on Point Clouds
* Occlusion-Aware Feature Recover Model for Occluded Person Re-Identification
* OFPF-MEF: An Optical Flow Guided Dynamic Multi-Exposure Image Fusion Network With Progressive Frequencies Learning
* Omnidirectional Video Super-Resolution Using Deep Learning
* One-pass View-unaligned Clustering
* One-Stream Vision-Language Memory Network for Object Tracking
* Online Handwritten Chinese Character Recognition Based on 1-D Convolution and Two-Streams Transformers
* Online Low-Light Sand-Dust Video Enhancement Using Adaptive Dynamic Brightness Correction and a Rolling Guidance Filter
* Online Video Sparse Noise Removing via Nonlocal Robust PCA
* Opinion-Unaware Blind Image Quality Assessment Using Multi-Scale Deep Feature Statistics
* Orientation-Aware Pedestrian Attribute Recognition Based on Graph Convolution Network
* PanorAMS: Automatic Annotation for Detecting Objects in Urban Context
* Parameter-Efficient and Student-Friendly Knowledge Distillation
* Part-Aware Correlation Networks for Few-Shot Learning
* Partial-Tuning Based Mixed-Modal Prototypes for Few-Shot Classification
* PCL: Point Contrast and Labeling for Weakly Supervised Point Cloud Semantic Segmentation
* Pedestrian Trajectory Prediction Based on Social Interactions Learning With Random Weights
* Perception-and-Cognition-Inspired Quality Assessment for Sonar Image Super-Resolution
* Perception-Driven Deep Underwater Image Enhancement Without Paired Supervision
* Perceptual Decoupling With Heterogeneous Auxiliary Tasks for Joint Low-Light Image Enhancement and Deblurring
* Perceptual Image Hashing Using Feature Fusion of Orthogonal Moments
* Perceptual Quality Analysis in Deep Domains Using Structure Separation and High-Order Moments
* Perceptual Quality Assessment of Face Video Compression: A Benchmark and An Effective Method
* Perceptual Quality Assessment of Retouched Face Images
* Perceptual Quality Improvement in Videoconferencing Using Keyframes-Based GAN
* Personalized Representation With Contrastive Loss for Recommendation Systems
* PersonMAE: Person Re-Identification Pre-Training With Masked AutoEncoders
* PG-VTON: A Novel Image-Based Virtual Try-On Method via Progressive Inference Paradigm
* PGCN: Pyramidal Graph Convolutional Network for EEG Emotion Recognition
* PhotoStyle60: A Photographic Style Dataset for Photo Authorship Attribution and Photographic Style Transfer
* Pixel Distribution Remapping and Multi-Prior Retinex Variational Model for Underwater Image Enhancement, A
* PMSNet: Parallel Multi-Scale Network for Accurate Low-Light Light-Field Image Enhancement
* Point Clouds are Specialized Images: A Knowledge Transfer Approach for 3D Understanding
* Point-LGMask: Local and Global Contexts Embedding for Point Cloud Pre-Training With Multi-Ratio Masking
* PointGL: A Simple Global-Local Framework for Efficient Point Cloud Analysis
* PointGT: A Method for Point-Cloud Classification and Segmentation Based on Local Geometric Transformation
* Polarimetric Inverse Rendering for Transparent Shapes Reconstruction
* Pose-Guided Attention Learning for Cloth-Changing Person Re-Identification
* Positive Unlabeled Fake News Detection via Multi-Modal Masked Transformer Network
* Post-Distillation via Neural Resuscitation
* PPM-SEM: A Privacy-Preserving Mechanism for Sharing Electronic Patient Records and Medical Images in Telemedicine
* Predicting Radiologists' Gaze With Computational Saliency Models in Mammogram Reading
* Print-Camera Resistant Image Watermarking With Deep Noise Simulation and Constrained Learning
* Prior Guided Wavelet-Spatial Dual Attention Transformer Framework for Heavy Rain Image Restoration, A
* Prior-Aware Cross Modality Augmentation Learning for Continuous Sign Language Recognition
* PriorNet: Two Deep Prior Cues for Salient Object Detection
* Privileged Modality Learning via Multimodal Hallucination
* Progressive Bidirectional Feature Extraction and Enhancement Network for Quality Evaluation of Night-Time Images
* Progressive Channel-Shrinking Network
* Progressive Diversity Generation for Single Domain Generalization
* Progressive Fourier Adversarial Domain Adaptation for Object Classification and Retrieval
* Progressive Graph Reasoning-Based Social Relation Recognition
* Progressive Learning Model for Big Data Analysis Using Subnetwork and Moore-Penrose Inverse
* Progressive Negative Enhancing Contrastive Learning for Image Dehazing and Beyond
* Progressive Placeholder Learning Network for Multimodal Zero-Shot Learning, A
* Progressive Similarity Preservation Learning for Deep Scalable Product Quantization
* Progressive Source-Aware Transformer for Generalized Source-Free Domain Adaptation
* Progressive Stereo Image Dehazing Network via Cross-View Region Interaction
* Prompt Guided Transformer for Multi-Task Dense Prediction
* Prompt-Based Learning for Unpaired Image Captioning
* Prototypical Bidirectional Adaptation and Learning for Cross-Domain Semantic Segmentation
* Provably Secure Robust Image Steganography
* Pseudo Label Fusion With Uncertainty Estimation for Semi-Supervised Cropping Box Regression
* Pseudo Light Field Image and 4D Wavelet-Transform-Based Reduced-Reference Light Field Image Quality Assessment
* Pyramid Fusion Transformer for Semantic Segmentation
* QoE Physiological Measure of VR With Vibrotactile Feedback Based on Frontal Lobe Power Asymmetry, A
* QSAM-Net: Rain Streak Removal by Quaternion Neural Network With Self-Attention Module
* Quad-Tree Structure-Preserving Adaptive Steganography for HEVC
* Quality Assessment for Stitched Panoramic Images via Patch Registration and Bidimensional Feature Aggregation
* Quality Assessment of Tone-Mapped Images Using Fundamental Color and Structural Features
* Query-Guided Prototype Evolution Network for Few-Shot Segmentation
* RaFPN: Relation-Aware Feature Pyramid Network for Dense Image Prediction
* Rate-Adaptive Neural Network for Image Compressive Sensing
* Real-Time Free Viewpoint Video Synthesis System Based on DIBR and a Depth Estimation Network
* Realistic Depth Image Synthesis for 3D Hand Pose Estimation
* Reality3DSketch: Rapid 3D Modeling of Objects From Single Freehand Sketches
* Reason Generation for Point of Interest Recommendation Via a Hierarchical Attention-Based Transformer Model
* Reciprocal Teacher-Student Learning via Forward and Feedback Knowledge Distillation
* Reconstructed Graph Constrained Auto-Encoders for Multi-View Representation Learning
* Recovering Generalization via Pre-Training-Like Knowledge Distillation for Out-of-Distribution Visual Question Answering
* Recurrent Affine Transformation for Text-to-Image Synthesis
* Refining Uncertain Features With Self-Distillation for Face Recognition and Person Re-Identification
* Reflection Intensity Guided Single Image Reflection Removal and Transmission Recovery
* Region-Aware Portrait Retouching With Sparse Interactive Guidance
* Reinforcement Learning Based Markov Edge Decoupled Fusion Network for Fusion Classification of Hyperspectral and LiDAR
* Relation-Aware Distribution Representation Network for Person Clustering With Multiple Modalities
* Relation-Aware Weight Sharing in Decoupling Feature Learning Network for UAV RGB-Infrared Vehicle Re-Identification
* Relation-Preserving Feature Embedding for Unsupervised Person Re-Identification
* Relational Experience Replay: Continual Learning by Adaptively Tuning Task-Wise Relationship
* Relational Network via Cascade CRF for Video Language Grounding
* Representation Learning Meets Optimization-Derived Networks: From Single-View to Multi-View
* Resolving Zero-Shot and Fact-Based Visual Question Answering via Enhanced Fact Retrieval
* ReSParser: Fully Convolutional Multiple Human Parsing With Representative Sets
* Rethinking Batch Sample Relationships for Data Representation: A Batch-Graph Transformer Based Approach
* Rethinking Few-Shot Class-Incremental Learning With Open-Set Hypothesis in Hyperbolic Geometry
* Rethinking Graph Contrastive Learning: An Efficient Single-View Approach via Instance Discrimination
* Rethinking the Person Localization for Single-Stage Multi-Person Pose Estimation
* Reversible Data Hiding for Encrypted 3D Mesh Models With Secret Sharing Over Galois Field
* Reversible Data Hiding in Encrypted Images Using Global Compression of Zero-Valued High Bit-Planes and Block Rearrangement
* Reversible Data Hiding in Encrypted Images With Adaptive Huffman Code Based on Dynamic Prediction Axes
* Reversible Data Hiding in Encrypted Images With Asymmetric Coding and Bit-Plane Block Compression
* Reversible Data Hiding-Based Contrast Enhancement With Multi-Group Stretching for ROI of Medical Image
* RISTRA: Recursive Image Super-Resolution Transformer With Relativistic Assessment
* Robust Adaptive Steganography Based on Adaptive STC-ECC
* Robust Coverless Video Steganography Based on the Similarity of Inter-Frames, A
* Robust Feature Matching via Graph Neighborhood Motion Consensus
* Robust Geometry-Dependent Attack for 3D Point Clouds
* Robust Image Classification With Noisy Labels by Negative Learning and Feature Space Renormalization
* Robust Multi-Model Visual Tracking With Distractor-Aware Template-Coupled Correlation Filters Joint Learning
* Robust Saliency-Aware Distillation for Few-Shot Fine-Grained Visual Recognition
* Robust Secret Image Sharing Resistant to JPEG Recompression Based on Stable Block Condition
* Robust Tensor Recovery for Incomplete Multi-View Clustering
* Robust Tracking via Bidirectional Transduction With Mask Information
* RPM: RF-Based Pose Machines
* rPPG-MAE: Self-Supervised Pretraining With Masked Autoencoders for Remote Physiological Measurements
* RUN: Rethinking the UNet Architecture for Efficient Image Restoration
* Runge-Kutta Guided Feature Augmentation for Few-Sample Learning
* SADCMF: Self-Attentive Deep Consistent Matrix Factorization for Micro-Video Multi-Label Classification
* Sample Weighting with Hierarchical Equalization Loss for Dense Object Detection
* Say No to Redundant Information: Unsupervised Redundant Feature Elimination for Active Learning
* Scalable Discrete and Asymmetric Unequal Length Hashing Learning for Cross-Modal Retrieval
* SCFANet: Semantics and Context Feature Aggregation Network for 360° Salient Object Detection
* Screen-Shooting Resistant Watermarking With Grayscale Deviation Simulation
* SCSP: An Unsupervised Image-to-Image Translation Network Based on Semantic Cooperative Shape Perception
* SD-NeRF: Towards Lifelike Talking Head Animation via Spatially-Adaptive Dual-Driven NeRFs
* SDPDet: Learning Scale-Separated Dynamic Proposals for End-to-End Drone-View Detection
* SeIF: Semantic-Constrained Deep Implicit Function for Single-Image 3D Head Reconstruction
* Self-Labeling Framework for Open-Set Domain Adaptation With Few Labeled Samples
* Self-Mining the Confident Prototypes for Source-Free Unsupervised Domain Adaptation in Image Segmentation
* Self-Paced Relational Contrastive Hashing for Large-Scale Image Retrieval
* Self-Similarity Prior Distillation for Unsupervised Remote Physiological Measurement
* Self-Supervised Generative-Contrastive Learning of Multi-Modal Euclidean Input for 3D Shape Latent Representations: A Dynamic Switching Approach
* Self-Supervised Intra-Modal and Cross-Modal Contrastive Learning for Point Cloud Understanding
* Self-Supervised Temporal Sensitive Hashing for Video Retrieval
* Self-Supervised Video Representation Learning by Serial Restoration With Elastic Complexity
* Self-Weighted Contrastive Fusion for Deep Multi-View Clustering
* Semantic Distance Adversarial Learning for Text-to-Image Synthesis
* Semantic Image Segmentation by Dynamic Discriminative Prototypes
* Semantic-Enhanced Proxy-Guided Hashing for Long-Tailed Image Retrieval
* Semi-Supervised Adversarial Learning for Attribute-Aware Photo Aesthetic Assessment
* Semi-Supervised Domain Adaptation for Major Depressive Disorder Detection
* Semi-Supervised Domain Adaptation via Joint Transductive and Inductive Subspace Learning
* Semi-Supervised Learning of Perceptual Video Quality by Generating Consistent Pairwise Pseudo-Ranks
* Semi-Supervised Medical Report Generation via Graph-Guided Hybrid Feature Consistency
* Semi-Supervised Single-Image Dehazing Network via Disentangled Meta-Knowledge
* Semi-Supervised Underexposed Image Enhancement Network With Supervised Context Attention and Multi-Exposure Fusion, A
* Separable Reversible Data Hiding for Encrypted 3D Mesh Models Based on Octree Subdivision and Multi-MSB Prediction
* SGDM: An Adaptive Style-Guided Diffusion Model for Personalized Text to Image Generation
* SGIR: Star Graph-Based Interaction for Efficient and Robust Multimodal Representation
* SGSR-Net: Structure Semantics Guided LiDAR Super-Resolution Network for Indoor LiDAR SLAM
* SgVA-CLIP: Semantic-Guided Visual Adapting of Vision-Language Models for Few-Shot Image Classification
* Shared Coupling-Bridge Scheme for Weakly Supervised Local Feature Learning
* Show Me a Video: A Large-Scale Narrated Video Dataset for Coherent Story Illustration
* Sign Language Recognition Framework Based on Cross-Modal Complementary Information Fusion, A
* Similarity- and Quality-Guided Relation Learning for Joint Detection and Tracking
* Single-Shot and Multi-Shot Feature Learning for Multi-Object Tracking
* Size Invariant Visual Cryptography Schemes With Evolving Threshold Access Structures
* Skeleton-Based Gesture Recognition With Learnable Paths and Signature Features
* SmartSit: Sitting Posture Recognition Through Acoustic Sensing on Smartphones
* Snippet-to-Prototype Contrastive Consensus Network for Weakly Supervised Temporal Action Localization
* Soft Weight Pruning for Cross-Domain Few-Shot Learning With Unlabeled Target Data
* Sounding Video Generator: A Unified Framework for Text-Guided Sounding Video Generation
* SP-Det: Leveraging Saliency Prediction for Voxel-Based 3D Object Detection in Sparse Point Cloud
* SPACE: Self-Supervised Dual Preference Enhancing Network for Multimodal Recommendation
* Spatial-Temporal Inter-Layer Reference Frame Generation Network for Spatial SHVC
* Spatiotemporal Orthogonal Projection Capsule Network for Incremental Few-Shot Action Recognition
* Spectrum-Driven Mixed-Frequency Network for Hyperspectral Salient Object Detection
* Split Computing With Scalable Feature Compression for Visual Analytics on the Edge
* SPMHand: Segmentation-Guided Progressive Multi-Path 3D Hand Pose and Shape Estimation
* Spreading Mosaic: An Image Restoration-Inspired Social Rumor Propagation Model
* SSPNet: Predicting Visual Saliency Shifts
* SSRR: Structural Semantic Representation Reconstruction for Visible-Infrared Person Re-Identification
* StableSwap: Stable Face Swapping in a Shared and Controllable Latent Space
* STAT: Multi-Object Tracking Based on Spatio-Temporal Topological Constraints
* Stealthy Physical Masked Face Recognition Attack via Adversarial Style Optimization
* Stereo Superpixel Segmentation via Decoupled Dynamic Spatial-Embedding Fusion Network
* STFE: A Comprehensive Video-Based Person Re-Identification Network Based on Spatio-Temporal Feature Enhancement
* Stitched Wide Field of View Light Field Image Quality Assessment: Benchmark Database and Objective Metric
* Structure Aware Multi-Graph Network for Multi-Modal Emotion Recognition in Conversations
* Structure Similarity Preservation Learning for Asymmetric Image Retrieval
* Structure-Preserving and Illumination-Consistent Cycle Framework for Image Harmonization, A
* Style Interleaved Learning for Generalizable Person Re-Identification
* Style-Agnostic Representation Learning for Visible-Infrared Person Re-Identification
* Subjective Media Quality Recovery From Noisy Raw Opinion Scores: A Non-Parametric Perspective
* Successor Feature-Based Transfer Reinforcement Learning for Video Rate Adaptation With Heterogeneous QoE Preferences
* Support Vector Regression-Based Reduced- Reference Perceptual Quality Model for Compressed Point Clouds
* Survivor in the Era of Large-Scale Pretraining: An Empirical Study of One-Stage Referring Expression Comprehension, A
* SVGC-AVA: 360-Degree Video Saliency Prediction With Spherical Vector-Based Graph Convolution and Audio-Visual Attention
* SVT-AVS3: An Open-Source High-Performance AVS3 Encoder With Scalable Video Technology
* Symbolic Music Generation From Graph-Learning-Based Preference Modeling and Textual Queries
* Synthesize Boundaries: A Boundary-Aware Self-Consistent Framework for Weakly Supervised Salient Object Detection
* SYRER: Synergistic Relational Reasoning for RGB-D Cross-Modal Re-Identification
* TA2V: Text-Audio Guided Video Generation
* Taking a Closer Look at Factor Disentanglement: Dual-Path Variational Autoencoder Learning for Domain Generalization
* Taking a Closer Look At Visual Relation: Unbiased Video Scene Graph Generation With Decoupled Label Learning
* Temporal Action Proposal Generation With Action Frequency Adaptive Network
* Temporal Decoupling Graph Convolutional Network for Skeleton-Based Gesture Recognition
* Temporally Language Grounding With Multi-Modal Multi-Prompt Tuning
* Tensor Low-Rank Graph Embedding and Learning for One-Step Incomplete Multi-View Clustering
* Tensorized Scaled Simplex Representation for Multi-View Clustering
* Test-Time Model Adaptation for Visual Question Answering With Debiased Self-Supervisions
* Text-Guided Eyeglasses Manipulation With Spatial Constraints
* Text-to-Image Person Re-Identification Based on Multimodal Graph Convolutional Network
* TextAdapter: Self-Supervised Domain Adaptation for Cross-Domain Text Recognition
* Textual Enhanced Adaptive Meta-Fusion for Few-Shot Visual Recognition
* TFRNet: Semantic Segmentation Network with Token Filtration and Refinement Method
* TG-Pose: Delving Into Topology and Geometry for Category-Level Object Pose Estimation
* TIF: Threshold Interception and Fusion for Compact and Fine-Grained Visual Attribution
* Toward Efficient Video Compression Artifact Detection and Removal: A Benchmark Dataset
* Toward General Cross-Modal Signal Reconstruction for Robotic Teleoperation
* Toward Interactive Image Inpainting via Robust Sketch Refinement
* Towards 3D Colored Mesh Saliency: Database and Benchmarks
* Towards a Complete and Detail-Preserved Salient Object Detection
* Towards Adaptive Multi-Scale Intermediate Domain via Progressive Training for Unsupervised Domain Adaptation
* Towards Automated Infographic Authoring From Natural Language Statement With Multiple Proportional Facts
* Towards Continual Egocentric Activity Recognition: A Multi-Modal Egocentric Activity Dataset for Continual Learning
* Towards Discriminative Feature Generation for Generalized Zero-Shot Learning
* Towards Effective Collaborative Learning in Long-Tailed Recognition
* Towards Fast and Accurate Image-Text Retrieval With Self-Supervised Fine-Grained Alignment
* Towards High-Quality Photorealistic Image Style Transfer
* Towards Robust Person Re-Identification by Adversarial Training With Dynamic Attack Strategy
* Towards Specific Domain Prompt Learning via Improved Text Label Optimization
* Towards Temporal Event Detection: A Dataset, Benchmarks and Challenges
* TPE-ADE: Thumbnail-Preserving Encryption Based on Adaptive Deviation Embedding for JPEG Images
* Transformer Fusion and Pixel-Level Contrastive Learning for RGB-D Salient Object Detection
* Transformer-Based High-Fidelity Facial Displacement Completion for Detailed 3D Face Reconstruction
* Tri-Level Modality-Information Disentanglement for Visible-Infrared Person Re-Identification
* Triple Consistency for Transparent Cheating Problem in Light Field Depth Estimation
* Trunk Pruning: Highly Compatible Channel Pruning for Convolutional Neural Networks Without Fine-Tuning
* Trusted Semi-Supervised Multi-View Classification with Contrastive Learning
* TTS: Hilbert Transform-Based Generative Adversarial Network for Tattoo and Scene Text Spotting
* Two-Stage Personalized Virtual Try-On Framework With Shape Control and Texture Guidance, A
* Two-Stage Watermark Removal Framework for Spread Spectrum Watermarking
* Two-Step Discrete Hashing for Cross-Modal Retrieval
* Two-Stream Hybrid Convolution-Transformer Network Architecture for Clothing-Change Person Re-Identification, A
* U2D2Net: Unsupervised Unified Image Dehazing and Denoising Network for Single Hazy Image Enhancement
* UCM-Net: A U-Net-Like Tampered-Region-Related Framework for Copy-Move Forgery Detection
* UEDG:Uncertainty-Edge Dual Guided Camouflage Object Detection
* UIERL: Internal-External Representation Learning Network for Underwater Image Enhancement
* UIQI: A Comprehensive Quality Evaluation Index for Underwater Images
* Unbiased Visual Question Answering by Leveraging Instrumental Variable
* Uncertain Facial Expression Recognition via Multi-Task Assisted Correction
* Uncertainty-Aware Deep Video Compression With Ensembles
* Underwater Color Correction Network With Knowledge Transfer
* Underwater Image Quality Assessment: Benchmark Database and Objective Method
* UniDCP: Unifying Multiple Medical Vision-Language Tasks via Dynamic Cross-Modal Learnable Prompts
* Unified Audio-Visual Saliency Model for Omnidirectional Videos With Spatial Audio
* Unified Open-Vocabulary Dense Visual Prediction
* Unified Transformer Framework for Group-Based Segmentation: Co-Segmentation, Co-Saliency Detection and Video Salient Object Detection, A
* UniMF: A Unified Multimodal Framework for Multimodal Sentiment Analysis in Missing Modalities and Unaligned Multimodal Sequences
* Unit Correlation With Interactive Feature for Robust and Effective Tracking
* UniTR: A Unified TRansformer-Based Framework for Co-Object and Multi-Modal Saliency Detection
* Unleashing Knowledge Potential of Source Hypothesis for Source-Free Domain Adaptation
* Unsupervised Domain Adaptation via Risk-Consistent Estimators
* Unsupervised Dual Hashing Coding (UDC) on Semantic Tagging and Sample Content for Cross-Modal Retrieval
* Unsupervised Monocular Estimation of Depth and Visual Odometry Using Attention and Depth-Pose Consistency Loss
* Unsupervised Point Cloud Co-Part Segmentation via Co-Attended Superpoint Generation and Aggregation
* Upward Robust Steganography Based on Overflow Alleviation
* URCDC-Depth: Uncertainty Rectified Cross-Distillation With CutFlip for Monocular Depth Estimation
* USD: Uncertainty-Based One-Phase Learning to Enhance Pseudo-Label Reliability for Semi-Supervised Object Detection
* Utilizing Greedy Nature for Multimodal Conditional Image Synthesis in Transformers
* UTLNet: Uncertainty-Aware Transformer Localization Network for RGB-Depth Mirror Segmentation
* Variational Mixture of Stochastic Experts Auto-Encoder for Multi-Modal Recommendation
* Variational Neuron Shifting for Few-Shot Image Classification Across Domains
* Video Compressed Sensing Reconstruction via an Untrained Network with Low-Rank Regularization
* Video Compression Artifacts Removal With Spatial-Temporal Attention-Guided Enhancement
* Video Demoiréing With Deep Temporal Color Embedding and Video-Image Invertible Consistency
* Video Frame Interpolation With Stereo Event and Intensity Cameras
* Video Violence Rating: A Large-Scale Public Database and A Multimodal Rating Model
* VideoXum: Cross-Modal Visual and Textural Summarization of Videos
* VirPNet: A Multimodal Virtual Point Generation Network for 3D Object Detection
* Vision-and-Language Navigation via Latent Semantic Alignment Learning
* Visual and Textual Prior Guided Mask Assemble for Few-Shot Segmentation and Beyond
* Visual Correspondence Learning and Spatially Attentive Synthesis via Transformer for Exemplar-Based Anime Line Art Colorization
* Visual Object Tracking With Mutual Affinity Aligned to Human Intuition
* Visual-Textual Attribute Learning for Class-Incremental Facial Expression Recognition
* Visualization Comparison of Vision Transformers and Convolutional Neural Networks
* ViTA: Video Transformer Adaptor for Robust Video Depth Estimation
* VMemNet: A Deep Collaborative Spatial-Temporal Network With Attention Representation for Video Memorability Prediction
* VOLTER: Visual Collaboration and Dual-Stream Fusion for Scene Text Recognition
* VVC Intra Rate Control With Small Bit Fluctuations Using a Lagrange Multiplier Adjustment, A
* VWP: An Efficient DRL-Based Autonomous Driving Model
* Wasserstein Embedding Learning for Deep Clustering: A Generative Approach
* Wave-Like Class Activation Map With Representation Fusion for Weakly-Supervised Semantic Segmentation
* WaveDM: Wavelet-Based Diffusion Models for Image Restoration
* Weighted Graph-Structured Semantics Constraint Network for Cross-Modal Retrieval
* When Channel Correlation Meets Sparse Prior: Keeping Interpretability in Image Compressive Sensing
* Where Does the Devil Lie?: Multimodal Multitask Collaborative Revision Network for Trusted Road Segmentation
* Width-Adaptive CNN: Fast CU Partition Prediction for VVC Screen Content Coding
* Zero-Shot Single-View Point Cloud Reconstruction via Cross-Category Knowledge Transferring
* Zero-Shot Video Moment Retrieval With Angular Reconstructive Text Embeddings
817 for MultMed(26)
MultMed(4)
* Image-Based Virtual World Generation
* Virtualized Reality: Constructing Virtual Worlds From Real Scenes
* Visually Searching the Web for Content
MultMed(5)
* Classifying color edges in video into shadow-geometry, highlight, or material transitions
* Similarity Retrieval of Trademark Images
MultMed(6)
* Isolated regions in video coding
MultMed(7)
* Detection and Representation of Scenes in Videos
MultMed(9)
* 3-D Head Model Retrieval Using a Single Face View Query
* Active Rearranged Capturing of Image-Based Rendering Scenes: Theory and Practice
* Adaptive Media-Aware Retransmission Timeout Estimation Method for Low-Delay Packet Video, An
* Adding Semantics to Detectors for Video Retrieval
* Audio-Visual Affect Recognition
* Audio-Visual Event Recognition in Surveillance Video Sequences
* Automatic Meeting Segmentation Using Dynamic Bayesian Networks
* Automatically-Determined Region of Interest in JPEG 2000
* Bayesian Approach for Morphology-Based 2-D Human Motion Capture
* Can High-Level Concepts Fill the Semantic Gap in Video Retrieval? A Case Study With Broadcast News
* Combination of Warping Robust Elastic Graph Matching and Kernel-Based Projection Discriminant Analysis for Face Recognition
* Comments on An SVD-Based Watermarking Scheme for Protecting Rightful Ownership
* Content-Based Copy Retrieval Using Distortion-Based Probabilistic Similarity Search
* Content-Based Image Retrieval by Feature Adaptation and Relevance Feedback
* Content-Based Retrieval of 3-D Objects Using Spin Image Signatures
* Delay-Distortion Optimization for Content-Adaptive Video Streaming
* Digital Image Tracing by Sequential Multiple Watermarking
* Discrete Wavelet Transform on Consumer-Level Graphics Hardware
* Edge Potential Functions (EPF) and Genetic Algorithms (GA) for Edge-Based Matching of Visual Objects
* Efficient Mode Decision Algorithm for H.264/AVC Encoding Optimization, An
* Efficient Short Video Repeat Identification With Application to News Video Structure Analysis
* Encoding of Affine Motion Vectors
* End-to-End Embedded Approach for Multicast/Broadcast of Scalable Video over Multiuser CDMA Wireless Networks, An
* Enhanced Eigen-Audioframes for Audiovisual Scene Change Detection
* Face Modeling and Animation Language for MPEG-4 XMT Framework
* Generic Framework for Efficient 2-D and 3-D Facial Expression Analogy, A
* Head-Size Equalization for Improved Visual Perception in Video Conferencing
* Human Behavior Analysis for Highlight Ranking in Broadcast Racket Sports Video
* Hybrid Model to Detect Zero Quantized DCT Coefficients in H.264
* Image Collection Organization and Its Application to Indexing, Browsing, Summarization, and Semantic Retrieval
* Incorporating Concept Ontology for Hierarchical Video Classification, Annotation, and Visualization
* Joint Design of Source Rate Control and QoS-Aware Congestion Control for Video Streaming Over the Internet
* Learned Lexicon-Driven Paradigm for Interactive Video Retrieval, A
* Learning Personal Preference From Viewer's Operations for Browsing and Its Application to Baseball Video Retrieval and Summarization
* Lecture Video Enhancement and Editing by Integrating Posture, Gesture, and Text
* Major Cast Detection in Video Using Both Speaker and Face Information
* Modeling and Mining of Users' Capture Intention for Home Videos
* Modeling Human Judgment of Digital Imagery for Multimedia Retrieval
* Motion Flow-Based Video Retrieval
* Moving Cast Shadows Detection Using Ratio Edge
* Moving-Object Detection, Association, and Selection in Home Videos
* Multistreaming of 3-D Scenes With Optimized Transmission and Rendering Scalability
* Near-Duplicate Keyframe Identification With Interest Point Matching and Pattern Learning
* New Model-Based Digital Halftoning and Data Hiding Designed With LMS Optimization, A
* Novel 4-D Perceptual Quantization Modeling for H.264 Bit-Rate Control, A
* Novel Point-Oriented Inner Searches for Fast Block Motion Estimation
* On Transcoding a B-Frame to a P-Frame in the Compressed Domain
* Optimized Content-Aware Authentication Scheme for Streaming JPEG-2000 Images Over Lossy Networks, An
* Pattern-Based Data Hiding for Binary Image Authentication by Connectivity-Preserving
* Perceptual Temporal Quality Metric for Compressed Video
* Perceptually Optimized 3-D Transmission Over Wireless Networks
* Quad-Tree Motion Estimation in the Frequency Domain Using Gradient Correlation
* Real-Time Motion Trajectory-Based Indexing and Retrieval of Video Sequences
* Real-Time Whiteboard Capture and Processing Using a Video Camera for Remote Collaboration
* Robust Biometric Person Identification Using Automatic Classifier Fusion of Speech, Mouth, and Face Experts
* Rule Based Technique for Extraction of Visual Attention Regions Based on Real-Time Clustering, A
* Scalable, Wavelet-Based Video: From Server to Hardware-Accelerated Client
* Scene Parsing Using Region-Based Generative Models
* Scene-Change Aware Dynamic Bandwidth Allocation for Real-Time VBR Video Transmission Over IEEE 802.15.3 Wireless Home Networks
* Security and Robustness Enhancement for Image Data Hiding
* Semantic Image and Video Indexing in Broad Domains
* Shape Indexing and Recognition Based on Regional Analysis
* Spatiotemporal Visual Considerations for Video Coding
* Summarization of Visual Content in Instructional Videos
* Super-Resolution of Face Images Using Kernel PCA-Based Prior
* Target Tracking Using a Joint Acoustic Video System
* Two-Dimensional Channel Coding Scheme for MCTF-Based Scalable Video Coding
* Video Packet Selection and Scheduling for Multipath Streaming
* Video Segmentation via Temporal Pattern Classification
* Virtual Viewpoint Replay for a Soccer Match by View Interpolation From Multiple Cameras
* Visual Salience-Guided Mesh Decomposition
* Watermarked 3-D Mesh Quality Assessment
* Watermarking Digital 3-D Volumes in the Discrete Fourier Transform Domain
* Word-Level Parallel Architecture of JPEG 2000 Embedded Block Coding Decoder
74 for MultMed(9)
MultMedMag
* *IEEE MultiMedia Magazine
* MPEG-21 and Its Interoperability with Rights-Information Standards
* SignTutor: An Interactive System for Sign Language Tutoring
MultMedMag(12)
* MPEG Standard for Rich Media Services, An
* MPEG-A: Multimedia Application Formats
* VERL: An Ontology Framework for Representing and Annotating Video Events
* What's New with MPEG?
MultMedMag(15)
* Dynamic Video Transcoding in Mobile Environments
MultMedMag(16)
* Dynamic Pictorially Enriched Ontologies for Digital Video Libraries
* Ecosystem for Semantics, An
* Folk Song Retrieval System with a Gesture-Based Interface, A
* Hybrid Tagging and Browsing Approaches for Efficient Manual Image Annotation
* Learning Video Preferences Using Visual Features and Closed Captions
* Multimedia at Work: Harvesting Resources for Recording Concurrent Videoconferences
* Novel Approach to Steganography in High- Dynamic-Range Images, A
* Standards: The MPEG Open Access Application Format
8 for MultMedMag(16)
MultMedMag(17)
* AR-Immersive Cinema at the Aula Natura Visitors Center
* Archive and Preservation of Media Content Using MPEG-A
* Cross-Modal Approach to Cleansing Weakly Tagged Images, A
* Crowdsourcing What Is Where: Community-Contributed Photos as Volunteered Geographic Information
* Data-Driven Approaches to Community-Contributed Video Applications
* Hiding Multitone Watermarks in Halftone Images
* Intelligent Multimedia Presentation in Ubiquitous Multidevice Scenarios
* Keyframe-Based Video Summary Using Visual Attention Clues
* Landscaping Future Interaction: Special issue on Mobile and Ubiquitous Multimedia
* Local Wavelet Features for Statistical Object Classification and Localization
* Mobile Multimedia Technology to Aid Those with Alzheimer's Disease, A
* Mobility Management for Video Streaming on Heterogeneous Networks
* Modeling Media Synchronization with Semiotic Agents
* New Paradigm for Content Producers, A
* Optimal Rate Allocation for Video Transmission over Wireless Ad Hoc Networks
* Picture Context Capturing for Mobile Databases
* Platform for Context-Aware and Digital Rights Management-Enabled Content Adaptation, A
* Question Answering over Community-Contributed Web Videos
* Social Surroundings: Bridging the Virtual and Physical Divide
* System Concept for Socially Enriched Access to Soccer Video Collections, A
* Video Annotation and Retrieval Using Ontologies and Rule Learning
* Video in the Web: Technical Challenges and Standardization
* Visual Navigation for Mobile Devices
23 for MultMedMag(17)
MultMedMag(18)
* Augmenting Live Broadcast Sports with 3D Tracking Information
* Cluster-Based Landmark and Event Detection for Tagged Photo Collections
* Converting 2D Video to 3D: An Efficient Path to a 3D Experience
* Data-Hiding in Halftone Images Using Adaptive Noise-Balanced Error Diffusion
* Discovering the Thematic Object in Commercial Videos
* Enhancing Bag-of-Words Models with Semantics-Preserving Metric Learning
* Film Analysis of Archived Documentaries
* Implementation and Analysis of a Peer-to-Peer Retransmissions System for Live Video Services
* Large-Scale Multimedia Retrieval and Mining
* Mining Event Structures from Web Videos
* Mixed-Reality System for Broadcasting Sports Video to Mobile Devices, A
* Mobile Visual Search: Architectures, Technologies, and the Emerging MPEG Standard
* MPEG-DASH Standard for Multimedia Streaming Over the Internet, The
* Music Generation with Markov Models
* Naming People in News Videos with Label Propagation
* Online Video Recommendation through Tag-Cloud Aggregation
* Personalized Coverage of Large Athletic Events
* Preserving Wayang Kulit for Future Generations
* Real-Time Video Copy-Location Detection in Large-Scale Repositories
* Semantic Annotation Architecture for Accessible Multimedia Resources
* Using Modality Replacement to Facilitate Communication between Visually and Hearing-Impaired People
* Visual Content Identification and Search
* Visual Reranking: From Objectives to Strategies
* Visual Rhythm Detection and Its Applications in Interactive Multimedia
* Vocabulary Hierarchy Optimization and Transfer for Scalable Image Search
* Web-Scale Multimedia Analysis: Does Content Matter?
* Weighted Subspace Filtering and Ranking Algorithms for Video Concept Retrieval
* You Can Judge an Artist by an Album Cover: Using Images for Music Annotation
28 for MultMedMag(18)
MultMedMag(19)
* Anatomy of an Optical Biopsy Semantic Retrieval System, The
* Boosting, Sparsity- Constrained Bilinear Model for Object Recognition, A
* Building Reliable and Reusable Test Collections for Image Retrieval: The Wikipedia Task at ImageCLEF
* Collecting Large, Richly Annotated Facial-Expression Databases from Movies
* Combining Face and Eye Detectors in a High- Performance Face-Detection System
* Current Developments and Future Trends in Audio Authentication
* Digital Image Scrambling Using 2D Cellular Automata
* Efficient Image Copy Detection Using Multiscale Fingerprints
* Face Matching and Retrieval in Forensics Applications
* Finding Information in Multimedia Meeting Records
* Image Retrieval in Forensics: Tattoo Image Database Application
* Immersive Environment: An Emerging Future of Telecommunications
* Indexing Large Online Multimedia Repositories Using Semantic Expansion and Visual Analysis
* Microsoft Kinect Sensor and Its Effect
* Mobile Media in Action: Remote Target Localization and Tracking
* Posterity Logging of Face Imagery for Video Surveillance
* Profiling Online Auction Sellers Using Image-Editing Styles
* Real-Time Compressed- Domain Video Watermarking Resistance to Geometric Distortions
* Threefold Dataset for Activity and Workflow Recognition in Complex Industrial Environments, A
* Using Texture Analysis for Medical Diagnosis
* Where Is the User in Multimedia Retrieval?
21 for MultMedMag(19)
MultMedMag(20)
* 3D Imaging Techniques and Multimedia Applications: Guest editor's introduction
* Affect in Media: Embodied Media Interaction in Performance and Public Art
* Applications of Face Analysis and Modeling in Media Production
* Character Behavior Planning and Visual Simulation in Virtual 3D Space
* Classification and Analysis of 3D Teleimmersive Activities
* Depth Sensing for 3DTV: A Survey
* Immersive 3D Holoscopic Video System
* In-Kernel Relay for Scalable One-to-Many Streaming
* JPEG's JPSearch Standard: Harmonizing Image Management and Search
* Large Visual Repository Search with Hash Collision Design Optimization
* Large-Scale Image Phylogeny: Tracing Image Ancestral Relationships
* Large-Scale Near-Duplicate Web Video Retrieval: Challenges and Approaches
* Learning to Rerank Web Images
* MMT: An Emerging MPEG Standard for Multimedia Delivery over the Internet
* New Writing Experience: Finger Writing in the Air Using a Kinect Sensor, A
* Partial-Duplicate Image Retrieval via Saliency-Guided Visual Matching
* Scalable Media Coding Enabling Content-Aware Networking
* Scalable Mobile Video Retrieval with Sparse Projection Learning and Pseudo Label Mining
* Securing Multimedia Content Using Joint Compression and Encryption
* Software-Based Solution for Distributing and Displaying 3D UHD Films, A
* Standards-Based Architectures for Content Management
* Unified Access to Media Metadata on the Web
* Video Copy-Detection and Localization with a Scalable Cascading Framework
* Video Delivery Challenges and Opportunities in 4G Networks
* Viewport: A Distributed, Immersive Teleconferencing System with Infrared Dot Pattern
* Walking in Colors: Human Gait Recognition Using Kinect and CBIR
* Web-Scale Image Retrieval Using Compact Tensor Aggregation of Visual Descriptors
* Web-Scale Near-Duplicate Search: Techniques and Applications
28 for MultMedMag(20)
MultMedMag(21)
* Clustering Faces in Movies Using an Automatically Constructed Social Network
* Compact Descriptors for Visual Search
* Context-Adaptive Modeling for Wavelet-Domain Distributed Video Coding
* Critical Multimedia
* Efficient BOF Generation and Compression for On-Device Mobile Visual Location Recognition
* Fashion Analysis: Current Techniques and Future Directions
* Finding the Needle in the Image Stack: Performance Metrics for Big Data Image Analysis
* Future of Smart Photography, The
* Graph-Based Residence Location Inference for Social Media Users
* How Many Visual Concepts?
* Joint Video and Text Parsing for Understanding Events and Answering Queries
* Large-Scale Geosocial Multimedia
* Latent Subspace Projection Pursuit with Online Optimization for Robust Visual Tracking
* Local Stereo Matching with Improved Matching Cost and Disparity Refinement
* Memory-Efficient Image Databases for Mobile Visual Search
* Mobile Photo Recommendation and Logbook Generation Using Context-Tagged Images
* Multimedia Grand Challenge 2013, The
* Multimedia Semantic Retrieval Mobile System Based on HCFGs, A
* Multimodal Feature Fusion for 3D Shape Recognition and Retrieval
* Multimodal Spatio-Temporal Theme Modeling for Landmark Analysis
* New Paradigm for Querying Blobs in Vehicular Networks, A
* Next-Generation 3D Formats with Depth Map Support
* Objective Self
* Online Learning a High-Quality Dictionary and Classifier Jointly for Multitask Object Tracking
* Projected Residual Vector Quantization for ANN Search
* Real-Time Gaze Estimation with Online Calibration
* Scalable Extensions of HEVC for Ultra-High-Definition Video Delivery, The
* Self-Recognized Image Protection Technique that Resists Large-Scale Cropping
* Standardization of Biometric Template Protection
* Toward Experiential Mobile Media Processing
* Toward Haptic Cinematography: Enhancing Movie Experiences with Camera-Based Haptic Effects
* Toward Multiscreen Social TV with Geolocation-Aware Social Sense
* Training Quality-Aware Filters for No-Reference Image Quality Assessment
* User-Centric Media Retrieval Competition: The Video Browser Showdown 2012-2014, A
* View-Based 3D Object Retrieval: Challenges and Approaches
* Visions for Augmented Cultural Heritage Experience
36 for MultMedMag(21)
MultMedMag(22)
* Bidirectional Mesh-Based Frame Rate Up-Conversion
* CitySensing: Fusing City Data for Visual Storytelling
* Cross-Platform Social Event Detection
* Data-Driven Scene Understanding with Adaptively Retrieved Exemplars
* Designing an Interactive Audio Interface for Climate Science
* Effects of Auditory Feedback on Menu Selection in Hand-Gesture Interfaces
* Effects of Ecological Auditory Feedback on Rhythmic Walking Interaction, The
* Emerging Multimedia Research and Applications
* Emotional and Social Signals: A Neglected Frontier in Multimedia Computing?
* Experiments with Distributed Theatre
* Green Metadata Standard for Energy-Efficient Video Consumption, The
* Integrating Multimedia into Autism Intervention
* Interactive Sonification in Rowing: Acoustic Feedback for On-Water Training
* Interleaved Time Bases in Hypermedia Synchronization
* Let's Share a Story: Socially Enhanced Multimedia Storytelling
* Let's Weave the Visual Web
* Machine Intelligence Approach to Virtual Ballet Training, A
* Manipulating Ultra-High Definition Video Traffic
* Multimedia Big Data
* Multimedia Big Data Computing
* Multimedia Search: From Relevance to Usefulness
* Novel Markov Logic Rule Induction Strategy for Characterizing Sports Video Footage, A
* Optimizing the Perceptual Quality of Real-Time Multimedia Applications
* Photos to Remember, Photos to Forget
* Saliency-Guided Deep Framework for Image Quality Assessment
* Social Multimedia and Storytelling
* Sonic Trampoline: How Audio Feedback Impacts the User's Experience of Jumping
* Sonification of Surface Tapping Changes Behavior, Surface Perception, and Emotion
* Survey of Current YouTube Video Characteristics, A
* Syncing Shared Multimedia through Audiovisual Bimodal Segmentation
* Teaching Privacy: Multimedia Making a Difference
* Variable Markov Oracle: Algorithms for Human Gesture Applications, The
* Viewpoint Sequence Recommendation Based on Contextual Information for Multiview Video
* Wearable Auditory Biofeedback Device for Blind and Sighted Individuals
34 for MultMedMag(22)
MultMedMag(23)
* Collaborative Sparse Coding for Multiview Action Recognition
* Computational Modeling of Affective Qualities of Abstract Paintings
* Example-Based Image Textural Style Transfer
* Expressive Modulation of Neutral Visual Speech
* Extended Guided Filtering for Depth Map Upsampling
* Eye-Controlled Interfaces for Multimedia Interaction
* Fast Summarization of User-Generated Videos: Exploiting Semantic, Emotional, and Quality Clues
* Fusing Incomplete Multisensor Heterogeneous Data to Estimate Urban Traffic
* Guest editors' introduction: Perception, Aesthetics, and Emotion in Multimedia Quality Modeling
* Image Encryption Algorithm Based on Autoblocking and Electrocardiography, An
* JPEG Pleno: Toward an Efficient Representation of Visual Reality
* JPEG XT: A New Family of JPEG Backward-Compatible Standards
* Multimedia Hashing and Networking
* Multimedia Memory Cues for Augmenting Human Memory
* Multimodal Ensemble Fusion for Disambiguation and Retrieval
* Nonlocal In-Loop Filter: The Way Toward Next-Generation Video Coding?
* Nonparametric Quality Assessment of Natural Images
* Novel Semi-Supervised Dimensionality Reduction Framework, A
* Person-Centered Multimedia Computing: A New Paradigm Inspired by Assistive and Rehabilitative Applications
* Planogram Compliance Checking Based on Detection of Recurring Patterns
* Scale-Aware Spatially Guided Mapping
* Selecting Interesting Image Regions to Automatically Create Cinemagraphs
* Ubiquitous Multimedia: Emerging Research on Multimedia Computing
* Unsupervised Speaker Identification for TV News
* Visual Attention Retargeting
25 for MultMedMag(23)
MultMedMag(24)
* Audience Behavior Mining: Integrating TV Ratings with Multimedia Content
* Augmented Reality in Reality
* Benchmarking Initiative for Multimedia Evaluation: MediaEval 2016, The
* Beyond 1 Million Nodes: A Crowdsourced Video Content Delivery Network
* ChildGuard: A Child-Safety Monitoring System
* Continuing Reinvention of Content-Based Retrieval: Multimedia Is Not Dead, The
* Crowdsensing Multimedia Data: Security and Privacy Issues
* Cryptanalyzing an Image-Scrambling Encryption Algorithm of Pixel Bits
* Deep Learning Triggers a New Era in Industrial Robotics
* Dynamic Deployment and Optimization of Virtual Content Delivery Networks
* Evaluating Responsive Web Design's Impact on Blind Users
* Extreme-Dynamic-Range Sensing: Real-Time Adaptation to Extreme Signals
* Flow Watermarking for Antinoise and Multistream Tracing in Anonymous Networks
* Future of Multimedia Distribution: An Interview with Baochun Li, Diego R. Lopez, and Christian Timmerer, The
* JPEG at 25: Still Going Strong
* Latest Multimedia Research from ISM 2016, The
* Light-Field Journey to Virtual Reality, A
* Multimedia Content Delivery with Network Function Virtualization: The Energy Perspective
* Multimedia Technologies for Enriched Music Performance, Production, and Consumption
* Multisensory Experiences in HCI
* Network Function Virtualization and Software-Defined Networking: Advancing Multimedia Distribution
* NFV-Based Video Quality Assessment Method over 5G Small Cell Networks, An
* Nonlinear Discrete Cross-Modal Hashing for Visual-Textual Data
* Object-Detection-Based Video Compression for Wireless Surveillance Systems
* Pooling-Based Quantitative Approach to Evaluating Binarization Algorithms
* Price-Based Controller for Utility-Aware HTTP Adaptive Streaming
* QoE-Aware Bandwidth Allocation for Video Traffic Using Sigmoidal Programming
* Querying Users as Oracles in Tag Engines for Personalized Image Tagging
* Selective Privacy-Preserving Approach for Multimedia Data, A
* vCache: Supporting Cost-Efficient Adaptive Bitrate Streaming
* When Cloud Media Meet Network Function Virtualization: Challenges and Applications
* Word of Mouth Mobile Crowdsourcing: Increasing Awareness of Physical, Cyber, and Social Interactions
32 for MultMedMag(24)
MultMedMag(25)
* 360-Degree Virtual-Reality Cameras for the Masses
* Adding a New Dimension to HTTP Adaptive Streaming Through Multiple-Source Capabilities
* Behavior Analysis through Multimodal Sensing for Care of Parkinson's and Alzheimer's Patients
* Biometrics: In Search of Identity and Security (Q & A)
* Clustering of Musical Pieces Through Complex Networks: An Assessment Over Guitar Solos
* Crossmodal Approach to Multimodal Fusion in Video Hyperlinking, A
* Cryptanalyzing an Image Encryption Algorithm Based on Autoblocking and Electrocardiography
* Deep Medical Image Computing in Preventive and Precision Medicine
* Discovering Latent Aspects for Diversity-Induced Image Retrieval
* Generalized Multi-Instance Control Mapping for Interactive Media Systems
* Health Media: From Multimedia Signals to Personal Health Insights
* Image and Video Captioning with Augmented Neural Architectures
* Integrating Vision and Language for First-Impression Personality Analysis
* Multimedia for Disaster Information Management
* Multiview Cross-Media Hashing with Semantic Consistency
* Non-uniform Watermark Sharing Based on Optimal Iterative BTC for Image Tampering Recovery
* pDisVPL: Probabilistic Discriminative Visual Part Learning for Image Classification
* Rhythm: A Unified Measurement Platform for Human Organizations
* Sensing Technologies for Monitoring Serious Mental Illnesses
* Social Relationship Labeling Based on Multimodal Behaviors and Social Interactions
* Technical Evaluation of HoloLens for Multimedia: A First Look
* Toward Real-Time Delivery of Immersive Sports Content
* Vision and Language Integration Meets Multimedia Fusion
* Visual Nonverbal Behavior Analysis: The Path Forward
* Watermarking Mechanism With High Capacity for Three-Dimensional Mesh Objects Using Integer Planning, A
25 for MultMedMag(25)
MultMedMag(26)
* 3-D Scene Management Method Based on the Triangular Mesh for Large-Scale Web3D Scenes, A
* AI-Oriented Large-Scale Video Management for Smart City: Technologies, Standards, and Beyond
* Arbitrary Screen-Aware Manga Reading Framework with Parameter-Optimized Panel Extraction
* Cloud Resource Optimization for Processing Multiple Streams of Visual Data
* Compact Descriptors for Video Analysis: The Emerging MPEG Standard
* Coping With the Challenges of Delivering Multiple Sensorial Media
* Discovering Latent Topics With Saliency-Weighted LDA for Image Scene Understanding
* Edge Caching and Computing in 5G for Mobile AR/VR and Tactile Internet
* Emotion-Aware Video QoE Assessment Via Transfer Learning
* Enhancing Video QoE Over High-Speed Train Using Segment-Based Prefetching and Caching
* Gender Differences in Multimodal Contact-Free Deception Detection
* Hierarchical Deep Cosegmentation of Primary Objects in Aerial Videos
* Multi-Bitrate Video Caching for D2D-Enabled Cellular Networks
* Multimedia for Autonomous Driving
* Multipoint Cooperative Transmission for Virtual Reality in 5G New Radio
* Person Reidentification by Deep Structured Prediction: A Fully Parameterized Approach
* QoE-Oriented Multimedia Assessment: A Facial Expression Recognition Approach
* Rank-Based Encoding Features for Stereo Matching
* Residual-Based Post-Processing for HEVC
* Retrieval System of Medicine Molecules Based on Graph Similarity, A
* Smart Media Transport: A Burgeoning Intelligent System for Next Generation Multimedia Convergence Service Over Heterogeneous Networks in China
* ToothPic: Camera-Based Image Retrieval on Large Scales
* Towards a QoE Model to Evaluate Holographic Augmented Reality Devices
* Ubiquitous Intelligent Cameras: Between Legal Nightmare and Social Empowerment
* Who is the Film's Director? Authorship Recognition Based on Shot Features
25 for MultMedMag(26)
MultMedMag(27)
* Adversarial Learning-Based Semantic Correlation Representation for Cross-Modal Retrieval
* Artificial Intelligence Fights Crime and Terrorism at a New Level
* Attribute-Guided Feature Learning Network for Vehicle Reidentification
* Building a Manga Dataset Manga109 With Annotations for Multimedia Applications
* Compression-Then-Encryption-Based Secure Watermarking Technique for Smart Healthcare System
* Deep Residual Split Directed Graph Convolutional Neural Networks for Action Recognition
* Detecting Disaster-Related Tweets Via Multimodal Adversarial Neural Network
* Do I Smell Coffee? The Tale of a 360° Mulsemedia Experience
* Domain Adaptation With Foreground/Background Cues and Gated Discriminators
* Effective Approach for Nonrigid Structure From Motion With Complex Deformation, An
* End-to-End Framework for Clothing Collocation Based on Semantic Feature Fusion, An
* Glasses-Free 3-D and Augmented Reality Display Advances: From Theory to Implementation
* Image Retrieval via Gated Multiscale NetVLAD for Social Media Applications
* Joint Watermarking-Encryption-ECC for Patient Record Security in Wavelet Domain
* Key-Point Sequence Lossless Compression for Intelligent Video Analysis
* Learning Quintuplet Loss for Large-Scale Visual Geolocalization
* Legal and Ethical Challenges in Multimedia Research
* Leveraging Smart Devices for Scene Text Preserved Image Stylization: A Deep Gaming Approach
* Leveraging Smart Devices for Scene Text Preserved Image Stylization: A Deep Gaming Approach
* Metric Learning-Based Multimodal Audio-Visual Emotion Recognition
* Multilabel Text Classification With Incomplete Labels: A Safe Generative Model With Label Manifold Regularization and Confidence Constraint
* Multimedia and the Tactile Internet
* Multimedia Data Privacy Against Machines
* PGAN: Part-Based Nondirect Coupling Embedded GAN for Person Reidentification
* Style Transfer of Urban Road Images Using Generative Adversarial Networks With Structural Details
* Toward Sensing Emotions With Deep Visual Analysis: A Long-Term Psychological Modeling Approach
* Urban Multimedia Computing: Emerging Methods in Multimedia Computing for Urban Data Analysis and Applications
* Wall Screen: An Ultra-High Definition Video-Card for the Internet of Things
* WarpClothingOut: A Stepwise Framework for Clothes Translation From the Human Body to Tiled Images
* Wavelet-Based Quality-Constrained ECG Data Compression System Without Decoding Process
30 for MultMedMag(27)
MultMedMag(28)
* AffectiveNet: Affective-Motion Feature Learning for Microexpression Recognition
* Characteristic Analysis of 2D Lag-Complex Logistic Map and Its Application in Image Encryption
* Class-Balanced Text to Image Synthesis With Attentive Generative Adversarial Network
* Destruction and Reconstruction Learning for Facial Expression Recognition
* EGGAN: Learning Latent Space for Fine-Grained Expression Manipulation
* Emotion Detection for Conversations Based on Reinforcement Learning Framework
* End-to-End Learning for Multimodal Emotion Recognition in Video With Adaptive Loss
* Enhancing QoE for Viewport-Adaptive 360-Degree Video Streaming: Perception Analysis and Implementation
* Facial Expression Recognition With Multiscale Graph Convolutional Networks
* Feature-Guided Spatial Attention Upsampling for Real-Time Stereo Matching Network
* From Semantic to Spatial Awareness: Vehicle Reidentification With Multiple Attention Mechanisms
* Generalized Face Antispoofing by Learning to Fuse Features From High- and Low-Frequency Domains
* Gradient-Based Intraprediction Fusion for Video Coding
* Implicit Emotion Relationship Mining Based on Optimal and Majority Synthesis From Multimodal Data Prediction
* Improved Speaker and Navigator for Vision-and-Language Navigation
* Introduction to the Special Issue on MMAC: Multimodal Affective Computing of Large-Scale Multimedia Data
* Large Dataset With a New Framework for Abandoned Object Detection in Complex Scenarios, A
* Learning-Based Satisfied User Ratio Prediction for Symmetrically and Asymmetrically Compressed Stereoscopic Images
* LFI-Augmenter: Intelligent Light Field Image Editing With Interleaved Spatial-Angular Convolution
* Magnitude and Angle Combined Optical Flow Feature for Microexpression Spotting, A
* Modeling Incongruity between Modalities for Multimodal Sarcasm Detection
* Multichannel Steganography in Digital Images for Multiple Receivers
* Multimedia in Virtual Reality and Augmented Reality
* Multimodal and Context-Aware Emotion Perception Model With Multiplicative Fusion
* Multimodal Event-Aware Network for Sentiment Analysis in Tourism
* Multimodal Political Deception Detection
* Multimodal Semantics-Based Supervised Latent Dirichlet Allocation for Event Classification
* Neighborhood Adaptive Loss Function for Deep Learning-Based Point Cloud Coding With Implicit and Explicit Quantization
* No-Reference Nonuniform Distorted Video Quality Assessment Based on Deep Multiple Instance Learning
* On the User-Centric Comparative Remote Evaluation of Interactive Video Search Systems
* Prediction With Multicross Component for Future Video Coding
* Real Testbed for Autonomous Anomaly Detection in Power Grid Using Low-Cost Unmanned Aerial Vehicles and Aerial Imaging
* Semantic Place Prediction With User Attribute in Social Media
* Sentiment-Aware Emoji Insertion Via Sequence Tagging
* Single Image Dehazing Via Region Adaptive Two-Shot Network
* State Representation Learning With Adjacent State Consistency Loss for Deep Reinforcement Learning
* Survey on Facial Expression Recognition: History, Applications, and Challenges
* Video Compression With CNN-Based Postprocessing
38 for MultMedMag(28)
MultMedMag(29)
* Comprehensive Framework of Early and Late Fusion for Image-Sentence Retrieval
* Context- and Knowledge-Aware Graph Convolutional Network for Multimodal Emotion Recognition
* Deep Multigraph Hierarchical Enhanced Semantic Representation for Cross-Modal Retrieval
* Detection of Risky Situations for Frail Adults With Hybrid Neural Networks on Multimodal Health Data
* DHNet: Double MPEG-4 Compression Detection via Multiple DCT Histograms
* DIBR Zero-Watermarking Based on Invariant Feature and Geometric Rectification
* Dual Expression Fusion: A Universal Microexpression Recognition Framework
* Efficient Low-Complexity Convolutional Neural Network Filter, An
* Efficient Multimedia Frame-Skipping Architecture Using Deep Learning in Vehicular Networks
* Emotion Recognition With Multimodal Transformer Fusion Framework Based on Acoustic and Lexical Information
* Enhanced Local and Global Learning for Rotation-Invariant Point Cloud Representation
* Exploiting the Structure Information of Suppositional Mesh for Unsupervised Multiview Stereo
* Fast Skin Segmentation on Low-Resolution Grayscale Images for Remote PhotoPlethysmoGraphy
* FLeak-Seg: Automated Fundus Fluorescein Leakage Segmentation via Cross-Modal Attention Learning
* Garment Style Creator: Using StarGAN for Image-to-Image Translation of Multidomain Garments
* Generating Dance Videos Using Pose Transfer Generative Adversarial Network With Multiple Scale Region Extractor and Learnable Region Normalization
* Integrity of Multimedia and Multimodal Data: From Capture to Use
* LIMAN: Local Information-Based Multiattention Network for 3D Shape Recognition
* MPEG Immersive Video Standard: Current Status and Future Outlook, The
* Multimedia Monitoring System of Obstructive Sleep Apnea via a Deep Active Learning Model
* Multimodal Fusion-Based Deep Learning Network for Effective Diagnosis of Alzheimer's Disease
* Next Frontier For MPEG-5 LCEVC: From HDR and Immersive Video to the Metaverse, The
* Novel Security Framework for Medical Data in IoT Ecosystems, A
* Postgraduate Student Depression Assessment by Multimedia Gait Analysis
* Privacy-Preserving Image Classification Using an Isotropic Network
* Privacy-Preserving Video Fall Detection via Chaotic Compressed Sensing and GAN-Based Feature Enhancement
* Robust Image Denoising Method With Multiview Texture-Aware Convolutional Neural Networks, A
* Scene-Adaptive Instance Modification for Semisupervised Pedestrian Detection
* Self-Supervised Cross-Modal Distillation for Thermal Infrared Tracking
* Transferring Deep Gaussian Denoiser for Compressed Sensing MRI Reconstruction
* Translational Symmetry-Aware Facade Parsing for 3-D Building Reconstruction
* Unpaired Image-to-Image Translation Using Negative Learning for Noisy Patches
* Views Meet Labels: Personalized Relation Refinement Network for Multiview Multilabel Learning
* Visual Surveillance for Human Fall Detection in Healthcare IoT
* Why Accuracy is Not Enough: The Need for Consistency in Object Detection
* Why VR Games Sickness? An Empirical Study of Capturing and Analyzing VR Games Head Movement Dataset
36 for MultMedMag(29)
MultMedMag(30)
* Anchor-Free Tracker Based on Space-Time Memory Network
* Artistic Line Drawing Rendering With Priors of Depth and Edge Density
* Bandwidth-Aware High-Efficiency Video Coding Design Scheme on a Multiprocessor System on Chip
* CADW: CGAN-Based Attack on Deep Robust Image Watermarking
* Content-Aware Latent Semantic Direction Fusion for Multi-Attribute Editing
* Could Head Motions Affect Quality When Viewing 360° Videos?
* Deep Blind Chest X-Ray Image Quality Assessment With Region-of-Interest-Guided Attention
* Distributed Architecture for an Elderly Accompaniment Service Based on IoT Devices, AI, and Cloud Services
* Edge Distraction-aware Salient Object Detection
* Edge Intelligence-Empowered Immersive Media
* Edge-Assisted Virtual Viewpoint Generation for Immersive Light Field
* Enabling Manageable and Secure Hybrid P2P-CDN Video-on-Demand Streaming Services Through Coordinating Blockchain and Zero Knowledge
* Encoding of Media Value Chain Processes Through Blockchains and MPEG-21 Smart Contracts for Media
* Improved Interaction Estimation and Optimization Method for Surveillance Video Synopsis, An
* JPEG AI Standard: Providing Efficient Human and Machine Visual Data Consumption, The
* Learning 3-D Face Shape From Diverse Sources With Cross-Domain Face Synthesis
* Learning From Coding Features: High Efficiency Rate Control for AOMedia Video 1
* Multiview Language Bias Reduction for Visual Question Answering
* Novel Learning Dictionary for Sparse Coding-Based Key Point Detection, A
* Optimizing Multidimensional Perceptual Quality in Online Interactive Multimedia
* Passthrough Mixed Reality With Oculus Quest 2: A Case Study on Learning Piano
* Perceptual Authentication Hashing for Digital Images With Contrastive Unsupervised Learning
* PP8K: A New Dataset for 8K UHD Video Compression and Processing
* Recent Advances in Immersive Multimedia
* Reversible Modal Conversion Model for Thermal Infrared Tracking
* Reviving Standard-Dynamic-Range Videos for High-Dynamic-Range Devices: A Learning Paradigm With Hybrid Attention Mechanisms
* Short-Long-Term Propagation-Based Video Inpainting
* Specular Detection and Rendering for Immersive Multimedia
* VR2Gather: A Collaborative, Social Virtual Reality System for Adaptive, Multiparty Real-Time Communication
29 for MultMedMag(30)
MultMedMag(31)
* Adaptive Detachable Partition-Based Reference Frame Recompression for Video Coding
* aVCSR: Adaptive Video Compressive Sensing Using Region-of-Interest Detection in the Compressed Domain
* ConvNet-HIDE: Deep-Learning-Based Dual Watermarking for Health-Care Images
* Convolutional Neural Network Ensemble for Video Source Camera Forensics, A
* Cryptanalyzing an Image Encryption Algorithm Underpinned by 2-D Lag-Complex Logistic Map
* Depth-Guided Aggregation for Real-Time Binocular Depth Estimation Network
* Exploiting Illumination Knowledge in the Real World for Low-Light Image Enhancement
* Feature Fusion-Based Data Augmentation Method for Small Object Detection
* Generative Adversarial Networks for Biomedical Imaging
* Generative AI for 3-D Point Clouds
* Hyperspectral Anomaly Detection Based on a Beta Wavelet Graph Neural Network
* Image-Relevant Entities Knowledge-Aware News Image Captioning
* Multimodal Integration of an Enhanced Novel Pulmonary Auscultation Real-Time Diagnostic System
* On Perceived AV Synchronization in 360° Multimedia
* Perceptual Hashing With Deep and Texture Features
* Retinex-Guided Channel Grouping-Based Patch Swap for Arbitrary Style Transfer
* Robust Color Image Hashing With Nonnegative Matrix Factorization and Saliency Map for Copy Detection
* Robust Image Registration for Power Equipment Using Large-Gap Fracture Contours
* S5: Sketch-to-Image Synthesis via Scene and Size Sensing
* Self-Supervised Cross-Modal Distillation for Thermal Infrared Tracking
* Software-Defined Networking-Driven Reliable Transmission Architecture for Enhancing Real-Time Video Streaming Quality, A
* Spatiotemporal Feature Fusion for Video Summarization
* Uncertainty-Guided Different Levels of Pseudolabels for Semisupervised Medical Image Segmentation
* Vehicle Reidentification Based on Convolution and Vision Transformer Feature Fusion
* You-Only-Look-Once Multiple-Strategy Printed Circuit Board Defect Detection Model
25 for MultMedMag(31)
MultMedMag(9)
* Applications of video-content analysis and retrieval
MultSys( Vol No. )
* *Multimedia Systems
MultSys(1)
* Automatic partitioning of full-motion video
MultSys(5)
* Advances in Fractal Compression for Multimedia Applications
* Alive System: Wireless, Full-Body Interaction with Autonomous Agents, The
* Towards Video-Based Immersive Environments
MultSys(7)
* Curvature Scale Space Image in Shape Similarity Retrieval
* feature-based algorithm for detecting and classifying production effects, A
MultSys(8)
* Relevance Feedback for Image Retrieval: A Comprenhensive Review
MultToolApp( Vol No. )
* *Multimedia Tools and Applications
MultToolApp(14)
* Audio Partitioning and Transcription for Broadcast Data Indexation
* Fixed Queries Array: A Fast and Economical Data Structure for Proximity Searching
* Guest Editorial: Content-Based Multimedia Indexing and Retrieval
* Multi-Modal Dialog Scene Detection Using Hidden Markov Models for Content-Based Multimedia Indexing
* Regions-of-Interest and Spatial Layout for Content-Based Image Retrieval
* Shot Change Detection Using Scene-Based Constraint
* ToCAI Description Scheme for Indexing and Retrieval of Multimedia Documents, The
7 for MultToolApp(14)
MultToolApp(27)
* survey of MPEG-1 audio, video and semantic analysis techniques, A
MultToolApp(3)
* Content-Based Representation and Retrieval of Visual Media: A State-of-the-Art Review
* Content-Based Retrieval for Trademark Registration
* Fractal-Based Clustering Approach in Large Visual Database-Systems, A
* Introduction to Special Issue on Representation and Retrieval of Visual Media in Multimedia Systems
MultToolApp(4)
* Application of Video Semantics and Theme Representation in Automated Video Editing, The
* Automatic Video Database Indexing and Retrieval
* Introduction to Special Issue on Representation and Retrieval of Visual Media in Multimedia Systems II
* Supporting Content-Based Retrieval in Large Image Database-Systems
* Techniques for Fast Partitioning of Compressed and Uncompressed Video
* VIMS: A Video Information Management System
MultToolApp(41)
* Independent query refinement and feature re-weighting using positive and negative examples for content-based image retrieval
MultToolApp(5)
* Annotation Engine for Supporting Video Database Population, An
* Similarity Is a Geometer
MultToolApp(7)
* Approach to a Content Based Retrieval of Multimedia Data, An
* Conceptual Modeling and Querying in Multimedia Databases