19.4.5.6.1 Video Captioning

Chapter Contents (Back)
Video Captioning.
See also Annotation, Captioning, Image Captioning.

Qiu, Z.F.[Zhao-Fan], Yao, T.[Ting], Mei, T.[Tao],
Learning Deep Spatio-Temporal Dependence for Semantic Video Segmentation,
MultMed(20), No. 4, April 2018, pp. 939-949.
IEEE DOI 1804
BibRef
Earlier:
Learning Spatio-Temporal Representation with Pseudo-3D Residual Networks,
ICCV17(5534-5542)
IEEE DOI 1802
3D from 2D nets. Computer architecture, Image segmentation, Semantics, Streaming media, video segmentation convolution, feature extraction, image classification, image recognition, image representation, Visualization BibRef

Qiu, Z.F.[Zhao-Fan], Yao, T.[Ting], Ngo, C.W.[Chong-Wah], Tian, X.[Xinmei], Mei, T.[Tao],
Learning Spatio-Temporal Representation With Local and Global Diffusion,
CVPR19(12048-12057).
IEEE DOI 2002
BibRef

Yao, T., Pan, Y., Li, Y., Qiu, Z., Mei, T.,
Boosting Image Captioning with Attributes,
ICCV17(4904-4912)
IEEE DOI 1802
BibRef
And: A2, A1, A3, A5, Only:
Video Captioning with Transferred Semantic Attributes,
CVPR17(984-992)
IEEE DOI 1711
computer vision, image representation, learning (artificial intelligence), Semantics. Computer architecture, Natural languages, Probability distribution, Recurrent neural networks, Visualization BibRef

Zhao, B., Li, X., Lu, X.,
CAM-RNN: Co-Attention Model Based RNN for Video Captioning,
IP(28), No. 11, November 2019, pp. 5552-5565.
IEEE DOI 1909
Visualization, Task analysis, Logic gates, Recurrent neural networks, Dogs, Semantics, Decoding, recurrent neural network BibRef

Yan, C., Tu, Y., Wang, X., Zhang, Y., Hao, X., Zhang, Y., Dai, Q.,
STAT: Spatial-Temporal Attention Mechanism for Video Captioning,
MultMed(22), No. 1, January 2020, pp. 229-241.
IEEE DOI 2001
BibRef
And: Corrections: MultMed(22), No. 3, March 2020, pp. 830-830.
IEEE DOI 2003
Video captioning, spatial-temporal attention mechanism, encoder-decoder neural networks. Mechatronics, Automation, Streaming media BibRef

Aafaq, N.[Nayyer], Mian, A.[Ajmal], Liu, W.[Wei], Gilani, S.Z.[Syed Zulqarnain], Shah, M.[Mubarak],
Video Description: A Survey of Methods, Datasets, and Evaluation Metrics,
Surveys(52), No. 6, October 2019, pp. xx-yy.
DOI Link 2001
video to text, Video description, video captioning, language in vision BibRef

Zhang, Z., Xu, D., Ouyang, W., Tan, C.,
Show, Tell and Summarize: Dense Video Captioning Using Visual Cue Aided Sentence Summarization,
CirSysVideo(30), No. 9, September 2020, pp. 3130-3139.
IEEE DOI 2009
Proposals, Visualization, Image segmentation, Feature extraction, Semantics, Decoding, Task analysis, Dense video captioning, hierarchical attention mechanism BibRef

Zhang, W.[Wei], Wang, B.R.[Bai-Rui], Ma, L.[Lin], Liu, W.[Wei],
Reconstruct and Represent Video Contents for Captioning via Reinforcement Learning,
PAMI(42), No. 12, December 2020, pp. 3088-3101.
IEEE DOI 2011
Decoding, Image reconstruction, Semantics, Training data, Visualization, Video sequences, Video captioning, backward information BibRef

Lee, S.[Sujin], Kim, I.[Incheol],
DVC-Net: A deep neural network model for dense video captioning,
IET-CV(15), No. 1, 2021, pp. 12-23.
DOI Link 2106
BibRef

Qi, S.S.[Shan-Shan], Yang, L.X.[Lu-Xi],
Video captioning via a symmetric bidirectional decoder,
IET-CV(15), No. 4, 2021, pp. 283-296.
DOI Link 2106
BibRef


Yang, B.[Bang], Zou, Y.[Yuexian],
Visual Oriented Encoder: Integrating Multimodal and Multi-Scale Contexts for Video Captioning,
ICPR21(188-195)
IEEE DOI 2105
Visualization, Semantics, Natural languages, Benchmark testing, Feature extraction, Encoding, Data mining BibRef

Perez-Martin, J.[Jesus], Bustos, B.[Benjamin], Pérez, J.[Jorge],
Attentive Visual Semantic Specialized Network for Video Captioning,
ICPR21(5767-5774)
IEEE DOI 2105
Visualization, Adaptation models, Video description, Semantics, Computer architecture, Logic gates, Syntactics, video captioning BibRef

Lu, M.[Min], Li, X.[Xueyong], Liu, C.[Caihua],
Context Visual Information-based Deliberation Network for Video Captioning,
ICPR21(9812-9818)
IEEE DOI 2105
Visualization, Semantics, Coherence, Benchmark testing, Pattern recognition, Decoding BibRef

Olivastri, S., Singh, G., Cuzzolin, F.,
End-to-End Video Captioning,
HVU19(1474-1482)
IEEE DOI 2004
convolutional neural nets, decoding, image recognition, learning (artificial intelligence), recurrent neural nets, BibRef

Li, L., Gong, B.,
End-to-End Video Captioning With Multitask Reinforcement Learning,
WACV19(339-348)
IEEE DOI 1904
computer vision, convolutional neural nets, learning (artificial intelligence), recurrent neural nets, Hardware BibRef

Wang, B., Ma, L., Zhang, W., Liu, W.,
Reconstruction Network for Video Captioning,
CVPR18(7622-7631)
IEEE DOI 1812
Decoding, Semantics, Image reconstruction, Video sequences, Visualization, Feature extraction, Natural languages BibRef

Li, Y., Yao, T., Pan, Y., Chao, H., Mei, T.,
Jointly Localizing and Describing Events for Dense Video Captioning,
CVPR18(7492-7500)
IEEE DOI 1812
Proposals, Dogs, Complexity theory, Task analysis, Training, Optimization BibRef

Wang, J., Jiang, W., Ma, L., Liu, W., Xu, Y.,
Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning,
CVPR18(7190-7198)
IEEE DOI 1812
Proposals, Visualization, Task analysis, Video sequences, Fuses, Semantics, Feature extraction BibRef

Wu, X., Li, G., Cao, Q., Ji, Q., Lin, L.,
Interpretable Video Captioning via Trajectory Structured Localization,
CVPR18(6829-6837)
IEEE DOI 1812
Trajectory, Feature extraction, Decoding, Visualization, Semantics, Recurrent neural networks BibRef

Wang, X., Chen, W., Wu, J., Wang, Y., Wang, W.Y.,
Video Captioning via Hierarchical Reinforcement Learning,
CVPR18(4213-4222)
IEEE DOI 1812
Task analysis, Semantics, Dogs, Neural networks, Portable computers BibRef

Zhou, L., Zhou, Y., Corso, J.J., Socher, R., Xiong, C.,
End-to-End Dense Video Captioning with Masked Transformer,
CVPR18(8739-8748)
IEEE DOI 1812
Proposals, Decoding, Encoding, Hidden Markov models, Feeds, Training, Visualization BibRef

Yang, D., Yuan, C.,
Hierarchical Context Encoding for Events Captioning in Videos,
ICIP18(1288-1292)
IEEE DOI 1809
Videos, Proposals, Task analysis, Mathematical model, Computational modeling, Decoding, Measurement, Video captioning, video summarization BibRef

Shen, Z.Q.[Zhi-Qiang], Li, J.G.[Jian-Guo], Su, Z.[Zhou], Li, M.J.[Min-Jun], Chen, Y.R.[Yu-Rong], Jiang, Y.G.[Yu-Gang], Xue, X.Y.[Xiang-Yang],
Weakly Supervised Dense Video Captioning,
CVPR17(5159-5167)
IEEE DOI 1711
Motion segmentation, Neural networks, Training, Visualization, Vocabulary BibRef

Baraldi, L., Grana, C., Cucchiara, R.,
Hierarchical Boundary-Aware Neural Encoder for Video Captioning,
CVPR17(3185-3194)
IEEE DOI 1711
Computer architecture, Encoding, Logic gates, Microprocessors, Motion pictures, Streaming media, Visualization BibRef

Pan, P.B.[Ping-Bo], Xu, Z.W.[Zhong-Wen], Yang, Y.[Yi], Wu, F.[Fei], Zhuang, Y.T.[Yue-Ting],
Hierarchical Recurrent Neural Encoder for Video Representation with Application to Captioning,
CVPR16(1029-1038)
IEEE DOI 1612
video captioning where temporal information plays a crucial role. BibRef

Yu, H.N.[Hao-Nan], Wang, J.[Jiang], Huang, Z.H.[Zhi-Heng], Yang, Y.[Yi], Xu, W.[Wei],
Video Paragraph Captioning Using Hierarchical Recurrent Neural Networks,
CVPR16(4584-4593)
IEEE DOI 1612
Generating one or multiple sentences to describe a realistic video BibRef

Shin, A.[Andrew], Ohnishi, K.[Katsunori], Harada, T.[Tatsuya],
Beyond caption to narrative: Video captioning with multiple sentences,
ICIP16(3364-3368)
IEEE DOI 1610
Feature extraction BibRef

Chapter on Implementations and Applications, Databases, QBIC, Video Analysis, Hardware and Software, Inspection continues in
Video Summarization, Abstract, MPEG Based, AVC, H264, MPEG Metadata .


Last update:Nov 30, 2021 at 22:19:38