19.4.5.7 Video Analysis -- Captions, Text and Audio

Chapter Contents (Back)
Video Analysis. Captions. Video Indexing.

Kim, H.K.,
Efficient Automatic Text Location Method and Content-Based Indexing and Structuring of Video Database,
JVCIR(7), No. 4, December 1996, pp. 336-344. 9704 BibRef

Jain, A.K.[Anil K.], Yu, B.[Bin],
Automatic Text Location in Images and Video Frames,
PR(31), No. 12, December 1998, pp. 2055-2076. BibRef 9812
Earlier:
WWW Version. ICPR98(Vol II: 1497-1499).
IEEE DOI may work or IEEE-CS DOI may work. 9808 BibRef

Viswanathan, M.[Mahesh], Beigi, H.S.M.[Homayoon S.M.], Dharanipragada, S.[Satya], Maali, F.[Fereydoun], Tritschler, A.[Alain],
Multimedia Document Retrieval Using Speech and Speaker Recognition,
IJDAR(2), No. 4, 1999, pp. xx-yy. 0008 BibRef

Li, H.P.[Hui-Ping], Doermann, D.[David], Kia, O.[Omid],
Automatic Text Detection and Tracking in Digital Video,
IP(9), No. 1, January 2000, pp. 147-156.
IEEE DOI may work or IEEE-CS DOI may work. 0001 BibRef
And: UMD--TR3962, December 1998. Neural Networks and Wavelets.
WWW Version.
WWW Version. BibRef

Doermann, D.[David], Li, H.P.[Hui-Ping],
Automatic Identification of Text in Digital Video Key Frames,
ICPR98(Vol I: 129-132).
IEEE DOI may work or IEEE-CS DOI may work. 9808 BibRef

Wu, V.[Victor], Manmatha, R.[Raghavan], Riseman, E.M.[Edward M.],
TextFinder: An Automatic System to Detect and Recognize Text in Images,
PAMI(21), No. 11, November 1999, pp. 1224-1229.
IEEE Abstract. IEEE Top Reference.
WWW Version. 9912 BibRef
And:
TextFinder,
UMassCS TR 99-40, June, 1999.
Postscript Version. Extraction of the text for images (i.e. ads). BibRef

Wu, V., Manmatha, R.,
Extracting Text From Greyscale Images,
UMassCS TR 95-88, November, 1995.
Postscript Version. BibRef 9511

Wu, V., Manmatha, R., Riseman, E.M.,
Finding Text In Images,
UMassCS TR 97-09, February, 1997
Postscript Version. BibRef 9702

Zhong, Y.[Yu], Zhang, H.J.[Hong-Jiang], Jain, A.K.[Anil K.],
Automatic Caption Localization in Compressed Video,
PAMI(22), No. 4, April 2000, pp. 385-392.
IEEE Abstract. IEEE Top Reference.
WWW Version. 0006 BibRef
Earlier: ICIP99(II:96-100).
IEEE Abstract. IEEE Top Reference. BibRef

Gauvain, J.L.[Jean-Luc], Lamel, L.[Lori], Adda, G.[Gilles],
Audio Partitioning and Transcription for Broadcast Data Indexation,
MultToolApp(14), No. 2, June 2001, pp. 187-200.
WWW Version. 0106 BibRef

Saraceno, C.[Caterina], Leonardi, R.[Riccardo],
Indexing audiovisual databases through joint audio and video processing,
IJIST(9), No. 5, 1999, pp. 320-331. BibRef 9900
Earlier:
Identification of Successive Correlated Camera Shots Using Audio and Video Information,
ICIP97(III: 166-169).
IEEE DOI may work or IEEE-CS DOI may work. BibRef
And:
Audio-visual processing for scene change detection,
CIAP97(II: 124-131).
WWW Version. 9709 BibRef

Kim, K.I.[Kwang In], Jung, K.C.[Kee-Chul], Park, S.H.[Se Hyun], Kim, H.J.[Hang Joon],
Support vector machine-based text detection in digital video,
PR(34), No. 2, February 2001, pp. 527-529.
WWW Version. 0011 BibRef

Lee, C.W.[Chang Woo], Jung, K.C.[Kee-Chul], Kim, H.J.[Hang Joon],
Automatic text detection and removal in video sequences,
PRL(24), No. 15, November 2003, pp. 2607-2623.
WWW Version. 0308 See also Text scanner with text detection technology on image sequences. BibRef

Welsh, S.[Stephen], Conway, D.[Damian],
Encoding Video Narration as Text,
RealTimeImg(6), No. 5, October 2000, pp. 391-405. 0011 BibRef

Li, D.G.[Dong-Ge], Sethi, I.K.[Ishwar K.], Dimitrova, N.[Nevenka], McGee, T.[Tom],
Classification of general audio data for content-based retrieval,
PRL(22), No. 5, April 2001, pp. 533-544.
HTML Version. 0105 BibRef

Tsekeridou, S.[Sofia], Pitas, I.[Ioannis],
Content-based video parsing and indexing based on audio-visual interaction,
CirSysVideo(11), No. 4, April 2001, pp. 522-535.
IEEE Top Reference. 0104 BibRef
Earlier:
Speaker dependent video indexing based on audio-visual interaction,
ICIP98(I: 358-362).
IEEE DOI may work or IEEE-CS DOI may work. 9810 BibRef

Tsekeridou, S.[Sofia], Krinidis, S.[Stelios], Pitas, I.[Ioannis],
Scene Change Detection Based on Audio-Visual Analysis and Interaction,
WTRCV01(214). 0103 BibRef

Amir, A.[Arnon], Srinivasan, S.[Savitha], Efrat, A.[Alon],
Search the Audio, Browse the Video: A Generic Paradigm for Video Collections,
JASP(2003), No. 2, February 2003, pp. 209.
HTML Version. 0304 BibRef

Syeda-Mahmood, T.F., Srinivasan, S., Amir, A., Ponceleon, D., Blanchard, B., Petkovic, D.,
CueVideo: a system for cross-modal search and browse of video databases,
CVPR00(II: 786-787).
IEEE Abstract. IEEE Top Reference.
WWW Version. 0403 BibRef

Adams, W.H., Iyengar, G.[Giridharan], Lin, C.Y.[Ching-Yung], Naphade, M.R.[Milind Ramesh], Neti, C.[Chalapathy], Nock, H.J.[Harriet J.], Smith, J.R.[John R.],
Semantic Indexing of Multimedia Content Using Visual, Audio, and Text Cues,
JASP(2003), No. 2, February 2003, pp. 170.
HTML Version. 0304 BibRef

Beal, M.J.[Matthew J.], Jojic, N.[Nebojsa], Attias, H.[Hagai],
A graphical model for audiovisual object tracking,
PAMI(25), No. 7, July 2003, pp. 828-836.
IEEE Abstract. IEEE Top Reference. 0307 BibRef
Earlier: A1, A3, A2:
Audio-Video Sensor Fusion with Probabilistic Graphical Models,
ECCV02(I: 736 ff.).
HTML Version. 02052 microphones and a camera. Track the moving object with clutter and noise. BibRef

Li, Y.[Ying], Narayanan, S.S.[Shrikanth S.], Kuo, C.C.J.[C.C. Jay],
Adaptive Speaker Identification with Audio-Visual Cues for Movie Content Analysis,
PRL(25), No. 7, May 2004, pp. 777-791.
WWW Version. 0405 BibRef

Li, Y.[Ying], Narayanan, S.S.[Shrikanth S.], Kuo, C.C.J.[C.C. Jay],
Content-Based Movie Analysis and Indexing Based on Audio-Visual Cues,
CirSysVideo(14), No. 8, August 2004, pp. 1073-1085.
IEEE Abstract. IEEE Top Reference. 0409 BibRef
Earlier:
Movie Content Analysis, Indexing and Skimming Via Multimodal Information,
VideoMining03(Chapter 5). BibRef

Li, Y.[Ying], Kuo, C.C.J.[C.C. Jay],
A robust video scene extraction approach to movie content abstraction,
IJIST(13), No. 5, 2003, pp. 236-244.
WWW Version. 0312 BibRef

Lyu, M.R., Song, J.[Jiqiang], Cai, M.[Min],
A comprehensive method for multilingual video text detection, localization, and extraction,
CirSysVideo(15), No. 2, February 2005, pp. 243-255.
IEEE Abstract. IEEE Top Reference. 0501 BibRef

Kiranyaz, S., Gabbouj, M.,
Generic content-based audio indexing and retrieval framework,
VISP(153), No. 3, June 2006, pp. 285-297.
WWW Version. 0608 See also Novel multimedia retrieval technique: progressive query (why wait?). BibRef

de Jong, F.M.G., Westerveld, T., de Vries, A.P.,
Multimedia Search Without Visual Analysis: The Value of Linguistic and Contextual Information,
CirSysVideo(17), No. 3, March 2007, pp. 365-371.
IEEE DOI may work or IEEE-CS DOI may work. 0703 BibRef

Huang, Y.P.[Yo-Ping], Hsu, L.W.[Liang-Wei], Sandnes, F.E.[Frode-Eika],
An Intelligent Subtitle Detection Model for Locating Television Commercials,
SMC-B(37), No. 2, April 2007, pp. 485-492.
IEEE DOI may work or IEEE-CS DOI may work. 0704 BibRef

Monaci, G., Jost, P., Vandergheynst, P., Mailhe, B., Lesage, S., Gribonval, R.,
Learning Multimodal Dictionaries,
IP(16), No. 9, September 2007, pp. 2272-2283.
IEEE DOI may work or IEEE-CS DOI may work. 0709Integrating audio-visual info. BibRef

Covell, M., Baluja, S., Fink, M.,
Detecting Ads in Video Streams Using Acoustic and Visual Cues,
Computer(39), No. 12, December 2006, pp. 135-137.
IEEE DOI may work or IEEE-CS DOI may work. 0612 BibRef

Dimitrova, N.[Nevenka], Agnihotri, L.[Lalitha], Wei, G.[Gang],
Video Classification Using Object Tracking,
IJIG(1), No. 3, July 2001, pp. 487-505. 0107 BibRef

Wei, G.[Gang], Agnihotri, L.[Lalitha], Dimitrova, N.[Nevenka],
TV Program Classification Based on Face and Text Processing,
ICME00(III: 1345-1348). 0007 BibRef

Agnihotri, L., Dimitrova, N.,
Text Detection for Video Analysis,
CBAIVL99(xx-yy). BibRef 9900


Aytar, Y.[Yusuf], Shah, M.[Mubarak], Luo, J.B.[Jie-Bo],
Utilizing semantic word similarity measures for video retrieval,
CVPR08(1-8).
IEEE DOI may work or IEEE-CS DOI may work. 0806 BibRef

Guillaumin, M.[Matthieu], Mensink, T.[Thomas], Verbeek, J.[Jakob], Schmid, C.[Cordelia],
Automatic face naming with caption-based supervision,
CVPR08(1-8).
IEEE DOI may work or IEEE-CS DOI may work. 0806 BibRef

Mathe, S.[Stefan], Fazly, A.[Afsaneh], Dickinson, S.[Sven], Stevenson, S.[Suzanne],
Learning the abstract motion semantics of verbs from captioned videos,
SLAM08(1-8).
IEEE DOI may work or IEEE-CS DOI may work. 0806 BibRef

Stone, Z.[Zak], Zickler, T.[Todd], Darrell, T.[Trevor],
Autotagging Facebook: Social network context improves photo annotation,
InterNet08(1-8).
IEEE DOI may work or IEEE-CS DOI may work. 0806 BibRef

Wachenfeld, S.[Steffen], Fleischer, S.[Stefan], Jiang, X.Y.[Xiao-Yi],
A Multiple Classifier Approach for the Recognition of Screen-Rendered Text,
CAIP07(921-928).
WWW Version. 0708 BibRef

Wang, Y.[Yaowei], Su, L.M.[Li-Min], Ye, Q.X.[Qi-Xiang],
A Robust Caption Detecting Algorithm on MPEG Compressed Video,
MCAM07(195-202).
WWW Version. 0706 BibRef

Quattoni, A.[Ariadna], Collins, M.[Michael], Darrell, T.J.[Trevor J.],
Transfer learning for image classification with sparse prototype representations,
CVPR08(1-8).
IEEE DOI may work or IEEE-CS DOI may work. 0806 BibRef
Earlier:
Learning Visual Representations using Images with Captions,
CVPR07(1-8).
IEEE DOI may work or IEEE-CS DOI may work. 0706 BibRef

Goldmann, L., Samour, A., Karaman, M., Sikora, T.,
Extracting High Level Semantics by Means of Speech, Audio, and Image Primitives in Surveillance Applications,
ICIP06(2397-2400). 0610
IEEE DOI may work or IEEE-CS DOI may work. BibRef

Wang, Y.K.[Yuan-Kai], Chen, J.M.[Jian-Ming],
Detecting Video Texts Using Spatial-Temporal Wavelet Transform,
ICPR06(IV: 754-757).
WWW Version. 0609 BibRef

Wachenfeld, S.[Steffen], Klein, H.U.[Hans-Ulrich], Jiang, X.Y.[Xiao-Yi],
Recognition of Screen-Rendered Text,
ICPR06(II: 1086-1089).
WWW Version. 0609 BibRef

Ravulapalli, S.[Sunil], Sarkar, S.[Sudeep],
Association of Sound to Motion in Video using Perceptual Organization,
ICPR06(I: 1216-1219).
WWW Version. 0609 BibRef

Cristani, M.[Marco], Bicego, M.[Manuele], Murino, V.[Vittorio],
Audio-Visual Foreground Extraction for Event Characterization,
SLAM06(116).
IEEE DOI may work or IEEE-CS DOI may work. 0609 BibRef
Earlier:
Audio-Video Integration for Background Modelling,
ECCV04(Vol II: 202-213).
WWW Version. 0405 BibRef

Velivelli, A.[Atulya], Huang, T.S.[Thomas S.],
Automatic Video Annotation by Mining Speech Transcripts,
SLAM06(115).
IEEE DOI may work or IEEE-CS DOI may work. 0609 BibRef

Su, Y.M.[Yih-Ming], Hsieh, C.H.[Chaur-Heh],
A Novel Caption Extraction Scheme for Various Sports Captions,
ICPR06(II: 1054-1057).
WWW Version. 0609 BibRef

Jamieson, M.[Michael], Dickinson, S.[Sven], Stevenson, S.[Suzanne], Wachsmuth, S.[Sven],
Using Language to Drive the Perceptual Grouping of Local Image Features,
CVPR06(II: 2102-2109).
IEEE DOI may work or IEEE-CS DOI may work. 0606Learning using features and captions. BibRef

Misra, C.[Chinmaya], Sural, S.[Shamik],
Content Based Image and Video Retrieval Using Embedded Text,
ACCV06(II:111-120).
WWW Version. 0601 BibRef

Natarajan, P., Elmieh, B., Schwartz, R., Makhoul, J.,
Videotext OCR using hidden Markov models,
ICDAR01(947-951).
IEEE DOI may work or IEEE-CS DOI may work. 0109 BibRef

Lefevre, S., Vincent, N.,
Caption localisation in video sequences by fusion of multiple detectors,
ICDAR05(I: 106-110).
IEEE DOI may work or IEEE-CS DOI may work. 0508 BibRef

Miyamori, H., Nakamura, S., Tanaka, K.,
Automatic Indexing of Broadcast Content Using its Live Chat on the Web,
ICIP05(III: 1248-1251).
IEEE DOI may work or IEEE-CS DOI may work. 0512 BibRef

Kidron, E.[Einat], Schechner, Y.Y.[Yoav Y.], Elad, M.[Michael],
Pixels that Sound,
CVPR05(I: 88-95).
IEEE DOI may work or IEEE-CS DOI may work. 0507Combine images with the sounds. Not just talking faces. BibRef

Zeng, Z.H.[Zhi-Hong], Tu, J.L.[Ji-Lin], Pianfetti, B.[Brian], Liu, M.[Ming], Zhang, T.[Tong], Zhang, Z.Q.[Zhen-Qiu], Huang, T.S.[Thomas S.], Levinson, S.[Stephen],
Audio-Visual Affect Recognition through Multi-Stream Fused HMM for HCI,
CVPR05(II: 967-972).
IEEE DOI may work or IEEE-CS DOI may work. 0507 BibRef

Xie, L., Kennedy, L., Chang, S.F., Divakarun, A., Sun, H., Lin, C.Y.,
Discovering meaningful multimedia patterns with audio-visual concepts and associated text,
ICIP04(IV: 2383-2386).
IEEE DOI may work or IEEE-CS DOI may work. 0505 BibRef

Kutics, A., Nakagawa, A., Arai, S., Tanaka, H., Ohtsuka, S.,
Relating words and image segments on multiple layers for effective browsing and retrieval,
ICIP04(IV: 2203-2206).
IEEE DOI may work or IEEE-CS DOI may work. 0505 BibRef

Nakagawa, A., Kutics, A., Tanaka, K., Nakajima, M.,
Combining words and object-based visual features in image retrieval,
CIAP03(354-359).
IEEE Abstract. IEEE Top Reference. 0310 BibRef

Kutics, A., Nakagawa, A., Nakajima, M.,
Image retrieval via connecting words to salient objects,
ICIP03(III: 17-20).
IEEE Abstract. IEEE Top Reference. 0312 BibRef

Declerck, T.[Thierry], Kuper, J.[Jan], Saggion, H.[Horacio], Samiotou, A.[Anna], Wittenburg, P.[Peter], Contreras, J.[Jesus],
Contribution of NLP to the Content Indexing of Multimedia Documents,
CIVR04(610-618).
WWW Version. 0505 BibRef

Wang, R.R.[Rong-Rong], Jin, W.[Wanjun], Wu, L.D.[Li-De],
A novel video caption detection approach using multi-frame integration,
ICPR04(I: 449-452).
IEEE DOI may work or IEEE-CS DOI may work. 0409 BibRef

Schauer, C., Gross, H.M.,
A Computational Model of Early Auditory-Visual Integration,
DAGM03(362-369).
HTML Version. 0310 BibRef

Fu, T.Y.[Tie-Yan], Liu, X.X.[Xiao Xing], Liang, L.H.[Lu Hong], Pi, X.B.[Xiao-Bo], Nefian, A.V.,
A audio-visual speaker identification using coupled hidden Markov models,
ICIP03(III: 29-32).
IEEE Abstract. IEEE Top Reference. 0312 BibRef

Yemez, Y.[Yücel], Kanak, A., Erzin, E., Tekalp, A.M.,
Multimodal speaker identification with audio-video processing,
ICIP03(III: 5-8).
IEEE Abstract. IEEE Top Reference. 0312 BibRef

Nakamura, A., Yamamoto, K.,
Caption text recognition in video frames by MAP matching,
ICDAR03(650-655).
IEEE Abstract. IEEE Top Reference. 0311 BibRef

Sugano, M., Isaksson, R., Nakajima, Y., Yanagihara, H.,
Shot genre classification using compressed audio-visual features,
ICIP03(II: 17-20).
IEEE Abstract. IEEE Top Reference. 0312 BibRef

Luo, B.[Bo], Tang, X.[Xiaoou], Liu, J.Z.[Jian-Zhuang], Zhang, H.J.[Hong-Jiang],
Video caption detection and extraction using temporal information,
ICIP03(I: 297-300).
IEEE Abstract. IEEE Top Reference. 0312 BibRef

Hauptmann, A.G., Jin, R., and Ng, T.D.,
Multi-modal information retrieval from broadcast video using OCR and speech recognition,
JCDL02(160-161); BibRef 0200

Aradhye, H., and Dorai, C.,
Augmented Edit Distance Based Temporal Contiguity Analysis for Improved Videotext Recognition,
MMSP01(xx-yy). BibRef 0100

Dorai, C., Aradhye, H., and Shim, J.C.,
End-to-End Videotext Recognition for Multimedia Content Analysis,
ICME01(xx-yy)
PDF Version. BibRef 0100

Aradhye, H., Dorai, C., Shim, J.C.,
Study of Embedded Font Context and Kernel Space Methods for Improved Videotext Recognition,
ICIP01(II: 825-828).
IEEE Abstract. IEEE Top Reference. 0108 BibRef

Shim, J.C.[Jae-Chang], Dorai, C.[Chitra], Bolle, R.M.[Ruud M.],
Automatic Text Extraction from Video for Content-Based Annotation and Retrieval,
ICPR98(Vol I: 618-620).
IEEE DOI may work or IEEE-CS DOI may work. 9808 BibRef

Wachsmuth, S., Sagerer, G.,
Integrated analysis of speech and images as a probabilistic decoding process,
ICPR02(II: 588-592).
IEEE DOI may work or IEEE-CS DOI may work. 0211 BibRef

Kulesh, V., Petrushin, V.A., Sethi, I.K.,
Video clip recognition using joint audio-visual processing model,
ICPR02(I: 500-503).
IEEE DOI may work or IEEE-CS DOI may work. 0211 BibRef

Sung, S.H.[Si-Hun], Chun, W.S.[Woo-Sung],
Knowledge-based numeric open caption recognition for live sportscast,
ICPR02(II: 822-825).
IEEE DOI may work or IEEE-CS DOI may work. 0211 BibRef

Miyamori, H.,
Improving accuracy in behaviour identification for content-based retrieval by using audio and video information,
ICPR02(II: 826-830).
IEEE DOI may work or IEEE-CS DOI may work. 0211 BibRef

de Santo, M., Percannella, G., Sansone, C., Vento, M.,
Classifying audio of movies by a multi-expert system,
CIAP01(386-391).
IEEE Top Reference. 0210 BibRef

Albiol, A., Torres, L., Delp, E.J.,
Video preprocessing for audiovisual indexing,
Southwest02(57-61).
IEEE Top Reference. 0208 BibRef

Bakker, E.M.[Erwin M.], Lew, M.S.[Michael S.],
Semantic Video Retrieval Using Audio Analysis,
CIVR02(271-277).
HTML Version. 0208 BibRef

Kim, K.[Kyungsu], Choi, J.[Junho], Kim, N.[Namjung], Kim, P.K.[Pan-Koo],
Extracting Semantic Information from Basketball Video Based on Audio-Visual Features,
CIVR02(278-288).
HTML Version. 0208 BibRef

Fisher, J.W.[John W.], Darrell, T.J.[Trevor J.],
Probabalistic Models and Informative Subspaces for Audiovisual Correspondence,
ECCV02(III: 592 ff.).
HTML Version. 0205 BibRef

Chu, S.M.[Stephen M.], Huang, T.S.[Thomas S.],
Audio-Visual Speech Fusion Using Coupled Hidden Markov Models,
MSCSAS07(1-2).
IEEE DOI may work or IEEE-CS DOI may work. 0706 BibRef

Naphade, M.R.[Milind R.], Garg, A.[Ashutosh], Huang, T.S.[Thomas S.],
Audio-Visual Event Detection using Duration Dependent Input Output Markov Models,
CBAIVL01(30).
IEEE DOI may work or IEEE-CS DOI may work. 0110 BibRef

Alatan, A.A.,
Automatic Multi-modal Dialogue Scene Indexing,
ICIP01(III: 374-377).
IEEE Abstract. IEEE Top Reference. 0108 BibRef

Smith, M.A.[Michael A.], Kanade, T.[Takeo],
Video Skimming and Characterization through the Combination of Image and Language Understanding Techniques,
CVPR97(775-781).
IEEE Abstract. IEEE Top Reference.
WWW Version. 9704 BibRef
And: DARPA97(357-366). BibRef
And: CMU-CS-TR-97-111, February 1997. Language from audio produce a skim.
Postscript Version. BibRef

Smith, M.A.[Michael A.], Kanade, T.[Takeo],
Video Skimming for Quick Browsing based on Audio and Image Characterization,
CMU-CS-TR-95-186, July 1995.
Postscript Version. BibRef 9507

Sundaram, H.[Hari], Chang, S.F.[Shih-Fu],
Video Scene Segmentation Using Video and Audio Features,
ICME00(TP10). 0007 BibRef

Smith, J.R.[John R.], Li, C.S.[Chung-Sheng],
Adaptive Synthesis in Progressive Retrieval of Audio-Visual Data,
ICME00(MP5). 0007 BibRef

Toklu, C., Liou, S.P.,
Image and Audio Sequence Visualization and Interaction Mechanisms for Structured Video Browsing and Editing,
ICIP00(Vol II: 263-266).
IEEE Abstract. IEEE Top Reference. 0008 BibRef

Jiang, H.[Hao], Lin, T.[Tong], Zhang, H.J.[Hong-Jiang],
Video Segmentation with the Assistance of Audio Content Analysis,
ICME00(WP5). 0007 BibRef

Lim, Y.K., Choi, S.H., Lee, S.W.,
Text Extraction in MPEG Compressed Video for Content-based Indexing,
ICPR00(Vol IV: 409-412).
IEEE DOI may work or IEEE-CS DOI may work.
HTML Version. 0009 BibRef

Mariano, V.Y., Kasturi, R.,
Locating Uniform-colored Text in Video Frames,
ICPR00(Vol IV: 539-542).
IEEE DOI may work or IEEE-CS DOI may work.
HTML Version. 0009 BibRef

Antani, S., Crandall, D., Kasturi, R.,
Robust Extraction of Text in Video,
ICPR00(Vol I: 831-834).
IEEE DOI may work or IEEE-CS DOI may work.
HTML Version. 0009 BibRef

Li, H., Doermann, D.,
A Video Text Detection System Based on Automated Training,
ICPR00(Vol II: 223-226).
IEEE DOI may work or IEEE-CS DOI may work.
HTML Version. 0009 BibRef

Wernicke, A.[Axel], Lienhart, R.[Rainer],
On the Segmentation of Text in Videos,
ICME00(WP5). 0007 BibRef

Kasturi, R., Gargi, U.[Ullas], Antani, S.[Sameer],
Indexing Text Events in Digital Video Databases,
ICPR98(Vol I: 916-918).
IEEE DOI may work or IEEE-CS DOI may work. 9808 BibRef

Lazarescu, M.[Mihai], Venkatesh, S.[Svetha], Caelli, T.M.[Terry M.], West, G.A.W.[Geoff A.W.],
Combining NL Processing and Video Data to Query American Football,
ICPR98(Vol II: 1238-1240).
IEEE DOI may work or IEEE-CS DOI may work. 9808 BibRef

Pandit, M., Kittler, J.V., Li, Y., Chilton, E.,
A Comparative Study of Different Segmentation Approaches for Audio Track Indexing,
ICPR00(Vol II: 467-470).
IEEE DOI may work or IEEE-CS DOI may work.
HTML Version. 0009 BibRef

Huang, J.C.[Jin-Cheng], Liu, Z.[Zhu], Yao, W.[Wang],
Integration of audio and visual information for content-based video segmentation,
ICIP98(III: 526-529).
IEEE DOI may work or IEEE-CS DOI may work. 9810 BibRef

Cheung, C.H., and Po, L.M.,
Text-Driven Automatic Frame Generation Using MPEG-4 Synthetic/Natural Hybrid Coding for 2-D Head-and-Shoulder Scene,
ICIP97(II: 69-72).
IEEE DOI may work or IEEE-CS DOI may work. BibRef 9700

Srihari, R.K.,
Combining text and image information in content-based retrieval,
ICIP95(I: 326-329).
IEEE DOI may work or IEEE-CS DOI may work. 9510 BibRef

Chapter on Implementations and Applications, Databases, QBIC, Video Analysis, Hardware and Software, Inspection continues in
News Video Analysis, Cut Detection, Summaries, Indexing .


Last update:Oct 1, 2008 at 09:28:47