Saraceno, C.[Caterina],
Leonardi, R.[Riccardo],
Indexing audiovisual databases through joint audio and video processing,
IJIST(9), No. 5, 1999, pp. 320-331.
BibRef
9900
Earlier:
Identification of Successive Correlated Camera Shots Using Audio
and Video Information,
ICIP97(III: 166-169).
IEEE DOI
BibRef
And:
Audio-visual processing for scene change detection,
CIAP97(II: 124-131).
Springer DOI
9709
BibRef
Li, D.G.[Dong-Ge],
Sethi, I.K.[Ishwar K.],
Dimitrova, N.[Nevenka],
McGee, T.[Tom],
Classification of general audio data for content-based retrieval,
PRL(22), No. 5, April 2001, pp. 533-544.
Elsevier DOI
0105
BibRef
Tsekeridou, S.[Sofia],
Pitas, I.[Ioannis],
Content-based video parsing and indexing based on audio-visual
interaction,
CirSysVideo(11), No. 4, April 2001, pp. 522-535.
IEEE Top Reference.
0104
BibRef
Earlier:
Speaker dependent video indexing based on audio-visual interaction,
ICIP98(I: 358-362).
IEEE DOI
9810
BibRef
Tsekeridou, S.[Sofia],
Krinidis, S.[Stelios],
Pitas, I.[Ioannis],
Scene Change Detection Based on Audio-Visual Analysis and Interaction,
WTRCV01(214).
0103
BibRef
Kyperountas, M.,
Kotropoulos, C.,
Pitas, I.[Ioannis],
Enhanced Eigen-Audioframes for Audiovisual Scene Change Detection,
MultMed(9), No. 4, 2007, pp. 785-797.
IEEE DOI
0905
BibRef
Gauvain, J.L.[Jean-Luc],
Lamel, L.[Lori],
Adda, G.[Gilles],
Audio Partitioning and Transcription for Broadcast Data Indexation,
MultToolApp(14), No. 2, June 2001, pp. 187-200.
0106
BibRef
Amir, A.[Arnon],
Srinivasan, S.[Savitha],
Efrat, A.[Alon],
Search the Audio, Browse the Video:
A Generic Paradigm for Video Collections,
JASP(2003), No. 2, February 2003, pp. 209.
WWW Link.
0304
BibRef
Beal, M.J.[Matthew J.],
Jojic, N.[Nebojsa],
Attias, H.T.[Hagai T.],
A graphical model for audiovisual object tracking,
PAMI(25), No. 7, July 2003, pp. 828-836.
IEEE Abstract.
0307
BibRef
Earlier: A1, A3, A2:
Audio-Video Sensor Fusion with Probabilistic Graphical Models,
ECCV02(I: 736 ff.).
Springer DOI
0205
2 microphones and a camera. Track the moving object with clutter and noise.
BibRef
Wu, P.[Peng],
Li, Y.[Ying],
Tretter, D.[Daniel],
Scalable video summarization,
US_Patent7,047,494, May 16, 2006
WWW Link.
BibRef
0605
Gong, Y.H.[Yi-Hong],
Summarizing Audiovisual Contents of a Video Program,
JASP(2003), No. 2, February 2003, pp. 160.
WWW Link.
0304
BibRef
Gong, Y.H.[Yi-Hong],
Liu, X.[Xin],
Method and system for segmentation, classification, and summarization
of video images,
US_Patent7,016,540, Mar 21, 2006
WWW Link.
BibRef
0603
And:
US_Patent7,151,852, Dec 19, 2006
WWW Link.
BibRef
And:
Creating audio-centric, image-centric,
and integrated audio-visual summaries,
US_Patent6,925,455, Aug 2, 2005
WWW Link.
BibRef
And:
Video Summarization using Singular Value Decomposition,
CVPR00(II: 174-180).
IEEE DOI
0005
BibRef
And:
Video Shot Segmentation and Classification,
ICPR00(Vol I: 860-863).
IEEE DOI
0009
BibRef
Wang, H.L.[Hua-Lu],
Divakaran, A.[Ajay],
Vetro, A.[Anthony],
Chang, S.F.[Shih-Fu],
Sun, H.F.[Hui-Fang],
Survey of compressed-domain features used in audio-visual indexing and
analysis,
JVCIR(14), No. 2, June 2003, pp. 150-183.
Elsevier DOI
0306
Survey, Image Retrieval.
BibRef
Naphade, M.R.[Milind R.],
On supervision and statistical learning for semantic multimedia
analysis,
JVCIR(15), No. 3, September 2004, pp. 348-369.
Elsevier DOI
0711
Factor graphs; Sum product algorithm; Active learning;
Hidden Markov models; Dynamic Bayesian networks; Support vector machines
BibRef
Naphade, M.R.,
Kozintsev, I.V.,
Huang, T.S.,
A factor graph framework for semantic video indexing,
CirSysVideo(12), No. 1, January 2002, pp. 40-52.
IEEE Top Reference.
0202
BibRef
Naphade, M.R.,
Kozintsev, I.V.,
Huang, T.S.,
Ramchandran, K.,
A factor graph framework for semantic indexing and retrieval in video,
CBAIVL00(35-39).
0008
BibRef
Naphade, M.R.[Milind R.],
Huang, T.S.[Thomas S.],
Detecting Semantic Concepts Using Context and Audio/Visual Features,
EventVideo01(92-98).
IEEE DOI
0106
BibRef
Earlier:
Recognizing High-level Audio-visual Concepts Using Context,
ICIP01(III: 46-49).
IEEE DOI
0108
BibRef
Earlier:
Semantic Video Indexing Using a Probabilistic Framework,
ICPR00(Vol III: 79-84).
IEEE DOI
0009
BibRef
And:
A Probabilistic Framework for Semantic Indexing and Retrieval in Video,
ICME00(MP9).
0007
BibRef
And:
Inferring Semantic Concepts for Video Indexing and Retrieval,
ICIP00(Vol III: 766-769).
IEEE DOI
0008
BibRef
Naphade, M.R.,
Kristjansson, T.,
Frey, B.J.,
Huang, T.S.,
Probabilistic multimedia objects (multijects): a novel approach to
video indexing and retrieval in multimedia systems,
ICIP98(III: 536-540).
IEEE DOI
9810
BibRef
Xie, X.,
Lu, L.,
Jia, M.,
Li, H.,
Seide, F.,
Ma, W.Y.,
Mobile Search With Multimodal Queries,
PIEEE(96), No. 4, April 2008, pp. 589-601.
IEEE DOI
0804
Text, image, audio queries.
BibRef
Kiranyaz, S.,
Gabbouj, M.,
Generic content-based audio indexing and retrieval framework,
VISP(153), No. 3, June 2006, pp. 285-297.
DOI Link
0608
See also Novel multimedia retrieval technique: progressive query (why wait?).
BibRef
Monaci, G.,
Jost, P.,
Vandergheynst, P.,
Mailhe, B.,
Lesage, S.,
Gribonval, R.,
Learning Multimodal Dictionaries,
IP(16), No. 9, September 2007, pp. 2272-2283.
IEEE DOI
0709
Integrating audio-visual info.
BibRef
Zhang, T.[Tong],
Using background audio change detection for segmenting video,
US_Patent7,266,287, Sep 4, 2007
WWW Link.
BibRef
0709
Kotti, M.,
Ververidis, D.,
Evangelopoulos, G.,
Panagakis, I.,
Kotropoulos, C.,
Maragos, P.,
Pitas, I.,
Audio-Assisted Movie Dialogue Detection,
CirSysVideo(18), No. 11, November 2008, pp. 1618-1627.
IEEE DOI
0811
BibRef
Cristani, M.[Marco],
Bicego, M.[Manuele],
Murino, V.[Vittorio],
Audio-Visual Event Recognition in Surveillance Video Sequences,
MultMed(9), No. 2, February 2007, pp. 257-267.
IEEE DOI
0905
BibRef
Earlier:
Audio-Visual Foreground Extraction for Event Characterization,
SLAM06(116).
IEEE DOI
0609
BibRef
Earlier:
Audio-Video Integration for Background Modelling,
ECCV04(Vol II: 202-213).
Springer DOI
0405
BibRef
Zeng, Z.H.[Zhi-Hong],
Tu, J.L.[Ji-Lin],
Liu, M.[Ming],
Huang, T.S.[Thomas S.],
Pianfetti, B.[Brian],
Roth, D.[Dan],
Levinson, S.[Stephen],
Audio-Visual Affect Recognition,
MultMed(9), No. 2, February 2007, pp. 424-428.
IEEE DOI
0905
BibRef
Zeng, Z.H.[Zhi-Hong],
Tu, J.L.[Ji-Lin],
Pianfetti, B.M.,
Huang, T.S.,
Audio-Visual Affective Expression Recognition Through Multistream Fused
HMM,
MultMed(10), No. 4, June 2008, pp. 570-577.
IEEE DOI
0905
BibRef
Zeng, Z.H.[Zhi-Hong],
Tu, J.L.[Ji-Lin],
Pianfetti, B.[Brian],
Liu, M.[Ming],
Zhang, T.[Tong],
Zhang, Z.Q.[Zhen-Qiu],
Huang, T.S.[Thomas S.],
Levinson, S.[Stephen],
Audio-Visual Affect Recognition through Multi-Stream Fused HMM for HCI,
CVPR05(II: 967-972).
IEEE DOI
0507
BibRef
Zhang, S.L.,
Huang, Q.M.,
Jiang, S.,
Gao, W.,
Tian, Q.,
Affective Visualization and Retrieval for Music Video,
MultMed(12), No. 6, 2010, pp. 510-522.
IEEE DOI
1003
BibRef
Zhang, S.L.[Shi-Liang],
Tian, Q.[Qi],
Hua, G.,
Huang, Q.M.[Qing-Ming],
Gao, W.[Wen],
Generating Descriptive Visual Words and Visual Phrases for Large-Scale
Image Applications,
IP(20), No. 9, September 2011, pp. 2664-2677.
IEEE DOI
1109
See also Edge-SIFT: Discriminative Binary Descriptor for Scalable Partial-Duplicate Mobile Search.
BibRef
Zhang, S.L.[Shi-Liang],
Tian, Q.[Qi],
Huang, Q.M.[Qing-Ming],
Gao, W.[Wen],
Rui, Y.[Yong],
USB: Ultrashort Binary Descriptor for Fast Visual Matching and
Retrieval,
IP(23), No. 8, August 2014, pp. 3671-3683.
IEEE DOI
1408
data compression
See also Edge-SIFT: Discriminative Binary Descriptor for Scalable Partial-Duplicate Mobile Search.
BibRef
Zhang, S.L.[Shi-Liang],
Tian, Q.[Qi],
Huang, Q.M.[Qing-Ming],
Gao, W.[Wen],
Rui, Y.,
Cascade Category-Aware Visual Search,
IP(23), No. 6, June 2014, pp. 2514-2527.
IEEE DOI
1406
Accuracy
BibRef
Irie, G.,
Satou, T.,
Kojima, A.,
Yamasaki, T.,
Aizawa, K.,
Affective Audio-Visual Words and Latent Topic Driving Model for
Realizing Movie Affective Scene Classification,
MultMed(12), No. 6, 2010, pp. 523-535.
IEEE DOI
1003
BibRef
Ibrahim, Z.A.[Zein Al_Abidin],
Ferrane, I.[Isabelle],
Joly, P.[Philippe],
A Similarity-Based Approach for Audiovisual Document Classification
Using Temporal Relation Analysis,
JIVP(2011), No. 2011, pp. xx-yy.
DOI Link
1104
BibRef
Philippeau, J.[Jeremy],
Pinquier, J.[Julien],
Joly, P.[Philippe],
Carrive, J.[Jean],
Dynamic organization of audiovisual database using a user-defined
similarity measure based on low-level features,
ICIP08(33-36).
IEEE DOI
0810
BibRef
Haidar, S.[Siba],
Joly, P.[Philippe],
Chebaro, B.[Bilal],
Style Similarity Measure for Video Documents Comparison,
CIVR05(307-317).
Springer DOI
0507
BibRef
Huurnink, B.[Bouke],
Snoek, C.G.M.[Cees G. M.],
de Rijke, M.[Maarten],
Smeulders, A.W.M.[Arnold W. M.],
Content-Based Analysis Improves Audiovisual Archive Retrieval,
MultMed(14), No. 4, 2012, pp. 1166-1178.
IEEE DOI
1208
BibRef
Earlier:
Today's and tomorrow's retrieval practice in the audiovisual archive,
CIVR10(18-25).
DOI Link
1007
BibRef
Huurnink, B.[Bouke],
de Rijke, M.[Maarten],
The value of stories for speech-based video search,
CIVR07(266-271).
DOI Link
0707
BibRef
Jhuo, I.H.[I-Hong],
Ye, G.N.[Guang-Nan],
Gao, S.H.[Sheng-Hua],
Liu, D.[Dong],
Jiang, Y.G.[Yu-Gang],
Lee, D.T.,
Chang, S.F.[Shih-Fu],
Discovering joint audio-visual codewords for video event detection,
MVA(25), No. 1, January 2014, pp. 33-47.
Springer DOI
1412
BibRef
Earlier: A2, A1, A4, A5, A6, A7, Only:
Joint audio-visual bi-modal codewords for video event detection,
ICMR12(39).
DOI Link
1301
BibRef
Feki, I.[Issam],
Ben Ammar, A.[Anis],
Alimi, A.M.[Adel M.],
Automatic environmental sound concepts discovery for video retrieval,
MultInfoRetr(5), No. 2, June 2016, pp. 105-115.
WWW Link.
1605
BibRef
Khan, M.U.G.[Muhammad Usman Ghani],
Gotoh, Y.[Yoshihiko],
Generating natural language tags for video information management,
MVA(28), No. 3-4, May 2017, pp. 243-265.
WWW Link.
1704
BibRef
Khan, M.U.G.[Muhammad Usman Ghani],
Zhang, L.[Lei],
Gotoh, Y.[Yoshihiko],
Generating coherent natural language annotations for video streams,
ICIP12(2893-2896).
IEEE DOI
1302
BibRef
Earlier:
Towards coherent natural language description of video streams,
SIG11(664-671).
IEEE DOI
1201
BibRef
Earlier: A2, A1, A3:
Video scene classification based on natural language description,
ARTEMIS11(942-949).
IEEE DOI
1201
From the small amount of natural language description.
BibRef
Peri, D.[Dheeraj],
Sah, S.[Shagan],
Ptucha, R.[Raymond],
Show, Translate and Tell,
ICIP19(295-299)
IEEE DOI
1910
Joint images and captions.
BibRef
Chen, K.[Kan],
Zhang, C.X.[Chuan-Xi],
Fang, C.[Chen],
Wang, Z.W.[Zhao-Wen],
Bui, T.[Trung],
Nevatia, R.[Ram],
Visually Indicated Sound Generation by Perceptually Optimized
Classification,
MultLearnApp18(VI:560-574).
Springer DOI
1905
Predict visually consistent sound from the video content.
BibRef
Haurilet, M.L.,
Tapaswi, M.,
Al-Halah, Z.,
Stiefelhagen, R.,
Naming TV characters by watching and analyzing dialogs,
WACV16(1-9)
IEEE DOI
1606
Data models
BibRef
Numano, S.[Shunsuke],
Enami, N.[Naoko],
Ariki, Y.[Yasuo],
Task-Driven Saliency Detection on Music Video,
CV4AC14(658-671).
Springer DOI
1504
BibRef
Scott, D.[David],
Zhang, Z.X.[Zhen-Xing],
Albatal, R.[Rami],
McGuinness, K.[Kevin],
Acar, E.[Esra],
Hopfgartner, F.[Frank],
Gurrin, C.[Cathal],
O'Connor, N.E.[Noel E.],
Smeaton, A.F.[Alan F.],
Audio-Visual Classification Video Browser,
MMMod14(II: 398-401).
Springer DOI
1405
BibRef
Lin, Y.T.[Yin-Tzu],
Tsai, T.H.[Tsung-Hung],
Hu, M.C.[Min-Chun],
Cheng, W.H.[Wen-Huang],
Wu, J.L.[Ja-Ling],
Semantic Based Background Music Recommendation for Home Videos,
MMMod14(II: 283-290).
Springer DOI
1405
BibRef
Shamma, D.A.[David A.],
Kennedy, L.[Lyndon],
Churchill, E.F.[Elizabeth F.],
Watching and talking: media content as social nexus,
ICMR12(12).
DOI Link
1301
BibRef
Nowak, S.[Stefanie],
Paduschek, R.[Ronny],
Kühhirt, U.[Uwe],
Photo summary: automated selection of representative photos from a
digital collection,
ICMR11(75).
DOI Link
1301
Demo.
BibRef
Paduschek, R.[Ronny],
Nowak, S.[Stefanie],
Kühhirt, U.[Uwe],
Automated detection of errors and quality issues in audio-visual
content,
ICMR11(74).
DOI Link
1301
automated
detection of errors and quality issues in audio-visual content
AVInspector.
BibRef
Vretos, N.[Nicholas],
Nikolaidis, N.[Nikos],
Pitas, I.[Ioannis],
The use of Audio-Visual Description Profile in 3D video content
description,
3DTV12(1-4).
IEEE DOI
1212
BibRef
Ta, A.P.[Anh-Phuong],
Ben, M.[Mathieu],
Gravier, G.[Guillaume],
Improving Cluster Selection and Event Modeling in Unsupervised Mining
for Automatic Audiovisual Video Structuring,
MMMod12(529-540).
Springer DOI
1201
BibRef
Mühling, M.[Markus],
Ewerth, R.[Ralph],
Freisleben, B.[Bernd],
Improving Cross-Domain Concept Detection via Object-Based Features,
CAIP15(II:359-370).
Springer DOI
1511
BibRef
Earlier:
On the Spatial Extents of SIFT Descriptors for Visual Concept Detection,
CVS11(71-80).
Springer DOI
1109
BibRef
Mühling, M.[Markus],
Ewerth, R.[Ralph],
Zhou, J.[Jun],
Freisleben, B.[Bernd],
Multimodal Video Concept Detection via Bag of Auditory Words and
Multiple Kernel Learning,
MMMod12(40-50).
Springer DOI
1201
BibRef
Valio, F.B.[Felipe Braunger],
Pedrini, H.[Helio],
Leite, N.J.[Neucimar Jeronimo],
Fast Rotation-Invariant Video Caption Detection Based on Visual Rhythm,
CIARP11(157-164).
Springer DOI
1111
BibRef
Gianni, F.[Frédéric],
Pinquier, J.[Julien],
Irisa, E.K.[Ewa Kijak],
ACADI showcase: Automatic character indexing in audiovisual document,
CIVR07(109-112).
DOI Link
0707
BibRef
Putthividhy, D.[Duangmanee],
Attias, H.T.[Hagai T.],
Nagarajan, S.S.[Srikantan S.],
Topic regression multi-modal Latent Dirichlet Allocation for image
annotation,
CVPR10(3408-3415).
IEEE DOI
1006
Using annotation texts.
BibRef
Jung, K.H.[Kwang-Hee],
Choi, S.H.[Sung-Hyun],
Kim, H.S.[Hyung-Seok],
Hur, N.H.[Nam-Ho],
Kim, J.K.[Joong Kyu],
Caption insertion method for 3D broadcasting service,
3DTV10(1-4).
IEEE DOI
1006
BibRef
Pramod, S.K.[Sankar K.],
Jawahar, C.V.,
Zisserman, A.[Andrew],
Subtitle-free Movie to Script Alignment,
BMVC09(xx-yy).
PDF File.
0909
BibRef
Zeng, Z.[Zhi],
Liang, W.[Wei],
Li, H.P.[He-Ping],
Zhang, S.W.[Shu-Wu],
A Novel Video Classification Method Based on Hybrid
Generative/Discriminative Models,
SSPR08(705-713).
Springer DOI
0812
Using audio.
BibRef
Zhu, Y.Y.[Ying-Ying],
Ming, Z.[Zhong],
Huang, Q.A.[Qi-Ang],
SVM-Based Audio Classification for Content- Based Multimedia Retrieval,
MCAM07(474-482).
Springer DOI
0706
BibRef
Goldmann, L.,
Samour, A.,
Karaman, M.,
Sikora, T.,
Extracting High Level Semantics by Means of Speech, Audio, and Image
Primitives in Surveillance Applications,
ICIP06(2397-2400).
IEEE DOI
0610
BibRef
Luo, J.[Jie],
Caputo, B.[Barbara],
Zweig, A.[Alon],
Bach, J.H.[Jörg-Hendrik],
Anemüller, J.[Jörn],
Object Category Detection Using Audio-Visual Cues,
CVS08(xx-yy).
Springer DOI
0805
BibRef
Caputo, B.,
Wallraven, C.,
Nilsback, M.E.,
Object categorization via local kernels,
ICPR04(II: 132-135).
IEEE DOI
0409
BibRef
Schauer, C.,
Gross, H.M.,
A Computational Model of Early Auditory-Visual Integration,
DAGM03(362-369).
Springer DOI
0310
BibRef
Fu, T.Y.[Tie-Yan],
Liu, X.X.[Xiao Xing],
Liang, L.H.[Lu Hong],
Pi, X.B.[Xiao-Bo],
Nefian, A.V.,
A audio-visual speaker identification using coupled hidden Markov
models,
ICIP03(III: 29-32).
IEEE DOI
0312
BibRef
Yemez, Y.[Yücel],
Kanak, A.,
Erzin, E.,
Tekalp, A.M.,
Multimodal speaker identification with audio-video processing,
ICIP03(III: 5-8).
IEEE DOI
0312
BibRef
Sugano, M.,
Isaksson, R.,
Nakajima, Y.,
Yanagihara, H.,
Shot genre classification using compressed audio-visual features,
ICIP03(II: 17-20).
IEEE DOI
0312
BibRef
Moncrieff, S.,
Venkatesh, S., and
Dorai, C.,
Horror film genre typing and scene labeling via audio analysis,
ICME03(I: 193-196).
BibRef
0300
Moncrieff, S.,
Dorai, C.,
Venkatesh, S.,
Affect computing in film through sound energy dynamics,
ACMMM01(525-527).
BibRef
0100
Wachsmuth, S.,
Sagerer, G.,
Integrated analysis of speech and images as a probabilistic decoding
process,
ICPR02(II: 588-592).
IEEE DOI
0211
BibRef
Kulesh, V.,
Petrushin, V.A.,
Sethi, I.K.,
Video clip recognition using joint audio-visual processing model,
ICPR02(I: 500-503).
IEEE DOI
0211
BibRef
Miyamori, H.,
Improving accuracy in behaviour identification for content-based
retrieval by using audio and video information,
ICPR02(II: 826-830).
IEEE DOI
0211
BibRef
de Santo, M.,
Percannella, G.,
Sansone, C.,
Vento, M.,
Classifying audio of movies by a multi-expert system,
CIAP01(386-391).
IEEE DOI
0210
BibRef
Albiol, A.,
Torres, L.,
Delp, E.J.,
Video preprocessing for audiovisual indexing,
Southwest02(57-61).
IEEE Top Reference.
0208
BibRef
Bakker, E.M.[Erwin M.],
Lew, M.S.[Michael S.],
Semantic Video Retrieval Using Audio Analysis,
CIVR02(271-277).
Springer DOI
0208
BibRef
Kim, K.[Kyungsu],
Choi, J.[Junho],
Kim, N.[Namjung],
Kim, P.K.[Pan-Koo],
Extracting Semantic Information from Basketball Video Based on
Audio-Visual Features,
CIVR02(278-288).
Springer DOI
0208
BibRef
Fisher, J.W.[John W.],
Darrell, T.J.[Trevor J.],
Probabalistic Models and Informative Subspaces for Audiovisual
Correspondence,
ECCV02(III: 592 ff.).
Springer DOI
0205
BibRef
Chu, S.M.[Stephen M.],
Huang, T.S.[Thomas S.],
Audio-Visual Speech Fusion Using Coupled Hidden Markov Models,
MSCSAS07(1-2).
IEEE DOI
0706
BibRef
Naphade, M.R.[Milind R.],
Garg, A.[Ashutosh],
Huang, T.S.[Thomas S.],
Audio-Visual Event Detection using Duration Dependent Input Output
Markov Models,
CBAIVL01(30).
IEEE DOI
0110
BibRef
Alatan, A.A.,
Automatic Multi-modal Dialogue Scene Indexing,
ICIP01(III: 374-377).
IEEE DOI
0108
BibRef
Sundaram, H.[Hari],
Chang, S.F.[Shih-Fu],
Video Scene Segmentation Using Video and Audio Features,
ICME00(TP10).
0007
BibRef
Smith, J.R.[John R.],
Li, C.S.[Chung-Sheng],
Adaptive Synthesis in Progressive Retrieval of Audio-Visual Data,
ICME00(MP5).
0007
BibRef
Toklu, C.,
Liou, S.P.,
Image and Audio Sequence Visualization and Interaction Mechanisms for
Structured Video Browsing and Editing,
ICIP00(Vol II: 263-266).
IEEE DOI
0008
BibRef
Jiang, H.[Hao],
Lin, T.[Tong],
Zhang, H.J.[Hong-Jiang],
Video Segmentation with the Assistance of Audio Content Analysis,
ICME00(WP5).
0007
BibRef
Pandit, M.,
Kittler, J.V.,
Li, Y.,
Chilton, E.,
A Comparative Study of Different Segmentation Approaches for Audio
Track Indexing,
ICPR00(Vol II: 467-470).
IEEE DOI
0009
BibRef
Huang, J.C.[Jin-Cheng],
Liu, Z.[Zhu],
Yao, W.[Wang],
Integration of audio and visual information for content-based video
segmentation,
ICIP98(III: 526-529).
IEEE DOI
9810
BibRef
Saraceno, C.,
Leonardi, R.,
Identification of story units in audio-visual sequences by joint audio
and video processing,
ICIP98(I: 363-367).
IEEE DOI
9810
BibRef
Chapter on Implementations and Applications, Databases, QBIC, Video Analysis, Hardware and Software, Inspection continues in
Survey, Comparison, Evaluation, of Segmentation and Cut Detection, Summarization .