21.3.4 Lipreading, Lip Reading, Lip Tracking

Chapter Contents (Back)
Real Time Vision. Application, Lipreading. Lipreading. Lip Reading. Visual Speech. See also Combined Audio Visual Recognition and Analysis.

Language Independent Lip Reading,
2007.
HTML Version. Dataset, Lip Reading.

OuluVS database,
2009.
WWW Link. Dataset, Lip Reading.

Mase, K.,
Recognition of Facial Expression from Optical Flow,
IEICE(E74-xx), No. 10, 1991, pp. 3474-3483. BibRef 9100

Mase, K., and Pentland, A.P.,
Automatic Lipreading by Computer,
IEICE(J73-D-II), No. 6, June 1990, pp. 796-803. BibRef 9006
Earlier:
Lip Reading: Automatic Visual Recognition of Spoken Words,
OSAMV89(1565-1570). BibRef

Murase, H., Sakai, R.,
Moving Object Recognition in Eigenspace Representation: Gait Analysis and Lip Reading,
PRL(17), No. 2, February 8 1996, pp. 155-162. BibRef 9602

Zhou, Y.[Yong],
Low bit rate audio-visual communication having improved face and lip region detection,
US_Patent5,596,362, Jan 21, 1997
WWW Link. BibRef 9701

Luettin, J., Thacker, N.A.,
Speechreading Using Probabilistic Models,
CVIU(65), No. 2, February 1997, pp. 163-178.
DOI Link 9704
BibRef

Luettin, J., Thacker, N.A., Beet, S.W.,
Locating and Tracking Facial Speech Features,
ICPR96(I: 652-656).
IEEE DOI 9608
BibRef
And:
Learning to Recognise Talking Faces,
ICPR96(IV: 55-59).
IEEE DOI 9608
(Univ. of Sheffield, UK) BibRef

Yu, K., Jiang, X.Y., Bunke, H.,
Lipreading: A Classifier Combination Approach,
PRL(18), No. 11-13, November 1997, pp. 1421-1426. 9806
BibRef

Goldschen, A.J.[Alan J.], (MITRE), Petajan, E.D.[Eric D.], (ATT), and Garcia, O.N.[Oscar N.], (Wright State University),
Continuous Automatic Speech Recognition by Lipreading,
MBR97(Chapter 14). BibRef 9700

Nan, L.[Li], Dettmer, S.[Shawn], and Shah, M.[Mubarak],
Visually Recognizing Speech Using Eigen Sequences,
MBR97(Chapter 15), UCF. BibRef 9700

Graf, H.P.[Hans Peter],
Method for locating a subject's lips in a facial image,
US_Patent5,805,745, September 8, 1998.
WWW Link. BibRef 9809

Petajan, E.D., Graf, H.P.,
Robust face feature analysis for automatic speechreading and character animation,
AFGR96(357-362).
IEEE DOI 9610
BibRef

Yu, K.[Keren], Jiang, X.Y.[Xiao-Yi], Bunke, H.[Horst],
Lipreading using signal analysis over time,
SP(77), No. 2, 1 September 1999, pp. 195-208. BibRef 9909
Earlier:
Lipreading using Fourier transform over time,
CAIP97(472-479).
Springer DOI 9709
BibRef

Mak, M.W., Allen, W.G.,
A lip-tracking system based on morphological processing and block matching techniques,
SP:IC(6), No. 4, August 1994, pp. 335-348.
WWW Link. BibRef 9408

Lepsřy, S.[Skjalg], Curinga, S.[Sergio],
Conversion of articulatory parameters into active shape model coefficients for lip motion representation and synthesis,
SP:IC(13), No. 3, September 1998, pp. 209-225.
WWW Link. BibRef 9809

Chan, S., Ngo, C.W., Lai, K.F.,
Motion tracking of human mouth by generalized deformable models,
PRL(20), No. 8, August 1999, pp. 879-887. BibRef 9908

Oliver, N.M.[Nuria M.], Pentland, A.P.[Alex P.], Bérard, F.[François],
LAFTER: a real-time face and lips tracker with facial expression recognition,
PR(33), No. 8, August 2000, pp. 1369-1382.
WWW Link. 0005
BibRef
Earlier:
LAFTER: Lips and Face Real Time Tracker,
CVPR97(123-129).
IEEE DOI 9704
With demo. BibRef

Jebara, T.S., Pentland, A.P.,
Parameterized Structure from Motion for 3D Adaptive Feedback Tracking of Faces,
CVPR97(144-150).
IEEE DOI 9704
BibRef
And: Vismod--401, 1996.
HTML Version. MIT. Color. Symmetries. BibRef

Chiou, G.I., Hwang, J.N.,
Lipreading from Color Video,
IP(6), No. 8, August 1997, pp. 1192-1195.
IEEE DOI 9708
BibRef
Earlier:
Lipreading from Color Motion Video,
ICASSP96(XX) Dept of EE. University of Seattle. BibRef

Matthews, I.[Iain], Cootes, T.F.[Timothy F.], Bangham, J.A.[J. Andrew], Cox, S.J.[Stephen J.], Harvey, R.W.[Richard W.],
Extraction of Visual Features for Lipreading,
PAMI(24), No. 2, February 2002, pp. 198-213.
IEEE DOI 0202
Evaluation, Lip Reading. Compare 3 methods for evaluation. BibRef

Matthews, I., Bangham, J.A., Harvey, R.W., Cox, S.J.,
A Comparison of Active Shape Model and Scale Decomposition Based Features for Visual Speech Recognition,
ECCV98(II: 514).
Springer DOI BibRef 9800
Earlier: A3, A1, A2, A4:
Lip Reading from Scale-Space Measurements,
CVPR97(582-587).
IEEE DOI 9704
BibRef

Daubias, P.[Philippe], Deléglise, P.[Paul],
Statistical Lip-Appearance Models Trained Automatically Using Audio Information,
JASP(2002), No. 11, November 2002, pp. 1202.
WWW Link. 0304
BibRef

Zhang, X.Z.[Xiao-Zheng], Broun, C.C.[Charles C.], Mersereau, R.M.[Russell M.], Clements, M.A.[Mark A.],
Automatic Speechreading with Applications to Human-Computer Interfaces,
JASP(2002), No. 11, November 2002, pp. 1228.
WWW Link. 0304
BibRef

Zhang, X.Z.[Xiao-Zheng], Mersereau, R.M.[Russell M.],
Lip Feature Extraction Towards an Automatic Speechreading System,
ICIP00(Vol III: 226-229).
IEEE DOI 0008
BibRef

Gordan, M.[Mihaela], Kotropoulos, C.[Constantine], Pitas, I.[Ioannis],
A Support Vector Machine-Based Dynamic Network for Visual Speech Recognition Applications,
JASP(2002), No. 11, November 2002, pp. 1248.
WWW Link. 0304
BibRef
Earlier:
Application of support vector machines classifiers to visual speech recognition,
ICIP02(III: 129-132).
IEEE DOI 0210
BibRef

Siatras, S., Nikolaidis, N.[Nikos], Krinidis, M.[Michail], Pitas, I.[Ioannis],
Visual Lip Activity Detection and Speaker Detection Using Mouth Region Intensities,
CirSysVideo(19), No. 1, January 2009, pp. 133-137.
IEEE DOI 0902
BibRef

Luthon, F., Caplier, A., Lievin, M.,
Spatiotemporal MRF approach to video segmentation: Application to motion detection and lip segmentation,
SP(76), No. 1, 1 July 1999, pp. 61-80. BibRef 9907

Caplier, A.[Alice], Luthon, F.[Franck],
A new spatiotemporal approach for image analysis. Application to motion detection,
CAIP95(246-253).
Springer DOI 9509
BibRef

Lievin, M., Luthon, F.,
Nonlinear Color Space and Spatiotemporal MRF for Hierarchical Segmentation of Face Features in Video,
IP(13), No. 1, January 2004, pp. 63-71.
IEEE DOI 0402
BibRef

Lievin, M., Luthon, F.,
A Hierarchical Segmentation Algorithm for Face Analysis Application for Lipreading,
ICME00(TP8). 0007
BibRef
Earlier:
Lip features automatic extraction,
ICIP98(III: 168-172).
IEEE DOI 9810
BibRef

Luthon, F.[Franck], and Lievin, M.,
Lip Motion Automatic Detection,
SCIA97(xx-yy)
HTML Version. 9705
BibRef

Cetingul, H.E., Yemez, Y., Erzin, E., Tekalp, A.M.,
Discriminative Analysis of Lip Motion Features for Speaker Identification and Speech-Reading,
IP(15), No. 10, October 2006, pp. 2879-2891.
IEEE DOI 0609
BibRef
Earlier:
Discriminative lip-motion features for biometric speaker identification,
ICIP04(III: 2023-2026).
IEEE DOI 0505
BibRef

Lafon, S.[Stephane], Keller, Y., Coifman, R.R.[Ronald R.],
Data Fusion and Multicue Data Matching by Diffusion Maps,
PAMI(28), No. 11, November 2006, pp. 1784-1797.
IEEE DOI 0609
Laplace-Beltrami approach for computing density invariant embeddings. Second, a refinement of the Nyström extension algorithm. Finally, a multicue data matching scheme based on nonlinear spectral graphs alignment. Apply to lipreading. BibRef

Bayro-Corrochano, E.[Eduardo], Trujillo, N.[Noel], Naranjo, M.[Michel],
Quaternion Fourier Descriptors for the Preprocessing and Recognition of Spoken Words Using Images of Spatiotemporal Representations,
JMIV(28), No. 2, June 2007, pp. 179-190.
Springer DOI 0710
BibRef

Yau, W.C.[Wai Chee], Kumar, D.K.[Dinesh Kant], Arjunan, S.P.[Sridhar Poosapadi],
Visual Speech Recognition Using Dynamic Features And Support Vector Machines,
IJIG(8), No. 3, July 2008, pp. 419-437. 0807
BibRef
Earlier:
Visual Speech Recognition Method Using Translation, Scale and Rotation Invariant Features,
AVSBS06(63-63).
IEEE DOI 0611
BibRef

Yau, W.C.[Wai Chee], Kumar, D.K.[Dinesh Kant], Chinnadurai, T.[Tharangini],
Lip-Reading Technique Using Spatio-Temporal Templates and Support Vector Machines,
CIARP08(610-617).
Springer DOI 0809
BibRef

Shaikh, A.A.[Ayaz A.], Kumar, D.K.[Dinesh K.], Gubbi, J.[Jayavardhana],
Automatic visual speech segmentation and recognition using directional motion history images and Zernike moments,
VC(29), No. 10, October 2013, pp. 969-982.
WWW Link. 1310
BibRef

Yau, W.C.[Wai Chee], Kumar, D.K.[Dinesh Kant], Weghorn, H.[Hans],
Visual Speech Recognition Using Motion Features and Hidden Markov Models,
CAIP07(832-839).
Springer DOI 0708
BibRef

Seymour, R.[Rowan], Stewart, D.[Darryl], Ming, J.[Ji],
Comparison of Image Transform-Based Features for Visual Speech Recognition in Clean and Corrupted Videos,
JIVP(2008), No. 2008, pp. xx-yy.
DOI Link 0804
BibRef

Stewart, D., Seymour, R., Pass, A., Ming, J.[Ji],
Robust Audio-Visual Speech Recognition Under Noisy Audio-Video Conditions,
Cyber(44), No. 2, February 2014, pp. 175-184.
IEEE DOI 1403
audio streaming BibRef

Borgstrom, B.J., Alwan, A.[Abeer],
A Low-Complexity Parabolic Lip Contour Model With Speaker Normalization for High-Level Feature Extraction in Noise-Robust Audiovisual Speech Recognition,
SMC-A(38), No. 6, November 2008, pp. 1273-1280.
IEEE DOI 0811
BibRef

Wang, S.L., Liew, A.W.C., Lau, W.H., Leung, S.H.,
An Automatic Lipreading System for Spoken Digits With Limited Training Data,
CirSysVideo(18), No. 12, December 2008, pp. 1760-1765.
IEEE DOI 0812
BibRef
Earlier: A1, A3, A4, Only:
Automatic Lipreading with Limited Training Data,
ICPR06(III: 881-884).
IEEE DOI 0609
BibRef

Zhao, G., Barnard, M., Pietikäinen, M.[Matti],
Lipreading With Local Spatiotemporal Descriptors,
MultMed(11), No. 7, November 2009, pp. 1254-1265.
IEEE DOI 0911
BibRef

Yang, M.T.[Mau-Tsuen], You, Z.W.[Zhen-Wei], Shih, Y.C.[Ya-Chun],
Lip contour extraction for language learning in VEC3D,
MVA(21), No. 1, November 2009, pp. xx-yy.
Springer DOI 0911
BibRef

Shin, J.J.[Jong-Ju], Lee, J.[Jin], Kim, D.J.[Dai-Jin],
Real-time lip reading system for isolated Korean word recognition,
PR(44), No. 3, March 2011, pp. 559-571.
Elsevier DOI 1011
Lip reading; Two-stage lip tracking; Word classifier; Automatic speech recognition; Audio-visual ASR BibRef

Stillittano, S.[Sébastien], Girondel, V.[Vincent], Caplier, A.[Alice],
Lip contour segmentation and tracking compliant with lip-reading application constraints,
MVA(24), No. 1, January 2013, pp. 1-18.
WWW Link. 1301
BibRef
Earlier:
Inner and outer lip contour tracking using cubic curve parametric models,
ICIP09(2469-2472).
IEEE DOI 0911
BibRef

Zhou, Z.H.[Zi-Heng], Hong, X.P.[Xiao-Peng], Zhao, G.Y.[Guo-Ying], Pietikainen, M.[Matti],
A Compact Representation of Visual Speech Data Using Latent Variables,
PAMI(36), No. 1, 2014, pp. 181-187.
IEEE DOI 1312
BibRef
Earlier: A1, A3, A4, Only:
Towards a practical lipreading system,
CVPR11(137-144).
IEEE DOI 1106
BibRef
Earlier:
Lipreading: A Graph Embedding Approach,
ICPR10(523-526).
IEEE DOI 1008
Data models BibRef

Saitoh, T.[Takeshi], Zhou, Z.H.[Zi-Heng], Zhao, G.Y.[Guo-Ying], Pietikäinen, M.[Matti],
Concatenated Frame Image Based CNN for Visual Speech Recognition,
LipRead16(II: 277-289).
Springer DOI 1704
See also Image-Based Visual Speech Animation System, An. BibRef

Cheung, Y.M.[Yiu-Ming], Li, M.[Meng], Cao, X.C.[Xiao-Chun], You, X.[Xinge],
Lip Segmentation under MAP-MRF Framework with Automatic Selection of Local Observation Scale and Number of Segments,
IP(23), No. 8, August 2014, pp. 3397-3411.
IEEE DOI 1408
BibRef
Earlier: A1, A2, Only:
MAP-MRF based lip segmentation without true segment number,
ICIP11(769-772).
IEEE DOI 1201
Markov processes BibRef

Zhou, Z.H.[Zi-Heng], Zhao, G.Y.[Guo-Ying], Hong, X.P.[Xiao-Peng], Pietikäinen, M.[Matti],
A review of recent advances in visual speech decoding,
IVC(32), No. 9, 2014, pp. 590-605.
Elsevier DOI 1408
Survey, Visual Speech. Visual speech decoding BibRef

Travieso, C.M.[Carlos M.], Zhang, J.G.[Jian-Guo], Miller, P.[Paul], Alonso, J.B.[Jesús B.],
Using a Discrete Hidden Markov Model Kernel for lip-based biometric identification,
IVC(32), No. 12, 2014, pp. 1080-1089.
Elsevier DOI 1412
Discrete Hidden Markov Model Kernel BibRef

Ibrahim, M.Z., Mulvaney, D.J.,
Geometrical-based lip-reading using template probabilistic multi-dimension dynamic time warping,
JVCIR(30), No. 1, 2015, pp. 219-233.
Elsevier DOI 1507
BibRef
Earlier:
Geometry based lip reading system using Multi Dimension Dynamic Time Warping,
VCIP12(1-6).
IEEE DOI 1302
Lip reading BibRef

Shin, J.[Jeyong], Kim, H.I.[Hong-In], Park, R.H.[Rae-Hong],
New interface for musical instruments using lip reading,
IET-IPR(9), No. 9, 2015, pp. 770-776.
DOI Link 1509
audio-visual systems BibRef

Le, T.H.N.[T. Hoang Ngan], Savvides, M.[Marios],
A novel Shape Constrained Feature-based Active Contour model for lips/mouth segmentation in the wild,
PR(54), No. 1, 2016, pp. 23-33.
Elsevier DOI 1603
Prior shape BibRef

Patrona, F., Iosifidis, A., Tefas, A., Nikolaidis, N., Pitas, I.,
Visual Voice Activity Detection in the Wild,
MultMed(18), No. 6, June 2016, pp. 967-977.
IEEE DOI 1605
Feature extraction. From facial videos. BibRef

Liu, X.[Xin], Cheung, Y.M.[Yiu-Ming], Tang, Y.Y.[Yuan Yan],
Lip event detection using oriented histograms of regional optical flow and low rank affinity pursuit,
CVIU(148), No. 1, 2016, pp. 153-163.
Elsevier DOI 1606
Lip event detection BibRef

Howell, D.[Dominic], Cox, S.[Stephen], Theobald, B.J.[Barry-John],
Visual units and confusion modelling for automatic lip-reading,
IVC(51), No. 1, 2016, pp. 1-12.
Elsevier DOI 1606
Lip-reading BibRef

Shaw, F., Theobald, B.J.[Barry-John],
Expressive Modulation of Neutral Visual Speech,
MultMedMag(23), No. 4, October 2016, pp. 68-78.
IEEE DOI 1612
computer animation BibRef

Zhang, H.P.[Hao-Peng], El-Gaaly, T.[Tarek], Elgammal, A.M.[Ahmed M.], Jiang, Z.G.[Zhi-Guo],
Factorization of view-object manifolds for joint object recognition and pose estimation,
CVIU(139), No. 1, 2015, pp. 89-103.
Elsevier DOI 1509
Homeomorphic manifold analysis. Object cateforization. BibRef

Bakry, A.[Amr], El-Gaaly, T.[Tarek], Elhoseiny, M.[Mohamed], Elgammal, A.M.[Ahmed M.],
Joint Object Recognition and Pose Estimation Using a Nonlinear View-Invariant Latent Generative Model,
WACV16(1-9)
IEEE DOI 1606
Computational modeling. Category recognition and pose. BibRef

Bakry, A.[Amr], Elgammal, A.M.[Ahmed M.],
Untangling Object-View Manifold for Multiview Recognition and Pose Estimation,
ECCV14(IV: 434-449).
Springer DOI 1408
BibRef
And:
MKPLS: Manifold Kernel Partial Least Squares for Lipreading and Speaker Identification,
CVPR13(684-691)
IEEE DOI 1309
AVLetters BibRef

Butt, W.U.R.[Waqqas Ur Rehman],
Speech Recognition Supported by Lip Analysis,
ELCVIA(15), No. 2, 2016, pp. 30-32.
DOI Link 1611
BibRef

Georgakis, C., Petridis, S., Pantic, M.,
Discrimination Between Native and Non-Native Speech Using Visual Features Only,
Cyber(46), No. 12, December 2016, pp. 2758-2771.
IEEE DOI 1612
Databases BibRef

Wei, J.G.[Jian-Guo], Zhang, J.S.[Jing-Shu], Ji, Y.[Yan], Fang, Q.A.[Qi-Ang], Lu, W.H.[Wen-Huan],
Morphological normalization of vowel images for articulatory speech recognition,
JVCIR(41), No. 1, 2016, pp. 352-360.
Elsevier DOI 1612
Vocal tract normalization BibRef


Ra, R., Aihara, R., Takiguchi, T., Ariki, Y.,
Visual-to-speech conversion based on maximum likelihood estimation,
MVA17(518-521)
DOI Link 1708
Feature extraction, Lips, Maximum likelihood estimation, Speech, Training, Visualization BibRef

Iwasaki, M., Kubokawa, M., Saitoh, T.,
Two features combination with gated recurrent unit for visual speech recognition,
MVA17(326-329)
DOI Link 1708
Feature extraction, Hidden Markov models, Lips, Logic gates, Mouth, Speech recognition, Visualization BibRef

Wielgat, R., Mik, L., Lorenc, A., Truchan, A., Szostek, M.,
Choice of optimal measurement conditions for calculating the correlation between EMA sensor and video marker position coordinates in electromagnetic articulography,
WSSIP17(1-5)
IEEE DOI 1707
Cameras, Correlation, Lips, Magnetic heads, Speech, Tongue, correlation, electormagnetic, articulography BibRef

Fernandez-Lopez, A., Martinez, O., Sukno, F.M.[Federico M.],
Towards Estimating the Upper Bound of Visual-Speech Recognition: The Visual Lip-Reading Feasibility Database,
FG17(208-215)
IEEE DOI 1707
Context, Decoding, Speech, Speech recognition, Visual databases, Visualization BibRef

Špetlík, R.[Radim], Cech, J.[Jan], Franc, V.[Vojtech], Matas, J.[Jirí],
Visual Language Identification from Facial Landmarks,
SCIA17(II: 389-400).
Springer DOI 1706
BibRef

Chung, J.S.[Joon Son], Zisserman, A.[Andrew],
Lip Reading in the Wild,
ACCV16(II: 87-103).
Springer DOI 1704
BibRef

Zimmermann, M.[Marina], Ghazi, M.M.[Mostafa Mehdipour], Ekenel, H.K.[Hazim Kemal], Thiran, J.P.[Jean-Philippe],
Visual Speech Recognition Using PCA Networks and LSTMs in a Tandem GMM-HMM System,
LipRead16(II: 264-276).
Springer DOI 1704
BibRef

Lee, D.[Daehyun], Lee, J.[Jongmin], Kim, K.E.[Kee-Eung],
Multi-view Automatic Lip-Reading Using Neural Network,
LipRead16(II: 290-302).
Springer DOI 1704
BibRef

Watanabe, T.[Takuya], Katsurada, K.[Kouichi], Kanazawa, Y.S.[Yasu-Shi],
Lip Reading from Multi View Facial Images Using 3D-AAM,
LipRead16(II: 303-316).
Springer DOI 1704
BibRef

Shi, X.X., Wang, S.L., Lai, J.Y.,
Visual speaker authentication by ensemble learning over static and dynamic lip details,
ICIP16(3942-3946)
IEEE DOI 1610
Adaptation models BibRef

Aung, Z.H.[Zaw Htet], Ritthipravat, P.[Panrasee],
Robust Visual Voice Activity Detection Using Long Short-Term Memory Recurrent Neural Network,
PSIVT15(380-391).
Springer DOI 1602
BibRef

Patrona, F.[Foteini], Iosifidis, A.[Alexandros], Tefas, A.[Anastasios], Nikolaidis, N.[Nikolaos], Pitas, I.[Ioannis],
Visual voice activity detection based on spatiotemporal information and bag of words,
ICIP15(2334-2338)
IEEE DOI 1512
Bag of Words model BibRef

Liu, H.[Hong], Zhang, X.W.[Xue-Wu], Wu, P.P.[Ping-Ping],
Two-level multi-task metric learning with application to multi-classification,
ICIP15(2756-2760)
IEEE DOI 1512
Face Identification; Lipreading; Metric Learning; Multi-task Learning BibRef

Liu, H.[Hong], Zhang, X.[Xuewu], Wu, P.P.[Ping-Ping],
Regression based landmark estimation and multi-feature fusion for visual speech recognition,
ICIP15(808-812)
IEEE DOI 1512
Bayes Combination BibRef

Anina, I., Zhou, Z.H.[Zi-Heng], Zhao, G.Y.[Guo-Ying], Pietikainen, M.,
OuluVS2: A multi-view audiovisual database for non-rigid mouth motion analysis,
FG15(1-5)
IEEE DOI 1508
cameras BibRef

Antonakos, E., Roussos, A., Zafeiriou, S.,
A survey on mouth modeling and analysis for Sign Language recognition,
FG15(1-7)
IEEE DOI 1508
Internet BibRef

Shiraishi, J., Saitoh, T.,
Optical flow based lip reading using non rectangular ROI and head motion reduction,
FG15(1-6)
IEEE DOI 1508
face recognition BibRef

Ichino, M.[Masatsugu], Yamazaki, Y.S.[Yasu-Shi], Jian-Gang, W.[Wang], Yun, Y.W.[Yau Wei],
Text independent speaker gender recognition using lip movement,
ICARCV12(176-181).
IEEE DOI 1304
BibRef

Bear, H.L.[Helen L.], Harvey, R.W.[Richard W.], Theobald, B.J.[Barry-John], Lan, Y.X.[Yu-Xuan],
Resolution limits on visual speech recognition,
ICIP14(1371-1375)
IEEE DOI 1502
Accuracy BibRef
And:
Which Phoneme-to-Viseme Maps Best Improve Visual-Only Computer Lip-Reading?,
ISVC14(II: 230-239).
Springer DOI 1501
BibRef

Sad, G.D.[Gonzalo D.], Terissi, L.D.[Lucas Daniel], Gómez, J.C.[Juan Carlos],
Decision Level Fusion for Audio-Visual Speech Recognition in Noisy Conditions,
CIARP16(360-367).
Springer DOI 1703
BibRef

Terissi, L.D.[Lucas Daniel], Sad, G.D.[Gonzalo D.], Gómez, J.C.[Juan Carlos], Parodi, M.[Marianela],
Audio-Visual Speech Recognition Scheme Based on Wavelets and Random Forests Classification,
CIARP15(567-574).
Springer DOI 1511
BibRef
Earlier: A1, A4, A3, Only:
Lip Reading Using Wavelet-Based Features and Random Forests Classification,
ICPR14(791-796)
IEEE DOI 1412
Hidden Markov models BibRef

Rekik, A.[Ahmed], Ben-Hamadou, A.[Achraf], Mahdi, W.[Walid],
A New Visual Speech Recognition Approach for RGB-D Cameras,
ICIAR14(II: 21-28).
Springer DOI 1410
BibRef

Kawasaki, T., Ukai, N., Takumi, S., Tamura, S., Hayamizu, S.,
Improvement of Lip Reading Performance in Real Environments Using Speaker and Environmental Adaptation,
ACPR13(346-350)
IEEE DOI 1408
image recognition BibRef

Agarwal, S., Mukherjee, D.P.,
Lip tracking under varying expressions utilizing domain knowledge,
NCVPRIPG13(1-4)
IEEE DOI 1408
annealing BibRef

Pei, Y.R.[Yu-Ru], Kim, T.K.[Tae-Kyun], Zha, H.B.[Hong-Bin],
Unsupervised Random Forest Manifold Alignment for Lipreading,
ICCV13(129-136)
IEEE DOI 1403
Lipreading; Manifold Alignment; RFMA; Unsupervised Random Forest BibRef

Pei, Y.R.[Yu-Ru], Zha, H.B.[Hong-Bin],
Visyllable-specific facial transition motion embedding and extraction,
ICIP09(1809-1812).
IEEE DOI 0911
BibRef

Shi, F.H.[Fu-Hao], Pei, Y.R.[Yu-Ru], Zha, H.B.[Hong-Bin],
Interactive modeling of 3D facial expressions with hierarchical Gaussian process latent variable models,
ICIP09(2477-2480).
IEEE DOI 0911
BibRef

Lopes, C.B.O., Goncalves, A.L., Scharcanski, J., Jung, C.R.,
Color-based lips extraction applied to voice activity detection,
ICIP11(1057-1060).
IEEE DOI 1201
BibRef

Prajapati, G.I.[Ghanshyam I.], Patel, N.M.[Narendra M.],
DToLIP: Detection and tracking of lip contours from human facial images using Snake's method,
ICIIP11(1-6).
IEEE DOI 1112
BibRef

Koller, O.[Oscar], Ney, H.[Hermann], Bowden, R.[Richard],
Read My Lips: Continuous Signer Independent Weakly Supervised Viseme Recognition,
ECCV14(I: 281-296).
Springer DOI 1408
BibRef

Ong, E.J.[Eng-Jon], Bowden, R.[Richard],
Learning temporal signatures for Lip Reading,
ARTEMIS11(958-965).
IEEE DOI 1201
BibRef
And:
Learning Sequential Patterns for Lipreading,
BMVC11(xx-yy).
HTML Version. 1110
See also Learning multi-kernel distance functions using relative comparisons. BibRef

Sheerman-Chase, T.[Tim], Ong, E.J.[Eng-Jon], Bowden, R.[Richard],
Cultural factors in the regression of non-verbal communication perception,
HICV11(1242-1249).
IEEE DOI 1201
BibRef

Pass, A.[Adrian], Zhang, J.G.[Jian-Guo], Stewart, D.[Darryl],
AN investigation into features for multi-view lipreading,
ICIP10(2417-2420).
IEEE DOI 1009
BibRef

Li, K.[Kunlun], Wang, M.[Miao], Liu, M.[Ming], Zhao, A.[Aimin],
Improved level set method for lip contour detection,
ICIP10(673-676).
IEEE DOI 1009
BibRef

Nascimento, J.C.[Jacinto C.], Silva, J.S.[Jorge S.],
Lip contour tracking using multiple dynamic models on a manifold,
ICIP11(2321-2324).
IEEE DOI 1201
BibRef

Carneiro, G.[Gustavo], Nascimento, J.C.[Jacinto C.],
The Fusion of Deep Learning Architectures and Particle Filtering Applied to Lip Tracking,
ICPR10(2065-2068).
IEEE DOI 1008
BibRef
Earlier: A2, A1:
Efficient search methods and deep belief networks with particle filtering for non-rigid tracking: Application to lip tracking,
ICIP10(3817-3820).
IEEE DOI 1009
See also Combining Multiple Dynamic Models and Deep Learning Architectures for Tracking the Left Ventricle Endocardium in Ultrasound Data. BibRef

Pass, A.[Adrian], Ming, J.[Ji], Hanna, P.[Philip], Zhang, J.G.[Jian-Guo], Stewart, D.[Darryl],
Inter-frame contextual modelling for visual speech recognition,
ICIP10(93-96).
IEEE DOI 1009
BibRef

Saitoh, T.[Takeshi], Kato, T.[Tomoya], Konishi, R.[Ryosuke],
A Novel Transducer: From Lip Motion to Voice Message,
MVA09(410-).
PDF File. 0905
BibRef

Liu, X.[Xin], Cheung, Y.M.[Yiu-Ming], Li, M.[Meng], Liu, H.L.[Hai-Lin],
A Lip Contour Extraction Method Using Localized Active Contour Model with Automatic Parameter Selection,
ICPR10(4332-4335).
IEEE DOI 1008
BibRef

Heracleous, P.[Panikos], Hagita, N.[Norihiro], Beautemps, D.[Denis],
Gestures and Lip Shape Integration for Cued Speech Recognition,
ICPR10(2238-2241).
IEEE DOI 1008
BibRef

Estellers, V., Gurban, M., Thiran, J.P.,
Selecting relevant visual features for speechreading,
ICIP09(1433-1436).
IEEE DOI 0911
BibRef

Gastelum, A., Krueger, M., Marquez, J., Gimel'farb, G.L., Delmas, P.,
Automatic 3D lip shape segmentation and modelling,
IVCNZ08(1-6).
IEEE DOI 0811
BibRef

Saitoh, T.[Takeshi], Morishita, K.[Kazutoshi], Konishi, R.[Ryosuke],
Analysis of efficient lip reading method for various languages,
ICPR08(1-4).
IEEE DOI 0812
BibRef

Saitoh, T.[Takeshi], Konishi, R.[Ryosuke],
Profile Lip Reading for Vowel and Word Recognition,
ICPR10(1356-1359).
IEEE DOI 1008
BibRef
Earlier:
Lip Reading Based on Sampled Active Contour Model,
ICIAR05(507-515).
Springer DOI 0509
BibRef

Terry, L.H.[Louis H.], Shiell, D.J.[Derek J.], Katsaggelos, A.K.[Aggelos K.],
Vector quantization with memory and multi-labeling for isolated video-only automatic speech recognition,
ICIP08(1320-1323).
IEEE DOI 0810
BibRef

Wu, G.Y.[Guan-Yong], Zhu, J.[Jie],
Modular BDPCA based visual feature representation for lip-reading,
ICIP08(1328-1331).
IEEE DOI 0810
BibRef

de la Cuesta, A.G., Zhang, J.G.[Jian-Guo], Miller, P.,
Biometric Identification Using Motion History Images of a Speaker's Lip Movements,
IMVIP08(83-88).
IEEE DOI 0809
BibRef

Pachoud, S.[Samuel], Gong, S.G.[Shao-Gang], Cavallaro, A.[Andrea],
Macro-cuboďd based probabilistic matching for lip-reading digits,
CVPR08(1-8).
IEEE DOI 0806
BibRef

Kumar, K.[Kshitiz], Chen, T.H.[Tsu-Han], and Stern, R.M.[Richard M.],
Profile View Lip Reading,
ICASSP07(IV: 429-432).
PDF File. Intro to profile view lip reading, compares to frontal, and combines with audio for comprenhensive system. BibRef 0700

Faraj, M.I.[Maycel Isaac], Bigun, J.[Josef],
Speaker and Digit Recognition by Audio-Visual Lip Biometrics,
ICB07(1016-1024).
Springer DOI 0708
BibRef

Faraj, M.I.[Maycel Isaac], Bigun, J.[Josef],
Lip Biometrics for Digit Recognition,
CAIP07(360-365).
Springer DOI 0708
BibRef
Earlier:
Motion Features from Lip Movement for Person Authentication,
ICPR06(III: 1059-1062).
IEEE DOI 0609
BibRef
And:
Person Verification by Lip-Motion,
Biometrics06(37).
IEEE DOI 0609
BibRef

Fu, Y.[Yun], Zhou, X.[Xi], Liu, M.[Ming], Hasegawa-Johnson, M.[Mark], Huang, T.S.[Thomas S.],
Lipreading by Locality Discriminant Graph,
ICIP07(III: 325-328).
IEEE DOI 0709
BibRef

Gómez, J.B.[Juan B.], Hernández, J.E.[Jorge E.], Prieto, F.[Flavio], Redarce, T.[Tanneguy],
Real-Time Robot Manipulation Using Mouth Gestures in Facial Video Sequences,
BVAI07(224-233).
Springer DOI 0710
BibRef

Yu, D.H.[Da-Hai], Ghita, O.[Ovidiu], Sutherland, A.[Alistair], Whelan, P.F.[Paul F.],
A Novel Visual Speech Representation and HMM Classification for Visual Speech Recognition,
PSIVT09(398-409).
Springer DOI 0901
BibRef
Earlier:
A New Manifold Representation for Visual Speech Recognition,
IMVIP07(210-210).
IEEE DOI 0709
BibRef
And: CAIP07(374-382).
Springer DOI 0708
BibRef

Kricke, R.[Ralph], Gernoth, T.[Thorsten], Grigat, R.R.[Rolf-Rainer],
Local binary patterns for lip motion analysis,
ICIP08(1472-1475).
IEEE DOI 0810
BibRef

Shafait, F., Kricke, R., Shdaifat, I., Grigat, R.R.,
Real Time Lip Motion Analysis for a Person Authentication System using Near Infrared Illumination,
ICIP06(1957-1960).
IEEE DOI 0610
BibRef

Kumatani, K.[Kenichi], Stiefelhagen, R.[Rainer],
Mouth Region Localization Method Based on Gaussian Mixture Model,
IWICPAS06(115-124).
Springer DOI 0608
BibRef

Ichino, M., Sakano, H., Komatsu, N.,
Multimodal Biometrics of Lip Movements and Voice using Kernel Fisher Discriminant Analysis,
ICARCV06(1-6).
IEEE DOI 0612
BibRef

Tsunekawa, T.[Takuya], Hotta, K.[Kazuhiro], Takahashi, H.[Haruhisa],
Lipreading Using Recurrent Neural Prediction Model,
ICIAR04(II: 405-412).
Springer DOI 0409
BibRef

Yin, P.[Pei], Essa, I.A., Rehg, J.M.,
Asymmetrically boosted HMM for speech reading,
CVPR04(II: 755-761).
IEEE DOI 0408
BibRef
Earlier:
Boosted audio-visual HMM for speech reading,
AMFG03(68-73).
IEEE Abstract. 0311
BibRef

Yao, H.X.[Hong-Xun], Gao, W.[Wen], Shan, W.[Wei], Xu, M.H.[Ming-Hui],
Visual Features Extracting and Selecting for Lipreading,
AVBPA03(251-259).
Springer DOI 0310
BibRef

Chindaro, S.[Samuel], Deravi, F.[Farzin],
Directional Properties of Colour Co-occurrence Features for Lip Location and Segmentation,
AVBPA01(84).
Springer DOI 0310
BibRef

Auckenthaler, R., Brand, J.D., Mason, J.S., Deravi, F., Chibelushi, C.C.,
Lip Signatures for Automatic Person Recognition,
AVBPA99(xx-yy). BibRef 9900

Brand, J.D., Mason, J.S., Colomb, S.[Sylvain],
Visual Speech: A Physiological or Behavioural Biometric?,
AVBPA01(157).
Springer DOI 0310
BibRef

Roach, M.J., Brand, J.D., Mason, J.S.D.,
Acoustic and Facial Features for Speaker Recognition,
ICPR00(Vol III: 258-261).
IEEE DOI
IEEE DOI 0009
BibRef

Lucey, S.[Simon],
An Evaluation of Visual Speech Features for the Tasks of Speech and Speaker Recognition,
AVBPA03(260-267).
Springer DOI 0310
BibRef

Kalberer, G.A.[Gregor A.], Müller, P.[Pascal], Van Gool, L.J.[Luc J.],
Biological Motion of Speech,
BMCV02(199 ff.).
Springer DOI 0303
People are sensitive to lip motion in speech. Analyze detailed motion of the face. BibRef

Delmas, P., Eveno, N., Lievin, M.,
Towards robust lip tracking,
ICPR02(II: 528-531).
IEEE DOI 0211
BibRef

Uda, K., Tagawa, N., Minagawa, A., Moriya, T.,
Effectiveness evaluation of word characteristics obtained from 3D image information for lipreading,
CIAP01(296-301).
WWW Link. 0210
BibRef

Murakami, H., Baba, H., Noma, T.,
MLSLib: A Lip Sync Library for Multi Agents and Languages,
WSCG02(295).
PDF File.
HTML Version. 0209
BibRef

Seguier, R., Cladel, N., Foucher, C., Mercier, D.,
Lipreading with Spiking Neurons: One Pass Learning,
WSCG02(397).
WWW Link.
HTML Version. 0209
BibRef

Mujal, M., Kirlin, R.L.,
Compression enhancement of video motion of mouth region using joint audio and video coding,
Southwest02(82-86).
IEEE Top Reference. 0208
BibRef

Arya, A., Hamidzadeh, B.,
Talking Face: Using Facial Feature Detection and Image Transformations for Visual Speech,
ICIP01(III: 943-946).
IEEE DOI 0108
BibRef

Potamianos, G., Neti, C.,
Improved ROI and Within Frame Discriminant Features for Lipreading,
ICIP01(III: 250-253).
IEEE DOI 0108
BibRef

Kshirsagar, S.[Sumedha], Magnenat-Thalmann, N.[Nadia],
Lip Synchronization Using Linear Predictive Analysis,
ICME00(TP8). 0007
BibRef

Caplier, A., Delmas, P., Lam, D.,
Robust Initialisation for Lips Edges Detection,
SCIA99(Image Analysis). BibRef 9900

Vanegas, O.[Oscar], Tokuda, K.[Keiichi], Kitamura, T.[Tadashi],
Location Normalization of HMM-Based Lip Reading: Experiments for the M2VTS Database,
ICIP99(II:343-347).
IEEE DOI BibRef 9900

Gao, L.[Lei], Mukaigawa, Y., Ohta, Y.,
Synthesis of Facial Images with Lip Motion from Several Real Views,
AFGR98(181-186).
IEEE DOI BibRef 9800

Kumar, V.P.[Vinay P.], Oren, M.[Mike], Osuna, E.[Edgar], Poggio, T.[Tomaso],
Real Time Analysis and Tracking of Mouths for Expression Recognition,
DARPA98(151-155). BibRef 9800

Kumar, V.P.[Vinay P.], Poggio, T.[Tomaso],
Recognizing Expressions by Direct Estimation of the Parameters of a Pixel Morphable Model,
BMCV02(519 ff.).
Springer DOI 0303
BibRef

Kumar, V.P.[Vinay P.], Poggio, T.[Tomaso],
Learning-Based Approach to Estimation of Morphable Model Parameters,
MIT AI Memo-1696, September, 2000. This paper describes a method for estimating the parameters of a linear morphable model (LMM) that models mouth images.
WWW Link. 0105
BibRef

Kumar, V.P.[Vinay P.],
Towards Man-Machine Interfaces: Combining Top-down Constraints with Bottom-up Learning in Facial Analysis,
MIT AI-TR-2002-008, September 2002.
WWW Link. BibRef 0209

Kumar, V.P.[Vinay P.], Poggio, T.[Tomaso],
Learning-Based Approach to Real Time Tracking and Analysis of Faces,
AFGR00(96-101).
IEEE DOI 0003
BibRef

Yu, K., Jiang, X., Bunke, H.,
Automatic Lipreading of Sentences Combining Hidden Markov Models and Grammars,
AVBPA99(xx-yy). BibRef 9900

Baig, A.R., Seguier, R., Vaucher, G.,
Image sequence analysis using a spatio-temporal coding for automatic lipreading,
CIAP99(544-549).
IEEE DOI 9909
BibRef

Sridharan, S.[Sridha], Wark, T.J.[Timothy J.], Chandran, V.,
An Approach to Statistical Lip Modelling for Speaker Identification via Chromatic Feature Extraction,
ICPR98(Vol I: 123-125).
IEEE DOI 9808
BibRef

Wark, T.J., Sridharan, S., Chandran, V.,
Robust Speaker Verification via Asynchronous Fusion of Speech and Lip Information,
AVBPA99(xx-yy). BibRef 9900

Potamianos, G., Graf, H.P., Cosatto, E.,
An image transform approach for HMM based automatic lipreading,
ICIP98(III: 173-177).
IEEE DOI 9810
BibRef

Jung, J.Y.[Jae Y.], Kim, M.H.[Moon H.],
Motion Estimation of Lips in Pronouncing Korean Vowels Based on Fuzzy Constraint Line Clustering,
ICIP96(III: 507-510).
IEEE DOI BibRef 9600

Bregler, C.[Christopher], and Omohundro, S.M.[Stephen M.],
Learning Visual Models for Lipreading,
MBR97(Chapter 13), Berkeley, and NEC. BibRef 9700

Bregler, C.[Christopher], Covell, M.[Michele], and Slaney, M.[Malcolm],
Video Rewrite: Driving Visual Speech with Audio,
SIGGraph-97(xx-yy).
WWW Link. BibRef 9700

Bregler, C.[Christoph], Omohundro, S.[Stephen],
Nonlinear Manifold Learning for Visual Speech Recognition,
ICCV95(494-499).
IEEE DOI BibRef 9500

Stork, D.G., Hennecke, M.E.,
Speechreading: an overview of image processing, feature extraction, sensory integration and pattern recognition techniques,
AFGR96(xvi-xxvi).
IEEE DOI 9610
BibRef

Chapter on Face Recognition, Detection, Tracking, Gesture Recognition, Fingerprints, Biometrics continues in
Combined Audio Visual Recognition and Analysis .


Last update:Sep 25, 2017 at 16:36:46