11.14.4.4 Talking Heads, Speech Driven Face Animation

Chapter Contents (Back)
Face Animation. Facial Animation. Speech Synthesis. See also Face Synthesis Using Three-Dimensional Models.

Bloomstein, R.W.[Richard W.],
Cinematic works with altered facial displays,
US_Patent4,827,532, May 2, 1989
WWW Version. alter lip motions BibRef 8905

Waters, K.[Keith], Levergood, T.M.[Thomas M.],
Method and apparatus for producing audio-visual synthetic speech,
US_Patent5,657,426, Aug 12, 1997
WWW Version. BibRef 9708

Gasper, E.[Elon], Matthews, III, J.H.[Joseph H.], Wesley, R.[Richard],
Advanced tools for speech synchronized animation,
US_Patent5,613,056, Mar 18, 1997
WWW Version. BibRef 9703
And: US_Patent5,630,017, May 13, 1997
WWW Version. BibRef
And: A1, A3, Only: US_Patent5,689,618, Nov 18, 1997
WWW Version. BibRef

Lyberg, B.[Bertil],
Device and method for dubbing an audio-visual presentation which generates synthesized speech and corresponding facial movements,
US_Patent5,826,234, Oct 20, 1998
WWW Version. BibRef 9810

Henton, C.G.[Caroline G.],
Method and apparatus for synthetic speech in facial animation,
US_Patent5,878,396, Mar 2, 1999
WWW Version. BibRef 9903

Goldenthal, W.D.[William D.], Van Thong, J.M.[Jean-Manuel], Waters, K.[Keith],
Automated speech alignment for image synthesis,
US_Patent5,884,267, Mar 16, 1999
WWW Version. BibRef 9903

Scott, K.C.[Kenneth C.], Yeates, M.C.[Matthew C.], Kagels, D.S.[David S.], Watson, S.H.[Stephen Hilary],
Method and apparatus for synthesizing realistic animations of a human speaking using a computer,
US_Patent6,097,381, Aug 1, 2000
WWW Version. BibRef 0008

Danieli, D.V.[Damon Vincent],
Method for generating mouth features of an animated or physical character,
US_Patent6,067,095, May 23, 2000
WWW Version. BibRef 0005

Grammalidis, N.[Nikos], Sarris, N.[Nikos], Deligianni, F.[Fani], Strintzis, M.G.[Michael G.],
Three-Dimensional Facial Adaptation for MPEG-4 Talking Heads,
JASP(2002), No. 10, October 2002, pp. 1005-1020.
HTML Version. 0211
BibRef

Cosatto, E., Ostermann, J., Graf, H.P., Schroeter, J.,
Lifelike talking faces for interactive services,
PIEEE(91), No. 9, September 2003, pp. 1406-1429.
IEEE DOI Link 0309
BibRef

Cosatto, E.[Eric], Graf, H.P.[Hans Peter], Potamianos, G.[Gerasimos], Schroeter, J.[Juergen],
Audio-visual selection process for the synthesis of photo-realistic talking-head animations,
US_Patent6,654,018, Nov 25, 2003
WWW Version. BibRef 0311

Cosatto, E.[Eric], Graf, H.P.[Hans Peter], Huang, F.J.[Fu Jie],
System and method for triphone-based unit selection for visual speech synthesis,
US_Patent7,209,882, Apr 24, 2007
WWW Version. BibRef 0704
And: US_Patent7,369,992, May 6, 2008
WWW Version. BibRef

Cosatto, E.[Eric], Potamianos, G.[Gerasimos], Graf, H.P.[Hans Peter],
Audio-Visual Unit Selection for the Synthesis of Photo-Realistic Talking-Heads,
ICME00(TA1). 0007
BibRef

Graf, H.P.[Hans Peter], Cosatto, E.[Eric], Ezzat, A.F.[Antoine F.],
Face Analysis for the Synthesis of Photo-Realistic Talking Heads,
AFGR00(189-194).
IEEE DOI Link 0003
BibRef

Graf, H.P.[Hans Peter],
Sample-based Synthesis of Talking Heads,
RATFG01(xx-yy). 0106
BibRef

Liu, K.[Kang], Ostermann, J.[Joern],
Minimized Database of Unit Selection in Visual Speech Synthesis without Loss of Naturalness,
CAIP09(1212-1219).
Springer DOI Link 0909
BibRef

Ostermann, J., Weissenfeld, A.,
Talking Faces: Technologies and Applications,
ICPR04(III: 826-833).
IEEE DOI Link 0409
BibRef

Rosenfeld, M.[Maury],
Method for automatically animating lip synchronization and facial expression of animated characters,
US_Patent6,307,576, Oct 23, 2001
WWW Version. BibRef 0110

Basu, S.[Sankar], Faruquie, T.A.[Tanveer Atzal], Neti, C.V.[Chalapathy V.], Rajput, N.[Nitendra], Senior, A.W.[Andrew William], Subramaniam, L.V.[L. Venkata], Verma, A.[Ashish],
Speech driven lip synthesis using viseme based hidden markov models,
US_Patent6,366,885, Apr 2, 2002
WWW Version. BibRef 0204

Dorvil, R.[Richemond],
Device and method for prosody generation at visual synthesis,
US_Patent6,389,396, May 14, 2002
WWW Version. BibRef 0205

Theobald, B.J., Kruse, S.M., Bangham, J.A., Cawley, G.C.,
Towards a low bandwidth talking face using appearance models,
IVC(21), No. 12-13, December 2003, pp. 1117-1124.
WWW Version. 0401
BibRef
Earlier: A1, A4, A2, A3: BMVC01(Session 6: Faces).
HTML Version. University of East Anglia 0110
BibRef

Theobald, B.J., Bangham, J.A., Matthews, I., Glauert, J.R.W., Cawley, G.C.,
2.5D Visual Speech Synthesis Using Appearance Models,
BMVC03(xx-yy).
HTML Version. 0409
BibRef

Devin, V.E.[Vincent E.], Hogg, D.C.[David C.],
Reactive Memories: An Interactive Talking-Head,
IVC(21), No. 12-13, December 2003, pp. 1125-1133.
WWW Version. 0401
BibRef
Earlier: BMVC01(Session 6: Faces).
HTML Version. University of Leeds 0110
BibRef

Sutton, S.[Stephen], Vermeulen, P.[Pieter],
Methods and devices for producing and using synthetic visual speech based on natural coarticulation,
US_Patent6,539,354, Mar 25, 2003
WWW Version. BibRef 0303

Cosker, D.P., Marshall, A.D., Rosin, P.L., Hicks, Y.A.,
Speech-driven facial animation using a hierarchical model,
VISP(151), No. 4, August 2004, pp. 314-321.
IEEE Abstract. 0411
BibRef
Earlier:
Speech driven facial animation using a hidden markov coarticulation model,
ICPR04(I: 128-131).
IEEE DOI Link 0409
BibRef

Cosker, D.P.[Darren P.], Borkett, R., Marshall, A.D.[A. David], Rosin, P.L.[Paul L.],
Towards automatic performance-driven animation between multiple types of facial model,
IET-CV(2), No. 3, September 2008, pp. 129-141.
WWW Version. 0905
BibRef

Cosker, D.P.[Darren P.], Roy, S.[Steven], Rosin, P.L.[Paul L.], Marshall, A.D.[A. David],
Re-mapping Animation Parameters Between Multiple Types of Facial Model,
MIRAGE07(365-376).
Springer DOI Link 0703
BibRef

Ma, J.Y.[Ji-Yong], Cole, R.[Ronald],
Animating visible speech and facial expressions,
VC(20), No. 2-3, May 2004, pp. 86-105.
Springer DOI Link 0405
BibRef

Muller, P., Kalberer, G.A., Proesmans, M., Van Gool, L.J.,
Realistic speech animation based on observed 3D face dynamics,
VISP(152), No. 4, August 2005, pp. 491-500.
WWW Version. 0512
BibRef

Hilton, A., Kalkavouras, M., Collins, G.,
3D studio production of animated actor models,
VISP(152), No. 4, August 2005, pp. 481-490.
WWW Version. 0512
BibRef

Hsieh, C.K.[Chao-Kuei], Chen, Y.C.[Yung-Chang],
Partial linear regression for speech-driven talking head application,
SP:IC(21), No. 1, January 2006, pp. 1-12.
WWW Version. 0512
BibRef

Haisma, N.[Nicoline], Sinke, J.G.[Johannes Govert], Bergevoet, B.A.J.[Bas Arnold Jan], van Gestel, H.A.W.[Henricus Antonius Wilhelmus],
Post-synchronizing an information stream including lip objects replacement,
US_Patent7,145,606, Dec 5, 2006
WWW Version. BibRef 0612

Xie, L.[Lei], Liu, Z.Q.A.[Zhi-Qi-Ang],
A coupled HMM approach to video-realistic speech animation,
PR(40), No. 8, August 2007, pp. 2325-2340.
WWW Version. 0704
BibRef
Earlier:
Speech Animation Using Coupled Hidden Markov Models,
ICPR06(I: 1128-1131).
IEEE DOI Link 0609
Speech animation; Audio-to-visual conversion; Talking faces; Facial animation; Coupled hidden Markov models (CHMMs) BibRef

Sargin, M.E.[Mehmet E.], Yemez, Y.[Yucel], Erzin, E.[Engin], Tekalp, A.M.[Ahmet M.],
Analysis of Head Gesture and Prosody Patterns for Prosody-Driven Head-Gesture Animation,
PAMI(30), No. 8, August 2008, pp. 1330-1345.
IEEE DOI Link 0806
BibRef

Ofli, F.[Ferda], Erzin, E.[Engin], Yemez, Y.[Yucel], Tekalp, A.M.[A. Murat],
Estimation and Analysis of Facial Animation Parameter Patterns,
ICIP07(IV: 293-296).
IEEE DOI Link 0709
BibRef

Erdem, A.T.[A. Tanju],
Method for animating a 3-D model of a face,
US_Patent6,731,287, May 4, 2004
WWW Version. BibRef 0405

Bozkurt, E.[Elif], Erdem, C.E.[Cigdem Eroglu], Erzin, E.[Engin], Erdem, T.[Tanju], Ozkan, M.K.[Mehmet K.], Tekalp, A.M.[A. Murat],
Speech-Driven Automatic Facial Expression Synthesis,
3DTV08(273-276).
IEEE DOI Link 0805
BibRef

Bozkurt, E.[Elif], Erdem, C.E.[Cigdem Eroglu], Erzin, E.[Engin], Erdem, T.[Tanju], Ozkan, M.K.[Mehmet K.],
Comparison of Phoneme and Viseme Based Acoustic Units for Speech Driven Realistic lip Animation,
3DTV07(1-4).
IEEE DOI Link 0705
BibRef

Hong, P.Y.[Peng-Yu], Wen, Z.[Zhen], Huang, T.S.[Thomas S.],
iface: A 3d Synthetic Talking Face,
IJIG(1), No. 1, January 2001, pp. 19-26. 0101
BibRef

Cheiky, M.[Michael], Gately, P.[Peter],
Photo realistic talking head creation system and method,
US_Patent6,919,892, Jul 19, 2005
WWW Version. BibRef 0507
And:
Do-it-yourself photo realistic talking head creation system and method,
US_Patent7,027,054, Apr 11, 2006
WWW Version. BibRef

Huang, Y.[Ying], Lin, S.S.T.[Stephen Ssu-Te], Guo, B.N.[Bai-Ning], Shum, H.Y.[Heung-Yeung],
System and method for real time lip synchronization,
US_Patent7,133,535, Nov 7, 2006
WWW Version. BibRef 0611

McAlpine, P.[Paul], Hernandez, T.[Todd], Bateman, J.[John], Zimmermann, R.[Remy], Depallens, P.[Philippe],
Facial feature-localized and global real-time video morphing,
US_Patent7,209,577, Apr 24, 2007
WWW Version. BibRef 0704

Massaro, D.W.[Dominic W.], Cohen, M.M.[Michael M.], Beskow, J.[Jonas],
Visual display methods for in computer-animated speech production models,
US_Patent7,225,129, May 29, 2007
WWW Version. BibRef 0705

Clarke, S.[Simon], Hovhannisyan, A.[Armen], Cutler, R.[Ross],
System and process for adding high frame-rate current speaker data to a low frame-rate video using delta frames,
US_Patent7,355,622, Apr 8, 2008
WWW Version. BibRef 0804

Cutler, R.[Ross],
System and process for adding high frame-rate current speaker data to a low frame-rate video using audio watermarking techniques,
US_Patent7,355,623, Apr 8, 2008
WWW Version. BibRef 0804
And: US_Patent7,362,350, Apr 22, 2008
WWW Version. BibRef

Yeung, M.[Minerva], Du, P.[Ping], Huang, C.[Chao],
Method and apparatus for animation of a human speaker,
US_Patent7,388,586, Jun 17, 2008
WWW Version. BibRef 0806

Buenaposada, J.M.[José Miguel], Muñoz, E.[Enrique], Baumela, L.[Luis],
Efficient illumination independent appearance-based face tracking,
IVC(27), No. 5, 2 April 2009, pp. 560-578.
Elsevier DOI Link
WWW Version. 0904
BibRef
Earlier:
Efficiently estimating facial expression and illumination in appearance-based tracking,
BMVC06(I:57).
PDF Version. 0609
BibRef
Earlier:
Performance Driven Facial Animation by Appearance Based Tracking,
IbPRIA05(I:476).
Springer DOI Link 0509
Linear models of appearance; Illumination invariance; Efficient linear subspace model fitting; Facial expression analysis BibRef

Buenaposada, J.M.[José Miguel], Muñoz, E.[Enrique],
Performance driven facial animation using illumination independent appearance-based tracking,
ICPR06(I: 303-306).
IEEE DOI Link 0609
BibRef

Munoz, E.[Enrique], Buenaposada, J.M.[Jose M.], Baumela, L.[Luis],
A direct approach for efficiently tracking with 3D morphable models,
ICCV09(1615-1622).
IEEE DOI Link 0909
BibRef

Aina, O.O.[Olusola O.],
Generating anatomical substructures for physically-based facial animation. Part 1: A methodology for skull fitting,
VC(25), No. 5-7, May 2009, pp. xx-yy.
Springer DOI Link 0905
BibRef

Schreer, O., Englert, R., Eisert, P., Tanger, R.,
Real-Time Vision and Speech Driven Avatars for Multimedia Applications,
MultMed(10), No. 3, April 2008, pp. 352-360.
IEEE DOI Link 0905
BibRef

Chen, H.[Hui], Wang, L.[Lan], Liu, W.[Wenxi], Heng, P.A.[Pheng-Ann],
Combined X-ray and facial videos for phoneme-level articulator dynamics,
VC(26), No. 6-8, June 2010, pp. 477-486.
WWW Version. 1101
BibRef

Kim, B.U.[Byung-Uck], Feng, W.W.[Wei-Wei], Yu, Y.Z.[Yi-Zhou],
Real-time data driven deformation with affine bones,
VC(26), No. 6-8, June 2010, pp. 487-495.
WWW Version. 1101
BibRef
And: Erratum: VC(26), No. 9, September 2010, pp. 1241.
WWW Version. 1101
BibRef

Xiong, B., Fan, X., Zhu, C., Jing, X., Peng, Q.,
Face Region Based Conversational Video Coding,
CirSysVideo(21), No. 7, July 2011, pp. 917-931.
IEEE DOI Link 1107
BibRef


Zhou, Z.H.[Zi-Heng], Zhao, G.Y.[Guo-Ying], Pietikäinen, M.[Matti],
Synthesizing a talking mouth,
ICCVGIP10(211-218).
WWW Version. 1111
BibRef

Tang, Y.Q.[Yong-Qing], Fang, Y.[Yong], Huang, Q.H.[Qing-Hua],
Audio personalization using head related transfer function in 3DTV,
3DTV11(1-4).
IEEE DOI Link 1105
BibRef

Liu, K.[Kang], Ostermann, J.[Joern],
Realistic head motion synthesis for an image-based talking head,
FG11(125-130).
IEEE DOI Link 1103
BibRef
And: FG11(221-226).
IEEE DOI Link 1103
BibRef

Chaloupka, J.[Josef], Chaloupka, Z.[Zdenek],
Czech Artificial Computerized Talking Head George,
COST08(324-330).
Springer DOI Link 0810
BibRef

Deena, S.[Salil], Galata, A.[Aphrodite],
Speech-Driven Facial Animation Using a Shared Gaussian Process Latent Variable Model,
ISVC09(I: 89-100).
Springer DOI Link 0911
BibRef

Zhao, H.[Hui], Chen, Y.B.[Yue-Bing], Shen, Y.M.[Ya-Min], Tang, C.J.[Chao-Jing],
Audio-Visual Speech Synthesis Based on Chinese Visual Triphone,
CISP09(1-5).
IEEE DOI Link 0910
BibRef

Hu, Y.L.[Yong-Li], Zhou, M.Q.[Ming-Quan], Wu, Z.K.[Zhong-Ke],
An Automatic Dense Point Registration Method for 3D Face Animation,
CISP09(1-6).
IEEE DOI Link 0910
BibRef

Berger, M.O.[Marie-Odile], Ponroy, J.[Jonathan], Wrobel-Dautcourt, B.[Brigitte],
Realistic Face Animation for Audiovisual Speech Applications: A Densification Approach Driven by Sparse Stereo Meshes,
MIRAGE09(297-307).
Springer DOI Link 0905
BibRef

Verdet, F.[Florian], Hennebert, J.[Jean],
Impostures of Talking Face Systems Using Automatic Face Animation,
BTAS08(1-4).
IEEE DOI Link 0809
BibRef

Gaur, U.[Utkarsh], Jain, A.[Amrita], Goel, S.[Sanjay],
Towards Real-Time Monocular Video-Based Avatar Animation,
ISVC08(II: 949-958).
Springer DOI Link 0812
BibRef

Badin, P.[Pierre], Elisei, F.[Frédéric], Bailly, G.[Gérard], Tarabalka, Y.[Yuliya],
An Audiovisual Talking Head for Augmented Speech Generation: Models and Animations Based on a Real Speaker's Articulatory Data,
AMDO08(xx-yy).
Springer DOI Link 0807
BibRef

Fanelli, G.[Gabriele], Fratarcangeli, M.[Marco],
A Non-Invasive Approach for Driving Virtual Talking Heads from Real Facial Movements,
3DTV07(1-4).
IEEE DOI Link 0705
BibRef

Xiong, L.[Lei], Zheng, N.N.[Nan-Ning], You, Q.[Qubo], Liu, J.Y.[Jian-Yi],
Facial Expression Sequence Synthesis Based on Shape and Texture Fusion Model,
ICIP07(IV: 473-476).
IEEE DOI Link 0709
BibRef

Beaumesnil, B.[Brice], Luthon, F.[Franck],
Real Time Tracking for 3D Realistic Lip Animation,
ICPR06(I: 219-222).
IEEE DOI Link 0609
BibRef

Ravindra de Silva, P., Madurapperuma, A.P., Marasinghe, A., Osano, M.,
Integrating Animated Pedagogical Agent as Motivational Supporter into Interactive System,
CRV06(34-34).
IEEE DOI Link 0607
BibRef

Pei, Y.R.[Yu-Ru], Zha, H.B.[Hong-Bin],
Vision Based Speech Animation Transferring with Underlying Anatomical Structure,
ACCV06(I:591-600).
Springer DOI Link 0601
BibRef

Liu, Y.H.[Yang-Hua], Xu, G.Y.[Guang-You], Tao, L.M.[Lin-Mi],
An Efficient Approach for Multi-view Face Animation Based on Quasi 3D Model,
ACCV06(II:913-922).
Springer DOI Link 0601
BibRef

Leszczynski, M.[Mariusz], Skarbek, W.[Wladyslaw],
Viseme Classification for Talking Head Application,
CAIP05(773).
Springer DOI Link 0509
BibRef
Earlier:
Viseme recognition: A comparative study,
AVSBS05(287-292).
IEEE DOI Link 0602
BibRef

Leszczynski, M.[Mariusz], Skarbek, W.[Wladyslaw], Badura, S.[Stanislaw],
Fast Viseme Recognition for Talking Head Application,
ICIAR05(516-523).
Springer DOI Link 0509
BibRef

Gracia-Roche, J.J.[Juan José], Orrite, C.[Carlos], Bernués, E.[Emiliano], Herrero, J.E.[José Elías],
Color Distribution Tracking for Facial Analysis,
IbPRIA05(I:484).
Springer DOI Link 0509
BibRef

Ypsilos, I.A., Hilton, A., Turkmani, A., Jackson, P.J.B.,
Speech-driven face synthesis from 3D video,
3DPVT04(58-65).
IEEE Abstract. 0412
BibRef

Saisan, P.[Payam], Bissacco, A.[Alessandro], Chiuso, A.[Alessandro], Soatto, S.[Stefano],
Modeling and Synthesis of Facial Motion Driven by Speech,
ECCV04(Vol III: 456-467).
WWW Version. 0405
BibRef

Malcangi, M.[Mario], de Tintis, R.[Raffaele],
Audio Based Real-Time Speech Animation of Embodied Conversational Agents,
GW03(350-360).
WWW Version. 0405
BibRef

Aleksic, P.S., Katsaggelos, A.K.,
Speech-to-video synthesis using facial animation parameters,
ICIP03(III: 1-4).
IEEE Abstract. 0312
BibRef

Hack, C.A., Taylor, C.J.,
Modelling 'Talking Head' Behaviour,
BMVC03(xx-yy).
HTML Version. 0409
BibRef

Choi, K.H.[Kyoung-Ho], Hwang, J.N.[Jenq-Neng],
Creating 3D speech-driven talking heads: a probabilistic network approach,
ICIP02(I: 984-987).
IEEE Abstract. 0210
BibRef

Hong, P.Y.[Peng-Yu], Wen, Z.[Zhen], Huang, T.S., Shum, H.Y.[Heung-Yeung],
Real-time speech-driven 3D face animation,
3DPVT02(713-716).
IEEE DOI Link 0206
BibRef

Morishima, S., Yotsukura, T.,
Hypermask: Talking Head Projected Onto Moving Surface,
ICIP01(III: 947-950).
IEEE Abstract. 0108
BibRef

Neumann, J.[Jan], Aloimonos, Y.[Yiannis],
Talking Heads: Introducing the tool of 3D motion fields in the study of action,
HUMO00(25-32).
IEEE Top Reference. 0010
BibRef

Melek, Z.[Zeki], Akarun, L.[Lale],
Automated Lip Synchronized Speech Driven Facial Animation,
ICME00(TA1). 0007
BibRef

Chen, T.H.[Tsu-Han], Wang, Y.[Yao], Graf, H.P., Swain, C.T.,
A new frame interpolation scheme for talking head sequences,
ICIP95(II: 591-594).
IEEE DOI Link 9510
BibRef

Shan, S.,
Individual 3d Face Synthesis Based on Orthogonal Photos and Speech-driven Facial Animation,
ICIP00(Vol III: 238-241).
IEEE Abstract. 0008
BibRef

Noh, J.Y.[Jun-Yong], Neumann, U.[Ulrich],
Talking Face,
ICME00(TA1). 0007
BibRef

Kakihara, K.[Kiyotsugu], Nakamura, S.[Satoshi], Shikano, K.[Kiyohiro],
Speech-To-Face Movement Synthesis Based on HMMS,
ICME00(MP7). 0007
BibRef

Van Gool, L.J.[Luc J.], Tuytelaars, T.[Tinne], Pollefeys, M.[Marc],
Adventurous Tourism for Couch Potatoes,
CAIP99(98-107).
WWW Version. 9909
Talking mask, display of scene, etc. BibRef

Ishikawa, T., Sera, H., Morishima, S., Terzopoulos, D.,
Facial Image Reconstruction by Estimated Muscle Parameter,
AFGR98(342-347).
IEEE DOI Link BibRef 9800

Bothe, H.H.[Hans H.],
A visual speech model based on fuzzy-neuro methods,
CIAP95(152-158).
Springer DOI Link 9509
BibRef

Chen, T.H.[Tsu-Han], Graf, H.P., Haskell, B.G., Petajan, E., Wang, Y.[Yao], Chen, H., Chou, W.[Wu],
Speech-assisted lip synchronization in audio-visual communications,
ICIP95(II: 579-582).
IEEE DOI Link 9510
BibRef

Chapter on 3-D Object Description and Computation Techniques, Surfaces, Deformable, View Generation, Video Conferencing continues in
Face Animation, Video Face Synthesis .


Last update:Feb 8, 2012 at 11:25:05