Emphatic visual speech synthesis

Melenchón, Javier; Martínez Marroquín, Elisa; Torre Frade, Fernando de la; Montero Morales, José Antonio

Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10609/109843

Título :	Emphatic visual speech synthesis
Autoría:	Melenchón, Javier Martínez Marroquín, Elisa Torre Frade, Fernando de la Montero Morales, José Antonio
Otros:	Universitat Oberta de Catalunya. eLearning Innovation Center Universitat Ramon Llull Carnegie Mellon University
Citación :	Melenchón Maldonado, J., Martínez Marroquín, E., De la Torre Frade, F. & Montero, J. (2009). Emphatic Visual Speech Synthesis. IEEE Transactions on Audio, Speech and Language Processing, 17(3), 459-468. doi: 10.1109/TASL.2008.2010213
Resumen :	The synthesis of talking heads has been a flourishing research area over the last few years. Since human beings have an uncanny ability to read people's faces, most related applications (e.g., advertising, video-teleconferencing) require absolutely realistic photometric and behavioral synthesis of faces. This paper proposes a person-specific facial synthesis framework that allows high realism and includes a novel way to control visual emphasis (e.g., level of exaggeration of visible articulatory movements of the vocal tract). There are three main contributions: a geodesic interpolation with visual unit selection, a parameterization of visual emphasis, and the design of minimum size corpora. Perceptual tests with human subjects reveal high realism properties, achieving similar perceptual scores as real samples. Furthermore, the visual emphasis level and two communication styles show a statistical interaction relationship.
Palabras clave :	síntesis audiovisual de la voz discurso visual enfático tertuliano
DOI:	10.1109/TASL.2008.2010213
Tipo de documento:	info:eu-repo/semantics/article
Fecha de publicación :	ene-2009
Aparece en las colecciones:	Articles Articles cientÍfics

Ficheros en este ítem:

No hay ficheros asociados a este ítem.

Mostrar el registro completo del ítem

Comparte:

Impacto:

Google Scholar

Microsoft Academic

Exporta:

Consulta las estadísticas