Please use this identifier to cite or link to this item: http://hdl.handle.net/10609/92577
Title: Voice quality modelling for expressive speech synthesis
Author: Monzo, Carlos  
Iriondo, Ignasi
Socoró, Joan Claudi  
Universitat Ramon Llull
Universitat Oberta de Catalunya (UOC)
Citation: Monzo, C., Iriondo, I. & Socoró, J. C. (2014). Voice quality modelling for expressive speech synthesis. The Scientific World Journal, 2014(). doi: 10.1155/2014/627189
Abstract: This paper presents the perceptual experiments that were carried out in order to validate the methodology of transforming expressive speech styles using voice quality (VoQ) parameters modelling, along with the well-known prosody (F0, duration, and energy), from a neutral style into a number of expressive ones. The main goal was to validate the usefulness of VoQ in the enhancement of expressive synthetic speech in terms of speech quality and style identification. A harmonic plus noise model (HNM) was used to modify VoQ and prosodic parameters that were extracted from an expressive speech corpus. Perception test results indicated the improvement of obtained expressive speech styles using VoQ modelling along with prosodic characteristics.
Keywords: voice quality
synthetic speech
DOI: 10.1155/2014/627189
Type: info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
Issue Date: 22-Jan-2014
Publication license: http://creativecommons.org/licenses/by/3.0/es/  
Appears in Collections:Articles cientÍfics
Articles

Files in This Item:
File Description SizeFormat 
voice.pdf1,46 MBAdobe PDFThumbnail
View/Open