Τετάρτη 29 Ιουνίου 2016

Manipulation of the prosodic features of vocal tract length, nasality and articulatory precision using articulatory synthesis

S08852308.gif

Publication date: Available online 28 June 2016
Source:Computer Speech & Language
Author(s): Peter Birkholz, Lucia Martin, Yi Xu, Stefan Scherbaum, Christiane Neuschaefer-Rube
Vocal emotions, as well as different speaking styles and speaker traits are characterized by a complex interplay of multiple prosodic features. Natural sounding speech synthesis with the ability to control such paralinguistic aspects requires the manipulation of the corresponding prosodic features. With traditional concatenative speech synthesis it is easy to manipulate the "primary" prosodic features pitch, duration, and intensity, but it is very hard to individually control "secondary" prosodic features like phonation type, vocal tract length, articulatory precision and nasality. These secondary features can be controlled more directly with parametric synthesis methods. In the present study we analyze the ability of articulatory speech synthesis to control secondary prosodic features by rule. To this end, nine German words were re-synthesized with the software VocalTractLab 2.1 and then manipulated in different ways at the articulatory level to vary vocal tract length, articulatory precision and degree of nasality. Listening tests showed that most of the intended prosodic manipulations could be reliably identified with recognition rates between 77-96%. Only the manipulations to increase articulatory precision were hardly recognized. The results suggest that rule-based manipulations in articulatory synthesis are generally sufficient for the convincing synthesis of secondary prosodic features at the word level.



from Speech via a.lsfakia on Inoreader http://ift.tt/29oaseU
via IFTTT

Δεν υπάρχουν σχόλια:

Δημοσίευση σχολίου