Both auditory and audiovisual speech synthesis have been the subject of many research projects throughout the years. Unfortunately, in recent years only very few research focuses on synthesis for the Dutch language. Especially for audiovisual synthesis, hardly any available system or resource can be found. In this paper we describe the creation of a new extensive Dutch speech database, containing audiovisual recordings of a single speaker. The database is constructed as such that it can be employed in both auditory and audiovisual speech synthesis systems. Subsequently, we describe how we achieve high-quality auditory speech synthesis by applying the database in our text-to-speech framework. In addition, it is explained how we used the new database to attain photorealistic audiovisual text-to-speech synthesis for Dutch. The new database and its applications for synthesis are a significant addition to the resources for Dutch speech synthesis research.
|International Conference on Auditory-Visual Speech Processing 2011, Volterra, Italy
|Published - 31 aug 2011
Duur: 31 aug 2011 → …
|31/08/11 → …