A solution to the reduction of concatenation artefacts in speech synthesis

Esther Klabbers, Raymond Veldhuis, Kim Koppen

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

2 Downloads (Pure)

Abstract

One problem with speech synthesis impeding high quality is the occurrence of audible discontinuities at segment boundaries. Formant jumps across concatenation points suggest the problem to be due to spectral differences. The problem is most apparent in vowels and semi-vowels. We propose to reduce the number of audible discontinuities by adding context-sensitive diphones to the database. The number of additional diphones is limited by clustering contexts with similar spectral effects on the neighbouring vowels, using the Kullback-Leibler distance. A listening experiment has shown that the percentage of perceived discontinuities has significantly decreased.

Original languageEnglish
Title of host publication6th International Conference on Spoken Language Processing, ICSLP 2000
Place of PublicationBeijing
PublisherChina Military Friendship Pub.
Pages474-477
Volume3
ISBN (Print)7801501144
Publication statusPublished - 1 Jan 2000
Externally publishedYes
Event6th International Conference on Spoken Language Processing, ICSLP 2000 - Beijing, China
Duration: 16 Oct 200020 Oct 2000
Conference number: 6

Conference

Conference6th International Conference on Spoken Language Processing, ICSLP 2000
Abbreviated titleICSLP
CountryChina
CityBeijing
Period16/10/0020/10/00

Fingerprint Dive into the research topics of 'A solution to the reduction of concatenation artefacts in speech synthesis'. Together they form a unique fingerprint.

Cite this