您的位置: 专家智库 > >

国家自然科学基金(s60805008)

作品数:1 被引量:1H指数:1
发文基金:国家自然科学基金更多>>
相关领域:电子电信化学工程更多>>

文献类型

  • 1篇中文期刊文章

领域

  • 1篇化学工程
  • 1篇电子电信

主题

  • 1篇CHINES...
  • 1篇MODEL
  • 1篇MODELI...
  • 1篇SENTEN...
  • 1篇PROSOD...
  • 1篇MANDAR...

传媒

  • 1篇Tsingh...

年份

  • 1篇2012
1 条 记 录,以下是 1-1
排序方式:
Modeling Pitch Contour of Chinese Mandarin Sentences with the PENTA Model被引量:1
2012年
In continuous speech, the pitch contour of the same syllable may vary much due to its contextual information. The Parallel Encoding and Target Approximation (PENTA) model is applied here to Mandarin speech synthesis with a method to predict pitch contours for Chinese syllables with different contexts by combining the Classification And Regression Tree (CART) with the PENTA model to improve its prediction accuracy. CART was first used to cluster the syllables' normalized pitch contours according to the syllables contextual information and the distances between pitch contours. The average pitch contour was used to train the PENTA model with the average contour for each cluster. The initial pitch is required with the PENTA model to predict a continuous pitch contour. A Pitch Discontinuity Model (PDM) was used to predict the initial pitches at positions with voiceless consonants and prosodic boundaries. Initial tests on a Chinese four-syllable word corpus containing 2048 words were extended to tests with a continuous speech corpus containing 5445 sentences. The results are satisfactory in terms of the Root Mean Square Error (RMSE) comparing the predicted pitch contour with the original contour. This method can model pitch contours for Mandarin sentences with any text for speech synthesis.
Hui PangZhiyong WuLianhong Cai
共1页<1>
聚类工具0