TTS-project-synthesis

来自cslt Wiki
2017年12月1日 (五) 02:59Zhangzy讨论 | 贡献的版本

跳转至: 导航搜索

Project name

Text To Speech

Project members

Dong Wang, Zhiyong Zhang

Introduction

xxx

Sample waves

Synthesis text:好雨知时节,当春乃发声,随风潜入夜,润物细无声

Mono-speaker TTS

Multi-speaker mix-training without speaker-vector

  • Female & Male[4]
  • Female & Child[5]
  • Male & Child[6]


Multi-speaker mix-training with speaker-vector

When synthesis, we just replace the speaker-vector for specific person.

  • Specific person===
  • Interpolate the speaker-vector of different person
  • Female & Male with different ratio

(1) 0.0:1.0[9] (2) 0.1:0.9[10] (3) 0.2:0.8[11] (4) 0.3:0.7[12] (5) 0.4:0.6[13] (6) 0.5:0.5[14] (7) 0.6:0.4[15] (8) 0.7:0.3[16] (9) 0.8:0.2[17] (10) 0.9:0.1[18] (11) 1.0:0.0[19]