“第二十七章 语音合成”版本间的差异
来自cslt Wiki
(→高级读者) |
|||
第38行: | 第38行: | ||
* 汤志远,李蓝天,王东,石颖,蔡云麒,郑方,《语音识别基本法》,清华大学出牌社,2021. [https://item.jd.com/13143784.html] | * 汤志远,李蓝天,王东,石颖,蔡云麒,郑方,《语音识别基本法》,清华大学出牌社,2021. [https://item.jd.com/13143784.html] | ||
+ | * Dudley H. The vocoder—Electrical re-creation of speech[J]. Journal of the Society of Motion Picture Engineers, 1940, 34(3): 272-278. [https://ieeexplore.ieee.org/abstract/document/7250932] | ||
+ | * Dudley H. Remaking speech[J]. The Journal of the Acoustical Society of America, 1939, 11(2): 169-177.[https://asa.scitation.org/doi/pdf/10.1121/1.1916020] | ||
* Ning Y, He S, Wu Z, et al. A review of deep learning based speech synthesis[J]. Applied Sciences, 2019, 9(19): 4050. [https://www.mdpi.com/2076-3417/9/19/4050/pdf] | * Ning Y, He S, Wu Z, et al. A review of deep learning based speech synthesis[J]. Applied Sciences, 2019, 9(19): 4050. [https://www.mdpi.com/2076-3417/9/19/4050/pdf] | ||
* Zen H, Tokuda K, Black A W. Statistical parametric speech synthesis[J]. speech communication, 2009, 51(11): 1039-1064. [https://nitech.repo.nii.ac.jp/index.php?action=pages_view_main&active_action=repository_action_common_download&item_id=5432&item_no=1&attribute_id=39&file_no=1&page_id=13&block_id=21] | * Zen H, Tokuda K, Black A W. Statistical parametric speech synthesis[J]. speech communication, 2009, 51(11): 1039-1064. [https://nitech.repo.nii.ac.jp/index.php?action=pages_view_main&active_action=repository_action_common_download&item_id=5432&item_no=1&attribute_id=39&file_no=1&page_id=13&block_id=21] |
2022年8月27日 (六) 08:09的版本
教学资料
扩展阅读
视频展示
- 源-滤波器模型 [7]
- Vocoder 1939 (long) [8]
- Vocoder 1939 (short) [9]
- Vocal folder [10]
- Vocal tract [11]
- Auditory perception [12]
演示链接
- Tacotron2 [13]
- CycleFlow 语音转换 [14]
- Online demo for TTS and Voice conversion [15]
- Online TTS demo [16]
- IBM TTS demo [17]
开发者资源
高级读者
- 汤志远,李蓝天,王东,石颖,蔡云麒,郑方,《语音识别基本法》,清华大学出牌社,2021. [21]
- Dudley H. The vocoder—Electrical re-creation of speech[J]. Journal of the Society of Motion Picture Engineers, 1940, 34(3): 272-278. [22]
- Dudley H. Remaking speech[J]. The Journal of the Acoustical Society of America, 1939, 11(2): 169-177.[23]
- Ning Y, He S, Wu Z, et al. A review of deep learning based speech synthesis[J]. Applied Sciences, 2019, 9(19): 4050. [24]
- Zen H, Tokuda K, Black A W. Statistical parametric speech synthesis[J]. speech communication, 2009, 51(11): 1039-1064. [25]