|
|
(2位用户的11个中间修订版本未显示) |
第1行: |
第1行: |
| | | |
| + | ==Introduction== |
| | | |
| + | '''《语音识别基本法》''' is an introduction book to speech processing. The main part of the book is about speech recognition, |
| + | though related topics are involved, including speaker recognition, language recognition and speech synthesis. |
| | | |
− | '''《语音识别基本法》'''
| |
| | | |
| + | [https://mp.weixin.qq.com/s/xxQXYqdbcsKIujxydayNxw Paper version] is already published. |
| | | |
− | *[[媒体文件:20200410 speech book.pdf | Version 20200410]]
| |
| | | |
− | *[[媒体文件:20190126 speech book.pdf | Version 20190126]]
| + | ==Current version== |
| | | |
− | *[[媒体文件:20190125 speech book.pdf | Version 20190125]] | + | *[[媒体文件:speech book.pdf | Version 20201012]] |
| | | |
− | *[[媒体文件:20190113a speech book.pdf | Version 20190113a]]
| + | ==Tex source== |
| | | |
− | *[[媒体文件:20190113 speech book.pdf | Version 20190113]] | + | *[https://gitlab.com/tzyll/speech_book GitLab repository] |
− | | + | |
− | *[[媒体文件:20181226 speech book.pdf | Version 20181226]]
| + | |
− | | + | |
− | *[[媒体文件:20181225 speech book.pdf | Version 20181225]]
| + | |
− | | + | |
− | *[[媒体文件:20181224 speech book.pdf | Version 20181224]]
| + | |
− | | + | |
− | *Tex on [https://gitlab.com/tzyll/speech_book GitLab]
| + | |
− | | + | |
− | | + | |
− | Note: the following contents are out of date.
| + | |
− | | + | |
− | =语音识别基础 (阿汤)=
| + | |
− | ==语音是什么==
| + | |
− | ==语音识别方法==
| + | |
− | ==语音识别工具==
| + | |
− | | + | |
− | =语音识别基本流程(阿汤)=
| + | |
− | ==实验先行==
| + | |
− | ==前端处理==
| + | |
− | ==训练与解码==
| + | |
− | | + | |
− | =语音识别实际问题=
| + | |
− | ==说话人自适应(启明)==
| + | |
− | [[媒体文件:Spk.pdf|pdf]]
| + | |
− | | + | |
− | [http://wangd.cslt.org/book/kaldi/spk-adapt.rar latex]
| + | |
− | | + | |
− | ==噪声对抗与环境鲁棒性(启明)==
| + | |
− | | + | |
− | [[媒体文件:Noise.pdf|pdf]]
| + | |
− | | + | |
− | [http://wangd.cslt.org/book/kaldi/noise-robust.rar latex]
| + | |
− | | + | |
− | | + | |
− | ==新词处理与领域泛化(待更新)==
| + | |
− | | + | |
− | | + | |
− | <!--
| + | |
− | [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/f/fd/Domain.pdf pdf]
| + | |
− | | + | |
− | [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/3/3d/Domain_adaptation.rar LaTex]
| + | |
− | -->
| + | |
− | | + | |
− | ==小语种识别(石颖)==
| + | |
− | | + | |
− | [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/f/f4/Minority_20190109.pdf pdf]
| + | |
− | | + | |
− | [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/0/02/Minority_asr.rar LaTex]
| + | |
− | | + | |
− | ==关键词唤醒与嵌入式系统(嘉瑶)==
| + | |
− | | + | |
− | [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/c/c5/Kws.pdf pdf]
| + | |
− | | + | |
− | [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/7/70/Kws.rar Latex]
| + | |
− | | + | |
− | =前沿课题=
| + | |
− | ==说话人识别(蓝天)==
| + | |
− | | + | |
− | [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/2/2f/Spk.pdf pdf]
| + | |
− | | + | |
− | [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/d/d5/SRE.rar latex]
| + | |
− | | + | |
− | ==语种识别(诏娣)==
| + | |
− | | + | |
− | [http://cslt.riit.tsinghua.edu.cn/mediawiki/index.php/%E6%96%87%E4%BB%B6:Lid.pdf pdf]
| + | |
− | | + | |
− | [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/9/9e/LID-speekbook.rar latex]
| + | |
− | | + | |
− | ==情绪识别(嘉威)==
| + | |
− | | + | |
− | [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/2/28/语音情绪识别.pdf pdf]
| + | |
− | | + | |
− | [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/7/78/Ser.rar LaTex]
| + | |
− | | + | |
− | ==语音合成(启明)==
| + | |
− | | + | |
− | [[媒体文件:Tts.pdf|pdf]]
| + | |
− | | + | |
− | [http://wangd.cslt.org/book/kaldi/tts.rar latex]
| + | |