“ASR Status Report 2017-9-11”版本间的差异
来自cslt Wiki
(4位用户的8个中间修订版本未显示) | |||
第22行: | 第22行: | ||
* Recording and cutting the audios, a total of 12 groups | * Recording and cutting the audios, a total of 12 groups | ||
|| | || | ||
− | * | + | * Continue to record the audios with zhangmiao |
* Continue to ask people to do human test | * Continue to ask people to do human test | ||
|- | |- | ||
第34行: | 第34行: | ||
|| | || | ||
* Continue to ask people to do human test | * Continue to ask people to do human test | ||
− | * Recording(the goal is to record 400 to 500 people)[ | + | * Recording(the goal is to record 400 to 500 people) [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/c/cc/录音说明.pdf here] |
|- | |- | ||
第41行: | 第41行: | ||
|Yanqing Wang | |Yanqing Wang | ||
|| | || | ||
− | * | + | * Absent |
|| | || | ||
* | * | ||
第50行: | 第50行: | ||
|Ying Shi | |Ying Shi | ||
|| | || | ||
− | * multi-decoding ASR model with more pdfs | + | * multi-decoding ASR model with more pdfs. Performance better than before but not well enough |
− | * add sperate symbel to discriminated kazak and uyghur | + | * add sperate symbel to discriminated kazak and uyghur word set |
* group-based softmax(in progress) | * group-based softmax(in progress) | ||
|| | || | ||
第72行: | 第72行: | ||
* Go on speaker segmentation tasks, see [http://192.168.0.51:5555/cgi-bin/cvss/cvss_request.pl?account=lilt&step=view_request&cvssid=615 here] | * Go on speaker segmentation tasks, see [http://192.168.0.51:5555/cgi-bin/cvss/cvss_request.pl?account=lilt&step=view_request&cvssid=615 here] | ||
** Complete the phonetic-aware speaker segmentation. | ** Complete the phonetic-aware speaker segmentation. | ||
− | ** Word-level boundaries from the ASR. | + | *** Word-level boundaries from the ASR. |
− | ** Word-level d-vector and clustering. | + | *** Word-level d-vector and clustering. |
|| | || | ||
* Try some smooth tricks. | * Try some smooth tricks. | ||
第82行: | 第82行: | ||
|Zhiyuan Tang | |Zhiyuan Tang | ||
|| | || | ||
− | * | + | * Organized the code and doc of Parrot system[http://192.168.0.51:5555/cgi-bin/cvss/cvss_request.pl?account=tangzy&step=view_request&cvssid=635] |
|| | || | ||
− | * | + | * Theoretical study of pronunciation detection |
− | + | ||
|- | |- | ||
2017年9月13日 (三) 00:45的最后版本
Date | People | Last Week | This Week |
---|---|---|---|
2017.9.4
|
Jiayin Cai |
|
|
Xiaofei Kang |
|
| |
Miao Zhang |
|
| |
Yanqing Wang |
|
| |
Ying Shi |
|
| |
Yixiang Chen |
|
| |
Lantian Li |
|
| |
Zhiyuan Tang |
|
|
Date | People | Last Week | This Week |
---|---|---|---|
2017.9.4
|
Jiayin Cai |
|
|
Xiaofei Kang |
|
| |
Miao Zhang |
|
| |
Yanqing Wang |
|
| |
Ying Shi |
|
| |
Yixiang Chen |
|
| |
Lantian Li |
|
| |
Zhiyuan Tang |
|
|