“ASR Status Report 2017-9-11”版本间的差异
来自cslt Wiki
(4位用户的10个中间修订版本未显示) | |||
第22行: | 第22行: | ||
* Recording and cutting the audios, a total of 12 groups | * Recording and cutting the audios, a total of 12 groups | ||
|| | || | ||
− | * | + | * Continue to record the audios with zhangmiao |
* Continue to ask people to do human test | * Continue to ask people to do human test | ||
|- | |- | ||
第34行: | 第34行: | ||
|| | || | ||
* Continue to ask people to do human test | * Continue to ask people to do human test | ||
− | * Recording(the goal is to record 400 to 500 people) | + | * Recording(the goal is to record 400 to 500 people) [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/c/cc/录音说明.pdf here] |
|- | |- | ||
第41行: | 第41行: | ||
|Yanqing Wang | |Yanqing Wang | ||
|| | || | ||
− | * | + | * Absent |
|| | || | ||
* | * | ||
第50行: | 第50行: | ||
|Ying Shi | |Ying Shi | ||
|| | || | ||
− | * multi-decoding ASR model with more pdfs | + | * multi-decoding ASR model with more pdfs. Performance better than before but not well enough |
− | * add sperate symbel to discriminated kazak and uyghur | + | * add sperate symbel to discriminated kazak and uyghur word set |
* group-based softmax(in progress) | * group-based softmax(in progress) | ||
|| | || | ||
第61行: | 第61行: | ||
|Yixiang Chen | |Yixiang Chen | ||
|| | || | ||
− | * | + | * Absent |
|| | || | ||
* | * | ||
第70行: | 第70行: | ||
|Lantian Li | |Lantian Li | ||
|| | || | ||
− | * | + | * Go on speaker segmentation tasks, see [http://192.168.0.51:5555/cgi-bin/cvss/cvss_request.pl?account=lilt&step=view_request&cvssid=615 here] |
+ | ** Complete the phonetic-aware speaker segmentation. | ||
+ | *** Word-level boundaries from the ASR. | ||
+ | *** Word-level d-vector and clustering. | ||
|| | || | ||
− | * | + | * Try some smooth tricks. |
|- | |- | ||
第79行: | 第82行: | ||
|Zhiyuan Tang | |Zhiyuan Tang | ||
|| | || | ||
− | * | + | * Organized the code and doc of Parrot system[http://192.168.0.51:5555/cgi-bin/cvss/cvss_request.pl?account=tangzy&step=view_request&cvssid=635] |
|| | || | ||
− | * | + | * Theoretical study of pronunciation detection |
− | + | ||
|- | |- | ||
2017年9月13日 (三) 00:45的最后版本
Date | People | Last Week | This Week |
---|---|---|---|
2017.9.4
|
Jiayin Cai |
|
|
Xiaofei Kang |
|
| |
Miao Zhang |
|
| |
Yanqing Wang |
|
| |
Ying Shi |
|
| |
Yixiang Chen |
|
| |
Lantian Li |
|
| |
Zhiyuan Tang |
|
|
Date | People | Last Week | This Week |
---|---|---|---|
2017.9.4
|
Jiayin Cai |
|
|
Xiaofei Kang |
|
| |
Miao Zhang |
|
| |
Yanqing Wang |
|
| |
Ying Shi |
|
| |
Yixiang Chen |
|
| |
Lantian Li |
|
| |
Zhiyuan Tang |
|
|