“ASR Status Report 2017-9-4”版本间的差异
来自cslt Wiki
(以“{| class="wikitable" !Date!!People !! Last Week !! This Week |- | rowspan="9"|2017.8.21 |Jiayin Cai || * || * |- |- |Xiaofei Kang || * Recording new audios from...”为内容创建页面) |
|||
(5位用户的8个中间修订版本未显示) | |||
第2行: | 第2行: | ||
!Date!!People !! Last Week !! This Week | !Date!!People !! Last Week !! This Week | ||
|- | |- | ||
− | | rowspan="9"|2017. | + | | rowspan="9"|2017.9.4 |
|Jiayin Cai | |Jiayin Cai | ||
|| | || | ||
− | * | + | *Finished the phonetic i-vector experiment. |
|| | || | ||
− | * | + | *get BN feature and train i-vector LID. |
+ | *Get phonetic feat from a stronger phonetic network | ||
+ | *combine PTN and phonetic i-vector. | ||
|- | |- | ||
第16行: | 第18行: | ||
|Xiaofei Kang | |Xiaofei Kang | ||
|| | || | ||
− | * | + | * cutting audio and marking:21 speakers,a total of 1050 sentences |
− | * | + | * Finish the new speaker recognition using the two recordings. |
|| | || | ||
− | * | + | * improve the human Test website |
|- | |- | ||
第26行: | 第28行: | ||
|Miao Zhang | |Miao Zhang | ||
|| | || | ||
− | * | + | * Absent |
|| | || | ||
− | * | + | * Perform human test on 21-style speech(add the disguise) |
+ | * Draw spectrums and t-SNE plots compared with experiment results | ||
|- | |- | ||
第35行: | 第38行: | ||
|Yanqing Wang | |Yanqing Wang | ||
|| | || | ||
− | * | + | * Absent. |
|| | || | ||
− | * | + | * |
|- | |- | ||
第44行: | 第47行: | ||
|Ying Shi | |Ying Shi | ||
|| | || | ||
− | * | + | * multi decodeing ASR model |
− | * | + | * multi decodeing with fake Lid [http://192.168.0.51:5555/cgi-bin/cvss/cvss_request.pl?account=shiying&step=view_request&cvssid=627 here] |
+ | * read code about TTS | ||
|| | || | ||
− | * train | + | * employ group softmax to train multi decoding ASR model |
− | + | * synthesis one 'real' speech | |
− | * | + | |
|- | |- | ||
第56行: | 第59行: | ||
|Yixiang Chen | |Yixiang Chen | ||
|| | || | ||
− | * | + | * Absent. |
|| | || | ||
* | * | ||
第65行: | 第68行: | ||
|Lantian Li | |Lantian Li | ||
|| | || | ||
− | * | + | * Go on speaker segmentation tasks, see [http://192.168.0.51:5555/cgi-bin/cvss/cvss_request.pl?account=lilt&step=view_request&cvssid=615 here] |
+ | ** Dimensionality reduction. | ||
+ | ** Clustering. | ||
+ | ** Visualization. | ||
|| | || | ||
− | * | + | * Phonetic-aware speaker segmentation. |
|- | |- | ||
第74行: | 第80行: | ||
|Zhiyuan Tang | |Zhiyuan Tang | ||
|| | || | ||
− | * | + | * more indicators for VV scoring system, see [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/a/a1/VV_scoring.pdf]. |
|| | || | ||
* more indicators, a demo with Shuai. | * more indicators, a demo with Shuai. |
2017年9月4日 (一) 05:22的最后版本
Date | People | Last Week | This Week |
---|---|---|---|
2017.9.4
|
Jiayin Cai |
|
|
Xiaofei Kang |
|
| |
Miao Zhang |
|
| |
Yanqing Wang |
|
| |
Ying Shi |
|
| |
Yixiang Chen |
|
| |
Lantian Li |
|
| |
Zhiyuan Tang |
|
|
Date | People | Last Week | This Week |
---|---|---|---|
2017.8.21 | Xiaofei Kang |
|
|
Miao Zhang |
|
| |
Yanqing Wang |
|
| |
Ying Shi |
|
| |
Yixiang Chen |
|
| |
Lantian Li |
|
| |
Zhiyuan Tang |
|
|