“ASR Status Report 2016-11-21”版本间的差异
来自cslt Wiki
(4位用户的15个中间修订版本未显示) | |||
第1行: | 第1行: | ||
{| class="wikitable" | {| class="wikitable" | ||
− | !Date!!People !! Last Week !! This Week | + | ! Date!!People !! Last Week !! This Week |
− | + | ||
− | + | ||
− | + | ||
|- | |- | ||
| rowspan="5"|2016.11.21 | | rowspan="5"|2016.11.21 | ||
|Hang Luo | |Hang Luo | ||
|| | || | ||
− | * | + | * Explore the language recognition models including: |
+ | * Evaluate the model in the aspect of sentence and frame, find the accuracy is very high. | ||
+ | * Minimize the language model, train it single and joint with speech model, evaluate its result. | ||
|| | || | ||
− | * | + | * Continue doing the basic explore of joint training. |
− | * | + | * Read paper about multi-language recognition models and others. |
|- | |- | ||
第23行: | 第22行: | ||
* change the size or word list and corpus this method not worked very well | * change the size or word list and corpus this method not worked very well | ||
* prune the LM .And the parameter been used to prune the LM is 2e-7 the size of LM reduce from 290M to 60M but the result about wer is very poor | * prune the LM .And the parameter been used to prune the LM is 2e-7 the size of LM reduce from 290M to 60M but the result about wer is very poor | ||
− | * I have upload some result about several experiment to CVSS | + | * I have upload some result about several experiment to CVSS[http://192.168.0.51:5555/cgi-bin/cvss/cvss_request.pl?account=tangzy&step=view_request&cvssid=576] |
|| | || | ||
* there are too much private affairs about myself so the job about visualization last week has been delayed I will try my best to finish it the week | * there are too much private affairs about myself so the job about visualization last week has been delayed I will try my best to finish it the week | ||
第33行: | 第32行: | ||
|Yixiang Chen | |Yixiang Chen | ||
|| | || | ||
− | * | + | * Learn MFCC extraction mechanism. |
− | * | + | * Read kaldi computer-feature code and find how to change MFCC. |
+ | * Frequency-weighting based feature extraction. | ||
|| | || | ||
− | * | + | * Continue replay detection (Freq-Weighting and Freq-Warping). |
|- | |- | ||
第43行: | 第43行: | ||
|Lantian Li | |Lantian Li | ||
|| | || | ||
− | * | + | * Joint-training on SRE and LRE (LRE task). [http://192.168.0.51:5555/cgi-bin/cvss/cvss_request.pl?account=tangzy&step=view_request&cvssid=574] |
− | * | + | ** Tdnn is better than LSTM. |
− | * | + | ** LRE is a long-term task. |
− | * | + | * Briefly overview Interspeech SRE-related papers. |
+ | * CSLT-Replay detection. | ||
+ | ** Baseline done (Freq / Mel domain). | ||
+ | ** performance-driven based Freq-Weighting and Freq-Warping --> Yixiang. | ||
|| | || | ||
− | * | + | * LRE task. |
− | * | + | * Replay detection. |
|- | |- | ||
第56行: | 第59行: | ||
|Zhiyuan Tang | |Zhiyuan Tang | ||
|| | || | ||
− | * | + | * report for Weekly Reading (a brief review of interspeech16), just prepared; |
− | * | + | * language scores as decoding mask (1.multiply probability, very bad; 2.add log-softmax, a little bad) |
+ | * training with mask failed | ||
|| | || | ||
− | * | + | * training with shared layers; |
− | * | + | * explore single tasks. |
|} | |} | ||
第69行: | 第73行: | ||
{| class="wikitable" | {| class="wikitable" | ||
!Date!!People !! Last Week !! This Week | !Date!!People !! Last Week !! This Week | ||
− | |||
− | |||
− | |||
|- | |- | ||
| rowspan="5"|2016.11.14 | | rowspan="5"|2016.11.14 |
2016年11月28日 (一) 01:13的最后版本
Date | People | Last Week | This Week |
---|---|---|---|
2016.11.21 | Hang Luo |
|
|
Ying Shi |
There are several method I have tried
|
| |
Yixiang Chen |
|
| |
Lantian Li |
|
| |
Zhiyuan Tang |
|
|
Date | People | Last Week | This Week |
---|---|---|---|
2016.11.14 | Hang Luo |
|
|
Ying Shi |
|
| |
Yixiang Chen |
|
| |
Lantian Li |
| ||
Zhiyuan Tang |
|
|