“ASR Status Report 2016-11-21”版本间的差异
来自cslt Wiki
| (4位用户的16个中间修订版本未显示) | |||
| 第1行: | 第1行: | ||
{| class="wikitable" | {| class="wikitable" | ||
| − | !Date!!People !! Last Week !! This Week | + | ! Date!!People !! Last Week !! This Week |
| − | + | ||
| − | + | ||
| − | + | ||
|- | |- | ||
| − | | rowspan="5"|2016.11. | + | | rowspan="5"|2016.11.21 |
|Hang Luo | |Hang Luo | ||
|| | || | ||
| − | * | + | * Explore the language recognition models including: |
| + | * Evaluate the model in the aspect of sentence and frame, find the accuracy is very high. | ||
| + | * Minimize the language model, train it single and joint with speech model, evaluate its result. | ||
|| | || | ||
| − | * | + | * Continue doing the basic explore of joint training. |
| − | * | + | * Read paper about multi-language recognition models and others. |
|- | |- | ||
| 第23行: | 第22行: | ||
* change the size or word list and corpus this method not worked very well | * change the size or word list and corpus this method not worked very well | ||
* prune the LM .And the parameter been used to prune the LM is 2e-7 the size of LM reduce from 290M to 60M but the result about wer is very poor | * prune the LM .And the parameter been used to prune the LM is 2e-7 the size of LM reduce from 290M to 60M but the result about wer is very poor | ||
| − | * I have upload some result about several experiment to CVSS | + | * I have upload some result about several experiment to CVSS[http://192.168.0.51:5555/cgi-bin/cvss/cvss_request.pl?account=tangzy&step=view_request&cvssid=576] |
|| | || | ||
* there are too much private affairs about myself so the job about visualization last week has been delayed I will try my best to finish it the week | * there are too much private affairs about myself so the job about visualization last week has been delayed I will try my best to finish it the week | ||
| 第33行: | 第32行: | ||
|Yixiang Chen | |Yixiang Chen | ||
|| | || | ||
| − | * | + | * Learn MFCC extraction mechanism. |
| − | * | + | * Read kaldi computer-feature code and find how to change MFCC. |
| + | * Frequency-weighting based feature extraction. | ||
|| | || | ||
| − | * | + | * Continue replay detection (Freq-Weighting and Freq-Warping). |
|- | |- | ||
| 第43行: | 第43行: | ||
|Lantian Li | |Lantian Li | ||
|| | || | ||
| − | * | + | * Joint-training on SRE and LRE (LRE task). [http://192.168.0.51:5555/cgi-bin/cvss/cvss_request.pl?account=tangzy&step=view_request&cvssid=574] |
| − | * | + | ** Tdnn is better than LSTM. |
| − | * | + | ** LRE is a long-term task. |
| − | * | + | * Briefly overview Interspeech SRE-related papers. |
| + | * CSLT-Replay detection. | ||
| + | ** Baseline done (Freq / Mel domain). | ||
| + | ** performance-driven based Freq-Weighting and Freq-Warping --> Yixiang. | ||
|| | || | ||
| − | * | + | * LRE task. |
| − | * | + | * Replay detection. |
|- | |- | ||
| 第56行: | 第59行: | ||
|Zhiyuan Tang | |Zhiyuan Tang | ||
|| | || | ||
| − | * | + | * report for Weekly Reading (a brief review of interspeech16), just prepared; |
| − | * | + | * language scores as decoding mask (1.multiply probability, very bad; 2.add log-softmax, a little bad) |
| + | * training with mask failed | ||
|| | || | ||
| − | * | + | * training with shared layers; |
| − | * | + | * explore single tasks. |
|} | |} | ||
| 第69行: | 第73行: | ||
{| class="wikitable" | {| class="wikitable" | ||
!Date!!People !! Last Week !! This Week | !Date!!People !! Last Week !! This Week | ||
| − | |||
| − | |||
| − | |||
|- | |- | ||
| rowspan="5"|2016.11.14 | | rowspan="5"|2016.11.14 | ||
2016年11月28日 (一) 01:13的最后版本
| Date | People | Last Week | This Week |
|---|---|---|---|
| 2016.11.21 | Hang Luo |
|
|
| Ying Shi |
There are several method I have tried
|
| |
| Yixiang Chen |
|
| |
| Lantian Li |
|
| |
| Zhiyuan Tang |
|
|
| Date | People | Last Week | This Week |
|---|---|---|---|
| 2016.11.14 | Hang Luo |
|
|
| Ying Shi |
|
| |
| Yixiang Chen |
|
| |
| Lantian Li |
| ||
| Zhiyuan Tang |
|
|