“ASR Status Report 2016-11-21”版本间的差异
来自cslt Wiki
| (4位用户的13个中间修订版本未显示) | |||
| 第1行: | 第1行: | ||
{| class="wikitable" | {| class="wikitable" | ||
| − | !Date!!People !! Last Week !! This Week | + | ! Date!!People !! Last Week !! This Week |
| − | + | ||
| − | + | ||
| − | + | ||
|- | |- | ||
| rowspan="5"|2016.11.21 | | rowspan="5"|2016.11.21 | ||
|Hang Luo | |Hang Luo | ||
|| | || | ||
| − | * | + | * Explore the language recognition models including: |
| + | * Evaluate the model in the aspect of sentence and frame, find the accuracy is very high. | ||
| + | * Minimize the language model, train it single and joint with speech model, evaluate its result. | ||
|| | || | ||
| − | * | + | * Continue doing the basic explore of joint training. |
| − | * | + | * Read paper about multi-language recognition models and others. |
|- | |- | ||
| 第33行: | 第32行: | ||
|Yixiang Chen | |Yixiang Chen | ||
|| | || | ||
| − | * Learn MFCC extraction mechanism | + | * Learn MFCC extraction mechanism. |
| − | * Read kaldi computer-feature code and find how to change MFCC | + | * Read kaldi computer-feature code and find how to change MFCC. |
| − | * | + | * Frequency-weighting based feature extraction. |
|| | || | ||
| − | * | + | * Continue replay detection (Freq-Weighting and Freq-Warping). |
|- | |- | ||
| 第44行: | 第43行: | ||
|Lantian Li | |Lantian Li | ||
|| | || | ||
| − | * | + | * Joint-training on SRE and LRE (LRE task). [http://192.168.0.51:5555/cgi-bin/cvss/cvss_request.pl?account=tangzy&step=view_request&cvssid=574] |
| − | * | + | ** Tdnn is better than LSTM. |
| − | * | + | ** LRE is a long-term task. |
| − | * | + | * Briefly overview Interspeech SRE-related papers. |
| + | * CSLT-Replay detection. | ||
| + | ** Baseline done (Freq / Mel domain). | ||
| + | ** performance-driven based Freq-Weighting and Freq-Warping --> Yixiang. | ||
|| | || | ||
| − | * | + | * LRE task. |
| − | * | + | * Replay detection. |
|- | |- | ||
| 第57行: | 第59行: | ||
|Zhiyuan Tang | |Zhiyuan Tang | ||
|| | || | ||
| − | * | + | * report for Weekly Reading (a brief review of interspeech16), just prepared; |
| − | * | + | * language scores as decoding mask (1.multiply probability, very bad; 2.add log-softmax, a little bad) |
| + | * training with mask failed | ||
|| | || | ||
| − | * | + | * training with shared layers; |
| − | * | + | * explore single tasks. |
|} | |} | ||
| 第70行: | 第73行: | ||
{| class="wikitable" | {| class="wikitable" | ||
!Date!!People !! Last Week !! This Week | !Date!!People !! Last Week !! This Week | ||
| − | |||
| − | |||
| − | |||
|- | |- | ||
| rowspan="5"|2016.11.14 | | rowspan="5"|2016.11.14 | ||
2016年11月28日 (一) 01:13的最后版本
| Date | People | Last Week | This Week |
|---|---|---|---|
| 2016.11.21 | Hang Luo |
|
|
| Ying Shi |
There are several method I have tried
|
| |
| Yixiang Chen |
|
| |
| Lantian Li |
|
| |
| Zhiyuan Tang |
|
|
| Date | People | Last Week | This Week |
|---|---|---|---|
| 2016.11.14 | Hang Luo |
|
|
| Ying Shi |
|
| |
| Yixiang Chen |
|
| |
| Lantian Li |
| ||
| Zhiyuan Tang |
|
|