“ASR Status Report 2016-11-21”版本间的差异
来自cslt Wiki
(4位用户的13个中间修订版本未显示) | |||
第1行: | 第1行: | ||
{| class="wikitable" | {| class="wikitable" | ||
− | !Date!!People !! Last Week !! This Week | + | ! Date!!People !! Last Week !! This Week |
− | + | ||
− | + | ||
− | + | ||
|- | |- | ||
| rowspan="5"|2016.11.21 | | rowspan="5"|2016.11.21 | ||
|Hang Luo | |Hang Luo | ||
|| | || | ||
− | * | + | * Explore the language recognition models including: |
+ | * Evaluate the model in the aspect of sentence and frame, find the accuracy is very high. | ||
+ | * Minimize the language model, train it single and joint with speech model, evaluate its result. | ||
|| | || | ||
− | * | + | * Continue doing the basic explore of joint training. |
− | * | + | * Read paper about multi-language recognition models and others. |
|- | |- | ||
第33行: | 第32行: | ||
|Yixiang Chen | |Yixiang Chen | ||
|| | || | ||
− | * Learn MFCC extraction mechanism | + | * Learn MFCC extraction mechanism. |
− | * Read kaldi computer-feature code and find how to change MFCC | + | * Read kaldi computer-feature code and find how to change MFCC. |
− | * | + | * Frequency-weighting based feature extraction. |
|| | || | ||
− | * | + | * Continue replay detection (Freq-Weighting and Freq-Warping). |
|- | |- | ||
第44行: | 第43行: | ||
|Lantian Li | |Lantian Li | ||
|| | || | ||
− | * | + | * Joint-training on SRE and LRE (LRE task). [http://192.168.0.51:5555/cgi-bin/cvss/cvss_request.pl?account=tangzy&step=view_request&cvssid=574] |
− | * | + | ** Tdnn is better than LSTM. |
− | * | + | ** LRE is a long-term task. |
− | * | + | * Briefly overview Interspeech SRE-related papers. |
+ | * CSLT-Replay detection. | ||
+ | ** Baseline done (Freq / Mel domain). | ||
+ | ** performance-driven based Freq-Weighting and Freq-Warping --> Yixiang. | ||
|| | || | ||
− | * | + | * LRE task. |
− | * | + | * Replay detection. |
|- | |- | ||
第57行: | 第59行: | ||
|Zhiyuan Tang | |Zhiyuan Tang | ||
|| | || | ||
− | * | + | * report for Weekly Reading (a brief review of interspeech16), just prepared; |
− | * | + | * language scores as decoding mask (1.multiply probability, very bad; 2.add log-softmax, a little bad) |
+ | * training with mask failed | ||
|| | || | ||
− | * | + | * training with shared layers; |
− | * | + | * explore single tasks. |
|} | |} | ||
第70行: | 第73行: | ||
{| class="wikitable" | {| class="wikitable" | ||
!Date!!People !! Last Week !! This Week | !Date!!People !! Last Week !! This Week | ||
− | |||
− | |||
− | |||
|- | |- | ||
| rowspan="5"|2016.11.14 | | rowspan="5"|2016.11.14 |
2016年11月28日 (一) 01:13的最后版本
Date | People | Last Week | This Week |
---|---|---|---|
2016.11.21 | Hang Luo |
|
|
Ying Shi |
There are several method I have tried
|
| |
Yixiang Chen |
|
| |
Lantian Li |
|
| |
Zhiyuan Tang |
|
|
Date | People | Last Week | This Week |
---|---|---|---|
2016.11.14 | Hang Luo |
|
|
Ying Shi |
|
| |
Yixiang Chen |
|
| |
Lantian Li |
| ||
Zhiyuan Tang |
|
|