“2013-05-10”版本间的差异
来自cslt Wiki
(以内容“== Data sharing == * status? == DNN progress == === Experiments === * setups for input layer : s1: mfcc(13), splice +-5[143] : s2: mfcc(13), splice +-5(143), LDA[143]...”创建新页面) |
(→Data sharing) |
||
第1行: | 第1行: | ||
== Data sharing == | == Data sharing == | ||
− | * | + | * LM count files still undelivered! |
== DNN progress == | == DNN progress == |
2013年5月13日 (一) 07:17的版本
目录
Data sharing
- LM count files still undelivered!
DNN progress
Experiments
- setups for input layer
- s1: mfcc(13), splice +-5[143]
- s2: mfcc(13), splice +-5(143), LDA[143]
- s3: mfcc(13), delta(39), splice +-5(429), LDA[143]
- s4: mfcc(13), delta(39), splice +-5(429), LDA[300]
- setups for alignment
- tri1: triphone training, feature input: mfcc(13), delta[39]. #pdfs 1651, #gaussians 10028
- tri2: LDA/MLLT training, feature input: mfcc(13), delta(39), splice +-4(351), LDA[40]. #pdfs 3536, #gaussians 39995
- other notes
- about 100 hours training data
- 88k LM, biglm decoding (1e-5 / 1e-9)
- gpu-based nnet training, in-1200-1200-1200-1200-out
- results
Test Set | fMMI | s1/tri1 | s2/tri1 | s3/tri1 | s4/tri1 | s2/tri2 | s4/tri2 | cpu-based (like s4/tri1) |
---|---|---|---|---|---|---|---|---|
map | 25.38 | 24.47 | 26.16 | 26.20 | 22.85 | 24.27 | 26.45 | |
2044 | 23.58 | 22.82 | 23.84 | 24.13 | 21.45 | 22.76 | 24.66 | |
notetp3 | 16.08 | 14.89 | 15.92 | 15.97 | 14.89 | 14.79 | 16.14 | |
1900 | 8.55 | 8.43 | 8.66 | 8.90 | 7.30 | 7.91 | 8.23 | |
general | 36.18 | 34.79 | 35.88 | 35.90 | 33.06 | 33.79 | 38.02 | |
online1 | 34.68 | 33.90 | 33.45 | 33.38 | 32.93 | 32.43 | 33.00 | |
online2 | 27.27 | 26.61 | 26.26 | 26.36 | 25.94 | 25.69 | 26.63 | |
speedup | 24.97 | 24.40 | 24.55 | 25.42 | 23.04 | 23.67 | 27.17 |
- conclusion
TODO
Tencent exps
TODO
GPU & CPU merge
TODO
L-1 sparse initial training
- experiments on L1 penalty
Test Set | 0 | 1.00E-06 | 1.00E-05 | 2.50E-05 | 5.00E-05 | 7.50E-05 | 1.00E-04 |
---|---|---|---|---|---|---|---|
map | 61.06 | 61.18 | 61.31 | 61.72 | 62.89 | 62.84 | 62.48 |
2044 | 49.53 | 49.58 | 49.84 | 49.94 | 50.71 | 51.08 | 51.08 |
notetp3 | 43.44 | 43.44 | 43.55 | 44.58 | 45.22 | 44.95 | 45.76 |
1900 | 38.50 | 38.54 | 39.00 | 39.21 | 40.33 | 40.53 | 40.60 |
general | 61.24 | 61.22 | 61.69 | 61.84 | 62.71 | 62.83 | 62.93 |
online1 | 58.02 | 58.05 | 58.31 | 58.23 | 58.83 | 59.06 | 59.46 |
online2 | 53.62 | 53.70 | 54.03 | 53.93 | 54.65 | 54.94 | 55.51 |
speedup | 57.51 | 57.49 | 57.93 | 58.31 | 59.75 | 60.08 | 59.52 |
- TODO
To be done
TODO
Kaldi/HTK merge
- HTK2Kaldi: hold.
- Kaldi2HTK: stuck. various sp models tried but don't help
- To be done
- TODO
Embedded progress
- Status:
- training acoustic model for sphinx
- To be done
- finish AM training
- run test