“Task List”版本间的差异
来自cslt Wiki
第1行: | 第1行: | ||
=Task To Do= | =Task To Do= | ||
− | * 1, RNN speech recognition (Tied-context-dependent-state and End-to-End) | + | |
− | * 2, Real environment noise cancellation( | + | [['''* 1, RNN speech recognition (Tied-context-dependent-state and End-to-End) --Chao Liu/Zhiyuan Tang |
− | * 3, Integrate the class information to HCLG fst for speech recognition | + | * 2, Real environment noise cancellation(RNN-DAE: echo or reverberation) --Zhiyong Zhang |
− | * 4, Multi-Mode features based VAD | + | * 3, Integrate the class information to HCLG fst for speech recognition |
− | * 5, DNN based Language identification and Speaker identification | + | * 4, Multi-Mode features based VAD --Shi Yin,Done |
− | * 6, Distant speech recognition(Reverberation, Mutli-microphones) | + | * 5, DNN based Language identification and Speaker identification |
− | * 7, Voice conversation | + | * 6, Distant speech recognition(Reverberation, Mutli-microphones) --(Lasso),Xuewei Zhang |
− | * 8, Unbound activation function(Rectifier/Maxout/Pnorm) go-through searching method. | + | * 7, Voice conversation --Zhongwei Yao |
− | * 9, Sparse DNN | + | * 8, Unbound activation function(Rectifier/Maxout/Pnorm) go-through searching method. --Zhiyong Zhang |
− | * 10, Neural network visulization | + | * 9, Sparse DNN --Zhiyuan Tang/Chao Liu |
− | * 11, | + | * 10, Neural network visulization --Mian Wang,Done |
− | + | * 11, DNN training GPU parallelization , nnet2 optimization. | |
− | + | * 12, Monmentum-like Hessien acceleration | |
+ | * 13, Correlation based SEONE cluster | ||
+ | * 14,''']] | ||
+ | |||
2015年4月22日 (三) 13:22的版本
Task To Do
[[* 1, RNN speech recognition (Tied-context-dependent-state and End-to-End) --Chao Liu/Zhiyuan Tang
- 2, Real environment noise cancellation(RNN-DAE: echo or reverberation) --Zhiyong Zhang
- 3, Integrate the class information to HCLG fst for speech recognition
- 4, Multi-Mode features based VAD --Shi Yin,Done
- 5, DNN based Language identification and Speaker identification
- 6, Distant speech recognition(Reverberation, Mutli-microphones) --(Lasso),Xuewei Zhang
- 7, Voice conversation --Zhongwei Yao
- 8, Unbound activation function(Rectifier/Maxout/Pnorm) go-through searching method. --Zhiyong Zhang
- 9, Sparse DNN --Zhiyuan Tang/Chao Liu
- 10, Neural network visulization --Mian Wang,Done
- 11, DNN training GPU parallelization , nnet2 optimization.
- 12, Monmentum-like Hessien acceleration
- 13, Correlation based SEONE cluster
- 14,]]
Technical Report To Write
- 1, DNN-DAE based noise cancellation -- Xiangyu Zeng / Mengyuan Zhao / Zhiyong Zhang
- 2, Speech Rate DNN speech recognition --Shi Yin
- 3, CNN+fbank feature combination --Mian Wang /Yiye Lin /Mengyuan Zhao /Shi Yin
- 4, Uyghur low-resource acoustic model enhancement -- Shi Yin / Mengyuan Zhao / Zhiyong Zhang
- 5, Uyghur 20h database release --Kaer /Shi Yin
Paper to Write
- 1, DNN-DAE Xiangyu Zeng/ Mengyuan Zhao Conference: ChinaSIP-2015
- 2, RNN-dAE Chao Liu / Zhiyiong Zhang Conference: Interspeech-2015