“Task List”版本间的差异

来自cslt Wiki
跳转至: 导航搜索
第1行: 第1行:
 
=Task To Do=
 
=Task To Do=
* 1, RNN speech recognition (Tied-context-dependent-state and End-to-End)
+
 
* 2, Real environment noise cancellation(DNN-DAE/CNN-DAE/RNN-DAE: echo or reverberation)
+
[['''* 1, RNN speech recognition (Tied-context-dependent-state and End-to-End) --Chao Liu/Zhiyuan Tang
* 3, Integrate the class information to HCLG fst for speech recognition
+
* 2, Real environment noise cancellation(RNN-DAE: echo or reverberation) --Zhiyong Zhang
* 4, Multi-Mode features based VAD
+
* 3, Integrate the class information to HCLG fst for speech recognition
* 5, DNN based Language identification and Speaker identification
+
* 4, Multi-Mode features based VAD --Shi Yin,Done
* 6, Distant speech recognition(Reverberation, Mutli-microphones)
+
* 5, DNN based Language identification and Speaker identification
* 7, Voice conversation
+
* 6, Distant speech recognition(Reverberation, Mutli-microphones) --(Lasso),Xuewei Zhang
* 8, Unbound activation function(Rectifier/Maxout/Pnorm) go-through searching method.
+
* 7, Voice conversation --Zhongwei Yao
* 9, Sparse DNN
+
* 8, Unbound activation function(Rectifier/Maxout/Pnorm) go-through searching method. --Zhiyong Zhang
* 10, Neural network visulization
+
* 9, Sparse DNN --Zhiyuan Tang/Chao Liu
* 11, DAE+dropout
+
* 10, Neural network visulization --Mian Wang,Done
:* DNN-DAE -- Xiangyu Zeng
+
* 11, DNN training GPU parallelization , nnet2 optimization.
:* CNN-DAE -- Yiye Lin
+
* 12, Monmentum-like Hessien acceleration
 +
* 13, Correlation based SEONE cluster
 +
* 14,''']]
 +
  
  

2015年4月22日 (三) 13:22的版本

Task To Do

[[* 1, RNN speech recognition (Tied-context-dependent-state and End-to-End) --Chao Liu/Zhiyuan Tang

  • 2, Real environment noise cancellation(RNN-DAE: echo or reverberation) --Zhiyong Zhang
  • 3, Integrate the class information to HCLG fst for speech recognition
  • 4, Multi-Mode features based VAD --Shi Yin,Done
  • 5, DNN based Language identification and Speaker identification
  • 6, Distant speech recognition(Reverberation, Mutli-microphones) --(Lasso),Xuewei Zhang
  • 7, Voice conversation --Zhongwei Yao
  • 8, Unbound activation function(Rectifier/Maxout/Pnorm) go-through searching method. --Zhiyong Zhang
  • 9, Sparse DNN --Zhiyuan Tang/Chao Liu
  • 10, Neural network visulization --Mian Wang,Done
  • 11, DNN training GPU parallelization , nnet2 optimization.
  • 12, Monmentum-like Hessien acceleration
  • 13, Correlation based SEONE cluster
  • 14,]]


Technical Report To Write

  • 1, DNN-DAE based noise cancellation -- Xiangyu Zeng / Mengyuan Zhao / Zhiyong Zhang
  • 2, Speech Rate DNN speech recognition --Shi Yin
  • 3, CNN+fbank feature combination --Mian Wang /Yiye Lin /Mengyuan Zhao /Shi Yin
  • 4, Uyghur low-resource acoustic model enhancement -- Shi Yin / Mengyuan Zhao / Zhiyong Zhang
  • 5, Uyghur 20h database release --Kaer /Shi Yin

Paper to Write

  • 1, DNN-DAE Xiangyu Zeng/ Mengyuan Zhao Conference: ChinaSIP-2015
  • 2, RNN-dAE Chao Liu / Zhiyiong Zhang Conference: Interspeech-2015