“Task List”版本间的差异

来自cslt Wiki
跳转至: 导航搜索
 
(4位用户的22个中间修订版本未显示)
第1行: 第1行:
=Task To Do=
 
==Speech Recognition==
 
*End-to-End speech recognition
 
:*Zhiyuan Tang/Mengyuan Zhao/Zhiyong Zhang
 
  
*Integrate the class information to HCLG fst for speech recognition
+
=Tasks at hand=
  
*Distant speech recognition
+
==Speech Recognition==
:*RNN-DAE: echo or reverberation
+
::*Mengyuan Zhao/Zhiyong Zhang
+
:*Reverberation
+
::*Mutli-microphones
+
::*(Lasso),Xuewei Zhang
+
 
+
*Voice conversation
+
:*xx
+
 
+
*Unbound activation function(Rectifier/Maxout/Pnorm) go-through searching method
+
:*Zhiyong Zhang/Zhiyuan Tang
+
 
+
*Sparse DNN
+
:*Zhiyuan Tang
+
 
+
*Monmentum-like Hessien-Free acceleration
+
 
+
*Correlation based SEONE cluster
+
 
+
*NN Multi-GPU parallel traing
+
:*Multi-Machine
+
::* nnet2 optimization
+
:*Multi-GPU on one Machine
+
::*Sheng Su
+
 
+
*Audio Embedding
+
 
+
*Activation value normalization through time
+
:* For bigger learning rate
+
 
+
*Mix-training Balance decision tree
+
:* Zhiyong Zhang
+
 
+
*RNN training accelerating
+
 
+
*Data selection
+
:*Zhiyong Zhang
+
:*Sub-modular data selection
+
 
+
*Decoder
+
:*Confidence output for task-required
+
 
+
==Speaker Verification==
+
*SUSR-ivector
+
:*Lantian Li
+
 
+
*binary code
+
:*Lantian Li
+
 
+
*RNN-ivector
+
:*Lantian Li
+
  
*DNN clustering
+
===joint learning===
:*Lantian Li
+
* Hang Luo, Zhiyuan Tang
  
*录音重放检测(专利)
+
===visualization===
:*Lantian Li
+
* Ying Shi, Zhiyuan Tang
  
=Task DONE=
+
==Speaker Recognition==
*Multi-Mode features based VAD*
+
*Lantian Li, Yixiang Chen
:*Shi Yin, DONE
+
  
*DNN based Language identification and Speaker identification*
 
:*Xuewei Zhang/Zhiyuan Tang
 
  
*Neural network visulization*
+
=Tasks Done=
:*Mian Wang,DONE
+
  
*Dark knowledge*
+
=Technical Reports to write=
:*Mengyuan Zhao, Xiangyu Zeng, Zhiyong Zhang, Chao Liu
+
  
*Normal RNN speech recognition*
+
=Papers to write=
:*Mengyuan Zhao
+
  
 +
=Patents to write=
  
=Technical Report To Write=
+
=Patents done=
1, DNN-DAE based noise cancellation -- Xiangyu Zeng / Mengyuan Zhao / Zhiyong Zhang  --DONE
+
2, Speech Rate DNN speech recognition --Shi Yin/Xiangyu Zeng --DONE
+
3, CNN+fbank feature combination --Mian Wang /Yiye Lin /Mengyuan Zhao /Shi Yin
+
4, Uyghur low-resource acoustic model enhancement -- Shi Yin / Mengyuan Zhao / Zhiyong Zhang --DONE
+
5, Uyghur 20h database release --Kaer /Shi Yin --DONE
+
6,Dark-Knowledge Transfer
+
    *: Xiangyu Zeng/ Mengyuan Zhao / Zhiyong Zhang
+
  
=Paper to Write=
+
=Projects=
  
=Project=
 
* Xiaomi TV
 
:*Mengyuan Zhao/Zhiyong Zhang
 
:*TAG-lm & Domain-specific general lm
 
  
*Chinese-English mix-training
+
------------------------------
 +
[[task previous]]

2016年10月16日 (日) 12:31的最后版本

Tasks at hand

Speech Recognition

joint learning

  • Hang Luo, Zhiyuan Tang

visualization

  • Ying Shi, Zhiyuan Tang

Speaker Recognition

  • Lantian Li, Yixiang Chen


Tasks Done

Technical Reports to write

Papers to write

Patents to write

Patents done

Projects


task previous