“ASR Status Report 2017-7-24”版本间的差异

来自cslt Wiki
跳转至: 导航搜索
 
(5位用户的7个中间修订版本未显示)
第2行: 第2行:
 
!Date!!People !! Last Week !! This Week
 
!Date!!People !! Last Week !! This Week
 
|-
 
|-
| rowspan="7"|2017.7.17
+
| rowspan="8"|2017.7.24
  
 
|Xiaofei Kang
 
|Xiaofei Kang
 
||  
 
||  
*  
+
* Prepare the data set of Speaker Recognition : pick out whisper
 +
* Learn the the nnet3 model, run the nnet3 experiment
 
||  
 
||  
*  
+
* Learn the Speaker Recognition model, run the Speaker Recognition experiment
 
|-
 
|-
  
第14行: 第15行:
 
|Miao Zhang
 
|Miao Zhang
 
||  
 
||  
*  
+
* joined a meeting in Chinese Academy of Social Sciences
 +
* worked out a recording plan
 +
* learnt kaldi and did experiments
 
||  
 
||  
*  
+
* test performances on 12 kinds of voices we have
 
|-
 
|-
  
第31行: 第34行:
 
|Yanqing Wang
 
|Yanqing Wang
 
||  
 
||  
*  
+
* change the source code of Kaldi to implement retraining ( with zero value fixed )
 +
* start to write a technical report of pruning the neural network ( not finished )
 
||
 
||
*  
+
* finish the retraining task
 +
* finish the technical report
 
|-
 
|-
  
第40行: 第45行:
 
|Ying Shi   
 
|Ying Shi   
 
||  
 
||  
*  
+
* data checking website
 +
* learn how to write a crawler program
 
||  
 
||  
*  
+
* write a more general crawler
 +
* realign kazak train and test data with transfer learning model
 
|-
 
|-
  
第49行: 第56行:
 
|Yixiang Chen   
 
|Yixiang Chen   
 
||  
 
||  
*  
+
* use wisper audio for speaker recognition
 +
* joined a meeting in Chinese Academy of Social Sciences
 
||  
 
||  
*  
+
* test performances on 12 kinds of voices
 
+
|-
  
 
|-
 
|-
 
|Lantian Li   
 
|Lantian Li   
 
||  
 
||  
*  
+
* deepspk on TASLP.
 +
* speaker segmentation.
 
||  
 
||  
*  
+
* recipe of deepspk.
 
|-
 
|-
  

2017年7月26日 (三) 04:21的最后版本

Date People Last Week This Week
2017.7.24 Xiaofei Kang
  • Prepare the data set of Speaker Recognition : pick out whisper
  • Learn the the nnet3 model, run the nnet3 experiment
  • Learn the Speaker Recognition model, run the Speaker Recognition experiment
Miao Zhang
  • joined a meeting in Chinese Academy of Social Sciences
  • worked out a recording plan
  • learnt kaldi and did experiments
  • test performances on 12 kinds of voices we have
Hui Tang
  • help jiayin to configure dnn and lstm in kaldi
  • left for postgraduate life
Yanqing Wang
  • change the source code of Kaldi to implement retraining ( with zero value fixed )
  • start to write a technical report of pruning the neural network ( not finished )
  • finish the retraining task
  • finish the technical report
Ying Shi
  • data checking website
  • learn how to write a crawler program
  • write a more general crawler
  • realign kazak train and test data with transfer learning model
Yixiang Chen
  • use wisper audio for speaker recognition
  • joined a meeting in Chinese Academy of Social Sciences
  • test performances on 12 kinds of voices
Lantian Li
  • deepspk on TASLP.
  • speaker segmentation.
  • recipe of deepspk.
Zhiyuan Tang
  • Replaced ATLAS lib with MKL lib for compiling auto-scoring system.
  • Kaldi book writing.
  • A basic demo for auto-scoring system.
  • Kaldi book writing.




Date People Last Week This Week
2017.7.17


Miao Zhang
  • Read the paralinguistic paper and material from Teacher Li
  • work out the recording plan (delayed)
  • work out the recording plan with instruction from Teacher Li
  • check the book of deep learning
Hui Tang
  • finish checking the speech database
  • help jiayin learn kaldi by training a language identification model
  • make sure which type of voice is what we need
  • help jiayin to configure dnn and lstm in kaldi
Yanqing Wang
  • check the former conclusions in a narrow network ( not finished yet )
  • read the source code as a preparation for retrain task.
  • finish checking the former conclusions and try to find the applicable conditions.
  • finish the retrain task.
Ying Shi
  • kazak transfer learning (WER)
    • dark:16.50
    • fix:18.33
    • org:23.69
  • data checking website
    • major function has been down
    • save state when refresh or close the page is in progress
  • finish the website(employ text database)
  • design more powerfull crawler
Yixiang Chen
  • Through voiceprint spectrum synthetic speech
  • read paralinguistic and challenge of paralinguistic 2009-2017
  • share paralinguistic
Lantian Li
Zhiyuan Tang
  • Replace the old version kaldi with new ones. (delayed)
  • Gather Part 1: 'Speech, Speech Processing and Tools' of Kaldi book for further release. (delayed)
  • Replace the old version kaldi with new ones for auto-scoring system.
  • Gather Part 1: 'Speech, Speech Processing and Tools' of Kaldi book for further release.