“ASR Status Report 2017-8-28”版本间的差异

来自cslt Wiki
跳转至: 导航搜索
第135行: 第135行:
 
|Lantian Li   
 
|Lantian Li   
 
||  
 
||  
*  
+
* Speaker segmentation
 +
** Speaker change points detection + K-means clustering.
 +
** GMM clustering.
 +
** Clean up codes.
 +
* Prepare IS2017 presentation.
 
||
 
||
*  
+
* Attend IS2017.
 
|-
 
|-
  

2017年8月28日 (一) 14:29的版本

Date People Last Week This Week
2017.8.21 Xiaofei Kang
  • Recording new audios from 38 person, located in /work7/tanghui/kangxf/workspaces/speaker/wavdata/V2.0
  • Improve the test website to judge before committing
  • Test the new recording。
Miao Zhang
Yanqing Wang
  • pruning the connections and refining, results
  • Absent.
Ying Shi
  • check toolkit code
  • multilingual baseline system
  • train language id model
  • use Lid to do multi-decoding
  • some experiments for zhiyong zhang about TTS
Yixiang Chen
Lantian Li
Zhiyuan Tang
  • several indicators for VV scoring system, see [1].
  • more indicators, a demo with Shuai.
  • toolbook writing.




Date People Last Week This Week
2017.8.21 Xiaofei Kang
  • Absence
  • finish experiments on 5 recorded speech.
  • Improve and test the human test website.
  • Learn 4 papers from lantian : about speaker recongnition and deep speaker feature.
Miao Zhang
  • Prepare the data and finish experiments on 5 recorded speech.
  • Finish the human test website(include 20 styles), express my apprecation to Shuai sister!
Yanqing Wang
  • explore how the pruning method influence the ( distribution of ) output of the network: result
  • after retraining, the distribution may reappear.
  • continue on exploration on the network's sparse structure.
Ying Shi
  • crawler program [finished]
  • tibetan asr system baseline (19.46%)
  • multilingual decoding
  • maybe I can help Zhiyong Zhang to do some work about TTS
  • check toolkit code(check data website and codemap) and check it into git
Yixiang Chen
Lantian Li
  • Speaker segmentation
    • Speaker change points detection + K-means clustering.
    • GMM clustering.
    • Clean up codes.
  • Prepare IS2017 presentation.
  • Attend IS2017.
Zhiyuan Tang
  • 1. align the candidate speech (fbank) with phone labels using nnet3-align-compiled (almost finished); 2.analyse the alignment with rhythm, tone, tune, for Parrot system, (revised goodness of pronunciation), to be done.
  • collecting material (PPT) for Kaldi toolbook.
  • analyse the alignment with rhythm, tone, tune, (revised goodness of pronunciation).
  • toolbook writing