ASR Status Report 2017-8-28

来自cslt Wiki
跳转至: 导航搜索
Date People Last Week This Week
2017.8.21 Xiaofei Kang
  • Recording new audios from 38 person, located in /work7/tanghui/kangxf/workspaces/speaker/wavdata/V2.0
  • Improve the test website to judge before committing
  • Test the new recording。
Miao Zhang
Yanqing Wang
  • pruning the connections and refining, results
  • Absent.
Ying Shi
  • check toolkit code
  • multilingual baseline system
  • train language id model
  • use Lid to do multi-decoding
  • some experiments for zhiyong zhang about TTS
Yixiang Chen
Lantian Li
  • Attend IS2017.
  • Go on speaker segmentation tasks.
Zhiyuan Tang
  • several indicators for VV scoring system, see [1].
  • more indicators, a demo with Shuai.
  • toolbook writing.




Date People Last Week This Week
2017.8.21 Xiaofei Kang
  • Absence
  • finish experiments on 5 recorded speech.
  • Improve and test the human test website.
  • Learn 4 papers from lantian : about speaker recongnition and deep speaker feature.
Miao Zhang
  • Prepare the data and finish experiments on 5 recorded speech.
  • Finish the human test website(include 20 styles), express my apprecation to Shuai sister!
Yanqing Wang
  • explore how the pruning method influence the ( distribution of ) output of the network: result
  • after retraining, the distribution may reappear.
  • continue on exploration on the network's sparse structure.
Ying Shi
  • crawler program [finished]
  • tibetan asr system baseline (19.46%)
  • multilingual decoding
  • maybe I can help Zhiyong Zhang to do some work about TTS
  • check toolkit code(check data website and codemap) and check it into git
Yixiang Chen
Lantian Li
  • Speaker segmentation
    • Speaker change points detection + K-means clustering.
    • GMM clustering.
    • Clean up codes.
  • Prepare IS2017 presentation.
  • Attend IS2017.
Zhiyuan Tang
  • 1. align the candidate speech (fbank) with phone labels using nnet3-align-compiled (almost finished); 2.analyse the alignment with rhythm, tone, tune, for Parrot system, (revised goodness of pronunciation), to be done.
  • collecting material (PPT) for Kaldi toolbook.
  • analyse the alignment with rhythm, tone, tune, (revised goodness of pronunciation).
  • toolbook writing