ASR Status Report 2017-8-14

来自cslt Wiki
跳转至: 导航搜索
Date People Last Week This Week
2017.8.14 Xiaofei Kang
  • Recording 35 people audio, located in /work7/zhangmiao/speaker/wavdata/data_new
  • Learn the new test website from zhangmiao
  • Go home with my mom, and come back on Friday night.
Miao Zhang
  • Recording work
  • Test website's data preparation
  • check the linear chapter
  • Continue to record
  • do experiments on recorded speech if possible
  • check the NN chapter
Yanqing Wang
  • TRP uploaded.
  • explore the importance of sparseness structure:
    • After pruning, initialize non-zero values randomly, train.
    • train nnet with 177-dimension hidden layer.
    • result
  • continue exploring the values of trained nnet.
Ying Shi
  • general codeMap finished(kazak)
  • crawler program delayed(Most of the kazakh website is down. I will cralw data from overseas websites)
  • collect more Unicode. such as Tibetan, Mongolia.
  • crawler kazak data from overseas websites.
Yixiang Chen
  • Study English and help Lantian do some Exps.
Lantian Li
  • Visualization and quantification for d-vector [1].
    • phone-aware and phone-blind.
    • within speaker variation and between speaker variation.
  • Speaker segmentation Exps.
  • Finish speaker segmentation Exp.
  • Prepare IS17 presentation.
Zhiyuan Tang
  • reorganize auto-scoring system, next ???
  • collecting material (PPT) for Kaldi toolbook.
  • prefer to rewrite the scoring part.
  • toolbook writing




Date People Last Week This Week
2017.8.7 Xiaofei Kang
  • Finish experiments of 12-style speech with ZhangMiao. (Results are shown in ZhangMiao's CVSS)
  • Complete a part of the recording work: collecting six types of sound from 13 people.
  • Finish the recording work left with ZhangMiao
  • Build a new test website with ZhangMiao
Miao Zhang
  • Finish experiments of 12-style speech with Xiaofei. (Results are shown in CVSS)
  • Build a new test website
  • Recording work
  • Improve the website by decreasing salience segments and replenish other styles
Yanqing Wang
  • retrain experiments finished
  • TRP finished
  • structure V.S. value
Ying Shi
  • setup server for m2asr [finished]
  • design crawler program
  • finish the crawler program
  • CodeMap for Tibetan
Yixiang Chen
Lantian Li
  • Visualization and quantification for d-vector [2].
    • phone-aware and phone-blind.
    • within speaker variation and between speaker variation.
  • Lots of trifles.
  • Speaker segmentation task.
Zhiyuan Tang
  • Some functions of the auto-scoring system rewrited.
  • An app demo with Shuai Zhang.
  • Kaldi book writing.