ASR Status Report 2017-9-11

来自cslt Wiki
2017年9月11日 (一) 05:22Kangxf讨论 | 贡献的版本

跳转至: 导航搜索
Date People Last Week This Week
2017.9.4


Jiayin Cai
Xiaofei Kang
  • improve the human Test website:, save the test recordings, decline the positive samples
  • Recording and cutting the audios, a total of 12 groups
  • Recording the 440 groups audios left with zhangmiao
  • Continue to ask people to do human test
Miao Zhang
  • Perform human test
  • Record some other people and do the experiments again
  • Continue to ask people to do human test
  • Recording(the goal is to record 400 to 500 people)
Yanqing Wang
Ying Shi
Yixiang Chen
Lantian Li
Zhiyuan Tang

Date People Last Week This Week
2017.9.4


Jiayin Cai
  • Finished the phonetic i-vector experiment.
  • get BN feature and train i-vector LID.
  • Get phonetic feat from a stronger phonetic network
  • combine PTN and phonetic i-vector.
Xiaofei Kang
  • cutting audio and marking:21 speakers,a total of 1050 sentences
  • Finish the new speaker recognition using the two recordings.
  • improve the human Test website
Miao Zhang
  • Absent
  • Perform human test on 21-style speech(add the disguise)
  • Draw spectrums and t-SNE plots compared with experiment results
Yanqing Wang
  • Absent.
Ying Shi
  • multi decodeing ASR model
  • multi decodeing with fake Lid here
  • read code about TTS
  • employ group softmax to train multi decoding ASR model
  • synthesis one 'real' speech
Yixiang Chen
  • Absent.
Lantian Li
  • Go on speaker segmentation tasks, see here
    • Dimensionality reduction.
    • Clustering.
    • Visualization.
  • Phonetic-aware speaker segmentation.
Zhiyuan Tang
  • more indicators for VV scoring system, see [1].
  • more indicators, a demo with Shuai.
  • toolbook writing.