FreeNeb Status Report 2017-11-20

来自cslt Wiki
2017年11月20日 (一) 01:13Zhaomy讨论 | 贡献的版本

(差异) ←上一版本 | 最后版本 (差异) | 下一版本→ (差异)
跳转至: 导航搜索

This Week:

People Last Week This Week Next Week Task Tracing(DeadLine)
Mengyuan Zhao
  • Engineering
  1. Draft API document of embedded-ASR engine.
  2. Draft API document of Deep Feature Extractor(deepfe).
  3. Draft a server version ASR DEMO, but still have bugs.
  • Engineering
  1. Try to finish server version of TTS DEMO.
Zhiyong Zhang
  • Train multi-speaker TTS based on Huilian and roobo data
  • Base model done, but the synthesised wav is not good. It seems the acoustic model does not converge.


  • Continue to find the problem of poor acoustic predicting of Multi-speaker TTS;
  • To train duration-model using 16k data.
Yang Wei
  • Write test specification for FreeNeb TTS engine.
  • Test FreeNeb TTS engine.
Dong Wang
  • ICASSP
  • OC2017


Zhenlong Han
  • Finish training Japanese acoustic model with transfer learning. Now the MPE is on training.

ECO135 ECO_77 ETT SPC160
baseline_4X1200X9391_xent 21.06 13.23 26.73 16.6
embedded_6X512X800_xent 26.85 18.38 33.22 24.02
embedded_6X400X800_transfer_learning_xent 23.8 15.71 29.66 19.64

  • VAD is finished.
  • English embedded model training.
  • Uyghur embedded model training.
Shuai Zhang
  • Add the intention of statistics into the graph and test it.
  • According to the feedback, modify the project
  • Complete the VVParrot project
  • Add the documents about the two project


Yanchi Jin
  • Complete the megrez_tool (Fizzim) output format.
  • Start the compilation of the unit testing and overall test framework.
  • Complete the compilation of the framework.