140603 Xiaoxi Wang

来自cslt Wiki
2014年6月3日 (二) 04:38Wxx讨论 | 贡献的版本

(差异) ←上一版本 | 最后版本 (差异) | 下一版本→ (差异)
跳转至: 导航搜索

Last week:

Improved corpora proprecessing tools (http stripper, num2hanzi), and reprocessed weibo corpora

learned cross-entropy difference based domain specific corpora extraction method.

recorded voice of numbers for testing

This week:

Train new lm with new corpora (weibo)

Compare new in-domain corpora selection method and old topic spotting based method