“140603 Xiaoxi Wang”版本间的差异

来自cslt Wiki
跳转至: 导航搜索
Wxx讨论 | 贡献
(以内容“Last week: Improved corpora proprecessing tools (http stripper, num2hanzi), and reprocessed weibo corpora learned cross-entropy difference based domain specific corpo...”创建新页面)
 
(没有差异)

2014年6月3日 (二) 04:38的最后版本

Last week:

Improved corpora proprecessing tools (http stripper, num2hanzi), and reprocessed weibo corpora

learned cross-entropy difference based domain specific corpora extraction method.

recorded voice of numbers for testing

This week:

Train new lm with new corpora (weibo)

Compare new in-domain corpora selection method and old topic spotting based method