“140603 Xiaoxi Wang”版本间的差异
来自cslt Wiki
(以内容“Last week: Improved corpora proprecessing tools (http stripper, num2hanzi), and reprocessed weibo corpora learned cross-entropy difference based domain specific corpo...”创建新页面) |
(没有差异)
|
2014年6月3日 (二) 04:38的最后版本
Last week:
Improved corpora proprecessing tools (http stripper, num2hanzi), and reprocessed weibo corpora
learned cross-entropy difference based domain specific corpora extraction method.
recorded voice of numbers for testing
This week:
Train new lm with new corpora (weibo)
Compare new in-domain corpora selection method and old topic spotting based method