“140428-Ma xi”版本间的差异
来自cslt Wiki
(以内容“Last week: 1.Learning the knowledge of language model 2.Segment the training/test sentences with the 150k lexicon; Get the ppl of the test: /nfs/home/zhangzhiyong/...”创建新页面) |
|||
第13行: | 第13行: | ||
This week: | This week: | ||
− | To extract sentences of the related field from the original corpus. | + | 1.To extract sentences of the related field from the original corpus. |
2014年4月28日 (一) 11:32的最后版本
Last week:
1.Learning the knowledge of language model
2.Segment the training/test sentences with the 150k lexicon; Get the ppl of the
test: /nfs/home/zhangzhiyong/work/train_470h/test/huawei_disanpi.txt. Using the following LM:
/home/thdnn/resource/lm/Hunhe_zhongzi_and_add_and_PPL_5yuan_1e9.lm
3.Build the new LM using the lexicon with the keywords involved; Re-segment the test files, and test the PPL.
This week:
1.To extract sentences of the related field from the original corpus.