“NLP Status Report 2017-8-21”版本间的差异
来自cslt Wiki
(1位用户的2个中间修订版本未显示) | |||
第19行: | 第19行: | ||
|- | |- | ||
|Shipan Ren || | |Shipan Ren || | ||
− | * organized all the experimental results(our baseline system,Moses,THUMT) | + | * organized all the experimental results(our baseline system,Moses,THUMT) [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/8/89/Nmt_baseline.xlsx] |
* trained and tested translation models(Toolkit:THUMT ) | * trained and tested translation models(Toolkit:THUMT ) | ||
* compared with our system | * compared with our system | ||
第45行: | 第45行: | ||
*3.data used Shiji and Zizhitongjian(43,0000 pairs), we can get BLEU about 9. | *3.data used Shiji and Zizhitongjian(43,0000 pairs), we can get BLEU about 9. | ||
*4.data used Shiji and Zizhitongjian(43,0000 pairs), and split the ancient language text one character by one, we can get BLEU 11.11 at most. | *4.data used Shiji and Zizhitongjian(43,0000 pairs), and split the ancient language text one character by one, we can get BLEU 11.11 at most. | ||
− | *The main factors now is the data( | + | *The main factors now is the data(pairs of sentence/the quality——the modern language text includes context information). |
|| | || | ||
− | *plan to read source code of seq2seq model; | + | *plan to read source code of seq2seq model and learn tensorflow; |
*plan to read a paper named Automatic Long Sentence Segmentation for NMT | *plan to read a paper named Automatic Long Sentence Segmentation for NMT | ||
|- | |- | ||
|} | |} |
2017年8月28日 (一) 07:44的最后版本
Date | People | Last Week | This Week |
---|---|---|---|
2017/8/14 | Jiyuan Zhang |
|
|
Aodong LI | |||
Shiyue Zhang | |||
Shipan Ren |
|
| |
Jiayu Guo |
checkpoint-100000 translation model BLEU: 11.11
|
|