“NLP Status Report 2017-8-21”版本间的差异
来自cslt Wiki
| (1位用户的2个中间修订版本未显示) | |||
| 第19行: | 第19行: | ||
|- | |- | ||
|Shipan Ren || | |Shipan Ren || | ||
| − | * organized all the experimental results(our baseline system,Moses,THUMT) | + | * organized all the experimental results(our baseline system,Moses,THUMT) [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/8/89/Nmt_baseline.xlsx] |
* trained and tested translation models(Toolkit:THUMT ) | * trained and tested translation models(Toolkit:THUMT ) | ||
* compared with our system | * compared with our system | ||
| 第45行: | 第45行: | ||
*3.data used Shiji and Zizhitongjian(43,0000 pairs), we can get BLEU about 9. | *3.data used Shiji and Zizhitongjian(43,0000 pairs), we can get BLEU about 9. | ||
*4.data used Shiji and Zizhitongjian(43,0000 pairs), and split the ancient language text one character by one, we can get BLEU 11.11 at most. | *4.data used Shiji and Zizhitongjian(43,0000 pairs), and split the ancient language text one character by one, we can get BLEU 11.11 at most. | ||
| − | *The main factors now is the data( | + | *The main factors now is the data(pairs of sentence/the quality——the modern language text includes context information). |
|| | || | ||
| − | *plan to read source code of seq2seq model; | + | *plan to read source code of seq2seq model and learn tensorflow; |
*plan to read a paper named Automatic Long Sentence Segmentation for NMT | *plan to read a paper named Automatic Long Sentence Segmentation for NMT | ||
|- | |- | ||
|} | |} | ||
2017年8月28日 (一) 07:44的最后版本
| Date | People | Last Week | This Week |
|---|---|---|---|
| 2017/8/14 | Jiyuan Zhang |
|
|
| Aodong LI | |||
| Shiyue Zhang | |||
| Shipan Ren |
|
| |
| Jiayu Guo |
checkpoint-100000 translation model BLEU: 11.11
|
|