“NLP Status Report 2017-8-21”版本间的差异
来自cslt Wiki
第41行: | 第41行: | ||
*target:现天子继承汉朝千年一统的大业,在泰山举行封禅典礼而我不能随行,这是命啊,是命啊! | *target:现天子继承汉朝千年一统的大业,在泰山举行封禅典礼而我不能随行,这是命啊,是命啊! | ||
*trans: 现在天子可以继承帝位的成就爵位,爵位至泰山,而我却未能执行先帝的命运。 | *trans: 现在天子可以继承帝位的成就爵位,爵位至泰山,而我却未能执行先帝的命运。 | ||
− | |||
− | |||
− | |||
*1.data used Zizhitongjian only(6,000 pairs), we can get BLEU 6 at most. | *1.data used Zizhitongjian only(6,000 pairs), we can get BLEU 6 at most. | ||
*2.data used Zizhitongjian only(12,000 pairs), we can get BLEU 7 at most. | *2.data used Zizhitongjian only(12,000 pairs), we can get BLEU 7 at most. | ||
第49行: | 第46行: | ||
*4.data used Shiji and Zizhitongjian(43,0000 pairs), and split the ancient language text one character by one, we can get BLEU 11.11 at most. | *4.data used Shiji and Zizhitongjian(43,0000 pairs), and split the ancient language text one character by one, we can get BLEU 11.11 at most. | ||
*The main factors now is the data(including pairs of sentence、the quality——cause the modern language text include context information. | *The main factors now is the data(including pairs of sentence、the quality——cause the modern language text include context information. | ||
+ | || | ||
+ | *plan to read source code of seq2seq model; | ||
+ | *plan to read a paper named Automatic Long Sentence Segmentation for NMT | ||
|- | |- | ||
|} | |} |
2017年8月21日 (一) 06:37的版本
Date | People | Last Week | This Week |
---|---|---|---|
2017/8/14 | Jiyuan Zhang |
|
|
Aodong LI | |||
Shiyue Zhang | |||
Shipan Ren |
|
| |
Jiayu Guo |
checkpoint-100000 translation model BLEU: 11.11
|
|