“NLP Status Report 2017-8-28”版本间的差异

2017年8月28日 (一) 07:50的最后版本

Date	People	Last Week	This Week
2017/8/14	Jiyuan Zhang	code refactoring wrote a document[1]
	Aodong LI
	Shiyue Zhang
	Shipan Ren	read the released information of other toolkits for nmt cleaned up the code wrote the documents [2] [3]	write the papers of our baseline system read augmented nmt code
	Jiayu Guo	learn the source code of the mode	learn the source code of seq2seq model and learn tensorflow

@@ 第4行： / 第4行： @@
 | rowspan="6"|2017/8/14
 |Jiyuan Zhang ||
-*done some work about code refactoring for poem system
+*code refactoring
+*wrote a document[http://cslt.riit.tsinghua.edu.cn/mediawiki/index.php/%E6%96%87%E4%BB%B6:VvPoem.docx]
 ||
-*plan to complete code refactoring for poem system
 |-
 |Aodong LI ||
@@ 第18行： / 第19行： @@
 |-
-|Shipan Ren ||
+|Shipan Ren
-* organized all the experimental results(our baseline system,Moses,THUMT)
+||
-* trained and tested translation models（Toolkit:THUMT ）
+* read the released information of other toolkits for nmt
-* compared with our system
+* cleaned up the code
-||
+* wrote the documents [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/5/5c/Manual_V1.0.docx] [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/3/38/Manual_V0.10.docx]
-* prepare to release the baseline system（tensorflow1.0 version）
+||
+* write the papers of our baseline system
+* read augmented nmt code
 |-
 |Jiayu Guo||
-* process data and run model;
+* learn the source code of the mode
-* test results.
-checkpoint-100000 translation model
-BLEU： 11.11
-*source:在秦者名错，与张仪争论,於是惠王使错将伐蜀，遂拔，因而守之。
-*target:在秦国的名叫司马错，曾与张仪发生争论，秦惠王采纳了他的意见，于是司马错率军攻蜀国，攻取后，又让他做了蜀地郡守。
-*trans：当时秦国的人都很欣赏他的建议，与张仪一起商议，所以吴王派使者率军攻打蜀地，一举攻，接着又下令守城 。
-*source:神大用则竭，形大劳则敝，形神离则死 。
-*target:精神过度使用就会衰竭，形体过度劳累就会疲惫，神形分离就会死亡。
-*trans: 精神过度就可衰竭,身体过度劳累就会疲惫，地形也就会死。
-*source:今天子接千岁之统，封泰山，而余不得从行，是命也夫，命也夫！
-*target:现天子继承汉朝千年一统的大业，在泰山举行封禅典礼而我不能随行，这是命啊，是命啊！
-*trans: 现在天子可以继承帝位的成就爵位，爵位至泰山，而我却未能执行先帝的命运。
-*1.data used Zizhitongjian only(6,000 pairs), we can get BLEU 6 at most.
-*2.data used Zizhitongjian only(12,000 pairs), we can get BLEU 7 at most.
-*3.data used Shiji and Zizhitongjian(43,0000 pairs), we can get BLEU about 9.
-*4.data used Shiji and Zizhitongjian(43,0000 pairs), and split the ancient language text one character by one, we can get BLEU 11.11 at most.
-*The main factors now is the data(pairs of sentence/the quality——the modern language text includes context information).
 ||
-*plan to read source code of seq2seq model and learn tensorflow;
+* learn the source code of seq2seq model and learn tensorflow
-*plan to read a paper named Automatic Long Sentence Segmentation for NMT
 |-
 |-
-|zhangshuai
-||
-* learn model source code
-||
-* learn tensorflow and source code
-|-
 |}

“NLP Status Report 2017-8-28”版本间的差异

2017年8月28日 (一) 07:50的最后版本

导航菜单

个人工具

名字空间

变种

查看

操作

搜索

导航

工具