“2017-3-6”版本间的差异

来自cslt Wiki
跳转至: 导航搜索
第7行: 第7行:
 
|-
 
|-
 
|Jiyuan Zhang ||
 
|Jiyuan Zhang ||
*  reproduced planning neural network
+
*  reproduced planning neural network [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/3/38/Planning_neural_network_initial_decode.pdf results]
 
||  
 
||  
*reproduce planning neural network
+
*reproduce planning neural network  
 
|-
 
|-
 
|Andi Zhang ||
 
|Andi Zhang ||

2017年3月6日 (一) 04:38的版本

Date People Last Week This Week
2017/1/3 Yang Feng
Jiyuan Zhang
  • reproduced planning neural network results
  • reproduce planning neural network
Andi Zhang
  • added source masks in attention_decoder where calculates attention and in gru_cell where calculates new states.
  • found the attribute sentence_length, perhaps it works better than my code
Shiyue Zhang
  • figured out the problem of attention: the initial value of V should be around 0
  • tested different modification, such as add mask, init b with 0.
  • Compared the results, and concluded only change the initial value of V is the best.
  • try to get right attention on memory
Peilun Xiao