Xiaoxi Wang 2016-01-04
来自cslt Wiki
Last week:
debug NTM code detail: a write head can be decomposed into an erase head and an add head if the initialised memory is all zeros the gradient of add vector wrt hidden W will be NaN
This week:
complete circle centre tasks and equation solving tasks.