Xiaoxi Wang 2016-01-04

来自cslt Wiki
跳转至: 导航搜索

Last week:

debug NTM code detail: a write head can be decomposed into an erase head and an add head if the initialised memory is all zeros the gradient of add vector wrt hidden W will be NaN

This week:

complete circle centre tasks and equation solving tasks.