Shiyue Zhang |
- added trained memory-attention model to neural model(43.0) and got 2+ blue gain (45.19), but need more validation and improvement
- ran baseline model on cs-en data, and found it was good on train set but poor on test set.
- ran baseline model on en-fr data, and found 'inf' problem.
- fixed the 'inf' problem by debugging the code of mask-added baseline model.
- running on cs-en and en-fr data again.
|
- go on with baseline on big data: get results of cs-en and enfr data, train on zh-en data from WMT17
- go on to refine memory attention model: retrain to find out if the 2+ is just by chance, try more memory attention structure (relu, a(t-1), y(t-1)...)
|