“NLP Status Report 2017-5-31”版本间的差异
来自cslt Wiki
| 第7行: | 第7行: | ||
|- | |- | ||
|Aodong LI || | |Aodong LI || | ||
| − | + | * code double-attention model with '''final_attn = alpha * attn_ch + beta * attn_en''' | |
| + | * baseline bleu = '''43.87''' | ||
| + | * experiments with '''random''' initialized embedding: | ||
| + | {| class="wikitable" | ||
| + | |- | ||
| + | ! alpha | ||
| + | ! beta | ||
| + | ! result (bleu) | ||
| + | |- | ||
| + | | 1 | ||
| + | | 1 | ||
| + | | 43.50 | ||
| + | |- | ||
| + | | 4/3 | ||
| + | | 2/3 | ||
| + | | 43.58 (w/o retrained) | ||
| + | |- | ||
| + | | 2/3 | ||
| + | | 4/3 | ||
| + | | 42.22 (w/o retrained) | ||
| + | |- | ||
| + | | 2/3 | ||
| + | | 4/3 | ||
| + | | 42.36 (w/ retrained) | ||
| + | |} | ||
| + | * experiments with '''constant''' initialized embedding: | ||
| + | {| class="wikitable" | ||
| + | |- | ||
| + | ! alpha | ||
| + | ! beta | ||
| + | ! result (bleu) | ||
| + | |- | ||
| + | | 1 | ||
| + | | 1 | ||
| + | | '''45.41''' | ||
| + | |- | ||
| + | | 4/3 | ||
| + | | 2/3 | ||
| + | | '''45.79''' | ||
| + | |- | ||
| + | | 2/3 | ||
| + | | 4/3 | ||
| + | | '''45.32''' | ||
| + | |} | ||
| + | * This model is similar to multi-source neural translation but uses less resource | ||
|| | || | ||
| − | + | * Explore different attention merge strategies | |
| + | * Explore hierarchical model | ||
|- | |- | ||
|Shiyue Zhang || | |Shiyue Zhang || | ||
2017年5月31日 (三) 04:42的版本
| Date | People | Last Week | This Week | ||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2017/5/31 | Jiyuan Zhang | ||||||||||||||||||||||||||||
| Aodong LI |
|
| |||||||||||||||||||||||||||
| Shiyue Zhang |
|
| |||||||||||||||||||||||||||
| Shipan Ren |