“M2asr-doct-papershare”版本间的差异
来自cslt Wiki
(以“媒体文件:OUTRAGEOUSLYLARGENEURALNETWORKSTHESPARSELY-GATEDMIXTURE-OF-EXPERTSLAYER.pdf OUTRAGEOUSLY LARGE NEURAL NETWORKS: THE SPARSELY-GATED MIXTURE-OF-EXPERTS...”为内容创建页面) |
|||
(相同用户的3个中间修订版本未显示) | |||
第1行: | 第1行: | ||
− | [[媒体文件:OUTRAGEOUSLYLARGENEURALNETWORKSTHESPARSELY-GATEDMIXTURE-OF-EXPERTSLAYER.pdf OUTRAGEOUSLY LARGE NEURAL NETWORKS: | + | [[媒体文件:OUTRAGEOUSLYLARGENEURALNETWORKSTHESPARSELY-GATEDMIXTURE-OF-EXPERTSLAYER.pdf|ICLR2017: OUTRAGEOUSLY LARGE NEURAL NETWORKS: THE SPARSELY-GATED MIXTURE-OF-EXPERTS LAYER]] |
− | THE SPARSELY-GATED MIXTURE-OF-EXPERTS LAYER]] | + | |
+ | [https://papers.nips.cc/paper/6469-dual-learning-for-machine-translation.pdf Dual Learning for Machine Translation] |
2017年7月11日 (二) 06:00的最后版本
ICLR2017: OUTRAGEOUSLY LARGE NEURAL NETWORKS: THE SPARSELY-GATED MIXTURE-OF-EXPERTS LAYER