“第三十二章 AI游戏”版本间的差异

2023年8月13日 (日) 02:22的最后版本

Mnih V, Kavukcuoglu K, Silver D, et al. Human-level control through deep reinforcement learning[J]. nature, 2015, 518(7540): 529-533. [14]
Baker B, Kanitscheider I, Markov T, et al. Emergent tool use from multi-agent autocurricula[J]. arXiv preprint arXiv:1909.07528, 2019. [15]
Arulkumaran K, Cully A, Togelius J. Alphastar: An evolutionary computation perspective[C]//Proceedings of the genetic and evolutionary computation conference companion. 2019: 314-315. [16]
Niels Justesen, Philip Bontrager, Julian Togelius, Sebastian Risi, Deep Learning for Video Game Playing [17][18]

@@ 第11行： / 第11行： @@
 * OpenAI 捉迷藏游戏[https://openai.com/blog/emergent-tool-use/]
 * DeepMind AlphaStar 博客  [https://www.deepmind.com/blog/alphastar-mastering-the-real-time-strategy-game-starcraft-ii]
+* AlphaStar真的智能了吗？ [https://www.sohu.com/a/294455221_610473]
+* DeepMind最强星际争霸AI—— AlphaStar的复现 [https://zhuanlan.zhihu.com/p/56539931]
@@ 第27行： / 第30行： @@
 ==开发者资源==
-* 斗地主 [https://github.com/kwai/DouZero/blob/main/README.zh-CN.md]
+* 斗地主 [*][https://github.com/kwai/DouZero/blob/main/README.zh-CN.md]
 ==高级读者==
@@ 第34行： / 第37行： @@
 * Baker B, Kanitscheider I, Markov T, et al. Emergent tool use from multi-agent autocurricula[J]. arXiv preprint arXiv:1909.07528, 2019. [https://arxiv.org/pdf/1909.07528]
 * Arulkumaran K, Cully A, Togelius J. Alphastar: An evolutionary computation perspective[C]//Proceedings of the genetic and evolutionary computation conference companion. 2019: 314-315. [https://arxiv.org/pdf/1902.01724]
+* Niels Justesen, Philip Bontrager, Julian Togelius, Sebastian Risi, Deep Learning for Video Game Playing [https://arxiv.org/abs/1708.07902][https://github.com/hijkzzz/deep-reinforcement-learning-notes]