Skip to content

Latest commit

 

History

History
11 lines (11 loc) · 1.57 KB

research.md

File metadata and controls

11 lines (11 loc) · 1.57 KB

Research

  1. Proximal Policy Optimization (PPO) https://arxiv.org/abs/1707.06347
  2. Multi-Agent DDPG https://github.com/openai/maddpg
  3. Monte Carlo Tree Search https://gnunet.org/sites/default/files/Browne%20et%20al%20-%20A%20survey%20of%20MCTS%20methods.pdf
  4. Monte Carlo Tree Search and Reinforcement Learning https://www.jair.org/media/5507/live-5507-10333-jair.pdf
  5. Cooperative Multi-Agent Learning https://link.springer.com/article/10.1007/s10458-005-2631-2
  6. Opponent Modeling in Deep Reinforcement Learning http://www.umiacs.umd.edu/~hal/docs/daume16opponent.pdf
  7. Machine Theory of Mind https://arxiv.org/pdf/1802.07740.pdf
  8. Coordinated Multi-Agent Imitation Learning https://arxiv.org/pdf/1703.03121.pdf
  9. Deep Reinforcement Learning from Self-Play in Imperfect-Information Games https://arxiv.org/pdf/1603.01121.pdf andhttp://proceedings.mlr.press/v37/heinrich15.pdf
  10. Autonomous Agents Modelling Other Agents http://www.cs.utexas.edu/~pstone/Papers/bib2html-links/AIJ18-Albrecht.pdf