Multi-agent reinforcement learning

Another promising area making significant strides is multi-agent reinforcement learning. Contrary to the problems we've seen where only one agent makes decisions, this topic involves having multiple agents make decisions simultaneously and cooperatively in order to achieve a common objective. One of the most significant works related to this has been OpenAI's Dota2-playing system, called OpenAI Five. Dota2 is one of the world's most popular Massively Multiplayer Online Role Playing Game (MMORPGs). Compared to traditional RL games such as Go and Atari, Dota2 is more complex for the following reasons:

For more information on OpenAI's Dota2 AI, check out their blogs on the project at https://blog.openai.com/openai-five/.

Moreover, by using novel upgrades on traditional reinforcement learning algorithms, each agent in OpenAI Five was able to learn to cooperate with one another in order to reach the common objective of destroying the enemy's base. They were even able to learn several team strategies that experienced human players employ. The following is a screenshot from a game being played between a team of Dota players and OpenAI Five:

OpenAI versus human players (source: https://www.youtube.com/watch?v=eaBYhLttETw)

Despite the extreme levels of resource requirements (240 GPUs, 120,000 CPU cores, ~200 human years of gameplay in a single day), this project demonstrates that current AI algorithms are indeed able to cooperate with one another to reach a common objective in a vastly complex environment. This work symbolizes another significant advancement in AI and RL research and demonstrates what the current technology is capable of.