“OpenAI Five”的意思、由来-开放百科全书

OpenAI Five is the name of a machine learning project and system that performs as a team of bots playing against human players in the competitive five-on-five video game, Dota 2. The bots learn to play against humans at a high skill level entirely through machine learning. The system was developed by OpenAI, a non-profit American AI research and development company founded with the mission to develop safe AI in a way that benefits humanity. OpenAI Five's development traces back to 2017, where it was first demonstrated in a live one-on-one game against a professional player of the game known as Dendi, who lost to it. The following year, the system had advanced to the point of performing as a full team of five, and began playing against professional teams.

According to OpenAI, the company uses the game as an experiment for general-purpose, applied machine learning to capture the unpredictability and continuous nature of the real world. The team stated that the complex nature of Dota 2 and its strong reliance on having to work together as a team to win was a major reason it was specifically chosen. The algorithms used for the project have also been applied to other systems, such as controlling a robotic hand. The project has also been compared to a number of other similar cases of AI playing against and defeating humans, such as Watson on the Jeopardy! television game show, Deep Blue in chess, and AlphaGo in the board game Go.

History

Development on the algorithms used for the bots began in November 2016 when Dota 2, a competitive five-on-five video game, was chosen due to it being popular on the live streaming platform Twitch.tv, had native support for Linux, and had an application programming interface (API) available.^[1] Before becoming a team of five, the first public demonstration occurred at The International 2017 in August, the annual premiere championship tournament for the game, where Dendi, a professional Ukrainian player of the game, lost against a bot in a live one-on-one matchup.^[2]^[3] After the match, CTO Greg Brockman explained that the bot had learned by playing against itself for two weeks of real time, and that the learning software was a step in the direction of creating software that can handle complex tasks "like being a surgeon".^[4]^[5] OpenAI calls the system "reinforcement learning", as the bots learn over time by playing against itself hundreds a times a day for months, in which they are rewarded for actions such as killing an enemy and destroying towers.^[6]^[7]^[8]

By June 2018, the ability of the bots expanded to play together as a full team of five and were able to defeat teams of amateur and semi-professional players.^[9]^[10]^[11]^[12] At The International 2018, OpenAI Five played in two games against professional teams, one against the Brazilian-based paiN Gaming and the other against an all-star team of former Chinese players.^[13]^[14] Although the bots lost both matches, OpenAI still considered it a successful venture, stating that playing against some of the best players in Dota 2 allowed them to analyze and adjust their algorithms for future games.^[15] In April 2019, the bots will play against The International 2018 champions OG at a live event in San Francisco.^[16]

Architecture

Each OpenAI Five network contains a single layer with a 1024-unit LSTM^[17] that observes the current game state extracted from the Dota developer’s API. The neural network conducts actions via numerous possible action heads (no human data involved), and every head has meaning. For instance, the number of ticks to delay an action, what action to select – the X or Y coordinate of this action in a grid around the unit. In addition, action heads are computed independently. The AI system observes the world as a list of 20,000 numbers and takes an action by conducting a list of eight enumeration values. Also, it selects different actions and targets to understand how to encode every action and observes the world.^[18]

OpenAI Five has been developed as a general-purpose reinforcement learning training system on the "Rapid" infrastructure. Rapid consists of two layers: it spins up thousands of machines and helps them ‘talk’ to each other and a second layer runs software. OpenAI Five has played 180 years’ worth of games in a self-play technique, which is a part of the reinforcement learning approach, using a scaled-up version of "Proximal Policy Optimization".^[19]^[20]running on 256 GPUs and 128,000 CPU cores.^[21]

Comparisons with other game AI systems

Prior to OpenAI Five, other AI versus human experiments and systems have been successful used before, such as Jeopardy! with Watson, chess with Deep Blue, and Go with AlphaGo.^[22]^[23]^[24] In comparison with other games that have used AI systems to play against human players, Dota 2 differs as explained below:

Reception

OpenAI Five have received acknowledgement from the AI, tech, and video game community at large. Microsoft founder Bill Gates called it a "big deal", as their victories "required teamwork and collaboration".^[30]^[31] Chess player Garry Kasparov, who lost against the Deep Blue AI in 1997, stated that despite their losing performance at The International 2018, the bots would eventually "get there, and sooner than expected".^[32]

Andreas Theodorou, an AI researcher at the University of Bath who uses computer games to study collaboration, says OpenAI Five was a "big step forward" in the AI industry, although noting that perhaps the most significant achievement was their use of transparent visualizations.^[33] In a conversation with MIT Technology Review, AI experts also considered OpenAI Five system as a significant achievement, as they noted that Dota 2 was an "extremely complicated game", so even beating non-professional players was impressive.^[34] Inspired by the success of OpenAI five, other AI companies have begun to develop systems that will be able to compete in similar complex video games that require strategic thinking, team play, and reasoning, such as StarCraft.^[35]^[36]

References

1. ^{{cite web|url=https://openai.com/five|title=OpenAI Five|website=openai.com/five}}
2. ^{{cite web |last1=Savov |first1=Vlad |title=My favorite game has been invaded by killer AI bots and Elon Musk hype |url=https://www.theverge.com/2017/8/14/16141938/dota-2-openai-bots-elon-musk-artificial-intelligence |website=The Verge |accessdate=25 June 2018}}
3. ^{{cite web|last1=Frank|first1=Blair Hanley|title=OpenAI’s bot beats top Dota 2 player so badly that he quits|url=https://venturebeat.com/2017/08/11/openais-bot-beats-top-dota-2-player-so-badly-that-he-quits/|website=Venture Beat|accessdate=12 August 2017|deadurl=no|archiveurl=https://web.archive.org/web/20170812065202/https://venturebeat.com/2017/08/11/openais-bot-beats-top-dota-2-player-so-badly-that-he-quits/|archivedate=12 August 2017}}
4. ^{{cite web|title=Dota 2|url=https://blog.openai.com/dota-2/|website=blog.openai.com|accessdate=12 August 2017}}
5. ^{{cite web|title=More on Dota 2|url=https://blog.openai.com/more-on-dota-2/|website=blog.openai.com|accessdate=16 August 2017}}
6. ^{{cite web |last1=Simonite |first1=Tom |title=Can Bots Outwit Humans in One of the Biggest Esports Games? |url=https://www.wired.com/story/can-bots-outwit-humans-in-one-of-the-biggest-esports-games/ |website=Wired |accessdate=25 June 2018}}
7. ^{{cite web |last1=Kahn |first1=Jeremy |title=A Bot Backed by Elon Musk Has Made an AI Breakthrough in Video Game World |url=https://www.bloomberg.com/news/articles/2018-06-25/musk-backed-bot-conquers-e-gamer-teams-in-ai-breakthrough |website=Bloomberg |accessdate=27 June 2018}}
8. ^{{cite web |last1=Clifford |first1=Catherine |title=Bill Gates says gamer bots from Elon Musk-backed nonprofit are 'huge milestone' in A.I. |url=https://www.cnbc.com/2018/06/27/bill-gates-openai-robots-beating-humans-at-dota-2-is-ai-milestone.html |website=CNBC |accessdate=29 June 2018}}
9. ^{{cite web |title=OpenAI Five Benchmark |url=https://blog.openai.com/openai-five-benchmark/ |website=blog.openai.com |accessdate=25 August 2018}}
10. ^{{cite web |last1=Simonite |first1=Tom |title=Can Bots Outwit Humans in One of the Biggest Esports Games? |url=https://www.wired.com/story/can-bots-outwit-humans-in-one-of-the-biggest-esports-games/ |website=Wired |accessdate=25 June 2018}}
11. ^{{cite web |last1=Vincent |first1=James |title=AI bots trained for 180 years a day to beat humans at Dota 2 |url=https://www.theverge.com/2018/6/25/17492918/openai-dota-2-bot-ai-five-5v5-matches |website=The Verge |accessdate=25 June 2018}}
12. ^{{cite web |last1=Savov |first1=Vlad |title=The OpenAI Dota 2 bots just defeated a team of former pros |url=https://www.theverge.com/2018/8/6/17655086/dota2-openai-bots-professional-gaming-ai |website=The Verge |accessdate=7 August 2018}}
13. ^{{cite web |last1=Simonite |first1=Tom |title=Pro Gamers Fend off Elon Musk-Backed AI Bots—for Now |url=https://www.wired.com/story/pro-gamers-fend-off-elon-musks-ai-bots/ |website=Wired |accessdate=25 August 2018}}
14. ^{{cite web |last1=Quach |first1=Katyanna |title=Game over, machines: Humans defeat OpenAI bots once again at video games Olympics |url=https://www.theregister.co.uk/2018/08/24/openai_bots_eliminated_dota_2/ |website=The Register |accessdate=25 August 2018}}
15. ^{{cite web |title=The International 2018: Results |url=https://blog.openai.com/the-international-2018-results/ |website=blog.openai.com |accessdate=25 August 2018}}
16. ^{{cite web |author1=Gopya |title=TI8 champions OG to take on OpenAI in San Francisco |url=https://www.vpesports.com/dota2/news/ti8-champions-og-to-take-on-openai-in-san-francisco |website=VP Esports |accessdate=27 March 2019}}
17. ^{{cite web|url=https://colah.github.io/posts/2015-08-Understanding-LSTMs/#lstm-networks|title=Understanding LSTM Networks|work=colah's blog|accessdate=27 August 2015}}
18. ^{{cite web|url=https://blog.openai.com/openai-five|title=OpenAI Five|website=blog.openai.com|accessdate=25 June 2018|date=25 June 2018}}
19. ^{{cite web|url=https://qz.com/1348177/why-are-ai-researchers-so-obsessed-with-games|title=Why are AI researchers so obsessed with games?|work=QUARTZ|accessdate=4 August 2018}}
20. ^{{Cite arxiv|title=Proximal Policy Optimization Algorithms|eprint = 1707.06347|last1 = Schulman|first1 = John|last2 = Wolski|first2 = Filip|last3 = Dhariwal|first3 = Prafulla|last4 = Radford|first4 = Alec|last5 = Klimov|first5 = Oleg|class = cs.LG|year = 2017}}
21. ^{{cite web|url=https://blog.openai.com/openai-five|title=OpenAI Five|website=blog.openai.com|accessdate=25 June 2018|date=25 June 2018}}
22. ^{{Cite newspaper|url=https://www.theguardian.com/technology/2011/feb/17/ibm-computer-watson-wins-jeopardy|title=IBM computer Watson wins Jeopardy clash|journal=The Guardian|accessdate=17 February 2011|date=17 February 2011|last1=Gabbatt|first1=Adam}}
23. ^{{cite web|url=https://www.businessinsider.com/garry-kasparov-talks-about-artificial-intelligence-2017-12|title=Chess grandmaster Garry Kasparov on what happens when machines 'reach the level that is impossible for humans to compete'|work=Business Insider|accessdate=29 December 2017}}
24. ^{{cite web|url=https://www.theverge.com/2017/10/18/16495548/deepmind-ai-go-alphago-zero-self-taught|title=DeepMind's Go-playing AI doesn't need human help to beat us anymore|work=Verge|accessdate=18 October 2017|date=18 October 2017}}
25. ^{{cite web|url=https://blog.openai.com/openai-five/#coordination|title=OpenAI Five|website=blog.openai.com|accessdate=25 June 2018|date=25 June 2018}}
26. ^{{cite web|url=https://blog.openai.com/openai-five/#coordination|title=OpenAI Five|website=blog.openai.com|accessdate=25 June 2018|date=25 June 2018}}
27. ^{{cite web|url=https://www.technologyreview.com/s/611536/a-team-of-ai-algorithms-just-crushed-expert-humans-in-a-complex-computer-game|title=A team of AI algorithms just crushed humans in a complex computer game|work= MIT Tech Review|accessdate=25 June 2018}}
28. ^{{cite web|url=https://blog.openai.com/openai-five/#coordination|title=OpenAI Five|website=blog.openai.com|accessdate=25 June 2018|date=25 June 2018}}
29. ^{{cite web|url=https://blog.openai.com/openai-five/#coordination|title=OpenAI Five|website=blog.openai.com|accessdate=25 June 2018|date=25 June 2018}}
30. ^{{cite web|url=https://www.cnbc.com/2018/06/27/bill-gates-openai-robots-beating-humans-at-dota-2-is-ai-milestone.html|title=Bill Gates says gamer bots from Elon Musk-backed nonprofit are 'huge milestone' in A.I.|work=CNBC|accessdate=28 June 2018|date=28 June 2018}}
31. ^{{cite web|url=https://www.businessinsider.com/bill-gates-hails-huge-milestone-for-ai-as-bots-beat-humans-at-dota-2-2018-6|title=Bill Gates hails 'huge milestone' for AI as bots work in a team to destroy humans at video game 'Dota 2'|work=Business Insider|accessdate=27 June 2018}}
32. ^{{cite web|url=https://twitter.com/Kasparov63/status/1033108573151092736|title=Gary Kasparov's Twitter|accessdate=24 August 2018|date=24 August 2018}}
33. ^{{cite web|url=https://www.theverge.com/2018/6/25/17492918/openai-dota-2-bot-ai-five-5v5-matches|title=AI bots trained for 180 years a day to beat humans at Dota 2|work=Verge|accessdate=25 June 2018|date=25 June 2018}}
34. ^{{cite web|url=https://www.technologyreview.com/s/611536/a-team-of-ai-algorithms-just-crushed-expert-humans-in-a-complex-computer-game|title=A team of AI algorithms just crushed humans in a complex computer game|work= MIT Tech Review|accessdate=25 June 2018}}
35. ^{{cite web|url=https://www.technologyreview.com/s/609242/humans-are-still-better-than-ai-at-starcraftfor-now|title=Humans Are Still Better Than AI at StarCraft for Now|work=MIT Tech Review|accessdate=1 November 2017}}
36. ^{{cite web|url=https://www.wired.com/story/can-bots-outwit-humans-in-one-of-the-biggest-esports-games|title=CAN BOTS OUTWIT HUMANS IN ONE OF THE BIGGEST ESPORTS GAMES?|work=WIRED|accessdate=25 June 2018}}

History

Architecture

Comparisons with other game AI systems

Reception

References

External links