Exploring Computer Science and Engineering : Prompt 9: Recognizing Significance - AlphaGo Beats One of the World's Best Go Players

By Qingyang Li

In March,2016 , Google and deep mind company set up a Go match that let AlphaGo-the Artificial Intelligence developed by two companies against the Korean Go player Lee sedol which is No.3 at Go ranks.The game lasted for 7 days and result is 1:4,Lee sedol lost this match,and it was the first time a computer program beat a 9-dan professional without handicaps. AlphaGo was awarded an honorary 9-dan by the Korea Baduk Association.

And then, during December 2016 from January 2017, a player name "Master" suddenly appeared online and have beat several best Go player in human history, including three victories over Go's top ranked player ,Ke jie, and during this time ,"Master's" record was 60 wins and 0 losses. Before it played against Ke jie, Google company finally announced to public that "Master" is AlphaGo. Which indicated that Artificial Intelligence beat Human-race in the most complex game humans play.

According to Deep mind company and Google, AlphaGo's algorithm uses a combination of machine learning and tree search techniques, combined with extensive training, both from human and computer play.It uses Monte Carlo tree search, guided by a "value network" and a "policy network", both implemented using deep neural network technology. A limited amount of game specific feature detection pre-processing is applied to the input before it is sent to neural networks.

The system's neural networks were initially bootstrapped from human gameplay expertise. AlphaGo was initially trained to mimic human play by attempting to match the moves of expert players from recorded historical games, using a database of around 30 million moves.Once it had reached a certain degree of proficiency, it was trained further by being set to play large numbers of games against other instances of itself, using reinforcement learning to improve its play. TO avoid "disrespectfully" wasting its opponent's time, the program is specifically programmed to resign if its assessment of win probability falls beneath a certain threshold: for the March 2016 match against Lee, the resignation threshold was set to 20%.

Although the theory of AI have already posted for almost 60 years, for 60 years there is not a achievement like AlphaGo can shock us. A wise man once said that human's history is the history of learning, and AlphaGo is also grow up during learning.If the technology of AI can develop at a mature level, It will be a great advantage for entire man-kind, what if AI against us like"The Matrix"? Scientists already argue about this topic for many years, but who knows, maybe hundreds years later we will see.

Exploring Computer Science and Engineering

Sunday, February 19, 2017

Prompt 9: Recognizing Significance - AlphaGo Beats One of the World's Best Go Players

No comments:

Post a Comment

Popular Posts