What we learned in Seoul with AlphaGo

March 16, 2016

Go isn’t just a game—it’s a living, breathing culture of players, analysts, fans, and legends.

Over the last 10 days in Seoul, South Korea, we’ve been lucky enough to witness some of

that incredible excitement firsthand. We've also had the chance to see something that's never

happened before: DeepMind's AlphaGo took on and defeated legendary Go player,

Lee Sedol (9-dan professional with 18 world titles), marking a major milestone for artificial

intelligence.

Pedestrians checking in on the AlphaGo vs. Lee Sedol Go match on the streets of Seoul (March 13)

Go may be one of the oldest games in existence, but the attention to our five-game tournament

exceeded even our wildest imaginations. Searches for Go rules and Go boards spiked in the U.S.

In China, tens of millions watched live streams of the matches, and the

“Man vs. Machine Go Showdown”

hashtag saw 200 million pageviews on Sina Weibo. Sales of Go boards even surged in Korea.

Our public test of AlphaGo, however, was about more than winning at Go. We founded DeepMind

in 2010 to create general-purpose artificial intelligence (AI) that can learn on its own—and, eventually,

be used as a tool to help society solve some of its biggest and most pressing problems, from

climate change to disease diagnosis.

Like many researchers before us, we've been developing and testing our algorithms through games.

We first revealed AlphaGo in January—the first AI program that could beat a professional player at

the most complex board game mankind has devised, using deep learning and reinforcement learning.

The ultimate challenge was for AlphaGo to take on the best Go player of the past decade—Lee Sedol.

To everyone's surprise, including ours, AlphaGo won four of the five games. Commentators noted

that AlphaGo played many unprecedented, creative, and even“beautiful” moves. Based on our

data, AlphaGo’s bold move 37 in Game 2 had a 1 in 10,000 chance of being played by a human.

Lee countered with innovative moves of his own, such as his move 78 against AlphaGo

in Game 4—again, a 1 in 10,000 chance of being played—which ultimately resulted in a win.

The final score was 4-1. We're contributing the $1 million in prize money to organizations that

support science, technology, engineering and math (STEM) education and Go, as well as UNICEF.

We’ve learned two important things from this experience. First, this test bodes well for AI’s potential

in solving other problems. AlphaGo has the ability to look “globally” across a board—and find solutions

that humans either have been trained not to play or would not consider. This has huge potential for

using AlphaGo-like technology to find solutions that humans don’t necessarily see in other areas.

Second, while the match has been widely billed as "man vs. machine," AlphaGo is really a human

achievement. Lee Sedol and the AlphaGo team both pushed each other toward new ideas,

opportunities and solutions—and in the long run that's something we all stand to benefit from.

But as they say about Go in Korean: “Don’t be arrogant when you win or you’ll lose your luck.”

This is just one small, albeit significant, step along the way to making machines smart. We’ve

demonstrated that our cutting edge deep reinforcement learning techniques can be used to

make strong Go and Atari players. Deep neural networks are already used at Google for specific

tasks—like image recognition, speech recognition, and Search ranking. However, we’re still a long

way from a machine that can learn to flexibly perform the full range of intellectual tasks

a human can—the hallmark of trueartificial general intelligence.

Demis and Lee Sedol hold up the signed Go board from the Google DeepMind Challenge Match

With this tournament, we wanted to test the limits of AlphaGo. The genius of Lee Sedol did

that brilliantly—and we’ll spend the next few weeks studying the games he and AlphaGo played

in detail. And because the machine learning methods we’ve used in AlphaGo are general purpose,

we hope to apply some of these techniques to other challenges in the future. Game on!

Posted by Demis Hassabis, CEO and Co-Founder of DeepMind

What we learned in Seoul with AlphaGo的更多相关文章

AlphaGo：用机器学习技术古老的围棋游戏掌握AlphaGo: Mastering the ancient game of Go with Machine Learning
AlphaGo: Mastering the ancient game of Go with Machine Learning Posted by David Silver and Demis Has ...
（转）The AlphaGo Replication Wiki
The AlphaGo Replication Wiki 摘自:https://github.com/Rochester-NRT/RocAlphaGo/wiki/01.-Home Contents : ...
世界围棋人机大战、顶峰对决第一盘：围棋世界冠军Lee Sedol（李世石，围棋职业九段）对战Google DeepMind AlphaGo围棋程序
Match 1 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo 很多网站对世界围棋大战进行了现场直播,比如YouTube.新浪.乐视.腾 ...
Elasticsearch Mantanence Lessons Learned Today
Today I troubleshooted an Elasticsearch-cluster-down issue. Several lessons were learned: When many ...
也谈谈AlphaGo
距离AlphaGo击败李世石已经过去数月了,心中的震撼至今犹在,全刊报道此项比赛的<围棋天地>杂志我已经看了不下十遍.总也想说点自己的意见,却也不知道从哪里说起,更不知道想表达些什么. 作 ...
人机大战之AlphaGo的硬件配置和算法研究
AlphaGo的硬件配置最近AlphaGo与李世石的比赛如火如荼,关于第四盘李世石神之一手不在我们的讨论范围之内.我们重点讨论下AlphaGo的硬件配置: AlphaGo有多个版本,其中最强的是分布 ...
(转) 一张图解AlphaGo原理及弱点
一张图解AlphaGo原理及弱点 2016-03-23 郑宇,张钧波 CKDD 作者简介: 郑宇,博士, Editor-in-Chief of ACM Transactions on Intellig ...
曲率已驱动了头发——深度分析谷歌AlphaGo击败职业棋手
这篇是我们自开设星际随笔以来写得最长的一篇.我们也花了不少力气.包括把那5盘棋各打了两遍的谱,包括从Nature官网上把那篇谷歌的报告花了200元下载下来研究它的算法(后来发现谷歌网站上可以免费下载 ...
田渊栋：AlphaGo系统即使在单机上也有职业水平
Facebook人工智能组研究员田渊栋博士在知乎专栏上更新了一篇文章,详细分析了AlphaGo在<自然>杂志上发表的论文,他认为AlphaGo整个系统即使在单机上也已具有了职业水平,与李世 ...

随机推荐

Linux 命令 - jobs: 显示后台作业的状态信息
命令格式 jobs [-lnprs] [jobspec ...] jobs -x command [args] 命令参数 -l 额外显示作业的进程 ID. -n 只列出状态发生变化的进程. -p 只列 ...
函数 datediff(根据objid 获取同name 同年度最近的4条记录)
显示包括选择的这条,在加上选择年度的此人最近的 3条.(最多显示4条) . 记录数大于4条 . 全显示 create table temp( objid ,) primary key , nam ...
hive 未初始化元数据库报错
启动hive-metastore和hive-server2 用beeline连接hive报错 [root@node04 hive]# beeline Beeline version 0.13.1-cd ...
如何设置Win7系统中的上帝模式GodMode（转载）
如何设置Win7系统中的上帝模式GodMode(转载) NT6系统中隐藏了一个秘密的“GodMode”,字面上译为“上帝模式”.God Mode其实就是一个简单的文件夹窗口,但包含了几乎所有系统的设置 ...
使用 EF Power Tool Code Frist 生成 Mysql 实体
原文:使用 EF Power Tool Code Frist 生成 Mysql 实体 1,在要生成的项目上右键 2, 3, 4, 5, 生成后的效果已知问题: 1,在Mys ...
ZipArchive 的使用
新建一个项目,首先添加 System.IO.Compression.FileSystem 引用. 解压文件 using System.IO.Compression; namespace cl { st ...
css3学习笔记之2D转换
translate() 方法 translate()方法,根据左(X轴)和顶部(Y轴)位置给定的参数,从当前元素位置移动. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 ...
iOS开发基础之排序
Objective-C 有排序的API,省了我们很多事. 主要有以下3种方法. NSComparator NSArray *unsortedArray = @[@5,@3,@8,@1,@7]; NSA ...
asp.net 中使用less
首先 ,需要知道 whats the less; 实际上less 只是针对css比较难于维护和抽象这种现象,而创造的一个工具. 然后,在抛开语言环境的情况下(例如.net 是vs环境,java是ecl ...
WeX5与阿里内测的Weex与有何纠葛？快来看HTML5开发圈那些逗逼事儿！
4月21日~23日,由infoQ主办的2016 Qcon大会北京站如期举行. HTML5开发已经成为移动开发/前端专题中无可争议的焦点,核心议题已经由前几年的是否该用HTML5转向了如何高性能.高效率 ...

What we learned in Seoul with AlphaGo

What we learned in Seoul with AlphaGo

What we learned in Seoul with AlphaGo的更多相关文章

随机推荐

热门专题