一、阿里巴巴CTO程立(花名:鲁肃)的个人经历
我的经历很简单,2004年之前一直在学校读书,读到30岁。2004年跟着一个师兄做外包项目,为淘宝网做一个重要的架构升级——把原来PHP/MySQL架构换成Java EE架构。2005年2月份正式以实习生身份加入支付宝。
2005到2014年,我主要的岗位是支付宝架构师。2013年,蚂蚁的CTO要回美国,蚂蚁CEO彭蕾找我谈过话,我那时候对带这么大团队完全没有信心。我最没有信心的不是带技术团队,而是CTO另外一半的职责,作为公司经营管理的一部分,怎么跟CEO对话。
前任CTO会在公司管理会议上和业务吵得面红耳赤,我觉得我做不到。后来王坚博士约我去喝茶,他说没有关系,每个人都是不一样的,但是每个人都能用自己的方式解决问题。这给了我信心。
于是,2013年我开始带支付宝整个技术团队,经过一年的考察,在2014年公司任命我为蚂蚁CTO,2014年到2019年我一直在这个岗位。
2018年,公司发现技术确实会对某些业务起到更加关键作用,需要更强的技术背景同学进入业务。当时对技术有很强需求的是蚂蚁的国际业务,就让我做了两年蚂蚁国际业务的COO。2019年年底,转任阿里巴巴的CTO。又过了一段时间,菜鸟也需要一位CTO,就让我兼任菜鸟CTO。
二、商业与技术共同进化之旅——蚂蚁
2003
首推担保交易“支付宝”应运而生
2005
CTU上线,开创互联网支付实时风控
2007
金融型系统分布式“服务化”
2010
阿里小贷上线
2011
条码支付
2013
“余额宝”上线,支付最后一台小型机下线
2014
OceanBase首次支撑“双11”核心交易,峰值支付破3万笔/秒
2015
“三地五中心”异地多活,网商银行开业,蚂蚁金融云发布
2016
印度PayTM钱包上线,蚂蚁技术向海外开放
2017
“刷脸支付”上线
2018
蚂蚁BASIC金融科技全面开放:蚂蚁链、金融智能、安全风控、小程序、蜻蜓、金融PaaS、OceanBase
我挑了几个蚂蚁CTO工作的节点,非常好地印证了“商业和技术的共同进化”。最早期支付宝非常简单,就是要做一个互联网的第三方支付平台,技术只要能把业务需求实现就好了,项目最大特点就是快、轻量。
2005年,某个国际支付巨头要进入中国,很多同事很紧张,那时陆兆禧是支付宝的CEO。他说:这有什么可怕的,他们一个需求两个月才能实现,我们支付宝两天就能实现,他们肯定不是我们的对手。
到2007年,公司面临一个技术分叉点:互联网支付一定会成为行业非常主流的支付方式,我们已经能够看到业务的规模,但需要做一个判断——该用什么技术架构支撑。
支付宝最早对技术架构没有很强的要求,什么好用就用什么。我们既有非常互联网的架构,脱胎于淘宝网;也有非常金融的架构,2005年我们敲锣打鼓、披红挂彩买了一台小型机处理核心账务。
但在2007年我们必须做一个判断,未来我们更像一家银行,还是更像一家互联网公司?幸好后来没有走弯路,我们判断未来必须要用互联网架构去解决支付相关的问题。
于是决定把系统做一个面向未来的架构设计:分布式改造。整整一年时间,我们将核心交易、核心账务、核心会员、核心支付,全部分布式化。
但涉及到一些核心技术如何解决的问题,比如在这么大的分布式系统里,怎么解决核心事务的问题——这其实不容易。
2007年某个国际支付巨头也曾经想解这个问题,结果系统上线后宕机两周,CTO离职。在对这个问题的改造过程中,我们也胆战心惊,但最终还是把这块硬骨头啃下来了。
第二个关键节点是2011年,移动业务起来了,但移动场景下支付到底是个怎样的形态,需要做一个非常关键的判断。当用户拿着手机和商家支付时,到底用什么方式进行?
那时智能手机的格局并没有完全定形,一些手机还没有摄像头。当时公司部署了很多方案,有扫一扫的,当时叫悦享拍,拍一下就能完成支付;有叫“咻一咻”的,让手机发出声波,商家接收这个声音;还有靠蓝牙的。几轮下来之后,其实很多商家装了咻一咻的设备,很多商家用二维码,最后发现用户能接受的是扫码支付。
2013年,也是我担任蚂蚁CTO的第一年,发生了很多重要的事情。我们内部起了非常多的重要项目,编号从1号到9号。
2013年这一年,1号项目是网商银行;2号项目是余额宝上线,让支付宝业务实现真正破圈,真正做数字普惠金融;3号项目是花呗。当然还有456,我们酝酿了很多项目,有成功的,也有不那么成功的。
这个过程对整个技术架构造成了非常巨大的冲击,从原来互联网支付脱胎的技术架构,开始往数字金融这个方向走。在这个过程中,我反而觉得技术正好可以发力,帮助公司技术实现变革。那一年支付宝开始正式“去IOE”,最后一台小型机下线。
三、商业与技术共同进化之旅@阿里巴巴
1999
阿里巴巴成立,开启B2B业务
阿里巴巴的技术其实非常广,后半程的发展会以阿里云为主线。
2003
淘宝网上线,开启C2C业务
第一阶段非常清楚,就是技术支撑商业发展,从淘宝网上线开始。那时淘宝网为什么能够赢,除了它商业上有很多创新之外,“快”也是非常关键的因素。在这个过程中,CTO要能在业务快速增长时,提前在技术上做布局,让商业成长能够持续。2004年,我做淘宝架构升级外包业务的阶段其实是淘宝网非常关键的时期,Java EE架构的形成对淘宝的增长奠定了非常好的基础。
2004
淘宝网全面转向Java开发,奠定高性能网站基础
2005
支付宝正式成为第三方支付平台
2006
淘宝网服务平台化
2007
阿里妈妈“直通车”上线开启互联网营销
2008
淘宝商城(天猫)上线,五彩石项目实现,淘宝商城与淘宝网全面打通
2009
阿里云成立、阿里团队在北京写下第一行代码
2009年有一个神来之笔——那个时间点阿里巴巴敢于判断云是未来,敢于为云写下第一行代码。阿里巴巴内部很多人都不看好云,但是为什么那时候能判断对呢?我觉得阿里巴巴有一句话说的非常好:“因为相信,所以看见”。就像电商业务也一样,很早阿里巴巴就判断电子商务一定是未来,先发先至,所以在很长的一段时间,坚定相信这个方向、持续地走,只要时机到来,就自然可以取得商业上的成果。技术其实也一样,阿里巴巴很早就做了一个判断:云计算一定是未来,后面所有发展都以这个为主线。
现在回头来看,后面每个关键节点都是和云相关的。比如阿里云2009年写下第一行代码后,内部找了一圈都找不到客户,这时候CEO就指定阿里小贷业务必须用阿里云,给阿里云一个场景去打磨,这才给了阿里云一线生机。2010年孙权是阿里小贷的负责人,后来当了阿里云的负责人。
2010
手机淘宝上线,阿里云正式对外公测
2011
阿里云开始大规模对外提供云计算服务
2013
阿里巴巴去“IOE”完成,阿里云成为首家对外提供5K云计算服务公司
2014
大数据计算平台MaxCompute上线
2015
启动“大中台,小前台”战略
2016
手机淘宝实现“千人千面”,城市大脑正式上发布
2017
达摩院正式成立,天猫精灵上市
2018
宣布芯片战略,成立平头哥,张北30万台服务器数据交付中心
2019
开启阿里云智能时代
2020
“全面上云”封顶,进入云原生时代,云钉一体
我自己印象比较深的是2013、2014年,经过几年和阿里小贷的打磨,阿里云开始有一点技术了,大数据处理开始慢慢成熟。当时有一个很重要的“5K项目”,阿里云开始具备5000台机器大集群处理数据的能力。具备这个能力之后,公司又做了一个重要决定,阿里巴巴和支付宝所有的数据全部要上到阿里云的大数据平台上,那时我正好是蚂蚁的CTO,我举双手支持——因为我知道即使不支持,集团也会按下来。阿里云也因此又上了一个台阶。
2015年,大家都听过“大中台、小前台”,听上去很牛。“大中台、小前台”背后完成了一件事情:把阿里巴巴和支付宝所有的基础技术全部统一到阿里云上,这是个重大的技术变革,为了完成这个技术变革,阿里巴巴做了非常好的组织设计,让当时的阿里巴巴CTO行癫兼任阿里云CTO,把技术收在一起,加强阿里云。“大中台、小前台”策略执行了三年,阿里云整个技术架构就非常完整了。阿里巴巴是集全公司之力支持一个技术战略的达成。
2017年之后,这个阶段用行癫的话讲,是“用技术创造新商业”,开始去探索技术本身能不能成为商业的一个部分。2017年达摩院成立,同样是“因为相信,所以看见”。因为我们判断未来一定是数字经济,围绕数字经济所需要的核心技术达摩院要进行布局,无论是AI、计算,还是芯片,都是围绕数字经济的核心问题的思考和布局。
2020年,我已经是阿里巴巴集团的CTO了,我们完成了上云的封顶,所有核心业务基本都跑在云上,包括最具挑战性的业务,比如搜索、推荐、广告业务,云都能支撑,我觉得这是云非常大的技术进步,也是集团非常重大的技术变革。
现在我们进入一个新的时代,要把云和业务技术的边界定义清楚,边界就是云原生。我们未来业务应用全是云原生的应用,这是一个云原生时代。另外,云本身也在升级,就是大家都知道的“云钉一体”。
四、不同业务阶段下的三种技术领导力
三种技术领导力,我是受到俞永福的启发,他觉得企业发展就是这样波浪式的发展过程:一开始要先找到一个方向,进入一个业务的轨道;如果这个方向判断准了,企业就会进入快速增长阶段;发展到一定阶段,就必须要脱离现有的惯性,再去找新的发展方向——就是这么一波波的过程……
在波浪式发展过程中,技术在每个阶段起的作用不一样。在入轨的阶段,CTO应该是整个公司业务一号位班子的成员,是支持一号位的二号位,班子一起看清方向,把业务带入正轨。
一旦入轨之后,业务进入快速增长期,CTO的核心不是看方向,而是怎么做好技术,这时首席架构师会变得非常重要,技术让业务更高速增长、加速成长,业务不要被技术拖慢增速。同时,组织设计在这个阶段和技术架构一样重要。然后不能等到业务停滞时才去判断未来,CTO要提前判断未来会发生什么,第二曲线是什么,设计一条未来的路线。
我是觉得整体上组织需要有两种领导力:一、专业的领导力:CTO、首席架构师、技术总监和VP;二、组织设计的管理领导力。CTO在不同时候戴着不同帽子,有时会承担一个专业角色,有的时候会承担一个管理的角色,有的时候会承担一个战略的角色。
很难说一个CTO是万能的,CTO一定有强项和短板。比如我自己,我的定位是一个工程师,工程能力是我相对比较强的一项能力,组织是我的弱项。在这个过程中,一个核心班子里需要非常好的配合,具备不同的领导力,给不同的人戴上不同的帽子。
五、CTO的职责
(一)建立商业与技术的“共振”连接
CTO在一个公司中到底扮演什么角色和职责?
我觉得第一个非常基础的职责,是要建立起商业和技术连接。前面讲到商业和技术是共同进化的,而共同进化的过程中两者要发生很好的共振连接。为了实现社会价值、客户价值,企业要判断需要怎样的核心能力?这个判断不是一个纯粹技术判断,也不是纯业务判断,而是一个公司核心班子共同的判断。
这个过程中,CEO和CTO怎么对话就变得非常关键了,能不能说一样的语言?CTO要和CEO问清楚几个框架性的问题:一是我们公司服务谁,要把客户定义的非常清楚;二是我们为这些客户创造什么样的核心价值、差异化价值;三是我们的商业模式,用什么样的商业模式实现核心价值;四是为了实现这样的商业模式,我们需要什么样的能力、走过什么路径、构建什么样的组织。把这些问题理解清楚之后,技术就能理解业务要干什么了。
我在阿里巴巴接到任何一个新业务,都一定要对它进行新的理解,后续可能还会有多的问题、更深入的理解,但这是我做的第一件事情,这是CTO的职责。
(二)一张图、一场仗、一颗心,愿景牵引前行
很多时候,牵引团队往前是非常有挑战的事情,小团队还可以靠关系。大团队里大部分人你都不能认识,怎么还能“一张图、一场仗、一颗心”?这时候就需要有领导力了。
我分享几个例子,比较有挑战,也有很多不一样的方法。
第一个例子,我在团队里建立领导力的方式,是跟团队一起定义非常清晰的目标。2013年经过一番犹豫之后,我决定接受负责蚂蚁整个技术团队的任命。当时我遇到的最大的挑战就是和团队之间的信任——本来大家都是平级,突然我要成为这个部门的总负责人。当时大家给我的反馈是,觉得鲁肃做技术可以,但不懂业务技术。
我觉得信任是要靠跟大家一起把事情做成,甚至没有必要让大家建立起对领导者的认同,就看大家能不能相信共同的目标,并把目标实现。2013年,我跟团队共识了“1234”目标:
1.公司从互联网平台向数字普惠金融平台转型,我们的技术架构怎么去支撑?需要重新定义支撑数字普惠金融新的平台。做平台架构这件事情,至少是我的本行,大家认同。
2.蚂蚁哪些核心技术是我们今年必须要突破的?当时定了两个核心点:一是OceanBase在这一年能落地;二是我们要把蚂蚁的平台建在云上。
3.我们能不能在这一年大促的时候实现3万笔/秒的支付?大家知道从2010年开始,每年会有一次大考:大促能扛多高的峰值。前两年,蚂蚁都是非常吃力的去扛。甚至2012年对团队来说是个很大的耻辱——大促前50分钟系统被冲垮了,50分钟后才恢复。2013年大家都想一雪前耻:2012年时峰值是每秒3000笔,已经是性能极限了。我们定一个10倍的目标实现3万笔/秒!
4.因为我们的服务不仅仅是支付,还开始提供金融服务,稳定性必须像银行一样,甚至比银行更稳定。我们定下可运行目标4个9(99.99%),之前3个9都很吃力。
这四个目标,大家非常认同,虽然都很有挑战。年底的时候,经过大家的努力,四个目标里实现了三个半:完成了平台的转型,余额宝、花呗业务都起来了;OceanBase在蚂蚁系统里落地了;做到4个9;但我们没有做到3万笔/秒,只做到1.5万笔/秒,是5倍的提升。之后大家开始建立起对我的信任。
我到了阿里巴巴之后,第一件事情也是需要大家聚在一起,一起定下目标。空降到一个平台,其实很难给团队定一个全新的愿景。如果新CTO上来就先画一张未来的图,基本是不太靠谱的,还是要和团队一起定下非常扎实的目标、一起达成目标。于是,我选择先在云这个关键战略上稳中有进,全面上云必须全部完成。二是在云之上,如何从“上云”到“用云”,把云原生做深入,将阿里巴巴中台做深入,包括AI中台、数据中台和业务中台。另外一个目标是组织数字化,阿里巴巴虽然生于数字时代,但也需要做数字化升级和转型,阿里巴巴数字原生商业系统和数字原生组织是去年和团队一起确定的方向。
CTO和团队在一起要有一个面向未来的思考,不只是当下与业务的连接。未来是什么,关键的路径是什么,核心的几场仗是什么,这是CTO的牵引力。
(三)关键决策,扫清前进中的障碍
CTO需要帮团队解决问题,扫清一些障碍,做出一些关键的决策。在蚂蚁做CTO时,第一个关键的决策是技术架构往金融方向还是互联网方向?最后决策是互联网。第二个关键决策是关于OceanBase的转型。
2010年,阳振坤老师带着团队开始打造OceanBase。最早OceanBase还不是一个真正的数据库,当时第一个场景是解决淘宝的收藏夹问题,需要海量数据但不需要很强的数据处理能力。但到了2012年,OceanBase面临一个发展决策,要不要成为一个真正的数据库。
蚂蚁团队对OceanBase的怀疑有合理的理由,我们现在的业务对数据库要求这么高,Oracle那么先进的数据库团队打造了几十年,才刚刚能满足我们的业务需求。阳老师带着几十个人花两年时间打造一个数据库,能把Oracle替代掉吗?
我觉得王坚博士是很有领导艺术的,当时他做了一个选择,他说:“把OceanBase送给支付宝吧。”到了蚂蚁这边就成了我当时定的“1234目标”中“2”的部分:看OceanBase能不能再往前发展。
当时我们采用了什么办法呢?第一先了解清楚OceanBase能干什么;第二,既然公司整体不能做,就搞一个小场景,在蚂蚁的核心交易里切了1%的流量给OceanBase,让它在1%流量里证明能力。OceanBase也很珍惜这个舞台,撑住了1%的流量,最终在这一年完成了从非关系型数据库向真正关系型数据库的转变。2014年我们再大胆往前迈一步,把“双11”大促10%的流量切给它,让它进核心账务系统——账务系统比核心交易系统要求更高。当时这个决策是有一些冒险的,但现在回头看,正是这些决策让OceanBase有了不一样的未来。
后面几个决策也类似,作为CTO如何拿捏好风险和稳定,是非常关键的。
比如余额宝上线的第一天我们就知道这个产品一定会成功,因为它的客户价值特别清晰,让用户每一块钱的余额都有收益。但我们没有料到,它成功得那么快,一个月时间迅速就把原来准备的系统容量全部用掉了。我们自己还好,因为蚂蚁的平台已经完全分布式化了,可以快速扩容。但我们的合作伙伴天弘基金,因为老系统无法支撑余额宝这么快速的增长,扩容成了核心难题。
于是我们做了一个非常重要的决定:把天弘的系统搬到云上,用分布式架构重写一遍,三个月内必须完成。这件事也证明了金融云的价值,金融云的业务也就起来了。
这些都说明,CTO要在关键决策中发挥作用,帮助团队下决心并把握好不确定性。
(四)应对风险,化危为机
公司的技术风险有几类。
第一类,技术架构不能够支持业务发展,这是业务不能接受的风险。但这类的风险,相对显而易见。阿里巴巴为什么在2009年启动“去IOE”,就是判断到这个风险早晚会出现,“IOE”架构已经不能支持公司业务发展,必须去掉。蚂蚁在2010年开始启动“去IOE”,因为那一年“双11”大促让我们看到原来架构基本不能支持业务了。
2010年的“双11”大促被我们称为“人肉云计算”。大促开始,我们看到流量疯涨,判断白天肯定会冲破设计容量,就赶快把服务器库存全部加进去了,结果发现还是不行,又马上去“杀”服务,把不必要的服务全部关掉,把容量放在核心关键链路上,结果还不够,我们又去看关键服务里还有没有能“杀”的。那年“双11”最后几分钟,我们的核心账务数据库和会计数据库都到了极限,还有10秒钟就要挂掉了。最后关头,团队非常果断,把会计数据库杀掉了。会计记账本身很重要,但断个几分钟还能承受,核心账务不能挂,否则整个业务就全部中断了。就这样,2010年的“双11”大促是硬撑下来的。
其实后来很长一段时间,都是“人肉云计算”,人在做资源调配的事情,人来做决策判断。这太痛苦了,这一切逼着蚂蚁非常坚定地拥抱云的分布式架构,要上云。
第二类风险不是技术架构的问题,而是稳定性出现重大问题。CTO必须要判断出这个情况,CEO一定要给CTO相应的资源支持。
比如蚂蚁的“527”。2015年我们实现了整个支付宝系统的异地多活,华南机房和华东机房异地可以做分布式交易,这在金融系统里应该是第一次。我们还特地做了断网演练,直接把华东区域全部关掉,华南直接接管业务,结果非常成功,大家非常开心。我还给当时的CEO彭蕾发了一个战报,说蚂蚁首次实现了异地多活,你们可以放心睡觉了,以后任何情况下系统都会稳如磐石。
2015年5月27日,我开车在路上收到了报警,通常遇到这种情况团队就直接处理了,但那个报警就一直响。我就把车停在路边,才发现光纤被挖断了,系统中断将近2个小时。原来异地多活,光纤一断,系统就切不过去了。从那之后,我再也没有发过任何一份战报,没有和CEO报过任何一次喜。
当时这个事情对蚂蚁技术团队打击非常大,公司受到最大的打击是客户的信任。外面有很多文章调侃蚂蚁的技术,蚂蚁的技术形象和外部的信任丧失了。内部从我到整个团队与CEO的信任也开始出现了危机——以后再讲话,别人只听一半,因为说的话不见得靠谱。
危机也可以是好事情。“527”之后,我们做了几件非常关键的事情,第一,开始跟公司沟通,我们需要在蚂蚁成立一个专门的部门,叫技术风险部。这个部门就一个职责,守住蚂蚁技术底盘的风险,公司额外拨10%的技术资源给到这个团队。也就是说,我们愿意付出额外10%的技术资源专门确保蚂蚁系统未来没有风险,这件事情至少帮我们争取到这个资源。第二,立刻启动实战演练,前面说其实我们做过异地多活的断电演练,但这是有准备的演练。从那一天开始我们要做无准备演练,一直到今天。第三,我们确定把5月27日这一天作为蚂蚁的技术日,蚂蚁未来无论是存在102年还是更长的时间,所有技术工程师都要记住这一天,让我们永远能够记住风险。
这三件事情做完,反而让团队更加凝聚了,蚂蚁技术的风险防控水平有很大的提升,我觉得这是转危为机。
第三类风险,可能是CTO和CEO都最担心的,一个新技术出现之后会不会颠覆原有的业务模式。不仅是很多发展中的公司会担心,即便阿里巴巴这样规模的公司也非常担心。当一个新技术出现,无论公司大小,都有被完全颠覆的风险。蚂蚁面对移动互联网时代时,我们有一段时间很担心,通过好几年努力定义移动支付,基本上算是渡过这个危机,但当时对蚂蚁的挑战还是非常大的。当比特币开始成为一个现象时,我们也是非常担心的:它会不会把支付完全颠覆了?2019年6月Libra出现,更让人担心了:全球支付会不会被一种新的货币重新定义,这是一种降维打击。
这时候CTO必须要站到一号位班子里去,帮助CEO做判断。每一次对未来危机的判断,都可以触发未来新的商业机会。大的策略是:面对任何技术风险不能只是看,要亲自去试,需要公司投入一些有价值的浪费。
第四类风险,是温水煮青蛙。技术会不会反过来伤害公司,它不像风险那么直接,但是如果因为技术、架构或组织问题让公司效率变慢了,公司会慢慢丧失竞争力、创新力。随着时间的推移,这是最大的危机。阿里巴巴的中台就是一个很好的例子,中台的优点在于可以减少很多重复建设,让业务可以基于中台快速创新,因为重复建设的繁忙约等于效率低下。但阿里巴巴的中台已经非常完善了,开始进入了另外一个阶段。目前,业务中台里有面向核心电商的中台、数字供应链的中台、职能线业务中台、数据中台、AI中台等一系列技术中台。当我做一个新业务的时候,我需要跟这么多中台打交道,需要他们去支撑我,过程中如果有任何一个中台支持不能到位,我的业务可能就做不成。我们现在开始大力推动中台能力进一步升级,改善中台的交付方式,把中台再升级。这里涉及到很多技术架构的变化。为了防止温水煮青蛙就是要一直做系统化的梳理,再去一个阶段一个阶段的解决。
(五)组织设计与治理——平衡秩序与创新
一个人当CTO的时间越长,专业能力下降得就越严重。我判断自己的专业能力大概每隔两年会降一级。也有好处,你可以跟团队一起做,团队会更强。组织设计的核心是要既有秩序又能创新。这是有矛盾的,创新是在一个混乱的土壤里长出来的,秩序让效率高,但会把很多创新吃掉。这个过程有点像踩钢丝,处理秩序和创新的平衡。
阿里巴巴围绕技术的组织,是有两条线的。一条线是实的管理线,是分层分布的:
前台,面向业务的,为客户赢;中台,是能力中心,中台的客户是前台,让前台更加高效,让前台更有竞争力;底层后台,是强调技术先进性的,确保业务永续。这个组织每一层都是独立的业务经营单元,现在我们在做一件事情,让每个独立的业务经营单元都有CTO。这个CTO会对这个业务经营单元负全责。实现管理机制的核心就是把每一层之间的界面定义清楚。
这种治理让每一块业务都很灵活,都可以自主发展,但又带来了一个新问题,我们该统一发展的技术怎么形成合力,所以我们有另一条虚线:技术委员会,下设二三十个核心的技术小组,把所有的共性领域横向拉通。通过技术委员会和技术小组的专业领导力,实现策略通、人才通。这两条线会同时运作,随着管理线越来越清晰的分层,这条专业的虚线就会变得非常重要。比如音视频技术,每个业务里都会用到音视频技术,中台也有音视频能力,底层需要优化以提升音视频技术的竞争力。前台、中台和底层跟音视频相关的技术专家组成一个技术小组,由一个专家带领。
这条虚线会转化成实线的管理决策,我们必须要打通这个链路,这个体系的运作会比较有挑战。作为CTO,我觉得要在过程中保持非常开放的心态,要信赖每个领域技术专家的判断。技术专家意见不一致时要有独立思考,做出自己独立判断,要知道创新和秩序的平衡点在哪里。
组织大到一定程度之后就会遇到这样的问题,我确实也不建议技术组织还不大时就把组织结构搞的太复杂,这会带来额外的管理和沟通成本。
(六)凝心聚气、薪火相传
凝心聚气其实是最重要,也是最难的,这是一个技术文化的事情。每隔一段时间,我们都会聚在一起讨论阿里巴巴的技术文化。去年我们讨论了一轮,三年前我们也讨论过一轮。三年前定下阿里巴巴技术的slogan叫“技术创造新商业”,再之前的slogan是“技术拓展商业边界”。
除了slogan,阿里巴巴的技术文化底色是务实、有一说一,不要打花腔,不要做包装,事实是什么样就什么样,用事实说话。没有数据、没有事实的时候,就不要说话。如果大家都能践行这个文化的话,很多事情会变得特别简单,但其实践行文化并不容易。
我们有一些技术土话,比如“面向未来去思考”、“因为相信,所以看见”,是需要看准未来,敢于投入,真正做到先发先至。因为如果别人做、你再去做,永远是追赶者,会非常累,但看准未来、敢于投入,也不会轻松。
这些文化、愿景能不能在一个大的场景里形成共识,尤其这个组织每年都有人离开、有新同学进来,还能保证一致的文化,其实是非常有挑战的。但只有文化做好,很多事情才能顺理成章。
六、CTO可能不是思想家,但一定是行动派
前面是我思考的CTO的六个职责,虽然不完整,虽然不可能每件事情都做得非常好,但核心思考是,我是相信CTO或者说技术团队需要跟着公司业务发展不断进化。我从蚂蚁到阿里巴巴,差不多经历了前面六个阶段,我称之为CTO的六步曲:
1.跟团队先一起定义好目标,先一起做成一些事情。
2.多了解团队、了解业务,知道未来要去哪里,跟团队一起共创一个愿景,把大家热情点燃。
3.CTO的一个核心工作,是怎么能够让自己不要成为团队的天花板,而是把自己当成团队的地板,用人做事。如果CTO是公司技术天花板的话,那你把公司技术就压在一定的高度和范围内,公司技术永远是在一个小的、狭窄的领域。当CTO的技术能力是公司的地板时,公司可以通过新同学扩展边界。成为CTO还是用人做事为主,而不是做事用人为主。
4.一切都很好时,别忘了晴天去修屋顶,永远居安思危。一旦危机出现,乐观地看待,每个危机背后都有机会,转危为机。
5.过程中不只看当下,也要布局未来,为公司建立技术纵深。在业务发展早期,技术的纵深就是一个点。当发展到像阿里巴巴现在这个规模时,技术纵深就是一个多面体,必须有充分的、多面的布局,才能支撑一个大公司的发展。决定布局投入多少,要和CEO充分对焦。
6.最后一点,人才。薪火相传,人才才是公司未来发展的关键。我记得阿里云曾经有一位技术负责人分享说:什么是一家公司技术的最高境界,就是谁来当CTO都能当好,我觉得这是我要努力的一件事情。
七、如何发展与培养CTO
最重要的事情放在最后,就是人才。
前面讲到我们的分层分布治理,每个经营单元要有一个小CTO,这个CTO怎么培养?基本上我们让他在战场里去练,为他设计发展路径,也有“奇点营”这样的班专门培养业务小CTO。
当然最难的事情就是培养接班人,自己的接班人和团队的接班人,这件事情其实是我上任第一天就在想的事。但选准人、选好人,有非常多的挑战。
我有两个小经验:
1.CTO发展是“Z”字型的路线。
直线成为CTO的人,往往会因为路径太单一、没有足够磨炼而出问题。行癫负责过淘宝的业务,负责过B2B业务,再做阿里巴巴的CTO,再做阿里云的总裁。我做过蚂蚁的技术,做了两年蚂蚁国际业务,再做阿里巴巴的CTO。做过业务再回头看技术,跟CEO对话会有共同的框架,这一点很关键。
2.做“L”型职责设计。
CTO最怕做虚了,毕竟这是公司里非常高的位置,每件具体事情都有相应的核心骨干帮你负责,但你手不伸下去就很容易做虚,看不到下面的实际情况,会导致决策出错。
阿里巴巴怎么解决这个问题呢?就是给CTO一横一纵:横向管理的CTO,也给你一个纵向的业务技术一号岗位,保持与一线的对接。比如我现在既是阿里巴巴集团的CTO,又是菜鸟的CTO。行癫也曾经既是阿里巴巴的CTO,也是阿里云的CTO,也是一横一纵。
CTO应该具备怎样的特质?每个CTO都有不一样的风格,但有几点是共通的:一是要求真务实,真正“No Data No BB”,永远不是高高在上地做决定,而是做决定时能够看得到下面,这很重要;二是要有担当,在做关键决策时敢负历史责任,有进取心;三是必须时时自省和开放。如果不具备自省和开放的能力,是很难去进化的;四是一个大组织和大业务的CTO要有全局观,能够做架构,能把各方面、各种信息形成一张大图。
曾国藩非常懂用人之道,他这么选人:“专从危难之际,默察朴拙之人,则几矣”。我特别喜欢这句话,把自己的钉钉签名都写为“朴拙”,这是对自己的要求。
到现在为止,我觉得自己很多方面做得依然不够,包括对未来的判断。CEO和CTO实际上是要共同成长、共同进化,在工作中形成默契的,这是非常重要的。
翻译:
One is my experience
My story is simple. I was in school until 2004, until I was 30. In 2004, I followed a senior to do an outsourcing project and made an important architecture upgrade for Taobao.com — replacing the original PHP/MySQL architecture with Java EE architecture. In February 2005, I officially joined Alipay as an intern.
From 2005 to 2014, my main position was Alipay architect. In 2013, the CTO of Ant was going back to the United States. Peng Lei, CEO of Ant, talked to me. At that time, I had no confidence in leading such a large team. What I have the least confidence in is not leading the technical team, but the other half of the responsibility of the CTO, as part of the management of the company, how to talk to the CEO.
The previous CTO would have fought with the business in management meetings, and I didn’t think I could do that. Later, Dr. Wang Jian asked me to have tea. He said it didn’t matter, everyone was different, but everyone could solve problems in their own way. That gives me confidence.
Therefore, in 2013, I started to lead the whole technical team of Alipay. After a year’s investigation, the company appointed me as the CTO of Ant in 2014, and I have been working in this position from 2014 to 2019.
In 2018, the company found that technology will indeed become more critical to certain businesses, requiring students with stronger technical backgrounds to enter the business. At that time, Ant international business had a strong demand for technology, so I worked as COO of Ant International business for two years. At the end of 2019, he became the CTO of Alibaba. After a while, the rookie needed a CTO, so I was the rookie CTO.
The Journey of Business and Technology Co-evolution:Ant
2003 The first guaranteed transaction “Alipay” came into being
2005 CTU goes online to create real-time risk control for Internet payment
2007 Distributed “servitization” of financial system
2010 Ali Xiaodai online
2011 Barcode payment
2013 “Yu ‘e Bao” is online, and the last small payment machine is offline
2014 OceanBase supported core transactions of “Double 11” for the first time, and the peak payment exceeded 30,000 transactions per second
2015 “Three places five centers” live in different places, the opening of the network bank, Ant financial cloud release
2016 India’s PayTM wallet goes live, Ant technology opens overseas
2017 Pay with your face online
2018 Ant BASIC fintech is fully open: Ant chain, financial intelligence, security risk control, small program, Dragonfly, financial PaaS, OceanBase
I picked a few nodes where ant Ctos work, which is a good example of the “co-evolution of business and technology.” In the early stage, Alipay was very simple, which was to be an Internet third-party payment platform. As long as the technology could meet the business needs, it would be good. The biggest feature of the project was speed and light weight.
In 2005, when an international payment giant was about to enter China, many colleagues were nervous. At that time, Jonathan Lu was the CEO of Alipay. ‘It takes them two months to do it, but we can do it in two days,’ he said. ‘They are definitely not our rivals.’
By 2007, the company was facing a technology fork point: Internet payment was bound to become a very mainstream payment method in the industry. We could see the scale of the business, but we needed to make a judgment about what technical architecture to support it.
Alipay did not have strong requirements for technical architecture
At the earliest, Alipay did not have strong requirements for technical architecture, so it used whatever was useful. We have a very Internet structure, born out of Taobao; We also have a very financial structure. In 2005, we bought a small machine to handle the core accounting.
But in 2007 we have to make a judgment, will we be more like a bank or more like an Internet company in the future? Fortunately, it didn’t get too far, and we decided that we would have to use the Internet architecture to solve payment related problems in the future.
So I decided to make the system a future-oriented architecture design: distributed transformation. For a whole year, we distributed all the core transactions, core accounts, core membership and core payments.
But when it comes to solving some of the core technology problems, such as solving the core business problems in such a large distributed system — it’s not easy.
An international payment giant also tried to solve this problem.
In 2007, an international payment giant also tried to solve this problem. As a result, the system went down for two weeks after launching, and the CTO resigned. In the process of fixing this problem, we were terrified, but we finally got this hard bone down.
The second key point was in 2011, when mobile services came into being. However, a critical judgment was needed to be made on the form of payment under the mobile scenario. When users hold their phones and pay with merchants, how exactly do they do it?
Back then, the smartphone landscape wasn’t fully formed, and some phones didn’t have cameras. At that time, the company deployed a lot of schemes, some sweep, then called Yue enjoy clap, clap can complete the payment; Have called “xiu xiu”, let phones emit sound waves and merchants to receive the voice; And Bluetooth. After a couple of rounds, a lot of vendors actually installed a device to do it, and a lot of merchants used QR codes, and finally found that what users could accept was a scan code payment.
My first year as Ant CTO, many important things happened
In 2013, my first year as Ant CTO, many important things happened. We have a lot of important projects internally, numbered from 1 to 9.
In 2013, the No.1 project was MyBank; The second project is the launch of Yu ‘ebao, which enables Alipay to truly break the circle and do digital inclusive finance. Project No. 3 is a flower. And of course 456, we have a lot of projects in the pipeline, some successful, some not so successful.
This process has caused a huge impact on the whole technical framework, from the original Internet payment out of the technical framework, began to move towards the direction of digital finance. In this process, I felt that technology could be just the power to help companies achieve technological change. That year, Alipay began to officially “IOE”, the last small machine offline.
The Journey of Co-evolution of commerce and Technology :Alibaba
1999 Alibaba was founded to open B2B business
Alibaba’s technology is very broad, the second half of the development will be Ali cloud as the main line.
2003 Taobao online, open C2C business
The first stage is very clear: technology supports business development, starting from the launch of Taobao. At that time, why Taobao was able to win, in addition to its commercial innovation, “fast” is also a very key factor. In this process, the CTO should be able to make technical layout in advance when the business is growing rapidly, so that the business growth can be sustained. In 2004, when I was engaged in the outsourcing business of upgrading Taobao architecture, it was actually a very critical period for Taobao. The formation of Java EE architecture laid a very good foundation for the growth of Taobao.
2004 Taobao.com has turned to Java development to lay the foundation for a high-performance website
2005 Alipay has officially become a third-party payment platform
2006 Taobao.com service platform
2007 Ali mother “through train” online open Internet marketing
2008 Taobao Mall (Tmall) online, the realization of the five-color stone project, Taobao mall and Taobao comprehensive open
Ali Cloud was established
2009 Ali Cloud was established, Ali team wrote the first line of code in Beijing
One of the masterstrokes of 2009 was the point at which Alibaba dared to call the cloud the future and write the first line of code for it. Many people inside Alibaba are not optimistic about cloud, but why can judge right at that time? I think Alibaba has a good saying: “Because you believe, you see”. Just like e-commerce business, Alibaba has long believed that e-commerce must be the future, first come first come, so for a long time, we firmly believe in this direction and continue to move, as long as the time comes, we will naturally achieve commercial results. In fact, technology is the same. Alibaba made a judgment very early on: cloud computing must be the future, and all future developments will take this as the main line.
Now, in retrospect, each of the key nodes is related to the cloud. For example, after Ali Cloud wrote the first line of code in 2009, it failed to find customers after a circle of internal search. At this time, the CEO designated that Ali small loan business must use Ali Cloud and gave Ali Cloud a scene to be polished, which gave Ali Cloud a chance of survival. Sun Quan was the head of Ali Xiaodai in 2010 and later became the head of Aliyun.
Ali cloud officially external public beta
2010 Mobile Taobao online, Ali cloud officially external public beta
2011 Ali Cloud began to provide large-scale external cloud computing services
2013 Alibaba to “IOE” completed, Ali Cloud became the first external 5K cloud computing services company
2014 MaxCompute, a big data computing platform, went online
2015 Launch the strategy of “big middle desk, small front desk”
2016 Mobile Taobao to achieve “thousands of faces”, the city brain officially released
2017 Dharma courtyard officially established, Tmall genie listed
2018 Announced the chip strategy, the establishment of Pingtou brother, Zhang Bei 300,000 server data delivery center
2019 Open the era of Ali Cloud intelligence
2020 “Comprehensive cloud” capping, into the cloud native era, cloud nail one
There was a very important “5K project”
What impressed me most is that in 2013 and 2014, after several years of polishing with Ali Xiaodai, Ali Cloud began to acquire some technology and its big data processing began to mature gradually. At that time, there was a very important “5K project”, Alibaba Cloud began to have the ability to process data in a large cluster of 5,000 machines. With this ability, the company made an important decision that all the data of Alibaba and Alipay should be uploaded to the big data platform of Ali Cloud. At that time, I happened to be the CTO of Ant. I raised my hands to support it, because I knew that even if I did not support it, the group would press down. Alibaba Cloud has thus moved to a higher level.
In 2015, everyone has heard the phrase “big middle desk, small front desk”. It sounds great. “Big middle desk, small front desk” behind the completion of a thing: Alibaba and Alipay all the basic technology all unified to Ali cloud, this is a major technological change, in order to complete the technological change, Alibaba did a very good organization design, let the then Alibaba CTO act as a part-time Ali cloud CTO, put the technology together, strengthen Ali cloud. “Big middle desk, small front desk” strategy has been implemented for three years, Ali Cloud’s entire technical architecture is very complete. Alibaba is pulling together the entire company to support a technology strategy.
Creating a new business with technology
After 2017, this phase is “creating a new business with technology,” in the parlance of geeks, and starting to explore whether technology itself can be a part of business. In 2017, the Dharma Academy was established, which is also based on “seeing because you believe”. Because we judge that the future must be the digital economy, we need to make layout around the core technologies needed by the digital economy. Whether it is AI, computing, or chips, they are all thinking and layout around the core issues of the digital economy.
In 2020, I have become the CTO of Alibaba Group. We have completed the top of the cloud, and all core businesses are basically running on the cloud, including the most challenging businesses, such as search, recommendation and advertising, which cloud can support. I think this is a very big technological progress for cloud and a very significant technological change for the group.
We are now entering a new era where the boundary between cloud and business technology is clearly defined, and the boundary is cloud native. Our future business applications are all cloud native applications, which is a cloud native era. In addition, the cloud itself is also upgrading, which is known as the “cloud nail in one.”
Three types of technical leadership in four different business phases
I was inspired by Yu Yongfu, who believed that the development of an enterprise is a wavy development process. At the beginning, one should first find a direction and enter the track of a business. If this direction is judged correctly, the enterprise will enter the stage of rapid growth; Development to a certain stage, it is necessary to break away from the existing inertia, and then to find a new direction of development – is such a wave of process……
In the wavy development process, technology plays a different role at each stage. In the on-orbit phase, the CTO should be the first team member of the entire company business, is the second team to support the first team, the team together to see the direction, to bring the business into the right track.
The core of CTO is not to look at the direction, but how to do well in technology
Once in orbit and the business enters a period of rapid growth, the core of CTO is not to look at the direction, but how to do well in technology. At this time, the chief architect will become very important. Technology enables the business to grow faster and accelerate growth, and the business will not be slowed down by technology. Also, organizational design is as important at this stage as technical architecture. Then, instead of waiting for business to stall to judge the future, Ctos have to judge in advance what will happen in the future, what the second curve is, and design a future route.
I think there are two kinds of leadership in the organization as a whole: first, professional leadership: CTO, Chief architect, technical director and VP; 2. Management leadership of organizational design. Ctos wear different hats at different times, sometimes assuming a professional role, sometimes a managerial role, and sometimes a strategic role.
It is hard to say that a CTO is omnipotent. CTO must have strengths and weaknesses. For example, I am an engineer. Engineering ability is my strong ability, while organization is my weak point. In this process, a core team needs very good coordination, have different leadership, and put different hats on different people.
Five duties of the CTO
Create a “resonant” connection between business and technology
What is the role and responsibility of a CTO in a company?
The first, I think, very basic responsibility is to build a business and technology connection. Business and technology have coevolved, and the process of coevolution requires a good resonance connection between the two. In order to realize social value and customer value, what kind of core competence should an enterprise judge? This judgment is not a pure technical judgment, nor is it a pure business judgment, but a collective judgment of the core team of the company.
In this process, how the CEO and CTO talk to each other becomes very critical. Can they speak the same language? The CTO should ask the CEO several framework questions: first, who is our company to serve, to define the customer very clearly; Second, what kind of core value and differentiated value we create for these customers; Third, our business model, which business model to achieve the core value; Fourth, in order to realize such a business model, we need what kind of capability, what kind of path, what kind of organization. With these issues understood, the technology can understand what the business is doing.
Whenever I receive a new business in Alibaba, I must have a new understanding of it. There may be more problems and deeper understanding in the future, but this is the first thing I do. This is the responsibility of the CTO.
A picture, a battle, a heart, vision traction forward
In many cases, the traction team forward is very challenging things, small teams can also rely on the relationship. If you don’t know most of the people in a big team, how can you still have “one picture, one battle, one heart”? That’s where leadership comes in.
I’m going to share a few examples that are a little bit challenging and a lot of different approaches.
The first example is that the way I build leadership in a team is by working with the team to define very clear goals. In 2013, after much hesitation, I decided to accept the offer to run Ant’s entire technical team. The biggest challenge I had was trust with my team — everyone was on a level, and suddenly I had to become the head of the department. At that time, everyone gave me feedback, think Lu Su can do technology, but do not understand business technology.
working with others to make things happen
I think trust depends on working with others to make things happen. It is not even necessary for everyone to establish the identity of the leader. It depends on whether everyone can believe in the common goal and achieve the goal. In 2013, my team and I agreed on the “1234” goal:
How does our technical framework support the company’s transformation from an Internet platform to a digital inclusive financial platform? New platforms underpinning digital financial inclusion need to be redefined. Doing platform architecture this thing, at least is my profession, everyone agrees.
Which core ant technologies are we must break through this year?
Which core ant technologies are we must break through this year? At that time, two core points were determined: first, OceanBase could be landed in this year; Second, we’re going to build ant platforms on the cloud.
Can we achieve 30,000 payments per second in this year’s boom? As you know, since 2010, there has been a big test every year: how high can you carry the peak? For the first two years, the ants had a hard time carrying them. Even 2012 was a big shame for the team — the system crashed in the first 50 minutes, only to recover 50 minutes later. In 2013, everyone wanted to get their revenge: in 2012, the peak was 3,000 strokes per second, which was the limit. We set a 10x goal to achieve 30,000 strokes per second!
Because our service is not just about payment, but also about providing financial services, stability must be like or even more stable than a bank. We settled on four 9s (99.99%) to run. The first three 9s struggled.
These are four goals that everyone agrees on
These are four goals that everyone agrees on, even if they are challenging. By the end of the year, after everyone’s efforts, three and a half of the four goals were achieved. After the transformation of the platform was completed, the businesses of Yu ‘ebao and Huabei were both thriving. OceanBase landed in Ant system; Do four nines; We didn’t do 30,000 strokes per second, we did 15,000 strokes per second, which is a five-fold increase. And then people started to build trust in me.
he first thing I needed was for everyone to get together
When I arrived at Alibaba, the first thing I needed was for everyone to get together and set goals together. Parachuting into a platform, it’s hard to give the team a new vision. If a new CTO starts by drawing a picture of the future, it is not very reliable. It is still necessary to set solid goals and achieve them together with the team.
Therefore, I choose to make steady progress in the key strategy of cloud first, and the comprehensive cloud must be completed. The second is on the cloud, how to from “cloud” to “cloud”, the cloud native do in-depth, the Alibaba center to do in-depth, including AI center, data center and business center. Another goal is to organize digitalization. Although Alibaba was born in the digital era, it also needs to do digital upgrading and transformation. Alibaba digital native business system and digital native organization are the direction determined together with the team last year.
The CTO and the team together need to have a future-oriented thinking, not just a current connection to the business. What is the future, what is the critical path, what are the core battles, this is the traction of the CTO.
Key decisions to clear obstacles ahead
The CTO needs to help the team solve problems, clear the way, and make key decisions. When Ant became a CTO, the first key decision was whether the technology structure should be oriented towards finance or the Internet? The final decision was the Internet. The second key decision was about the transformation of OceanBase
In 2010, Mr. Yang Zhenkun led his team to build OceanBase. OceanBase was not a real database at the earliest. At that time, the first scenario was to solve the problem of Taobao’s favorites, which required massive data but did not require strong data processing ability. But by 2012 OceanBase was facing a growing decision to become a real database.
Ant team has a reasonable reason to doubt OceanBase.
Ant team has a reasonable reason to doubt OceanBase. Our current business has such a high requirement for database, and Oracle’s advanced database team has been built for decades, which just can meet our business needs. Yang teacher with dozens of people spent two years to build a database, can replace Oracle?
I think Dr. Wang Jian is very leadership, at that time he made a choice, he said: “Give OceanBase to Alipay.” Getting to the ants became the “2” part of my “Goal 1234” : See if OceanBase can go any further.
What did we do then? First, understand what OceanBase can do; Second, since the company as a whole can not do it, we made a small scene, cut 1% of the ant’s core transaction traffic to OceanBase, let it prove its ability in 1% of the traffic. OceanBase also cherished this stage, supporting 1% of the traffic, and finally completed the transition from a non-relational database to a true relational database that year. In 2014, we took another bold step forward and allocated 10% of the “Double 11” traffic to it, allowing it to enter the core accounting system — the accounting system has higher requirements than the core trading system. It was a risky decision at the time, but looking back, it was those decisions that set OceanBase up for a different future.
The next few decisions are similar. As a CTO, how to balance risk and stability is very crucial.
For example, on the first day of the launch of Yu ‘ebao, we knew that this product would be a success, because its customer value was very clear and users could benefit from every dollar of balance. But we did not expect it to succeed so quickly, in a month to use up the original system capacity. We’re in good shape because Ant’s platform is fully distributed and can be scaled up quickly. However, our partner Tianhong Fund, because the old system cannot support the rapid growth of Yu ‘ebao, has become the core problem of expansion.
So we made a very important decision: to move our system to the cloud and rewrite it with a distributed architecture. It had to be done in three months. This event also proved the value of the financial cloud, the financial cloud business is up.
All this means that Ctos have a role to play in key decisions, helping teams make decisions and manage uncertainty.
Deal with risks and turn crises into opportunities
There are several types of technological risks for companies.
First, the technical architecture does not support the business, which is an unacceptable risk to the business. But the risks are relatively obvious. The reason why Alibaba started “IOE removal” in 2009 is that it judged that this risk would appear sooner or later. The “IOE” structure could no longer support the company’s business development and must be removed. Ant started to “de-ioe” in 2010, because the “Double 11” sales that year made us see that the original structure could not support the business.
The “Double 11” promotion in 2010 is called “human cloud computing”. Big promote the beginning, we see the traffic surge, judge will break the design capacity during the day, quickly put the server inventory all added in, the results found or not, and immediately to “kill” the service, the unnecessary service all shut down, put the capacity on the core key link, the results are not enough, we go to see the key service there is no “kill”.
In the last few minutes of the “Double 11” that year, our core accounting database and accounting database were all stretched to the limit, with 10 seconds to go. At the last minute, the team was very decisive and killed the accounting database. Accounting accounting itself is very important, but a few minutes can also bear, the core accounts can not hang, otherwise the whole business is all interrupted. In this way, the “Double 11” promotion in 2010 is persevered.
it was all about “human cloud computing”.
In fact, for a long time afterwards, it was all about “human cloud computing”. People were doing resource allocation and making decisions and judgments. It was so painful that it forced the ants to embrace the distributed architecture of the cloud very firmly and go to the cloud.
The second type of risk is not a problem of technical architecture, but a major problem of stability. The CTO must recognize the situation, and the CEO must give the CTO the resources to support it.
Like the ant’s “527.”
Like the ant’s “527.” In 2015, we realized the multi-activity of the whole Alipay system in different places. The computer rooms in South China and East China can conduct distributed transactions in different places, which is probably the first time in the financial system. We also specially conducted a network disconnection drill. We directly shut down the East China region and directly took over the business in South China. The result was very successful and everyone was very happy. I also sent a report to Peng Lei, the CEO at the time, saying that ants have achieved the first time to live in different places, you can rest assured to sleep, and the system will be as stable as a rock in any case.
not a single greeting to the CEO
On May 27, 2015, I was driving down the road when I received an alarm, which is usually handled directly by the team, but the alarm just kept going off. I pulled over to the side of the road, only to discover that the optical fiber had been cut, the system was out for nearly two hours. The original live in different places, a fiber is broken, the system can not be cut in the past. Since then, I have not sent a single battle report, not a single greeting to the CEO.
At that time, this issue dealt a great blow to the Ant technical team. The biggest blow to the company was the customer’s trust. There are a lot of articles poking fun at ant technology, and the ant’s technological image and external trust have been lost. There was also a crisis of trust within the company, from me to the whole team and the CEO. When I spoke later, people would only listen to half of what I said because what I said was not reliable.
After “527”, we did several key things.
A crisis can be a good thing. After “527”, we did several key things. First, we started to communicate with the company that we need to set up a special department in Ant, called the Technical Risk Department. This department has only one responsibility, guarding the risks of Ant technology chassis, and the company allocates an additional 10% of technical resources to this team.
In other words, we’re willing to spend an additional 10% of our technical resources specifically to ensure that Ant systems are risk-free in the future, and this at least gets us that resource. Second, we will immediately start the live drill. As I said before, we have done the power outage drill in different places, but this is a prepared drill. From that day on we were going to do a walk-through and continue to this day. Third, we decided to take May 27 as the ant technology day. Whether the ant exists for 102 years or longer, all technical engineers should remember this day, so that we can always remember the risk.
these three things
The completion of these three things, on the contrary, makes the team more cohesive, and the risk prevention and control level of ant technology has been greatly improved. In my opinion, this is to turn the crisis into an opportunity.
The third type of risk, which Ctos and ceos are probably most worried about, is whether a new technology will come along and disrupt the existing business model. And it’s not just a lot of growing companies that are worried. Even a company the size of Alibaba is very worried.
When a new technology comes along, no matter how big or small a company is, it risks being completely disrupted. When the Ants faced the mobile Internet era, we were worried for a while, and after several years of trying to define mobile payments, we basically got over the crisis, but the challenges for the ants were still very big. When Bitcoin started to become a phenomenon, we were also very worried: would it disrupt payments completely? The arrival of Libra in June 2019 raises even more concerns about whether global payments will be redefined by a new currency, a dimensionality reduction blow.
The CTO must stand in the top team and help the CEO make judgment
At this time, the CTO must stand in the top team and help the CEO make judgment. Every judgment on a future crisis can trigger new business opportunities in the future. The big strategy: to face any technological risk not just by looking at it, but by trying it out for yourself, requires the company to invest some valuable waste.
The fourth risk is boiling frogs in warm water.
Whether the technology will come back to hurt the company is not as straightforward as the risk, but if the company slows down due to technical, architectural or organizational problems, the company will slowly lose competitiveness and innovation. Over time, this is the biggest crisis. Alibaba’s Middle Taiwan is a good example. The advantage of middle Taiwan is that it can reduce a lot of repetitive construction, so that businesses can quickly innovate based on the middle Taiwan, because the busy construction of repetitive construction is equal to low efficiency.
But Alibaba’s middle platform has been perfected and is starting to move into another phase. At present, there are a series of technical middle desks for core e-commerce, digital supply chain, functional line business, data, AI, etc. When I start a new business, I have to deal with so many people in the middle to support me, and if any of them are not in place, my business may not be successful. We are now launching a major push to further upgrade the capabilities of the middle Taiwan, improve the delivery methods of the middle Taiwan, and upgrade the Middle Taiwan. There are a lot of technical architecture changes involved here. To prevent the frog from boiling in warm water is to always do systematic combing, and then go to the stage by stage to solve.
Organization design and governance — Balance order and innovation
The longer a person serves as a CTO, the more their professional skills decline. I judge my professional abilities to drop a grade every two years or so. It’s also good. You can work with a team, and the team is stronger. The core of organizational design is to have both order and innovation. This is paradoxical. Innovation grows in a soil of chaos. Order makes efficiency high, but it eats up a lot of innovation. The process is a bit like walking a tightrope, balancing order and innovation.
Alibaba’s organization around technology, there are two lines. One line is a real management line and is distributed in layers:
Front desk, business oriented, win for customers; Middle Taiwan is the ability center, and the customers in Middle Taiwan are the reception desk, which makes the reception desk more efficient and more competitive. The bottom background emphasizes the advanced technology to ensure the sustainable business. Each level of the organization is a separate business unit, and now we’re working on a process where each of the separate business units has a CTO. The CTO will take full responsibility for the business unit. The core of implementing the management mechanism is to define the interface between each layer clearly.
This kind of governance makes each business very flexible and can develop independently
This kind of governance makes each business very flexible and can develop independently, but it also brings a new problem, how should we unify the development of technology to form a synergy, so we have another dotted line: technical committee, under the 20 or 30 core technical groups, horizontal pull all the common areas. Through the professional leadership of the technical committee and technical team, we can realize the common strategy and talents.
These two lines operate simultaneously, and as management lines become more clearly layered, this professional dotted line becomes very important. For example, audio and video technology is used in every business, and the middle platform also has audio and video capabilities. The bottom layer needs to be optimized to improve the competitiveness of audio and video technology. Front desk, middle desk and bottom related to audio and video technical experts to form a technical team, led by a specialist.
solid line of management decisions
This dotted line will translate into a solid line of management decisions. We have to get through this link, and the system will be challenging to operate. As a CTO, I think we should keep a very open mind in the process and rely on the judgment of technical experts in each field. When technical experts disagree, they should think independently and make their own judgment. They should know where to strike a balance between innovation and order.
This is a problem that happens when an organization is large enough, and I really don’t recommend making the structure too complex when the technical organization is still small, which will bring additional administrative and communication costs.
Condensing heart gather qi, torch from generation to generation
Gathering the heart and gathering the breath is actually the most important, but also the most difficult, this is a technical and cultural thing. Every once in a while, we get together to talk about Alibaba’s technology culture. We had a round last year, we had a round three years ago. Three years ago, the slogan of Alibaba’s technology was “Technology creates new business”. Before that, the slogan was “Technology expands business boundaries”
In addition to slogan, Alibaba’s technology and culture background color is pragmatic, have one say, do not play the colorway, do not do packaging, what the facts are like, speak with the facts. When there are no figures, no facts, don’t speak. If everyone can practice this culture, many things will be very simple, but practice culture is not easy.
future-oriented thinking
Some of our technical idioms, such as “future-oriented thinking” and “seeing because you believe”, mean that you need to look into the future, have the courage to invest, and truly be a first starter. Because if others do, you will always be a catch-up, will be very tired, but see the future, dare to invest, will not be easy.
Whether these cultures and visions can form a consensus in a large setting, especially in an organization where people leave and new students come in every year, and still maintain a consistent culture, is actually very challenging. But a lot can only be made sense if the culture is done right.
Ctos may not be thinkers, but they must be actionists
The above is my thinking about the six responsibilities of CTO. Although it is not complete and it is impossible to do everything well, the core thinking is that I believe that CTO or the technical team needs to evolve with the development of the company’s business. From Ant to Alibaba, I pretty much went through the first six stages, which I call the six steps of CTO:
Define goals and accomplish things together with your team.
Know more about the team and business, know where to go in the future, create a vision with the team, and ignite everyone’s enthusiasm.
One of the core tasks of CTO is how to make oneself not become the ceiling of the team
One of the core tasks of CTO is how to make oneself not become the ceiling of the team, but as the floor of the team. If CTO is the technical ceiling of the company, then you put the company technology in a certain height and scope, the company technology is always in a small, narrow field. When Ctos’ technical competence is the floor of the company, the company can expand its boundaries with new students. To be a CTO is to work for people, not work for people.
Don’t forget to repair the roof on a sunny day
When everything is going well, don’t forget to repair the roof on a sunny day. Once a crisis occurs, look at it optimistically, behind every crisis there is an opportunity to turn a crisis into an opportunity.
In the process, we should not only look at the present, but also layout the future and establish technical depth for the company. The depth of the technology is a point early in the business. When it comes to the scale of Alibaba, the depth of technology is a polyhedron, which must have a full and multifaceted layout to support the development of a large company. Deciding how much to invest in the layout should be fully focused with the CEO.
And finally, talent. From generation to generation, talent is the key to the future development of the company. I remember that a technical director of Alibaba Cloud once shared that “what is the highest level of a company’s technology is that anyone can be a good CTO. I think this is something I will work hard on.
How to develop and cultivate Ctos
As for our hierarchical and distributed governance, each business unit should have a small CTO. How to cultivate this CTO? Basically, we put him in the field, we design a path for him, and we have classes like Camp Singularity that train business Ctos.
Of course, the hardest thing is to train your successor, your successor and your team’s successor, which is what I was thinking about the first day. But choosing the right people, choosing the right people, has a lot of challenges.
I have two small lessons:
CTO development is a “Z” shaped route.
Straight line to become a CTO, often because the path is too simple, not enough practice and problems. The Epilepsy was responsible for Taobao’s business, responsible for B2B business, and then do Alibaba’s CTO, and then do Ali Cloud president. I worked in Ant technology for two years and then as the CTO of Alibaba. When you look back at technology after doing business, it’s critical that you have a common framework for talking to the CEO.
Do “L” type responsibility design.
After all, this is a very high position in the company, and every specific thing has the corresponding core cadre to help you take charge of it. However, if you don’t reach down, it is easy to make a false job, and you can’t see the actual situation below, which will lead to mistakes in decision-making.
How does Alibaba solve this problem? It is to give a horizontal and vertical position to the CTO: a horizontal management of the CTO, also give you a vertical business technology No.1 position, to maintain the connection with the front line. For example, I am now the CTO of Alibaba Group and Cainiao. Doting also used to be both Alibaba CTO, but also Ali cloud CTO, is also a horizontal vertical.
What characteristics should a CTO have?
What characteristics should a CTO have? Each CTO has a different style, but there are several points in common: first, the requirement is really pragmatic, the real “No Data No BB”, never make decisions above, but can see below when making decisions, which is very important.
Second, we need to be responsible. When making key decisions, we should take historical responsibility and be enterprising. Third, we must always be introspective and open. It’s hard to evolve without the ability to be introspective and open; Fourth, a CTO of a large organization or business should have a holistic view, be able to structure, and form a big picture of all aspects and information.
Zeng Guofan knew very well how to employ people. He chose people in this way: “Only in times of crisis can we observe those who are simple and incompetent.” I especially like this sentence, I write my signature as “simple”, this is the requirement of myself.
Up to now, I feel I have not done enough in many areas, including the judgment of the future. Ceos and Ctos actually grow and evolve together and form a tacit understanding at work, which is very important.
本文由CXO UNION-CXO联盟(cxounion.cn)转载而成,来源于网络;编辑/翻译:CXO UNIONCXO联盟小C。
如需加入CXO UNION(CXO联盟)高管社群,请联系社群小伙伴哦~

免责声明: 本网站(http://www.cxounion.cn/)内容主要来自原创、合作媒体供稿和第三方投稿,凡在本网站出现的信息,均仅供参考。本网站将尽力确保所提供信息的准确性及可靠性,但不保证有关资料的准确性及可靠性,读者在使用前请进一步核实,并对任何自主决定的行为负责。本网站对有关资料所引致的错误、不确或遗漏,概不负任何法律责任。
本网站刊载的所有内容(包括但不仅限文字、图片、LOGO、音频、视频、软件、程序等) 版权归原作者所有。任何单位或个人认为本网站中的内容可能涉嫌侵犯其知识产权或存在不实内容时,请及时通知本站,予以删除。
如需加入CXO UNION(CXO联盟)高管社群,请联系社群小伙伴哦~

免责声明: 本网站(http://www.cxounion.cn/)内容主要来自原创、合作媒体供稿和第三方投稿,凡在本网站出现的信息,均仅供参考。本网站将尽力确保所提供信息的准确性及可靠性,但不保证有关资料的准确性及可靠性,读者在使用前请进一步核实,并对任何自主决定的行为负责。本网站对有关资料所引致的错误、不确或遗漏,概不负任何法律责任。
本网站刊载的所有内容(包括但不仅限文字、图片、LOGO、音频、视频、软件、程序等) 版权归原作者所有。任何单位或个人认为本网站中的内容可能涉嫌侵犯其知识产权或存在不实内容时,请及时通知本站,予以删除。
Search
Popular Posts
-
2024数字化灯塔案例评选申报开启!
“2024数字化灯塔案例评选”于3月正式启动,诚挚欢迎业界同仁自荐和推荐,一起推动产业数字化进程,助力赋能企业…
-
2024 X-Award星盘奖申报通道已开启!
X-Award星盘奖是数字化转型服务、IT服务行业重要的商业奖项,旨在表彰行业里提供杰出数字化转型服务与IT服…
-
2024 N-Award星云奖申报通道已开启!
N-Award是数字化转型领域重要的商业奖项,旨在表彰那些以非凡的远见、超群的领导才能和卓越的成就来激励他人的…






