A conversation with Kevin Scott: Whatâs next in AI
A conversation with Kevin Scott: Whatâs next in AI
Artificial intelligence systems powered by large language models today are transforming how people work and create, from generating lines of code for software developers to sketches for graphic designers.
Kevin Scott, Microsoftâs chief technology officer, expects these AI systems to continue to grow in sophistication and scaleâfrom helping address global challenges such as climate change and childhood education to revolutionizing fields from healthcare and law to materials science and science fiction.
Scott recently shared his thoughts with us on the impact of AI for knowledge workers and whatâs next in AI. The biggest takeaways:
- Advances in large AI models and generative AI will continue to boost productivity, creativity and satisfaction.
- AI will enable scientific breakthroughs and help the world solve some of its biggest challenges.
- As these models become platforms and Microsoft continues to responsibly scale AI advancements for customers, the cloud, infrastructure investments and a strong responsible AI approach are critical.
In your mind, what were some of the most important advancements in AI this year?
When we were heading into 2022, I think just about everybody in AI was anticipating really impressive things to take place over the next twelve or so months. But now that weâre pretty much through the year, and even with those lofty expectations, itâs kind of genuinely mind-blowing to look back at the magnitude of innovation that we saw left-to-right in AI. The things that researchers and other folks have done to advance the state-of-the-art are just light years beyond what we thought possible even a few years ago. And almost all of this is a result of the incredibly rapid advancement that has happened with large AI models.
The three things Iâve been most impressed by this year were the launch of GitHub Copilot, which is a large language model-based system that turns natural language prompts into code and has this dramatic positive impact on developer productivity. It opens up coding to a much broader group of people than weâve ever had before, which is awesome because so much of the future is dependent on our ability to write software.
The second thing is these generative image models such as DALLâE 2 that have become very popular and more accessible. A fairly high degree of skill is required to sketch and draw and master all of the tools of graphic design, illustration and art. An AI system such as DALLâE 2 doesnât turn ordinary people into professional artists, but it gives a ton of people a visual vocabulary that they didnât have beforeâa new superpower they didnât think they would ever have.
(Editorâs note: All images in this post except for Kevin Scottâs photograph were generated by a producer using DALLâE 2.)
Weâve also seen that AI models are becoming more powerful and delivering even more substantial gains for the problems that theyâre being used to solve. I think the work on protein folding this year has been really good throughout the technology industry, including the work that weâve done with David Bakerâs laboratory at the University of Washington, the Institute for Protein Design with RoseTTAFold, and helping that with a bunch of advanced AI to do transformational things.
And so thatâs just tremendously exciting. Anything thatâs a force multiplier on science and medicine is just net beneficial to the world because those are where some of our biggest, nastiest problems live.
Thatâs a big, impressive year. And I think next year will be better.
Where do you see AI technology having the greatest impact next year and beyond?
I think with some confidence I can say that 2023 is going to be the most exciting year that the AI community has ever had. And I say that after really, genuinely believing that 2022 was the most exciting year that weâd ever had. The pace of innovation just keeps rolling in at a fast clip.
I talked about GitHub Copilot already, and itâs amazing. But itâs the tip of the iceberg for what large AI models are going to be able to do going forwardâextrapolating the same idea to all kinds of different scenarios for how they can assist in other kinds of intellectual labor beyond coding. The entire knowledge economy is going to see a transformation in how AI helps out with repetitive aspects of your work and makes it generally more pleasant and fulfilling. This is going to apply to almost anythingâdesigning new molecules to create medicine, making manufacturing ârecipesâ from 3D models, or simply writing and editing.
For example, Iâve been playing around with an experimental system I built for myself using GPT-3 designed to help me write a science fiction book, which is something that Iâve wanted to do since I was a teenager. I have notebooks full of synopses Iâve created for theoretical books, describing what the books are about and the universes where they take place. With this experimental tool, I have been able to get the logjam broken. When I wrote a book the old-fashioned way, if I got 2,000 words out of a day, Iâd feel really good about myself. With this tool, Iâve had days where I can write 6,000 words in a day, which for me feels like a lot. It feels like a qualitatively more energizing process than what I was doing before.
This is the âcopilot for everythingâ dreamâthat you would have a copilot that could sit alongside you as youâre doing any kind of cognitive work, helping you not just get more done, but also enhancing your creativity in new and exciting ways.
This increase in productivity is clearly a boost to your satisfaction. Why do these tools bring more joy to work?
All of us use tools to do our work. Some of us really enjoy acquiring the tools and mastering them and figuring out how to deploy them in a super effective way to do the thing that weâre trying to do. I think that is part of whatâs going on here. In many cases, people now have new and interesting and fundamentally more effective tools than theyâve had before. We did a study that found using no-code or low-code tools led to more than an 80% positive impact on work satisfaction, overall workload and morale by users. Especially for tools that are in their relatively early stages, thatâs just a huge benefit to see.
For some workers, itâs literally enhancing that core flow that you get into when youâre doing the work; it speeds you up. Itâs like having a better set of running shoes to go run a race or marathon. This is exactly what weâre seeing with the experiences developers are having with Copilot; they are reporting that Copilot helps them stay in the flow and keeps their minds sharper during what used to be boring and repetitive tasks. And when AI tools can help to eliminate drudgery from a job, something that is super repetitive or annoying or that was getting in their way of getting to the thing that they really enjoy, it unsurprisingly improves satisfaction.
Personally, these tools let me be in flow state longer than I was before. The enemy of creative flow is distraction and getting stuck. I get to a point where I donât know quite how to solve the next thing, or the next thing is, like, âIâve got to go look this thing up. Iâve got to context switch out of what I was doing to go solve the subproblem.â These tools increasingly solve the subproblem for me so that I stay in the flow.
In addition to GitHub Copilot and DALLâE 2, AI is showing up in Microsoft products and services in other ways. How is next-generation AI improving current products such as Teams and Word?
This is the big untold story of AI. To date, most of AIâs benefits are spread across 1,000 different things where you may not even fully appreciate how much of the product experience that youâre getting is coming from a machine learned system.
For example, weâre sitting here in this Teams call on video and, in the system, there are all these parameters that were learned by a machine learning algorithm. There are jitter buffers for the audio system to smooth out the communication. The blur behind you on your screen is a machine learning algorithm at work. There are more than a dozen machine learning systems that make this experience more delightful for the both of us. And that is certainly true across Microsoft.
Weâve gone from machine learning in a few places to literally 1,000 machine learning things spread across different products, everything from how your Outlook email client works, your predictive text in Word, your Bing search experience, to what your feed looks like in Xbox Cloud Gaming and LinkedIn. Thereâs AI all over the place making these products better.
One of the big things that has changed in the past two years is it used to be the case that you would have a model that was specialized to each one of these tasks that we have across all our products. Now you have a single model that gets used in lots of places because theyâre broadly useful. Being able to invest in these models that become more powerful with scaleâand then having all the things built on top of the model benefit simultaneously from improvements that youâre makingâis tremendous.
Microsoftâs AI research and development continues through initiatives such as AI4Science and AI for Good. What excites you most about this area of AI?
The most challenging problems we face as a society right now are in the sciences. How do you cure these intractably complicated diseases? How do you prepare yourself for the next pandemic? How do you provide affordable, high-quality healthcare to an aging population? How do you help educate more kids at scale in the skills that they will need for the future? How do you develop technologies that will reverse some of the negative effects of carbon emissions into the atmosphere? Weâre exploring how to take some of these exciting developments in AI to those problems.
The models in these basic science applications have the same scaling properties as large language models. You build a model, you get it into some self-supervised mode where itâs learning from a simulation or itâs learning from its own ability to observe a particular domain, and then the model that you get out of it lets you dramatically change the performance of an applicationâwhether youâre doing a computational fluid dynamics simulation or youâre doing molecular dynamics for drug design.
Thereâs immense opportunity there. This means better medicines, it means maybe we can find the catalyst we donât have yet to fix our carbon emission problem, it means across the board accelerating how scientists and other folks with big ideas can work to try to solve societyâs biggest challenges.
How have breakthroughs in computing techniques and hardware contributed to the advances in AI?
The fundamental thing underlying almost all of the recent progress weâve seen in AI is how critical the importance of scale has proven to be. It turns out that models trained on more data with more compute power just have a much richer and more generalized set of capabilities. If we want to keep driving this progress furtherâand to be clear, right now we donât see any end to the benefits of increased scaleâwe need to optimize and scale up our compute power as much as we possibly can.
We announced our first Azure AI supercomputer two years ago, and at our Build developer conference this year I shared that we now have multiple supercomputing systems that weâre pretty sure are the largest and most powerful AI supercomputers in the world today. We and OpenAI use this infrastructure to train nearly all of our state-of-the-art large models, whether thatâs our Turing, Z-code and Florence models at Microsoft or the GPT, DALLâE and Codex models at OpenAI. And we just recently announced a collaboration with NVIDIA to build a supercomputer powered by Azure infrastructure combined with NVIDIA GPUs.
Some of this progress has just been via brute force compute scale with bigger and bigger clusters of GPUs. But maybe even a bigger breakthrough is the layer of software that optimizes how models and data are distributed across these giant systems, both to train the models and then to serve them to customers. If weâre going to put forth these large models as platforms that people can create with, they canât only be accessible to the tiny number of tech companies in the world with enough resources to build giant supercomputers.
So, weâve invested a ton in software like DeepSpeed to boost training efficiency, and the ONNX Runtime for inference. They optimize for cost and latency and generally help us make bigger AI models more accessible and valuable for people. Iâm super proud of the teams we have working on these technologies because Microsoft is really leading the industry here, and weâre open sourcing all of it so others can keep improving.
These advances are all playing out amid an ongoing concern that AI is going to impact jobs. How do you think about the issue of AI and jobs?
We live in a time of extraordinary complexity and historic macroeconomic change, and as we look out 5, 10 years into the future, even to just achieve a net neutral balance for the whole world, weâre going to need new forms of productivity for all of us to be able to continue enjoying progress. We want to be building these AI tools as platforms that lots of people can use to build businesses and solve problems. We believe that these platforms democratize access to AI to far more people. With them, youâll get a richer set of problems solved and youâll have a more diverse group of people being able to participate in the creation of technology.
With the previous instantiation of AI, you needed a huge amount of expertise just to get started. Now you can call Azure Cognitive Services, you can call the Azure OpenAI Service and build complicated products on top of these things without necessarily having to be so expert at AI that youâve got to be able to train your own large model from scratch.
As all these huge AI systems continue to grow and evolve, I think we can expect that these advances are going to fundamentally change the nature of work, in some places more than others, and in some cases create a whole spate of new jobs that didnât exist before. You can look back and see the same thing happen adjacent to all kinds of famous paradigm shifts in technology over history: the telephone, the automobile, the internet. And I think that just like those examples, weâre going to need new ways to think about work, new ways to think about skills and to be super focused on making sure that weâve got enough talented folks around and trained for the really critical jobs.
Another concern associated with AI technologies is the potential for misuse and abuse. What are the concrete steps that Microsoft is taking to ensure its AI tools and services are developed and used responsibly?
This is a thing that we take super seriously. We have a responsible AI process that our AI systems go through, and we continue to improve that process. We scrutinize what weâre doing with a multidisciplinary team of experts to try to make sure that we understand all the potential harmful things that could happen, and we mitigate as many of them as possible. Examples of that are things like refining the dataset used to train models, deploying filters to limit the generation of harmful content, integrating techniques like query blocking on sensitive topics that helps to prevent misuse by bad actors or applying technology that can return more helpful and diverse responses and results. And we have a plan in place with the AI system where we can detect and mitigate as quickly as possible post-launch any harms that happen that we didnât anticipate.
Another very important safeguard is intentional and iterative deployment. Most of the work that we do is on models that have broad capability. We host them in our cloud, and we make them accessible by API or through our products. For the API, any developer can get access to it, but they have to comply with the terms of service in order to use it, and if they violate the terms of service, their access can be taken away. And for other products, we may start with a limited preview with a select number of customers with well-defined use cases in mind. Collaborations with these early customers will help us make sure the responsible AI safeguards are working in practice so we can scale adoption more broadly.
We truly believe safety and responsibility is important. Hopefully, we can offer some encouragement to the whole industry. To that end, all the resources and expertise that weâve applied against developing some solutions are being shared with the rest of the broader community through our Responsible AI Standard and Principles.
Top image: Center photograph of Microsoft Chief Technology Officer Kevin Scott is courtesy of Microsoft. Left and right images were created by a producer using DALLâE 2, OpenAIâs AI system that can create realistic images and artwork from text descriptions.
How it works
Once you click Generate, Ollama reads this article and crafts 5 comprehension questions. Your answers are graded against the article content â general knowledge won't be enough. Score 70+ to count toward your certificate.
Questions are cached â you'll always get the same 5 for this article.