How Devin replaces your junior engineers with infinite AI interns that never sleep | Scott Wu (Cognition CEO)

**Scott Wu **is the co-founder and CEO of Cognition Labs, the creators of Devin, an AI agent designed to function as a junior engineer on software development teams. In this conversation, Scott demonstrates how his team uses their own product to accelerate development workflows, reduce engineering toil, and handle routine tasks asynchronously. Scott walks us through real examples of how Devin integrates into Cognition’s daily operations—from researching and implementing new features to responding to crashes and handling frontend fixes. He explains how Devin differs from traditional AI coding assistants by functioning more like a team member than a tool, allowing engineers to delegate well-scoped tasks while focusing on higher-level problems. What you’ll learn: 1. How to use DeepWiki to research your codebase and generate better prompts for AI engineering tasks 2. A workflow for treating AI agents as asynchronous junior engineers who can handle multiple tasks while you attend meetings 3. Why public channels create better learning environments for both humans and AI when implementing engineering solutions 4. The top five engineering tasks AI excels at: frontend fixes, version upgrades, documentation, incident response, and testing 5. How to implement a “first line of defense” system where AI agents analyze crashes before humans need to intervene 6. A technique for bringing voice AI into meetings as an additional participant to answer questions without disrupting flow — Brought to you by: Google Gemini—Your everyday AI assistant Vanta—Automate compliance. Simplify security. — Where to find Scott Wu: X: https://x.com/ScottWu46 LinkedIn: https://www.linkedin.com/in/scott-wu-8b94ab96/ — Where to find Claire Vo: ChatPRD: https://www.chatprd.ai/ Website: https://clairevo.com/ LinkedIn: https://www.linkedin.com/in/clairevo/ X: https://x.com/clairevo — In this episode, we cover: (00:00) Introduction to Scott Wu and Devin (03:53) Where Devin excels (06:08) Using DeepWiki to research codebases and create better prompts (10:27) Prompting tips (11:24) The asynchronous nature of working with Devin (13:38) Multithreading tasks (14:43) Using Devin to implement an MCP server integration (18:38) Setting up workflows in Slack for first-line responses (23:22) Encouraging AI adoption in public Slack channels (25:50) Top five engineering tasks for Devin (32:17) Using ChatGPT voice as a meeting participant (35:57) Lightning round — Tools referenced: • Devin: https://devin.ai/ • DeepWiki: https://deepwiki.org/ • ChatGPT: https://chat.openai.com/ • Windsurf: https://windsurf.ai/ • Slack: https://slack.com/ • Linear: https://linear.app/ • GitHub: https://github.com/ — Other references: • MCP (model context protocol): https://www.anthropic.com/news/model-context-protocol • TanStack Router: https://tanstack.com/router/ — Production and marketing by https://penname.co/. For inquiries about sponsoring the podcast, email [redacted email].

Published: Published Sep 8, 2025
Uploaded: Uploaded Jun 13, 2026
File type: Podcast
Queried: 00
Source: podcasters.spotify.com

Full transcript

Showing the full transcript for this episode.

AI-generated transcript with timestamped sections.

0:00-1:51

[00:00] - Devin is async. Once you kick off the Devin session, Devin's gonna start working and looking through the code, but you're not expected to be there with it. It's just as if you gave your intern a project and your intern is going and working on it. - Devin's my favorite intern on my team and I have infinite of them. Why don't you pick a task that you might bite off for your product and show us how you would work through that end to end. - I'll say, please go research the chat PRD MCP server. So this will produce a pull request for us. Often you're running a few of these at once, [00:30] way to have multiple tasks going and then check in on each of them. One of the benefits of this from a How I AI use case is you can multi-thread a lot with tools like this and set two, three, four, five, ten of these going at once on different projects and not feel like you have to sit there and babysit things. Welcome back to How I AI. I'm Claire Vaux, product leader and AI obsessive here on a mission to help you build better with these new tools. Today is a very [01:00] me because we're talking to Scott Wu, CEO and founder of Cognition Labs. [01:04] and the builder of one of my favorite AI products, Devon. We're going to hear about how Scott uses DeepWiki and Devon to kick off well-scoped tasks to get things done, [01:14] uses Devin as his favorite and most tagged employee inside of Slack, and how he's making it not weird to bring ChatGPT voice into your meetings. Let's get to it. [01:26] This podcast is supported by Google. Hey everyone, Sreshta here from Google DeepMind. The Gemini 2.5 family of models is now generally available. 2.5 Pro, our most advanced model, is great for reasoning over complex tasks. 2.5 Flash finds the sweet spot between performance and price. And 2.5 Flash Light is ideal for low latency, high volume tasks.

1:52-3:25

[01:52] Start building in Google AI Studio at ai.dev. [01:56] Scott, thanks for joining How I AI as Devin's number one reply guy on X. I am really excited about this conversation and for you to show off how your company works [02:09] uses and you use the product that at least makes me very happy and I'm sure makes lots of software engineering teams out there very happy. So welcome. Thank you so much for having me. Now, I'm honored to be here, honestly. I'm a big fan of you guys and all the work you do. So [02:24] - Great, well, we wanna get into, we have lots of stuff to talk about, but what we really wanna do is get into [02:30] how you AI and in particular how you AI with the products that you've built. And, you know, I think what's really fun as somebody who's building AI products, [02:43] It's something you get to use every day and get really good at, but also probably show some of our listeners and watchers some tips and tricks about using the tools that you've built that they may not have thought about so far. So we're getting the expert look into how to AI with the Cognition product. So what are you going to show us first? And what are some of your common workflows when you're doing engineering work or trying to move the product forward? [03:13] an AI that can code is has got to be one of the coolest things that we could probably spend our time on. I wanted to show a couple of flows actually of how we use the Devon stack because there are a few different pieces involved with flack and linear. There's

3:26-5:02

[03:26] the wiki, obviously, and then there's like, ask Debit, and then there's, you know, starting Devon sessions and getting call requests out of it. I think there's some real, um, [03:34] I think there's some real nuance and like, what are the right flows of like, how do you work with Devon as an employee? Because I think it really is quite different from a lot of the tools out there, which are much more kind of like an IDE, for example, or like a terminal UI or like Devon is, is. [03:47] I think first and foremost, almost like an engineer on your team. Yep. [03:52] Totally. So what are some of the things that you reach for with Devin and the capabilities that you think really make a difference for you as a software engineer? The way that we like to describe it is Devin is a junior engineer. And so Devin is not going to go. And, you know, we're working on getting Devin to senior engineer, obviously, you know, we'll get Devin the promotion and everything. [04:22] make and then kind of like execute on for the next month, like you probably want to be involved in those as well. Devon can help you with the decision, obviously, by kind of like, referencing the right things or giving a few things or input. But I think where Devon really shines is [04:36] One way that we say also is kind of like tasks, not problems. And so often when you have a very clear, like, here's exactly what we need to go do, and here's the task, and here's all the details of what we need, Devin is really great at going and executing that for you and makes that much faster. And so naturally, I think the next question that comes to mind then is like, how do you figure out the spec or the task exactly that you want to do?

5:02-6:33

[05:02] And so a lot of other tools like Wiki and Search, you know, really are here for you to be able to kind of like ask the right questions that you want about understanding the code base or what needs to be done and then putting a task together. I think in practice, a lot of the use cases that we see all the time are, you know, probably number one is just. [05:21] Crawling through your issue backlog, you know, whenever, um, [05:25] Whenever you have an issue that comes up, or, you know, we have a lot of Slack channels where we talk about issues, and then on every single one of them, we just tag Devon as the first pass. That's a big one. And so, like, you know, someone says, oh, you know, we need to go fix this thing in front end, or, you know, maybe we need to go support this other, you know, support this other MCP, for example, which will show in a second. Things like that. And then for a lot of the other kind of, like, [05:47] I'll say like engineering toil use cases, it also does really, really well. And so often that's like, you know, going and doing a version upgrade or added documentation throughout, you know, your repo or adding unit tests for a specific thing that you have up or responding to, you know, a crash report that just came up and trying to diagnose what went wrong. Yep. I love that. What you said about, you know, Devin's a junior engineer. I say Devin's my favorite intern on my team. [06:17] of them. And then I like this idea of scoping task, not problem. And I do think it's something that people are working with AI and AI. [06:26] Even other AI tools, not in the engineering space, really thinking about task level orientation.

6:34-8:04

[06:34] sets you up for success or at least a sequence of tasks can be very helpful. And so why don't you pick a task that you might bite off for for your product? And let's you know, show us how you would [06:46] work through that end to end. Yeah. [06:49] Yeah, let's do it. [06:50] So as you might know, I'm a huge fan, actually, of Chatbeardy. [06:54] And the national thing, [06:56] that change in line for me was... [06:58] we need to integrate into chat priority as MCP server. [07:01] And so I was looking into how to do that with Devin. [07:04] And so the first thing that I always kind of go to as an initial thing is what we call the dbookie, which basically for any repo, this is true for public or private repos, you can come in and get a full AI generated. [07:17] documentation of the repo. And so in this case, this is the Devon web app repo, appropriately, there's not too sensitive here. But it's basically, you know, it explains Devon, it's pulling a lot of information from readme or understanding the system architecture. [07:33] And I can search this and pull up different things. And so, you know, if I understand, [07:38] how the MCP marketplace is set up, you know, that'll point out what particular components there are, or what particular files are called here. [07:46] Um, [07:47] And I can read up on this and kind of understand exactly how this is set up. [07:51] And the natural question here that I might ask is, [07:56] Okay, cool. But just show me where the MCP server list is implemented. [07:59] And to this, we'll look through our repo. And Devon, at this point, has done a lot of work.

8:04-9:37

[08:04] in the Devon web app repo for your standard work. And so that helps a lot, which is Devon builds this representation of the code base over time. [08:12] And we can see what's going on here. It has all this. [08:15] And so you're getting both like sort of a natural language explanation of how the server list is implemented. And then you also on the right side of this for folks that are watching, get the actual code snippets and reference files that you can view and really understand the deep layer of the code. So you have like sort of a combination of let me explain. [08:36] how it works and then this is the nitty gritty. [08:40] Yep, combination of English and code. I think it's an interesting one where it's like, you know, I think someday it'll probably all be English, you know, but I think especially now, you know, in this current period, I think we're really in the era where obviously you have to pick up you as the engineer want to be looking at BERT, English and code. [08:55] And you can see here, it's giving you kind of the answers of what's going on. And in particular, I'll point out, OK, here's our list of all the different marketplace servers that we have. And we have an Atlassian, MCP, and there's a HubSpot, MCP, and so on, right? [09:09] And from here, the natural thing that I'll want to do here, which is what we found to be a big flow for folks, is to use this to produce actually a props for depth. [09:21] And so the whole idea is [09:24] Now that we're in this context, we know what the questions were, we know what part of the code base that we're looking at. [09:29] It gives a lot for Devin to be able to start from. And if we have it after a pass in mind, then we can get that going. So I'll say,

9:37-11:13

[09:37] Please go. [09:38] research the chat PRD MCP server. [09:42] And add [09:45] that. [09:45] Get to this. [09:48] And so what this will do, [09:50] I use basically constructed debit prompt from this. And so this has, you know, my prompts here, which I just typed in, which is not super refined. But it also has on detail about, you know, the part of the code that we're in and what components we're looking at, and so on. [10:07] And so then it will generate for me this prompt in Denon that I can just go ahead and use the Meteor. [10:12] And you can see here, it'll tell, you know, [10:15] You want to follow the pattern of existing servers like Atlassian and HubSpot. Here's the exact type dip structure that's being used here. Here are the functions that you should be working at. And here's what you should check to make sure that it works. [10:27] One of the things that I want to call out for folks in terms of a workflow that they should think about is a lot of people, myself included, sorry, Devin. [10:35] would have just sent that prompt, which is... [10:38] Add, you know, add chat parodies, MCP server to the list. And I do think that one very short but important loop of... [10:48] take this prompt and turn it into an effective prompt given the context you know and then sending that into the task [10:55] to do [10:56] just saves you a lot of heartache and it feels like extra friction at the time. [11:02] But I think pretty soon is one going to be the job to be done of the tool itself. So does that like loop become invisible either through these reasoning models or some application layer that manages it?

11:13-12:51

[11:13] and two is just worthwhile for people to do so this you know when you're thinking about sending a five-word prompt [11:19] Think instead saying, here's my Fiverr prompt, build me a better prompt and sending that into your system. [11:24] Yeah, for sure. And I think it's, you know, it's a great call because, you know, as we said, Devin is async, right? And so from this point onward, the nice thing about this is, you know, once you kick off the Devin session, Devin's going to start working and looking through the code and reading online about chat, for example, right? I'm going to do all this, but, but you're not expected to be there with it. [11:43] And so it's going to work on its own. It's just as if you gave your intern a project, and your intern is going and working on it. And so they can ping you on Slack and ask you if there's questions or something, or you can go kind of like, [11:55] You can go take a quick look and see how your intern is doing. But you don't have to be sitting there with Devin for every step of the way here. And so one way that we kind of describe it is, [12:04] For a lot of tasks, there's often this sync component, like a synchronized component, and then an asynchronous component. And a lot of what Surge and Wiki is for, is for doing the synchronous part of the task before you do the async. And so if you had an intern, for example, would you just send them the five word Slack message and just leave it at that? [12:22] maybe sometimes for something that is like, you know, super clear. And then, you know, exactly often what you actually would do [12:27] is you would sit down with them, talk it through for two minutes and be like, okay, yeah, like, you know, you know how we have this MCP marketplace, and then we go and look at it together, you know, we read the particular line of the code. And then you say, okay, yeah, so let's, let's add chat PRD to this. And, you know, just go take a look at how that MCP server is implemented and make sure we add it to the list. And then you kind of hand off there, right? So you kind of have the first

12:51-14:23

[12:51] two minutes of going back and forth with Devin, your intern, and then as soon as you [12:56] hit go on the Devon prompt, you're kind of expecting it to be more of an async reason where you don't have to be in the loop. Well, and one of the things I want to call out for people that are building AI products out there, you know, like you, like me is... [13:07] In these sync products, latency really matters. People get really frustrated with wait times. But if you set up your product [13:15] to really be this asynchronous modality, you actually buy yourself a lot of user love on waiting time because there's not that expectation. Just like you would not say, "Hey, intern, [13:26] okay, now go research this other MCP and do a PR for me and come back when it's ready. [13:32] You know, just like that would not be something you expect an intern to come back to you immediately. [13:39] You also, from a product perspective, don't expect Devin to come back immediately. Now, one of the [13:44] benefits of this from a how I use case is you can multi-thread a lot with tools like this and set [13:53] two, three, four, five, ten of these going at once on different projects and not feel like you have to sit there and [13:59] and babysit things. And so I'm wondering, [14:02] you know, while this is running, do you go pop off and go to a meeting or get a coffee? What has this sort of like asynchronous workflow done? [14:09] enabled for you. [14:10] For better or for worse, I'm in meetings for a lot of the day. And it's great to be able to just kind of kick these off or, you know, you have an issue backlog or pay them at least three or four things I was hoping to look at today, right? And you click off each one with Devin.

14:23-15:59

[14:23] And then, you know, these go and work asynchronously. And it'll make the pull request for you in GitHub. And it'll kind of show you the diff and what work it went through. If it's like a chronic change or something like that, it'll send you the screenshots of what, of the before and after, right? You can see it's going and researching chat PRD. Well, I will say just my, clearly my SEO on the MCP is not good. But Devin did make my MCP homepage. So it's in the top nav. Yeah. That's funny. [14:53] It doesn't work for me, so it should know. Yeah. Cool. So this is pretty. So I think for sure, often you're running a few of these at once. And like you said, it's just a nice way to be able to have multiple tasks going and then check in on each of them. [15:09] Yep. And so what this would do, and maybe we can come back to it later when it's done thinking, what this is going to go do is it's going to go do research. [15:18] It's going to find my docs page on the MCP server that Devin did make for us. And then it's going to pull that docs in and then you're going to get actual code out of this. Your goal for this is to get a PR, right? Right. [15:31] Yep. So this will produce a pull request for us. [15:34] And then from there, I'll be able to review the pull request. And then if that looks good, then I'll merge it. And then obviously we'll have this out in the next step. Amazing. And then your prompts are going to be so much better. And I'm feeling guilty. So... [15:46] I am just going to slack you the the the MCP homepage and you can give that to Devin to go. Yeah, sure, sure, sure. You're going to get a true live, true live demo here.

15:59-17:35

[15:59] Yes, yes. This is when your intern comes back to you and says, hey, I was looking this up and I couldn't find it. Can you point me to where? Okay, you have it. chatprd.ai slash product slash MCP. Okay. [16:12] Okay. It has code snippets and everything. Okay. Okay. Here we go. Great. So this, this is a good example of you've done your research, you use that research to create a better prompt, you use that prompt to kick off a task. [16:26] that task is worked asynchronously as a sort of like more junior engineer would work, including doing [16:33] research in your code. [16:35] external to your your business and then it's going to go ahead with the context of your repo [16:41] and do a PR and ship this feature. And otherwise you would have had to like ask somebody to do this. And I think about... [16:48] For me, I think about the people [16:50] that you'd have to involve in something like this. Like you'd have to go find the senior engineer that wrote [16:55] The MCP server... [16:56] code and say, like, please explain it to me. [16:59] You'd have to, you know, get you take the time to write out that nice spec of this is what you want to do. And then you'd have to task it to somebody to actually implement. And so I think you can press that. [17:11] workflow of a team of like, you know, even three people's time into, you know, about 10 minutes to get something done. [17:21] Yeah, yeah. No, and I think often a lot of the folks who we see who really, really love debtor and, and use it this way, especially are, you know, folks who are like tech reads or product managers or, or things like that, you know, it's, it's a great kind of intersection of

17:36-19:12

[17:36] One is, on the one hand, you're already used to the flow of kind of like, you know, figuring out an issue and getting into what is going on there and then handing off something, you know. [17:44] of here's exactly what we need to build, right? And then I think two is naturally like the async workflow for people who are in meetings or who have a lot of back-to-back going on. [17:53] is just a great way to kick off and check in on tasks quickly. And so, um, [17:58] from starting things from the web app or serving things from Slack, for example, is a nice thing if you're not in your IDE all the time. [18:05] You can start tasks from the IDE as well, obviously. But we see this kind of flow a lot with leads and PMs, basically, who are going back and forth with a lot of things. [18:17] Yeah, one of the things that I've been telling people more and more is as part of your PM onboarding, you should be now giving everybody access to GitHub, which isn't something that typically happens in a lot of product organizations. Giving access to GitHub, giving access to tools like this, because I think it does enable product managers to... [18:37] do a lot more so while this is running what i wanted to talk about is you know before we got into the show [18:44] You and I are saying you're just a little bit busy, you know, over the last month, doing a few, few interesting things with with the business, in addition to I'm sure wanting to build and spending time with the team. And so. [18:57] You know, this asynchronous nature and this junior engineer on demand. How do you actually use that day to day to just keep a float on top of all the stuff that's coming in your team? You know, not I have a feature I want to build. Let's go build it. We just saw that flow. But like.

19:12-20:54

[19:12] the kind of reactive stuff in your company? How are you... [19:16] using AI to stay on top of that and keep the velocity high? For us, a lot of it is just setting the right workflows in our Slack and in our org and so on. And so, you know, Devon obviously has knowledge, which means it'll learn your code base with time as we keep working with it, or you can kind of give it more details about how certain things work. And a lot of things are almost just like [19:37] institutionalizing Devin as first line of response, is how I would describe it. And so I could show a few examples. The big thing is to really get to the point where [19:48] For a lot of these different things that we file, you know, like Devin is the first person that gets tagged on all of these, right? And Devin won't be able to do every single thing, you know, on one shot on the first try, but often you're working back and forth with Devin and Devin puts up a PR and there's some slight touch up that you have to do at the end or that you have to build, then you're able to do that. And so we have a ton of channels where we go and talk about issues or [20:12] various things that we need to build or things like that. We have one for all the crashes that come in. We have one for core infrastructure things that come up. We have one for-- this is the one for our web app. [20:24] Um, [20:25] which is hopefully a little bit less sensitive. And you can see here, basically every single thing that folks talk about and we do, we start in Dev and Fetch. And so it's like, hey, can you standardize? [20:39] The font size, spacing, and style for these three levels, right? And then, you know, we just go and start the Devon session. And Devon will make the PR, it'll go through the PR. This one gets merged, because there's some back and forth feedback here.

20:54-22:25

[20:54] And so like Devin goes and edits, let's see. [20:59] Thank you. [21:00] And so, Devin made this PR, there were a couple back and forth edits, and then Dave, our engineer, went and merged this. [21:08] And this is often how we work. You know, it's how this is another good example. Hey, Devin, can you make it to that when you command click on a notification? It takes you to that in a new tab, right? Natural feature, probably one of our users requested it. And you just started Devin's session. [21:23] And Devin will give you this progress update of here's what I'm doing so far. Here are the files that I'm looking at. Here's what I see. [21:29] In this case, by the way, it's actually confidence medium. And then says, oh, no, no, no, you should take a look at this thing instead. [21:36] One of the cool things I want to point out too, [21:39] is because of this, Devon is a naturally multiplayer experience. And so we will often have a few different folks going back and forth. Or if somebody else is looking at this issue, or if somebody else is the expert on this part of the code base, [21:52] They'll go and give their own kind of input here, and Devon will just go back and forth with them as well. And so really, it is just a thread where a group of you are communicating and figuring out how to work on this issue. And Devon is just one of the players in the thread, right? And so, you know, Ethan comes into Walden's thread here and says, "Hey, you know, I'm [22:10] Make sure to use a link element from tan stock router, and then gives that feedback, right? And then, [22:17] Devin goes and makes that change in the pull request. And so you can see Devin had an initial thing. [22:22] and then had some additional [22:24] Commit.

22:25-23:55

[22:25] And it went and did this... [22:27] link from Tansac Roger instead. [22:30] As an AI founder, you're used to sprinting towards product market fit, your next round, or that first enterprise contract. But speed isn't enough for AI startups. Buyers expect security, compliance, and transparency from day one. That's why serious AI startups use Vanta. With deep integrations and automated workflows built for fast-moving AI teams, [23:00] as your models, infra, and customers evolve. AI innovators like Langchain, Rider, and Cursor scaled faster and closed bigger deals by getting security right early with Vanta. Listeners can claim a special offer of $1,000 off Vanta at vanta.com slash howiai. You know, one of the things that I like about this, and again, kind of a shout out on our use case [23:30] or AI adoption in their teams is doing this as much as possible in public is really helpful from a learning perspective. So one of the experiences I had running the engineering team at LaunchDarkly. [23:45] was when we started putting Devon and Devon-like agents in public channels, we saw a lot more adoption.

23:55-25:36

[23:55] and upskilling of our team on how to actually talk to [24:01] these agents how to get the right outcomes. And so, you know, I, we were talking earlier and I was saying, I DM Devin all the time. It's because I have no employees, no one to talk to. He's my only buddy. But I DM, I DM Devin all the time. And we have these sort of like side conversations. He's [24:19] "my intern on the side, [24:21] But in larger organizations, I was very much a... [24:25] do it in public channels, do it where people can see it. Because not only does the work get done, and it's nice kind of muscle memory to tag in these tools immediately, but also just learning [24:37] how you use them. What is an effective prompt? What are the kind of things that it's good at and not good at is really useful for just overall engagement with these tools. And so [24:49] I think hiding your AI use is kind of the worst thing you can do. You can do it in work. So I say do it all in public. [24:56] Yeah, yeah, yeah. And I think there's two sides of it. And then I was gonna say work work. One is like that kind of like, [25:02] When we talk about these multiplayer experiences, right? I think there are two benefits, right? One is this kind of like, [25:09] Um, [25:10] that the knowledge transfer for the agent itself, which I think more and more products are starting to have, which is, you know, one person uses Devin or uses this tool or that tool, right? And that adds to the knowledge of the tool itself. So that, you know, a week later, when somebody else does that session, Devin's like, Hey, oh, yeah, I just touched this piece of the code last week. Like, I know exactly what you're talking about. Let me go and find that. And then the other side is kind of like educating the humans, right? Or like, you're showing each other what your experiences

25:40-27:23

[25:40] - Yeah. [25:40] And I totally agree. I think because of both of those, United will see a lot of experiences in AI productivity get more and more multiplayer. Yeah. Yeah, that's my hope. OK, before we move on from from Devin and your use of it for engineering, I want to get really specific. So you'll go and then I'll go. [25:58] What are your top five [26:01] Like everybody can reach for them tasks that Devin can do for you. And you pick kind of like five categories of tasks and I'll pick five. [26:08] Okay, sounds good. Yeah, so top five. I think miscellaneous front-end fixes, it's amazing for. I mean, because often that full workload is like, [26:18] you know, for various reasons, like you said, you have to get like three different people involved, right? And it's like, here's what we're going to do. And then you bring in somebody who looks at that code, and there's somebody else who's reviewing or something. And now with this, [26:30] You tag that, you explain, here's a screenshot. [26:33] you know, I want to make this button a little bit more round, or, you know, I want to touch up the design here and I want to do AppFindMe, right? And it'll go and do that, it'll find the right parts of the curve, it'll do the implementation, but also, it'll [26:45] sends you the before and after screenshot as well. Right. And so you can just kind of review it in line there. And that's just like a really, really great use case both I think because [26:54] Similarly, it's verifiable for the ages, but it's also verifiable for the human rights. Yeah. And while while you're you're saying that, I will just pull up an example of this, which is. [27:05] Let me share my screen, which I rarely get to do here. [27:08] Yeah. [27:09] It's a very exciting window. Let's do always thrilling to share your Slack. As you can see, my only friends are agents. But here's an example of it I just did very recently, which is I'm working on the Chat Parity homepage. And

27:23-28:59

[27:23] you know, Devin shoots back to me, here's a new a new hero image that I like. And I was able to give [27:29] feedback on on that. So that's this is kind of exactly what you're talking about, which is like, let's make changes. [27:35] and then get kind of that immediate [27:38] feedback back right in your workflow. [27:41] Yeah, fixes, new Capernan changes that you want to make in your front end. It's super, super nice because, yeah, as you're saying, you can just kind of [27:49] Do this all inside, basically. So that's probably number one for me. I think number two that comes to mind is... [27:56] version upgrades, migrations, things like that. And so, you know, like upgrading your node version or getting onto, you know, the latest packages and so on. So it's a big time. So, you know, we all have to do it. And then, you know, somehow these new packages just come out so quick. But obviously the devil is in the details of like finding, you know, that this new version will say, oh, you know, every instance of this component is, you know, we recommend that you use, you know, [28:26] and do the semantic search and find each of the components and make the right changes. Number three, I'd say is [28:32] Um, documentation they want as well. Um, and so we have our, you know, Devon docs, for example, um, like our, our, our own kind of like docs page, uh, like the external docs page. [28:42] And I mean, Devon has written the entire thing. DeepRicky itself, obviously, is kind of an extension of that. But even writing your own docs pages or putting the series together, a lot of what Devon does is go again processing the code base and understanding

28:59-30:48

[28:59] this references that and, you know, here's what this does, and so on. And so it's a funny one, in the sense that it's not strictly a writing code use case, or it isn't always but but but I think it's so closely related to it that a lot of the same capabilities are really valuable there. [29:16] Hey, number four that I would say is, um, [29:19] instant response actually. And so we have this setup so that whenever there's a crash, the first line defender [29:26] you know, on pager duty, basically, is Dev. And Devon gets the page and Devon gets started. [29:32] goes and you know, kind of runs a session. And obviously, you probably want a human there to you, especially for these big incidents that to make sure what's going on. But the nice thing is [29:41] you know, it's like 4am and, and you're kind of like half asleep. And then you get to your computer and Devin has already written a report of like, hey, I looked at it, [29:49] I think it was this change from like last week that happened or yesterday that happened. You know, here's exactly where, you know, the trace of the error goes. So we use that a lot. It's a huge lifesaver for us. [30:01] And then number five, let's see. I would say... [30:05] Adding testing. [30:06] is a big one for us. You know, it's a very common thing where this is especially for kind of like, individual engineers as they're going and working on things, you know, you have your whole PR, you built things out, you built a new feature. And, and, [30:22] always be the last thing that you have to do before you ship it is you have to go and add your own unit tests and make sure your thing works right um and the nice thing again is like devon will go and do that it will make a test and then it will run the test locally itself and make sure those tests pass and so we can iterate with you to nurture the link pass make sure the ci pass and so on um and just kind of like add those for you all right well we're very close my my five are very close so i love those so to recap and i'll augment yours with mine so

30:48-32:25

[30:48] Number one, friend and fixes. My particular version of friend and fixes is I think these AI tools can really help you do polish, really nice interactive user experiences where you wouldn't normally be able to spend time on them. So any of those like little magical moments. [31:03] that you don't want to toil in front end on, I think it's really good at [31:06] Docs, I think, is underrated. I actually have a GitHub action that every PR gets opened, gets reviewed by Devin, gets the PR description rewritten by Devin. And then after the PR is closed, Devin goes and ships. [31:19] our documentation, internal documentation into our repo so that Devin has access to the doc. So I think he's like an excellent technical writer. I too have Devin first line of defense for incidents. So Devin actually has a Sentry login and logs in to Sentry and goes through all of our open issues and starts to fix stuff for us. Definitely upgrades. And then the one that I didn't hear you say, but I just think is a more like operational and [31:49] personal benefit is [31:51] it's like 24-7 availability rubber ducking, which is like when you're working on something and you're just like, [31:58] Can you just look at this and see if I'm being crazy? If this is crazy? You know, Sunday night, Monday night, Saturday morning, where you like really don't want to bother a colleague. I just think having... [32:10] something to like sort of rubber duck with is, is really nice. And so, [32:15] Those would be those would be my use cases. Very similar. [32:18] Okay, Scott, we're going to close with just one really high-level use case outside of the Devon ecosystem, which is...

32:26-33:59

[32:26] voice. And you were telling me a really interesting chat GPT voice use case that I hadn't heard before. So do you mind spending a few minutes just telling us about that? [32:36] Yeah, for sure. So I'm a big fan of voice. I actually think there are a lot of interesting, you know, we've played around, you know, we have voice in Windsor now, actually, as of wave 11 to it partially because of that. But in short part, the way I'm describing it is like, [32:50] I think, you know, Google itself, like 20, 25 years ago, what was [32:56] basically a better encyclopedia, right? You know, we have all sorts of things that you want to look up and pull together and so on, right? And it basically, it got you a faster answer, and it got it to you, you know, [33:07] with more up-to-date information of what was going on. And I almost think of ChatGP voice as like a better Google, you know, like you can get an even faster answer. It's fully synchronous, you can do it in the conversation. And then obviously you have all of the detail on it. It can go and research and do these other things too. [33:24] What I'll often do is, you know, if I'm in a meeting, and we'll be talking about things, you know, there are always questions that come up, like yesterday, I was in a meeting, and we were talking about this, which is, [33:34] Um, [33:35] There are so many orgs out there with tons of software engineers. And so we were kind of thinking, what are all the companies that have, let's say, 10,000 plus software engineers? And how many are there in the world? Obviously, the big bench out there, tens of thousands of software engineers, big tech companies. Those are the first couple, maybe the Accenture, Infosys, that category. Those are the first ones that come to mind. But what are all these different companies that have it?

33:59-35:42

[33:59] Um. [34:01] And, you know, naturally in a meeting, it's kind of rude to just go on your phone and just kind of like, you know, be totally unresponsive for like two minutes as you're looking. So it said what I'll often do is I'll just pull out ChatsyPT and go on voice. [34:14] And it's basically like adding chat to keep different conversations. And so when I say, hey, can you please tell us how many companies that have 10,000 plus software engineers? And then whether it's voice to voice or whether it's voice and then you kind of get the response in text, I use both of those modes a lot. [34:44] I was going to say, in the encyclopedia area, if you were going to look something up, it took like, I don't know, five minutes or something. So you had to go pull the right letter of the alphabet or something and find this. And then Google got it for like 10 seconds. Yeah. [34:58] And like voice is kind of like getting it from like 10 seconds down to like one or two seconds where you can just get on instantly and just say what you want to say. [35:05] And that actually matters, I think, for being able to go back and forth or just like having, you know, very off the cusp, like off the cuff questions that you that you want to ask. [35:15] Yeah, I was going to say, you know, you've maybe changed my mind here because I used to think that voice mode was like super socially disruptive in that it feels so unnatural to like talk during a meeting. But if you flip it on its head and you're like, no, this is just another meeting participant that I'm putting into the room. It actually is is more socially inclusive. Everybody hears the result, right? You're not like slacking around links and then people are opening them up on their laptop and reading while somebody is talking like.

35:42-37:18

[35:42] everybody sort of like clued into the synchronous nature of this new, new information. So yeah, [35:47] If I had people to be in meetings with and not to brag, but I have very few meetings, then maybe I will bring chat GPT into it. Okay, we will do. Must be nice. Must be nice. It's the dream, man. [36:00] So quick lightning round questions. We will get you back to your work. First one, it's like picking between your children. I know now. [36:06] The IDE, the terminal, or the agent. What is going to be the form factor to rule AI engineering? [36:14] I really think of this in the future as [36:17] you know, we call it coding agent and control. Like a lot of what this becomes is actually just, [36:21] the next generation of human computer interface [36:25] And like the way that I like to say it is Tony Stark doesn't have a laptop. [36:29] Like, like, you don't need one at some point, if you're just you have your Jarvis plugged in, and you're going back and forth with your agent, and then go and do these things for you, right? And you can imagine that building software is just kind of like, [36:40] You're not looking at your code. You're not looking, you're just working at your own products, right? And you're looking at your own products and you're saying, hey, let's make this button rounder. Look, let me add this thing over here. Let's save this and, you know, let's ask the user for this and that info, you know, and you're just making the changes in real time in your products. And your agent obviously is going and implementing this for you. And so I think it's a, it's, it's. [37:00] Certainly very agentic, but I think it's almost like [37:02] we might whether we call it an idea or an agent or whatever, it really is basically just like a [37:08] a different human computer interface where you are just looking directly at the product rather than having to go through all your code or go through. You know, and so I think that's that that's the future version.

37:19-38:51

[37:19] some years out. I think today [37:22] I would say [37:24] I think a lot of it depends on the cohort. And so I'm, for example, in meetings all the time. [37:32] Unfortunately, you know, not that. But yeah, you know, and because of that, I actually think the Slack agent workflow is like a super, super natural one, you know, or like linear, for example, and tagging, you know, dev and perm linear. I think for an engineering IC user is, you know, gets to code for, you know, eight or 10 hours a day, again, must be nice. But then the idea [37:54] is kind of the natural place for a lot of this starts, right? Which is, you know, you'll have these things that run in the background and you'll have these asynchronous processes that are going as you're doing your thing. But the natural place to get started for that is the IDE today, I'd say. I also just think what's nice about this era is like the form factor can come to you and you can decide what the interface is that works best for your workflow. Okay, you know, as somebody, Devin is my buddy, I am sure you get lots of chats that would give us very good insight [38:24] which is, [38:25] when you are frustrated with our sweet, sweet intern, Devin, [38:29] What is your prompting technique? And I know you all monitor this because when I get frustrated, [38:35] Sometimes I get little credits back, little credits back. Like you did that wrong. I get credit back. So I know you see a lot of human language to agents, but what is your strategy? What do you find yourself doing in a moment of, you know, frustration or being blocked?

38:51-40:23

[38:51] I can give some advice. I can't say that I've always followed my own advice. But a lot of what it looks like, I'd say for an agent, especially is, I think the agent is a little bit different from from a chatbot in the sense that like a chatbot, [39:05] There's less to go off of is kind of like how I want to say it, right? Where with a chatbot, it's like, you know, you ask the question, it gives you the wrong answer. And it's like, [39:12] No, that was the wrong answer. That's all you can really say. With an agent, one of the nice things that you can do is you can go through [39:18] and look through all the history of what he was doing. And so we had an example of that just now where Devon got stuck of like, [39:26] I see the chat security page. It's probably have an MCP server. I'm like trying to find the documentation on this. Right. And if we go and scroll through the logs and we'll see like what happened, that it Googled it and it found some other things. Right. And that was what the issue was. Right. And so, so from there, it's kind of like. [39:41] you take that information and then you understand, oh, Devin was missing the link to this page. And then you send that. And so I think a lot of it actually with agents is just [39:50] Um, it's kind of like pair programming or pair debugging with an intern. Like you want to, you know, first you get to go through and see, okay, here's all the steps that you took. Oh, by the way, it's like, you know, I think you missed this one file, which is, you know, the downstream reference of this. And that's why there was the bug or something like that. [40:07] I think that's the biggest thing that will really move the needle. Okay. So review the history, figure out where it went wrong, and then re-instruct. Okay, Scott, this has been so fun. Thank you for showing us. Where can we find you and how can we be helpful?

40:24-41:08

[40:24] Yeah, yeah. So we're Cognition and Devin on Twitter. We officially got the Twitter of slash Cognition, which is great. And then obviously it's Devin.ai if you'd like use the product. Great. Well, thank you so much and I appreciate you spending the time with us. Cool. Thank you so much for having me. [40:50] You can also find this podcast on Apple Podcasts, Spotify, or your favorite podcast app. Please consider leaving us a rating and review, which will help others find the show. [41:01] You can see all our episodes and learn more about the show at howiaipod.com. [41:08] See you next time.

Want to learn more?