“Vibe analysis”: How Faire’s data team uses AI to investigate conversion drops, analyze experiment results, and convert raw data into executive-ready insights
Tim Trueman and Alexa Cerf from Faire’s data team demonstrate how AI tools are revolutionizing data analysis workflows. They show how data teams, product managers, and engineers can use tools like Cursor, ChatGPT, and custom agents to investigate business metrics, analyze experiment results, and extract insights from user surveys—all while dramatically reducing the time and technical expertise required. What you’ll learn: 1. How to use AI to investigate sudden drops in business metrics by searching documentation and codebases 2. Techniques for creating a semantic layer that helps AI understand your business data 3. How to build end-to-end analytics workflows using Cursor and Model Context Protocols (MCPs) 4. Ways to automate experiment analysis and create standardized reports 5. How AI can help design and analyze customer surveys 6. Strategies for creating executive-ready documents from raw data analysis 7. Why every team member should have access to code repositories—not just engineers — Brought to you by: Zapier—The most connected AI orchestration platform Brex—The intelligent finance platform built for founders — Where to find Tim Trueman: LinkedIn: https://www.linkedin.com/in/tim-trueman-99788592/ — Where to find Alexa Cerf: LinkedIn: https://www.linkedin.com/in/alexandra-cerf/ — Where to find Claire Vo: ChatPRD: https://www.chatprd.ai/ Website: https://clairevo.com/ LinkedIn: https://www.linkedin.com/in/clairevo/ X: https://x.com/clairevo — In this episode, we cover: (00:00) Introduction to Tim and Alexa from Faire (02:53) The challenge of analyzing product quality and usage (04:14) Breaking down what analytics actually involves beyond data manipulation (05:46) Demo: Investigating a conversion rate drop using enterprise AI search (09:05) Using ChatGPT Deep Research to analyze code changes (12:40) Leveraging Cursor as the ultimate context engine for code analysis (18:55) Analyzing a new product feature’s performance with Cursor (26:27) How semantic layers make AI tools more effective for data analysis (30:00) Using Model Context Protocols (MCPs) to connect AI with data tools (34:17) Creating visualizations and dashboards with Mode integration (37:04) Generating structured analysis documents with Notion integration (44:39) Building custom agents to automate experiment result documentation (53:10) Designing and analyzing customer surveys (59:40) Lightning round and final thoughts — Tools referenced: • Cursor: https://cursor.com/ • ChatGPT: https://chat.openai.com/ • Notion: https://www.notion.so/ • Snowflake: https://www.snowflake.com/ • Mode: https://mode.com • Qualtrics: https://www.qualtrics.com/ • GitHub: https://github.com/ — Other references: • Model Context Protocol (MCP): https://www.anthropic.com/news/model-context-protocol • Faire Careers: https://www.faire.com/careers — Production and marketing by https://penname.co/. For inquiries about sponsoring the podcast, email [redacted email].
- Published
- Published Nov 3, 2025
- Uploaded
- Uploaded Jun 12, 2026
- File type
- Podcast
- Queried
- 00
- Source
- podcasters.spotify.com
Full transcript
Showing the full transcript for this episode.
AI-generated transcript with timestamped sections.
[00:00] How do we start at the very beginning of analyzing a product and its quality and its usage through analyzing conversion rates? The new AI tools have just absolutely transformed the process of just getting all that context. You can go as broad as you like self-serve into an unfamiliar topic just incredibly quickly and that means you can not only deliver quicker analysis, you can just deliver a much better analysis too. I'm going to start just by doing an enterprise AI search. So I'm just going to start very simply by asking Notion, what experiments were new features launched between [00:30] December to December 2024. [00:31] that could have added friction to the checkout process for new retailers in Europe or North America. And I've just said to focus on XP Docs, PRDs and launch announcements. I've got straight away [00:42] a really interesting list of hypotheses to dig into it. No work. You can see it searched across [00:46] Slack, Notion, Jira, and everything else very, very quickly. So Alexa, how do we do actual analysis of data when we've identified a problem or an opportunity we want to go after? Without AI, especially the context gathering would mean hours spent digging through all the specs and PRDs, writing SQL queries from scratch, and then spending a lot of time writing and editing a doc. Using Cursor to actually create, edit, write SQL has been pretty game-changing. [01:18] Welcome back to How I AI. I'm Claire Vo, product leader and AI obsessive here on a mission to help you build better with these new tools. [01:26] Today I have a great episode with Tim and Alexa from the data team at [01:29] fair.
[01:30] They're gonna show us how you can use cursor, MCPs, [01:34] chat gbt and even write your own agents to do data analysis we're going to see everything from decomposing that scary question what went wrong in september to doing detailed funnel analysis on experiments and surveys [01:48] Let's get to it. [01:49] AI is supposed to make work easier, but I've been there. Weeks of setup, endless back and forth with engineering, and yet another tool the team never really adopts. That's why I use Zapier's AI orchestration platform. It connects with nearly 8,000 apps, so I can finally put AI to work without the drama, without the delays, and without pulling engineering in every time I want to automate something. With Zapier, you can roll out AI-powered workflows that do real work across your whole [02:19] It automatically responds to leads with enriched personalized data. It checks my calendar weekly and offers smarter ways to manage my time. And it even drafts emails for every new request that lands in my inbox. All of that running quietly in the background so I can focus on the work that matters. And Zapier's built for scale. With enterprise-grade security, compliance, and governance, it's trusted by teams at Dropbox, Airbnb, Opendoor, and thousands more. [02:49] to learn more about how Zapier can bring the power of AI orchestration to your entire org. [02:55] Alexa, Tim, thank you for joining How I AI. [02:59] Great to be here. Thanks for having us.
[03:00] Thank you so much. One of the things that we can do now that I am probably personally causing in the internet world is we can just build... [03:11] A lot of product. I am always out there like, I was thinking the other day, I was like, I'm going to tweet something where I tell PMs that they should just spend a month saying yes, instead of saying no. Like, let's ship some features. And I think AI has really accelerated product development, software engineering, getting innovation to the hands of customers. But the problem it has created is we don't know if those products are any different. [03:34] Any good. So the perennial product problem, which is you can ship things and they can not make the difference that you hope they would make. And so I'm really excited about this conversation because you are going to show us how to use... [03:47] AI and even some of these tools that software engineers or product managers might be familiar with. [03:52] to do really deep, meaningful product analysis. And I spent a lot of time in experimentation. And so [04:01] I love a good conversion rate optimization. So Tim, we're going to kick it to you. [04:06] To start with, how do we start at the very beginning of analyzing kind of a product and its quality and its usage through analyzing conversion rates? Yeah, I love this. I think everyone's talking about Vibe coding, but no one's really talking about Vibe analysis. And we're heading in that direction very quickly. So let's get into it. So before we do anything too technical, I think we want to share a really broad range of examples here from the really complicated to the like. [04:28] actually incredibly simple. I think everyone knows PMs are going to have to become engineers. And then we've got a lot of issues where all of you guys are going to have to become an analyst as well. So I think there's a lot we can show here.
[04:38] So we want to start off with just a really simple use case that should be familiar to, I think, everyone listening. But I think it illustrates the point. There's often the most simple AI tools that can actually have the biggest impact here. [04:48] I think before we get into the actual demo, I think it's useful just to pause very quickly for a second on the question of what analytics actually is. [04:56] I think once you break that down, you get a much clearer view of where these current tools can be most valuable. [05:00] I think most people jump straight to the nuts and bolts from actually manipulating and crunching data. But actually, it's really just a small part of the overall process. [05:10] and the most important of the most, [05:12] difficult thing is actually just getting the right context in the first place. Because that's what separates good analysis from bad. Like you need to know it to ask the right questions, to come up with the right hypotheses, to know what analyses are even worth doing in the first place. You need to know where the data lives and you need to be able to interpret it all very, very well. And the new AI tools have just absolutely transformed the process of just getting all that context. You can go as broad as you like self-serve into an unfamiliar topic just incredibly quickly. [05:41] And that means you can not only deliver quicker analysis, you can still have a much better analysis too. So to illustrate the point, I'm gonna talk through what Sandhya, I'm guessing is a very familiar situation where a business metric suddenly drops off a cliff [05:53] and no one's got a clue what to do with it. So I'm actually, I'm going to use a real example from FAIR for this. And this happened to our new customer conversion funnel at the end of last year. [06:05] So if you've ever worked in growth, everyone's going to know new customers. They're just extremely sensitive to even the tiniest little friction.
[06:12] So almost anything anyone does in the business, anywhere, [06:16] can affect these kind of things, whether it's a sign up flow, a search algorithm, a shipping policy, like this all can affect these things. And if you're not careful, you're going to have to decomp the entire business. So let me show you how these things can just be done so much quicker. And so imagine this problem lands on my desk. I might look at a couple of just existing dashboards that exist to say, [06:37] what's going on here. And you can see very quickly the issues started in September and there was another drop in December and it seems to be concentrated in the checkout stage. [06:47] But beyond that, I've really got no idea what could have actually caused that. So let's start really bored. I'm just going to share my screen. [06:54] I'm going to start just by doing an enterprise AI search. You know, we use Notion, but frankly, every document system now is going to have an AI system. If they haven't got one yet, it's coming. And they are just game changers. [07:03] So I'm just going to start very simply by asking Notion, [07:07] what happened. Okay, so the only thing I'm going to do, I'm going to just make this more realistic. I'm going to filter the date range. I don't want it cheating and looking at the answer. It's only going to have access to the things I had access to when I actually did this. So I'm going to put it up to the end of April last year, which is fine right now. Okay. [07:22] And then we're just going to get that running. So if you think of this, all I've asked is, what experiments were new features launched between September to December 2024 that could have added friction to the checkout process for new retailers in Europe or North America? And I've just said, focus on XP Docs, PRDs and launch announcements. [07:41] Okay, so if you think about what I've done in the past, I'd have to be crawling through a million documents, doing a load of searches, going through a ton of different Slack channels, trying to work out what's going on. And instead, look, I've got straight away
[07:55] a really interesting list of hypotheses to dig into it. And they work. You can see it searched across Slack, Notion, Jira and everything else very, very quickly. [08:04] And [08:05] Let's just pull out a couple of these. So what's happening? So let's go. So you've got clearly we launched some kind of checkout experiment around this time. That's definitely worth looking in. [08:15] We've done something with a checkout blocker in Europe. Okay, lots of interesting things to dig into. Now, with a couple of clicks, I've got a good long list, but I don't really know what these things are. So I've got all the links of extra documents I could go click into, but let's just ask, as a starting point, what is EORI? Let's pick one of them. What is EORI? [08:34] so we'll just ask that it's going to run another little search give us more things now [08:37] You've got a little bit here, but it's going to start bring up a little bit more information. [08:42] to just get a bit more detail on this thing. So let's see where that goes. [08:47] Okay, so very quickly, it's saying, give me the term of what it is. And you can kind of see it's, okay, it's a regulation that's involved Europe, and someone's done something to start asking for more details, clearly trying to improve checkout and conversion rates, and then trying to bring that one in. [09:05] But... [09:06] I think [09:06] This is a great starting point. I've got some detail. But I think what's really interesting here is [09:12] Everyone knows like a PID is one part of the story, but between a PID being written and something going into the code base, a lot can happen. So to actually understand what's going on, you usually need to go... [09:21] one layer data into the actual technical implementation. I want to show you a quick trick of how I do that.
[09:28] So I think one of the best things about these AI tools is just the ability of someone who's like non-technical to access things that they couldn't previously access. And a great example of that is just being able to talk to the product code base. [09:42] I'm not an engineer. I can't write Kotlin or Swift. I used to be a lawyer, for God's sake. Instead, I can run a deep research against our code base, [09:50] to find out exactly what got implemented for this particular feature and when. [09:54] Now I'm going to do this in two different ways. I'm going to do it on ChatGPT [09:58] which I think is very simple and anyone can replicate incredibly quickly. Everyone's familiar with it. [10:03] And I'm going to do it on cursor, which is a bit more specialized, but just incredibly powerful. So I'm going to open up a new chat and I'm going to put it into... [10:12] Deep Research Mode [10:14] and make sure my GitHub is connected. So we do, it's not technical to do that. You just need to say yes a few times to get your GitHub connected. The only reason you do it on deep research is just because it's the only way you can actually access it. It's going to search our code base now. [10:27] in exactly the same way it would normally search the web on a deep research. [10:32] So I'm just going to put in a prompt. [10:34] Let's just copy that in. Now, let me talk a little bit about what this prompt is doing. [10:38] I've given it a roll. [10:40] I've said you're a senior staff engineer, and you've got expertise in all these different code bases, Kotlin, Swift, TypeScript. [10:45] and they were working at fair. [10:47] and I've given it a task to say, please conduct a forensic investigation of the co-base, [10:52] to produce a comprehensive [10:54] time sequence report of all changes to the EORI collection process at checkout between June 24 and February 25. So just making sure we don't miss anything. And the rest is just a bit of detail as to what I want this to look like.
[11:06] So I've said I want an exec sum. I want a table with all the different PRs and commits, what they've gone into. And I really wanted to focus in on the actual impact these commits had on the retailer experience. Explain it to me in layman's terms. [11:21] And then I've just put a few requirements in here just to give it a bit more context. So be precise, simple, clear language, only use GitHub sources. [11:30] I want to call out here, you're using this prompt in the context of sort of what I would call like a business incident, right? New user signups just dropped. But this is a prompt that I want the engineers watching or listening to the podcast to really pay attention to. Because if you're in the middle of a, you know, Sev1 incident and you need to trace who did what, I know so many of our engineering teams are looking either manually looking through code, looking at these specialized kind of code gen tools to do this. [12:00] aren't reaching for something like ChatGPT, [12:02] deep research to just go ahead and do this for you. And if you're a product manager looking to be helpful during an incident, this is maybe a task you can take. [12:10] on on behalf of your engineering team just to provide some additional context in the background. [12:15] 100%. I think this is great for engineers. I think it's great for just getting people to talk better to engineers. I think there's just so much you can do here. So as always, deep researchers ask me a few questions. So use discretion. We'll just answer a few of those to make sure we get it. [12:28] Use discretion and yes, please. So that'll get it going. But no. You prompt just like I do. I just say you pick, you decide, you go. I don't care. I think the fact that Pro doesn't ask you these questions made me think it's more to like make you feel like it's doing it rather than anything else. So. Yeah.
[12:43] That's going to take a bit of time. So while that's running, I want to show you how to do this in Cursor. Because I think Cursor is one of those tools that everyone thinks of for Vibe, coders, they think of it for engineers. They're not really thinking about what else it can do. And I think for both... [12:58] Analysts and non-analysts alike, it's an incredible tool. So I think [13:04] More and more people are talking about the phrase context engineering well than context engineering. I love that. It sort of actually explains what we're trying to do here. And for me, just cursor is the ultimate context engine. You can hook it up. [13:15] to MCPs. So basically, I can hook it up to every single system in our business to get all the data I need. [13:22] And that just makes it such an incredibly good accelerator for getting context and doing analysis. So I actually find increasingly, this is getting better results than deep research on GPT. So both are good, both are getting changes, but I think this is just a little bit quicker and better. So I'm just gonna make sure my... [13:37] and then all I'm going to do is just to make sure [13:40] is I'm going to drop exactly the same prompt into cursor and we'll see the two running. So exactly the same prompt. So just for context, [13:49] We are not even started on our, hasn't even got off to the races at all on the Chats of T. And straight away in Cursor, we're going and finding it's got a nice to-do list. It's saying it's going to search all the right things in GitHub. It's going to then forensic analyze it. And we'll just let this run for a little bit. You can see it's already starting to pull in the code and the pull request that we want. [14:10] One of the things that I think is interesting to call out is, you know, I've run a lot of product engineering data orgs before,
[14:17] engineering certainly day one what are you doing you're getting access to all the repos you're getting set up with github you're pulling your your local environment together i know that data teams often have a similar onboarding because they're working so closely with production data one of the things i think is going to change or if it hasn't already should change right now is i think product managers and designer onboarding first seven days has to include [14:42] at least read access to GitHub. [14:45] getting your local repository pulled down, getting all your MCPs set up, because it just... [14:52] code has become now a data source for anybody doing work, not just people [14:57] writing code. So I look at this and I think leaders out there need to pay attention and rethink basically their onboarding process because you don't want to be in a situation like this and go like, can something give me GitHub? Like, [15:07] Can I get access? I think it goes even beyond that. Like everyone should have access to every system. And it should be from day one. These tools are just the best onboarding accelerator. We've seen it for analysts. We've seen it for engineers. Suddenly, people get the content very quickly. OK, so we're already on. It's summarized everything. It's written as a national problem. We're actually starting to write things out here. [15:24] Straight away, you can see I've got a nice, exact summary. It's given me a few things, but this is what I was most interested in ago. [15:29] So I'm getting a table here for those who can't see my screen. I'm getting a table with every single PR that affected this part of the flow. [15:39] from local start in July [redacted address] to still going [15:44] but it'll probably go to somewhere like December or February, if anyone's going to go with all of these things. Now, let's just call out what this is doing. So it's given me an exact link to the specific PR that actually pushed this into the code base.
[15:57] It's giving me the name of it and it's giving me a summary of what it did. [16:01] It's saying who was affected [16:03] and it's saying, what was the impact on a retail experience? Now, if anyone's done this kind of thing, it's so difficult to do, and actually pick through all the codes and actually understand what's going on on this. And it can just be incredibly quick. And so very quickly, knowing nothing about [16:16] this feature I can already start to get really smart on what happened and I can see if I dive down here yeah you can see there was an experiment launched in mid-September right in the sweet spot of when this drop first happened and if I scroll through [16:30] getting through to looking at December, [16:33] uh yeah you can see it launched all treatment uh all users that experiment went live so this now looks like a really interesting potentially smoking gun that we can debutt into and so instead of [16:44] days talking to people about all the potential hypotheses. I can now speak to exactly the right colleagues [16:50] and have a really targeted conversation, an informed conversation right from the off rhythm, [16:55] to crunch through this problem in a matter of like hours rather than weeks here. So even if we've done any data crunching, this can just be absolutely... [17:02] game changing for us. [17:03] Yeah, and it allows you to go a lot deeper than, you know, I've been able to do historically on these kinds of analyses. You know, when you're running these high-velocity experimentation programs, [17:13] You have so many concurrent experiments. You have experiments colliding with rollouts, colliding with just plain launches. And just trying to decompose what was the state of your app on any single day is really challenging. And even if you can do the manual research to get this at a feature level, like, yeah, today we launched the one-
[17:31] One page checkout. [17:32] I think the real challenge is, well, did we implement it well? Is there anything in there that we should worry about? Did we exclude any users from that? And so I do think the ability to use code as a detailed source of truth when doing these kinds of forensic analyses really makes the difference in... [17:50] figuring out what's going on with your business. And they're getting smart enough to go one level deep as well. You can ask follow-up questions to say, how did it differ for different segments? Are there other ones interested? Like you can get so much detail just by asking questions on these kind of things without speaking to any engineers. And [18:05] And this gives me a little bit of some inspiration on other use cases for querying your code base and GitHub history for events. One of the things that I do very frequently is I do a very similar analysis to this. But I say, what is everything that shipped in the last week from the context of a customer? And then I use it to write my newsletter. So, again, like I'm starting to use our code base as a source of truth for our marketing materials. I don't have to proxy through it. [18:30] like what was in the PRD or what did a PM write or any of that stuff. I'm just like, just tell me what was in the code in the code commits, because that's what I know went live. It can interpret what the customer facing experience and intention would be. And then you can create these really interesting business and market facing assets out of that. So I just think the ability to query [18:48] your code base and your GitHub history for any use case, including this one, is really useful. Yeah, I love that. [18:54] Great. [18:55] Now, what do we do after this? So you've identified, you have a conversion rate problem, you've identified maybe a couple sources of the issue, you're going to go talk to your colleagues, you're going to look at the code.
[19:05] How do we actually do some analysis? I know we said we were going to do some vibe analysis and we have seen very few numbers. So Alexa, how do we do actual analysis of data when we've identified a problem or an opportunity we want to go after? [19:19] Yeah, so obviously like a quite classic analytics task. I'm going to take us through [19:26] You know, we launched a new product feature and we actually [19:30] want to understand how it did. So I'll take us [19:33] end to end from understanding how the feature was built, analyzing its performance, [19:37] and then producing a summary that could eventually go to our exec team. [19:42] like Tim kind of touched on without AI, especially the context gathering would mean hours spent digging through all the specs and PRDs, writing SQL queries from scratch, and then, you know, spending a lot of time writing and editing a doc. So with AI, [19:58] I can pull context similar to what Tim just did directly from the code base. I can generate queries and I can draft a synthesized doc. [20:08] And so I am going to start sharing my screen. - And while you pull that up, I have to say, people think that, [20:15] why I got into AI in a deep way was because I thought it was so fun to code. [20:20] And it was actually it made my sequel so much less ugly than it used to be. It was like my number one use case. However, many years ago, I was like, thank God. Now I don't have to bother my colleague with my disgusting sequel. I bother AI with my horrifying sequel and it can make it a little bit more efficient.
[20:40] Yeah, I mean, even just [20:42] chat GPT for the last couple months has been a game changer for [20:46] SQL queries, the problem with ChatGBT is you had to spend a good amount of time giving context, like the exact table names, the exact field names. And so... [20:54] using I mean, it's not it's [20:57] sort of [20:58] most marketed use case, but using [21:01] cursor which is what i'm going to show today to actually create edit write sequel has been pretty game-changing [21:07] especially because it's so context-aware, and I will talk about that. So Cursor can take like three to four minutes to run some queries. So I'm going to just kick off [21:16] this prompt [21:17] And then I'll explain. [21:19] the context and what I have done. [21:21] So, [21:22] While it's running, I will set the stage. [21:26] Last month in July, we redesigned the signup flow for a new payment method that we have been piloting. And this process of signup is successful when a customer links their bank account for the payments. And our old flow had been live for a few months, we had a hypothesis that we could improve it. So we redesigned the flow. [21:45] Because this is a pilot, we actually didn't have enough retailers or users. [21:51] to run an A/B test. So I just needed to do a pretty straightforward, you know, how is this performing before? How is it performing after? Historically, again, that would have meant a lot of digging through documentation, or more realistically, just pinging an engineer to ask questions like, okay, what did we build? Who sees it? And why? What front end events are omitted that I can use to analyze this?
[22:15] And while I do work closely with our engineers during the N-Spec phase to figure this out, those details are easy to lose track of, especially, we're often coming back to analyze things. [22:26] you know, weeks or even months after the feature launched. I will say that I probably would start with Notion AI context building similar to Tim, but we already showed that so I'm skipping straight to the code base. And if we [22:40] go up to this prompt. My prompts are way less pretty than Tim's. I don't spend a lot of time on them. I feel like with cursor you can always iterate. And so I wanted to understand the setup wizard, which is what we [22:53] called this new flow. I told it to research our code base and I essentially asked who, what, where, when, why. [22:59] And so if we go to this [23:01] answer, we can see, okay, it is, you know, looking in to the code base and, you know, [23:08] I'm not an engineer, I don't really know what this means, but it, you know, we called this in our code, the first run user experience. And it tells me about some flags cannot be sub users, there's just like a lot of detail here. [23:19] And it's telling me [23:21] when users see this flow, what happens during the flow, the order of steps that happen. That's pretty important. If I'm going to analyze a funnel, I need to know in what order did things happen. [23:33] And then, [23:34] if there is a success event when the setup is complete. And then it gives me a bunch of events that I can use to analyze it. [23:40] So, [23:41] This is already such a game changer. Like in the past, I would have leaned on...
[23:46] secondhand sources like Notion to piece together how it was built. With Cursor, like you were saying, I can go straight to the source and have it translated into natural language. [23:56] And that just gives me a lot more confidence because [23:58] It reflects what's actually live, not what someone remembered to write down. [24:03] One thing I want to call out while you're going to your next step is one of the steps that I see [24:08] skipped by engineering teams is good event tracking when they release a feature. Because, you know, you start up front in the PRD and you like define a tracking plan and then it gets to implementation and people forget it should be a front end event, should be back end event. And one of my favorite follow up AI tasks after something has been released or it's in code review is I do a quick prompt and I go, is this, is everything appropriately tracked in this feature? [24:38] and make sure that the schemas are normalized. So for all the data analysts out there, [24:42] be annoying and do a PR for your own events on new features. So you're not, you know, stuck with what the engineers built for you. [24:50] That inspires me to-- I can take the M spec and just put it into-- [24:55] any AI tool and say, what front end events do I, or what events do I need to ask for to be able to measure the success of this effectively? Because right now I'm just doing that in my head. That is not something that I have. Yeah. Don't do it in your head. That's the title of how I AI, how I AI. Yes. Don't do it in your head. [25:13] So with this next prompt, I again, not the most
[25:18] like, [25:19] sophisticated prompt. I'm just saying I want to understand at a high level how this feature has been performing. And I give the quick context of, you know, our goal is to make it better. [25:29] That's pretty obvious. I just want to spell that out. And I, like Tim, I'm giving a fair amount of discretion to the cursor agent. I'm saying... [25:38] OK, come up with the ideal output fields. I have some ideas, but like, you know, [25:42] It's up to you. And then two, I do find that telling it explicitly [25:47] to create a file, sometimes forgets to do that and just writes the SQL directly in the conversation sidebar, use the MCP connection like I went through all this trouble to set it up. [25:59] I want it to use the snowflake mcp connection and then actually qa the file and that's what's [26:04] so powerful about this Cursor agent and the Snowflake MCP is not only is it writing the sequel, which is what ChatGVT has been doing for me for the last year, [26:14] It is running it, looking at the output, and then making its own sniff test sense check decisions, which is just... [26:22] So cool. [26:23] OK, and then another thing I want to call out [26:26] as we are running this [26:28] The reason why I have a fair amount of confidence that this is going to work [26:32] relatively quickly is because I and our data team have done a fair amount of [26:37] work to create what's called a semantic layer. And so first, our amazing data engineering team, like, [26:44] six months ago, decided we were going to create like a general company semantic layer. And a semantic layer is essentially just a translation for an LLM of like our business terms, tables, fields, filters, metrics, etc. And AI can look at those files and understand what our tables mean.
[27:02] This general one covered like our most used products [27:05] generic tables, orders, items, users, etc. And so they connected it to a custom GPT. And anyone in the company can go ask pretty basic questions like, [27:16] what was the average order size in Europe last year and get an answer really quickly. And so that's been a huge unlock to save our analytics team time of like, we're not answering these [27:26] questions for people, they can self-serve. [27:29] It's just... [27:30] democratizing data and, you know, saving us a lot of time so that we can focus on more deep analysis and for deeper analysis like, [27:39] we needed something more than just these basic tables. And so I, with a lot of help from one of our data engineers, she's built a specialized semantic layer just for like my scope as a test. So I was, you know, where the I was the first one in the company to do this, but we're planning on kind of rolling it out to [27:57] all of the areas of scope and you know basically this [28:01] semantic layer just defines the tables that I use the most, the joins, the filters, the metrics, [28:07] And because it lives in our code base, it's like in our data science repo, Cursor can just tap into it and it just makes the zero shot ability like insane. [28:16] Of. [28:17] running SQL. [28:19] I've seen a couple of these and I don't know what yours looks like, but they really just look like defined terms tables. This table means this. This field means that. If you're trying to query average order value, this is how you do it.
[28:32] And it's almost your documentation in a little bit more of a structured form around... [28:38] common queries. And what I think is nice about this is [28:42] its ability to be managed by code. You can change it, you can update it, you can add new things. I also think for the data engineers out there, [28:49] it reduces a little bit of needed complexity on the data warehouse setup because previously you were creating these like aggregate tables and these like defined metrics and you're hoping people were writing queries the right way. And now you can define these canonical queries and, [29:03] And know that no matter kind of like what your tables look like, they're going to get to the right answer, which I think is quite nice on the data engineering side. [29:11] Yeah, so this is an example of like what you're talking about. It's just a very structured JSON file. And from what I understand, I did not do this, but I had the engineer explain the process to me. And honestly, LLM's helped a lot with creating this, you know, he fed in, [29:25] details about our data warehouse and just a million queries that I had previously written. And it kind of helped spit out [29:32] this type of thing. He also used Langchain to like [29:35] change the names of a bunch of the reports that we had into question form. [29:41] Because obviously when I'm querying this, whether it's through a custom GPT or cursor, I'm often asking a question. And so... [29:49] I thought that was pretty cool. Like translating it to a question makes the semantic layer work so much better. [29:55] Oh, this is gonna be my next project. This is so fun. Amazing. Glad to inspire. So, [30:02] To go back to the actual sequel that was run...
[30:06] And I will actually just [30:08] Run this. [30:10] Let's see, hopefully this. And just in case people miss this, you did call out the Snowflake MCP, which was what we're seeing right now, which is a programmatic way to hook into running queries in your Snowflake. [30:21] data warehouse. So you can not only generate the SQL here, but instead of like copying and pasting it and going into like Snowflake Cloud and running it or whatever your visualization tool is, [30:30] You can just run it right here. You're getting your tables right here. So again, like you're eliminating that context switching. [30:36] You're eliminating the copy and paste and you're getting your data right here. [30:39] Yep, exactly. And so... [30:41] I am oh this is interesting this actually I am looking at this and it's I think it showed a mistake [30:49] But, you know, I asked it to QA itself. Normally this has done... [30:54] does a very good job, but one of the quick QAs that I do for something like this is I want to see [31:00] No skip steps. Oh, actually, you know what? [31:04] I remember from the context, this is a temporary... [31:07] This is a step that only some people see. But usually when I'm looking through this, you know, [31:12] in a, in, [31:13] If we were not doing this demo, I would spend probably a lot longer QAing this. But I just want to see drop-off that makes sense, right? Like, I don't want to see 0, 0, and then 1, or then 0. And so that's just a quick QA. [31:26] that I can do. You know, it's not the AI's name on this analysis, it's mine. So I hope I do that. The other thing that I have done [31:34] to really make sure that I can QA this effectively is I, in my cursor rules,
[31:40] I tell it to comment every single CTE. [31:43] so that I know what the and sorry, CTEs are like sections of SQL that often are created when you're writing SQL. And I just want to know each step of what is happening so that as I'm looking at the SQL, I can say, okay, [31:59] the agent said it's doing this and like looking at this code, I can actually tell that it's doing this. [32:04] So engineers cover your ears because engineers hate, hate, hate, hate, hate when I say this. They hate it. [32:10] I love over commented AI code. And let me tell you why, because when you are not writing this code, [32:17] You really need to understand the thought process behind how the code was designed. And having AI comment the code that it writes gives you a natural language way to understand if your understanding of the implementation matches the actual technical implementation of the code itself based on the AI's own reasoning. [32:38] Fine, delete it if you want to. I don't care. I know all the arguments against overcommented code. And I think there's a lot of benefits for human review. Absolutely. [32:47] And it's also great context for AI when they go back and work on it. So engineers, you can now uncover your ears. You can yell at me on Twitter if you want to, or an X if you want to, but yeah. [32:57] I do the same thing where I say, go ahead and comment in the code so I can understand how you decompose these step by step. [33:03] Yeah, it's pretty, pretty awesome. It's also I even have a custom GPT in chat GPT to comment code I've written before I just insert code and then, you know, if I'm ever handing off dashboards to someone I really don't want anyone
[33:17] to [33:17] be so confused that they have to bother me. You know, my goal is to have it be quite self-serve. Look, those lines of code are not going to expand themselves. Let's get some common play. [33:27] This episode is brought to you by Brex. If you're listening to the show, you already know AI is changing how we work in real practical ways. Brex is bringing that same power to finance. Brex is the intelligent finance platform built for founders. With autonomous agents running in the background, your finance stack basically runs itself. Cards are issues, expenses are filed, [33:57] Add Brex's banking solution with a high yield treasury account. And you've got a system that helps you spend smarter, move faster and scale with confidence. One in three startups in the U.S. already runs on Brex. You can too at brex.com slash how I AI. [34:17] So I'm gonna kick off my next prompt, but basically like we're gonna skip ahead a couple hours here because [34:25] Up until this point, my goal was to get this kind of clean base query that I could use for dashboards in mode, which is [34:31] FAIRs BI tool. You know, a lot of what we are doing as the strategy and analytics team is creating [34:38] creating tables that then can be used for pretty charts to tell a story. [34:44] And so let's pretend that I spent... [34:46] a few hours with cursor like refining queries. I actually did one for the old flow and the new flow. I actually did do this. This is also a real use case like Tim's. And then I built some
[34:57] visualizations in mode. [34:59] What's really cool is that there is actually a mode MCP [35:04] And I can tell it to view a dashboard [35:08] directly for those who are listening. Here we have on the left hand side, our legacy flow, and on the right hand side, our new flow. You'll see that there's one step that is only present in some of the entry points. There's a split by entry point. And basically it's just showing, you know, like what is the overall success rate [35:31] and success rate by step for each of these flows. [35:34] And so this is what I have pointed the mode MCP [35:39] towards in this prompt. So if we go back to this prompt, [35:45] and I'm just going to tell it to run this tool. [35:48] OK, so I'm telling it. [35:50] Again, like, hey, go look at this mode dashboard and use this MCP. [35:55] I also give it the direct [35:58] sequel that's um that i wrote with cursor [36:01] that's powering that dashboard. I was asking it for some detailed takeaways [36:06] And next steps, I give it a little bit of context. [36:09] And I tell it to ask clarifying questions and use the MCPs if necessary. The MCPs, I think, I'm not sure if we've defined it yet, but model context protocol, I believe is what it stands for, are like so powerful. I think that that's when this has felt like magic. [36:26] the most?
[36:27] Like at first, I assumed that they were similar to APIs where [36:31] everything needs to be defined. Like some engineer on both sides needs to go to find endpoints that there's a very specific structure. It seemed like a lot of work. [36:41] These models just like know what to do. It's just wild to me. I will say that there's a lot of work [36:48] on our data engineering side to get some of these MCPs set up. So I think [36:52] Ben on our analytics [36:54] platform team has just spent a lot of time on this. Like, I don't want to minimize that step. But as the end user of them, it is like it just feels magical every time it can just access something. [37:05] And so, [37:07] if we go into the results over here, next [37:11] key takeaways and next steps. [37:13] Cool. So we [37:15] Looks like we did a good job. [37:17] Yay, fair. And it gives like a pretty detailed [37:21] list of [37:23] you know, the funnel analysis, insights and concerns, [37:26] actionable next steps, etc. Like this is already [37:30] a pretty good sort of output to start with. [37:33] But at the end of the day, like, [37:36] analysis like this only matters if you can [37:39] communicated clearly, right? Like you need to sort of convince people of whatever you are trying to communicate. So we also have a Notion MCP, and I'm going to ask Cursor to create a doc [37:52] That captures our findings in a structured way. [37:55] And I want to pause really quickly because we have done this in maybe 15 minutes where you have
[38:01] taken a problem, kind of like a pre and post analysis of a feature change. [38:06] You have written [38:08] SQL [38:09] have not used a WYSIWYG analytics tool. You have written straight up good sequel, traceable sequel, to do a funnel analysis of that on a daily basis. Very interesting. [38:20] You have made a dashboard for it so that your business users can use it. You have then done a meta analysis of that dashboard using [38:27] the MCP to actually read the dashboard, do a first-pass analysis, create a summary not only of the results, [38:33] but if recommended next steps, [38:35] And then you are going to publish that to your business using Notion. Now, I have to say, I have worked with a lot of data teams. [38:43] And most of them spending their time saying, what is the priority of this analysis? We have a backlog, I need data engineering, and fine, here's the dashboard. [38:53] *laughs* [38:54] like it's like the ones that like get promoted three times in a year that go the extra step where they're like and here's the analysis and here are my recommended next steps and i made it pretty so you can share it with your boss and i just think like i was watching this and i was like oh man i'm gonna promote this data analyst like they're pretty they're pretty they're pretty good and so i just think the ability to level up the quality of your work and think through the interesting things the interesting thing isn't like did i write this sequel join correctly
[39:24] do I have any creative ideas on what we could do next? [39:27] Can we improve this analysis for the future? And so I really like this end to end flow because it just shows how you are leveraging up [39:34] into higher strategic tasks, as opposed to spending your time sort of in the tactics. [39:40] Yeah, I mean, it's I totally agree. And we are almost done. But like you said, you know, we need to we need to communicate this. And so one thing that we have done on strategy and analytics is, um, [39:52] our chief strategy officer, Dan, like he really cares about synthesized writing and all the leaders on his team care about synthesized writing. And so we worked with him a couple months ago to actually create some guidance on how to write at FAIR, like [40:07] fair is very much a vertical doc culture, you know, pre-read culture. We're not creating a lot of slides. We are writing a lot of [40:13] docks and so we have this sort of [40:17] like use answer for structure key principles doc. And then we also have a template for what docs should look like. [40:25] And so actually in this prompt, you'll see [40:27] Like, [40:28] I tell it to follow these, um, [40:32] to follow these [40:33] rules that are in these docs. And that's like another thing that I love about Cursor is you can just tell it what rules to follow in a variety of ways. Okay, Alexa, I'm going to give you an upgrade here, which is you should reference these files in your cursor rules so you don't always have to answer them. [40:49] That's a great-- I should. [40:52] I wanted to, you know, show the full flow, but...
[40:55] The reason I don't is because it would have actually done it in the previous step. [40:58] Oh, yeah. Because it would have known, and then I wouldn't have gotten to talk about it. But yes, I will do that once we are done. [41:05] It's showbiz, folks. That's what this is. And so the last thing is I am going to pull over. [41:12] the doc, this is one that created from a previous time I did this just because I wanted to highlight in yellow. I gave instructions in this prompt to tell me what to add. [41:23] I think one thing I want to get across is this [41:27] I don't think that Cursor yet or AI can zero shot Cursor. [41:31] like an executive ready doc. [41:34] yet. [41:34] Like there's, that is where I think that we still need to do three to four revs of, um, [41:41] of sort of editing, adding analysis, making sure this makes sense. [41:46] These tools have so much context, but we have, we still have some context that is just this like je ne sais quoi thing. [41:52] like humans are still valuable. And so this is like a pretty good start. And I think what's cool about Cursor is like I cut out some of the middlemen. I got to this point like really, really quickly. [42:04] But we're not just creating like AI slop docs all over the place. We are, you know, just accelerating how fast analysts can do things like this. We, you know, [42:15] And the other thing that's really helpful about, I would run this through that guidance three or four times. It can be hard when you're so in the weeds of an analysis. [42:24] to like take a step back and
[42:26] make sure your story makes sense. And so that's what LLMs are really good for. So can like cover my blind spots. [42:33] Well, you know, what's more painful than running this three times through your guidance is sitting three times with your SVP of strategy and having them tell you this makes no sense and you need to go back and edit stuff. So again, I think what a what a nicer way to get to a higher quality output than yes, I'm having to. [42:53] It saves me time and it saves the leaders on my team time and hopefully improves the quality. You know, it's fundamentally [43:00] improving [43:01] how, you know, we are doing work on analytics team. [43:05] And one thing I want to call out for folks that are maybe listening and not watching is Alexa, my friend here, is smiling. This is fun. This is like interesting and it's fun. And you're not sitting here saying, I have no role to play anymore. The machines are going to take over. [43:20] You're saying, man, it was really boring to dig through tables and write all this sequel that I know how to write. [43:26] And I've done it a couple of times. So let's let the machines do it. And now you're able to focus on interfacing with the business, having impact. And it's just I think it's fun. Every time I get in these tools, I feel like it's magical. I feel like it's really fun. And so I want to call out, we got smiles across the board here on how I I didn't show this, but the type ahead, like if when you're actually editing the sequel, that's also so fun. [43:51] It's just fun. It knows what you want to do. So yeah, this, this whole process is very fun.
[43:57] I think what's so powerful about this is not just like making the good analysts just incredible. It's also democratizing data. So this is something that can be done. SQL can be written by people all over our business, whether you're in sales, designers, anyone else can write this. So the people with the context can do analysis just like this. And then the analysts can do the really complicated stuff where these tools could help them get really into the weeds. [44:18] I think it's great. [44:20] For people early in their career, I've said this before, and I mean it to be true. If you want to know the inflection point of Claire Vos career, it is when she learned SQL. I mean, truly, I became unstoppable at that point. And so lowering the barrier to entry on data analysis is just going to create a whole bunch of really high, high impact folks. [44:39] Awesome. Okay, Alexa, so we just saw how cursor can do end-to-end funnel analysis all the way to the proverbial front door of your SVP strategy. [44:48] Tim, let's talk about another kind of analysis, which is experimentation analysis by analysis. [44:54] Yeah, you should have close to your heart. So, look, we've talked about the big picture. We've talked about like a really detailed sort of actual analyst of how they do their day job. But I think one of the other things these AI tools are just so good is just accelerating process, like automating away some of those. [45:10] routine, lower impact steps in the analytics journey. And so as a good example, we want to show you a quick agent we build. [45:17] which automates the process of writing up experiment results. [45:21] So across FAIR, we might be running, I don't know, hundreds of A-B tests on the product a month. [45:26] And each of those experiments needs to be monitored, assessed, documented. And that just takes up so much time for analysts. So if we're going to stay on top of this very quickly,
[45:36] It's our team that can become the bottleneck and slow down our launch velocity, which is the last thing anyone wants. And I know this is something that's happening up and down the country around every single tech company. So we thought it'd be a good example just to demonstrate. So let me show you how I built this. One thing I want to really, really stress here is just how straightforward these things are to build. Once you've gone through the pain of setting up cursor, getting your MCPs in place, [46:00] Actually spinning up any new agent you can think about is just so quick and so non-technical for anyone to do. [46:06] So it all runs off a cursor rules file. So if you don't know what these are, they're literally just a type of file [46:13] an NDC file that these agents know to look for and know they're likely to contain instructions. [46:18] they're really easy to set up. It's basically plain English. So you just write a simple one line, and just a description of what it is. So format for writing experiment result using Epo data. And Epo is just, [46:31] the experiment tool that we use it basically takes our data does a bit of analysis slaps a ui around it and writes it up for us um uh so you then select when you want to apply i've just selected apply intelligent i trust the model to work out when it needs to use it they do a pretty good job [46:47] And then other than that, it literally is just writing out what you want the agent to do. Now, this might look a bit complicated. I'll generally write... [46:55] this in a few minutes in plain text what I wanted to write a last cursor to them. [46:59] Tear the thing down. [47:00] and I'll rewrite it a couple of times and just get it right in the format I want. But ultimately, [47:05] It's just a step-by-step guide of what I want this thing to do. So I've just said for those who are listening, I've said, if you're asked to write up experiment results, do the following thing. So ask the experiment name if you haven't already got it, and then go collect the data you're going to need. So use the EPO MCP we've set up. So go talk to our experiment space, pull in the actual results of the experiment, and then use our Notion MCP that we've already talked about to go pull in all the other contexts that you might need.
[47:35] that data and write up this report. And I've got a little bit down here, you can see telling it, [47:39] exactly what kinds of documents to look for. So PRDs, experiment docs, technical specifications, that's what's going to help it look for. And then I ask it to basically write out [47:51] those results in the format I give it. And then I'm pretty prescriptive about the format I want, because I want this to do it. [47:57] really consistently in the format we want with really tight takeaways. [48:02] So actually, I've asked it to create it in just a local file on my cursor on my computer. And that just means I can actually look at it before it goes create. So the Notion docs can take a peek, refine the prompt if I need to. [48:14] but that's just a fallback. And then ultimately, it's going to turn into another Notion doc so everyone else in the business can see it. And it's going to do all this incredibly quickly. And we're going to essentially see what this thing looks like in reality. So let's just run it on an experiment result. So I've just said, please write up the experiment results for, and I've given it the name of the experiment, which is vertical product iron images, and straight off, [48:35] it's gone off and it's found uh it's written itself a nice to-do list it's found the epa results so it's just called the results and it's found its results great it's found the the rules and that's going to start working this all out for me [48:47] Which is great to see. [48:49] And then while it's doing all that, we'll just have a look. So the format we've gone through, [48:53] we can just show here. So basically, the rest of this is all just showing exactly what the format of this thing is going to look like. So I've asked it to give me the document links, exactly what I want. So if I click into more context, the brief summary of the experiment, and then the key bit, the actual metrics that it's got from EPO. So it's going to show me the actual results, the confidence intervals, it's going to pull out the most important ones, and it'll give me a nice little color coding for it. And then I just want the actual answer from
[49:23] do next. And so it's written the takeaway section. So I want to clear, should we roll this out? Should we roll it back? What should we do? And give me the reasons why, like why are we doing this? And are there any other interesting insights that you found that we should call out from this? [49:37] So let's see. Right. So let's have a look at what it's doing here. [49:40] It has... [49:42] found everything we need. [49:45] It's starting to write out the doc, which is nice to see. [49:48] in this little thing. I'm just going to go ahead and queue up. So turn this into a notion. So as soon as I've read it, while we look at the actual results, [49:57] it will start writing the Notion doc. [49:59] And let's have a look. So straight away in a second, while it's running that, I've got a write up with all the right context I need. So it's got the links I needed. It's got the context. [50:10] It's pulled the right data. Good. The nice thing is this result. So this was just literally sharing vertical images rather than square images, like a really standard growth experiment, like which one performs better. And you can see a nice stat-sig lift. [50:22] or about 3.5% for the treatment. [50:26] And then it's pulled out some other interesting business metrics. [50:29] And let's have a look at these takeaways. So it's saying... [50:32] Ah. [50:33] Great, roll it out, that's the right answer because of that lift. [50:37] And it's also pulled out some interesting things. So it said our data science prediction models are also actually positive. So it's saying not only have we got more retailers, there are actually higher quality retailers, the ones we've got. So this looked good. As a first pass, this looks... [50:49] Great. [50:50] And just to call out one thing here personally, like,
[50:55] We have a standard format for doing these where you have to type the confidence interval and type the emojis. And that is like, [51:03] work that is not valuable for our team. And so it's pretty awesome that like it came up with takeaways, but also saved us. [51:11] five minutes of like fiddling around with emojis and decimal points. Yeah. I mean, AI has a translation layer between... [51:19] a SaaS interface or a SQL query into natural language in the format that you like, that your boss likes, [51:25] That's just a time saver in and of itself. So I love using AI as like the universal... [51:32] format translator. [51:34] So as you can see, I've just asked the notion link, it should produce the notion. So let's just open that up. [51:39] And let's put it on the screen. And look, straight away, I've got a nice document I can share around with everyone with all the right color codes, the takeaways. And even as a little bonus, let's see if it's done. It always has trouble getting things in a little toggle. But right at the bottom here, I've even asked it to spit out a slack with an even more summarized version. So I can just drop this into the right review channels and straight away this can go and get approved. [52:01] Are we going to do this for every complicated experiment? Probably not. There might need to be a bit of analysis. But for the simple ones, straight one shot. Even the complicated ones, this accelerates you. But also anyone in the business can start doing this, which means we can pass more and more of these things down to engineers, PMs, other people to write this kind of stuff and do the analysis for them, which again can just massively accelerate our launch velocity affair, which we're really excited for. Yeah, I'm sorry. And I know this is my brand, but I feel like AI is just accruing to every task
[52:30] Sorry, PM, it's your job now. I do like that little trend that's happening. This is amazing. Love it, have done these kinds of analyses before. They have not been this easy to read and they certainly haven't been generated in 90 seconds. [52:45] really useful tool for experimentation analysis, a call out to the experimentation tools out there that I know and love. If you have not made an MCP for access to your data, you're limiting your customers. And so I do think sort of AI integration of SaaS tools is going to be a way that teams start to evaluate the quality of tools that they're working with. So just something to think about if you're out there building data analysis tools. Okay, we are going to wrap up very quickly [53:15] cases, but yours are all so good. We're going to do a speed run through a bonus use case, which is actually designing and analyzing kind of unstructured data in a user survey. So Tim, you're going to whip us through how you could use AI to make surveys and survey analysis a lot better. Yeah, I'm going to do this really quickly. We don't want to spend time on this, but let's just show. I think it's just another one of those incredibly common analytics use cases that everyone has to do, and they are just so time consuming. You've got to design the survey [53:45] and then analyze all those dots. It's really time consuming. But end to end, AI can just like transform the whole process. Let's show another one. I'm just going to stop it. I'm not going to run these. I'm just going to go straight to my backup. So let's just start on design. [53:55] So what I love doing is I think you can do it in cursing, you can do it in many things. I think ChatGT project is really good for this. And again, incredibly accessible. Everyone knows how these work. It's just a great way of giving context.
[54:07] So if we switch over to this one, which is lovely and taking a bit of time to load, you see in files, what I did was give it a bit of background information. So what is our bit of business? This was a survey we want to design on Fair Direct tools. So that's our tools that we give all our brands to help them accelerate their sales with their own customers. [54:26] And so I've given a ton of information to the model that just says like, what actually is FairDirect? What are these tools? What's the strategy? And then [54:35] Whenever I do a survey like this, whether I'm doing AI or not, I'll start with hypotheses. That's ultimately what you want to test. And so this is my way. If I just open up. [54:45] those hypotheses. So this is what I fed it into. [54:48] I just gave it a list of simple hypotheses on what... [54:53] what we want to learn. We do align. We've got everyone aligned on some hypotheses. There's 14 in here. And then really simple. I'll just call that one like, [55:00] Higher sales on fare leads to more usage of these tools. Things like that that we ask. Now I've just given [55:07] that into it. And all I did, if I just look at this problem that we ran, [55:11] So this was a simple problem. All I did was drop it in saying, you're a specialist at doing these customer insight surveys. Design me a 10 minute survey for the thousand brands to test those hypotheses. I said, these are the inputs I've given you. Here's a bit of... [55:25] design requirements that we want. And I asked the three things. I said, turn those hypotheses into a full questionnaire that we can go out and sell customers. [55:33] But also, don't just do that. Give me the coding file that turns that questionnaire into the actual, in this case, Qualtrics, the platform you use to actually run these things, can actually design that straight away in one click and give me an analysis plan for what to do there.
[55:46] I have to pause you really quickly because [55:48] this whole episode has been Tim saying, I just did this really simple prompt. And then you see this like 1000 word, hyper structured, very organized prompt. And Alex is like, I mean, I would just go in there and be like, maybe a nice survey. I love it. So I'm a big believer that 99% of my prompts are going to be one line. [56:12] And then if I'm going to send a model, a big model to go do work for 15 minutes, I'll probably ask another model just to turn my one line into something more detailed. I want the A-B test of Alexei. You run this exact same GPT. [56:26] with a tinier prompt. And you tell me if you get the same quote. See what happens. See what happens. Maybe I'm just there. I don't trust it quite as much as Alexa does. Just yet. [56:34] Okay, so what do we get from that? So very quickly from a list of hypotheses, I've got straight away a really nice first pass of a survey. [56:42] It's going to ask a load of questions, it's about the right length. Like this can just massively accelerate the process. [56:48] And then once we've got that right, it's also given me that coding file, which I just scroll on screen. These things are painful to write. So just having this a one-liner to tell exactly how the system should prompt this and write it out. [56:58] is just like saves hours of time for our research operations team. And it even then translates that into an analysis plan that says, [57:05] this is what the outputs from that are going to look like. So straight away, this whole thing can go from a list of hypotheses into something we could probably get out to our customers by the end of the day. Now that's like shortens this enormously.
[57:17] But what happens when you get the results back? That's the other thing this can do. And so [57:21] Again, I'll do this incredibly quickly and just show you the final result. But I did a very similar prompt as well. So all I did, I'm going to show you the file I dropped into this, just show you how painful this is. [57:30] So, [57:31] I just get the same hypotheses and look how bad this is. Like it's the raw output from Qualtrics. Like these usually take a lot of cleaning. It's one line for every respondent and then one column, not just for every question, but for every possible answer to every question. So these things are incredibly dense for anyone who's worth them. And they take a bit of time, a bit of playing with. [57:52] So the only other thing I gave it was [57:54] a sub helper file [57:57] which was... [57:58] basically that sort of coding file that I just showed you. So it's the, what's the question ID? What's the question language? What's the answers? And then is it, I just add these two columns, which is like, [58:10] Is it a demographic question or an answer? And is it a single choice or is it a multiple choice? That's all I gave there. And then I've written another one of my fun and simple prompts. So roll, task here, just analyze the survey results. Find the right most interesting things in this data and then judge the predefined hypotheses. So I want a table that basically says like, for those hypotheses, was it right or was it wrong? [58:35] And then again, I always end on little quad check rest. I don't want it to go away 15 minutes before and come back with something that isn't very useful. [58:43] And let's have a look at this just very quickly. [58:45] So, [58:46] I've got a nice little somewhere out front.
[58:47] And then there are my 14 hypotheses. Oh. And it's got a nice table that says proved... [58:53] neutral, disproved for each of them. And it's even because I asked it to give me a nice confidence score. So I said, one, it's really confident in this. Five, it's not very confident at all. And you can kind of see the different levels throughout this. [59:05] And then beneath it, I've got for each of these, actually a specific analysis that I have to do. So just throw all the insights it found to back up those findings. [59:13] So is this the only analysis we're going to do on this survey? Almost certainly not. [59:17] But day one, I've got the results, I've thrown it into this, and within a matter of minutes, [59:22] I've got a much, much better intuition of what all that day is showing. So while I might go and do some analysis on this, I can be so much more targeted on exactly what we want to look into and where I want to spend my time. And straight away, we can start sort of sharing some of these findings out with people very, very quickly. [59:40] Oh, no. So I'm reflecting now after this episode. [59:46] Like, okay, I've told everybody to ship a bunch of features. And now I'm going to be like, do a bunch of analysis. Like in my mind, I'm like, oh my gosh, I'm underusing AI to actually understand my business. And it's so accessible. [59:58] And if I can just write, [1:00:00] 17 point prompts like Tim, I can get really high quality insights. But I do want to call out, just reflecting on this whole episode in your four workflows, what I love about what you're showing us [1:00:11] is so many people think that AI is an input to producing [1:00:15] a thing. [1:00:16] but haven't done that full circle back to analyzing the thing, sharing the thing, communicating about the thing. And I think you're showing both sides. You can create
[1:00:24] with AI and you can analyze and communicate with AI. And I think looking at both sides of that coin is really [1:00:30] useful. Okay, we are going to do the one and only lightning round question because we have gotten long on this episode. And I want to get you all back to all of your agents and MCPs and analysis. [1:00:41] We're going to go back to prompts one last time. We're going to figure out. [1:00:45] your personality around prompts, [1:00:47] Alexa, Tim. [1:00:49] when ai is not listening when you're mcp [1:00:52] will not call the tool, [1:00:54] What is your prompting technique? Alexa, what do you do? [1:00:57] I think that mine's pretty straightforward where I think the problem that I run into most frequently is that I'm clearly running out of context. [1:01:05] Like a conversation has gone so long that it's starting to be wonky. And so while I think, you know, level one is just starting over, what AI is best at is summarizing. [1:01:18] So I'll say, hey, summarize like what we've done so far in this, you know, 30 turn conversation and then use that to start over because, you know, like. [1:01:29] Like I've heard other episodes, people say you want to figure out like where it got off track. [1:01:33] Clearly, I'm a pretty efficient person. I don't you know, I'm not Tim, I'm not like writing out the entire prompt for 20 minutes. Like I don't have time for that. I just want to say, hey, summarize what happened. We're going to start over, but I'm going to give it that summary. So at least the new conversation can get some context from the old. [1:01:50] Great. And Tim, what about you? So much to do for my prompts. It's all AI. It's all AI. Work my chat to you. So I generally will go and open up
[1:01:58] three windows on cursor. [1:02:01] And I'll do three chats with three different models and put the same prompt in and get myself a cup of tea and see what comes back. That's the British stereotype in me and getting my cup of tea while I do it. Yeah, you run the A-B test is what you do. Yeah, exactly that. Okay. [1:02:12] I love this. Tim, Alexa, where can we find you, and what can we be helpful with? [1:02:17] You can find me on LinkedIn. My full name is Alexandra. And ways to be helpful. Our strategy and analytics team is hiring across the board. Our team partners super closely with PMs and our go to market team. [1:02:31] We make [1:02:32] strategic data-driven decisions, super fun. We have tons of open roles. So if you like experimenting with AI, we are very AI forward. [1:02:40] So you can learn more at [1:02:42] fair.com slash careers. [1:02:44] And you can find me on LinkedIn as well. And I echo that as well. Like, come join us. If you love AI, come join us and show us how we can do more here. [1:02:50] Okay, we will link to your careers page in the show notes. Alexa, Tim, this has been so fun. Thank you for joining How I AI. [1:02:57] Thank you for having us. Thanks so much for watching. If you enjoyed the show, please like and subscribe here on YouTube or even better, leave us a comment with your thoughts. [1:03:08] You can also find this podcast on Apple Podcasts, Spotify, or your favorite podcast app. Please consider leaving us a rating and review, which will help others find the show. You can see all our episodes and learn more about the show at howiaipod.com. See you next time.
Want to learn more?