Everyday AI Podcast – An AI and ChatGPT Podcast
The Everyday AI podcast is a daily livestream, podcast and free newsletter where we help everyday people grow their careers with AI.
The Everyday AI podcast is hosted by Jordan Wilson, a former journalist who's now the owner of a boutique digital strategy company with 20 years of martech experience.
Our main focus is to help you keep up with AI trends to make your job easier. Get your work done faster. Increase your output.
Start Here Series Inner Circle Connect
- Make sure to sign up for our daily newsletter at: https://youreverydayai.com
- Email us: info@youreverydayai.com
- Connect with Jordan on LinkedIn: https://www.linkedin.com/in/jordanwilson04/
In the Everyday AI podcast, we'll cover all things artificial intelligence, machine learning, and practical tips on how to use both in your daily life. We'll include a touch on a variety of topics, software and applications. We may be covering the latest AI news from Microsoft, Google, Facebook, Adobe and social channels like Snapchat, Tiktok, and Instagram. Or, we may be diving into software like ChatGPT, Midjourney, Bard, or Runway ML.
Everyday AI Podcast – An AI and ChatGPT Podcast
Ep 805: Codex Record and Replay: How to Teach an Agent Once Your Most Time-Consuming Workflows
Use Left/Right to seek, Home/End to jump to start or end. Hold shift to jump forward or backward.
Ever wish your agent would just watch you work and copy you? 👯
Oh wait, that's exactly what OpenAI released.
The new 'Record and Replay' feature in Codex allows you to simply record anything you do and turn it into a repeatable workflow for your agents.
Join us LIVE as we show you how. ⏺️
Codex Record and Replay: How to Teach an Agent Once Your Most Time-Consuming Workflows -- An Everyday AI Chat with Jordan Wilson
Newsletter: Sign up for our free daily newsletter
More on this Episode: Episode Page
Today's Episode on LinkedIn: Thoughts on this? Join the convo on LinkedIn and connect with other AI leaders.
Upcoming Episodes: Check out the upcoming Everyday AI Livestream lineup
Website: YourEverydayAI.com
Email The Show: info@youreverydayai.com
Connect with Jordan on LinkedIn
Topics Covered in This Episode:
- Codex Record and Replay Feature Overview
- Transition from Prompting to Demonstration
- Skill Markdown Files as Business Artifacts
- Codex vs Claude Desktop Agent Comparison
- Workflow Recording and Security Best Practices
- Step-by-Step Codex Record and Replay Demo
- Video Understanding Capability in Codex
- Editing and Sharing Skills Across Platforms
- Automating Multi-App, Cross-System Workflows
- Validation, Scheduling, and Automation of Skills
Timestamps:
00:00 OpenAI's record and replay feature
04:41 Preparing for screen sharing
08:56 Managing Small Tasks Efficiently
10:03 Automating top story analysis
15:43 Debating AI business strategies
18:29 Codec's video ingest feature
22:01 Practicing low stakes automation
24:57 Using skills and plugins efficiently
26:19 Using Chattopty for workflow automation
Keywords:
codex record and replay, OpenAI record and replay, record and replay feature, codex workflow automation, proactive AI agents, skill.md file, skill markdown file, editable skill files, demonstration over prompting, desktop AI agents, Mac AI features, video ingestion AI, video understanding by AI, cross-platform AI workflows, AI workflow scheduling, codex plugins, codex Chrome profile, AI screen recording, reusable AI skills, skill file cross-compatibility, Claude desktop import, Google Gemini video support, AI workflow privacy, enterprise AI permissions, low stakes workflow automation, scheduled AI automations, computer use access, browser automation AI, AI handling Chrome profiles, prompt engineering replacement, automation library, validation of AI skill files, skill validator, PyYAML error handling, skill replay, Codex versus Claude, multi-app workflow automation, real business artifact AI, automation success criteria, desktop program integrations, computer task recording, mundane workflow automation, non-technical AI setup, AI scheduled worker, AI skill refinement
Send Everyday AI and Jordan a text message. (We can't reply back unless you leave contact info)
Start Here ▶️
Not sure where to start when it comes to AI? Start with our Start Here Series. You can listen to the first drop -- Episode 691 -- or get free access to our Inner Cricle community and all episodes: StartHereSeries.com
Also, here's a link to the entire series on a Spotify playlist.
As smart as today's AI systems are, do you ever wish that they would just watch you do all your work and then magically do it all for you? Well, that's what we have now with a new feature from OpenAI and Codecs called Record and Replay. Because here's the reality: as we are continuing to shift from the responsive chatbot era to the proactive agent era, there's still a lot of manual work that we have to do, whether that's scheduling and automating your workflows, creating new skill files by hand or whatever it may be. Whether you're using Chat GPT, Codex, Claude, Copilot, or Gemini, there's a lot of manual work that needs to go in to create these repeatable and autonomous processes. But I think that may start to change. And I think that with what this new feature is in Codex Record and Replay, we have maybe a preview of what's coming next as we have these extremely powerful super apps that live on our desktop and can do work for us. So that's what we're going to be going over on today's show. But here is the big picture of Codecs Record and Replay. So I think prompting is kind of losing to demonstration. And maybe that's not a bad thing, even if you've spent a lot of time refining your prompt engineering skills or how you type to a chat bot that could be leaving by the wayside. So OpenAI shipped this new feature less than a week ago. And essentially, you demonstrate something on your on your actual computer on your Mac right now, and it becomes an editable skill.md file, then the agent can reuse. So the AI interface, I think, is moving away from just typing instructions to just showing your work and showing a series of tasks that go into a project that you may want to start automating. And I think that this right now it seems like a small feature, right? And I'm actually surprised this one didn't blow up more than it did. But I think this is a big step in transferring agency and workflow to autonomous systems. Because the other thing, why I think this is actually a big deal, because this kind of open AI is showing their cards in terms of model capabilities, because now, through whatever magic they did, they are now uh can understand and ingest video, which is huge because this has been a big advantage that Google Gemini has had for the last two years as the only um kind of big player that you could upload a video to and it could actually understand uh what was going on inside that video. So that's a bigger picture. Maybe we'll talk about that a little bit more. But here, stick with me for the next 25 minutes. This is gonna be a short one uh on today's show. Here's what you're gonna learn. You're gonna learn what OpenAI actually changed inside of Codecs beyond just the launch demo. You're gonna know why the skill md or the skill markdown file is the real business artifact here that leaders should understand. You're gonna know how the shared format changes uh the codecs versus claude comparison. Yeah, stick around for that. And you're gonna know which workflows you should try to record first without creating unnecessary security risk. All right, I hope you're excited for this. This is everyday AI and our weekly segment putting AI to work on Wednesday. So once a week, I get a little brave. I demo some live technology, it's unedited, unscripted. We'll see how it goes. Sometimes these hands-on demos are amazing. Sometimes we get bugs, but that's the reality of generative AI and working on the cutting edge. Sometimes you get cut. All right. So if you haven't already, please make sure to subscribe to the podcast if this is helpful. And then make sure you go to your everydayai.com, sign up for the free daily newsletter, and we're gonna be recapping not just the highlights from today's show, but all the other AI news and developments you need to know to be the smartest person in AI at your company. All right, so without further ado, yeah, let's get dirty, shall we? All right, so this one's gonna be a little tricky because I'm gonna be jumping back and forth uh between some different screens, some different things I have going on, but we're gonna try to make this work. And this is one of those that uh if you are uh listening on the podcast, this might be one of those ones that you should tune into uh for the video version. So you can always see that video version of this on our website at your everydayai.com. All right, I'm just making sure, doing one more check here before I start sharing my screen, that I don't have anything uh too private. All right. So yeah, not gonna be uh doxing myself or anyone else, or I'll only be doxing myself slightly. All right, uh, so let's go ahead and share the screen. Uh and and one thing to know, this is a plugin. Okay. So if you don't know too much about Codex, I've done a couple of shows recently uh on Codecs. But if you have a paid ChatGPT subscription, then you have a Codec subscription. It is the same, you know, it uses the same subscription, which is really nice. So you can go download uh Codecs is a desktop program, whether you use Mac or Windows. Although these, a lot of the features come to Macs for Mac first, like record and replay. All right, and then after you are logged into Codecs, if you haven't downloaded it already, go ahead and do that. You can follow along with me. Uh and then you're gonna make sure to go into your upper left-hand corner. There's a plugins section. All you're gonna do is search for record and replay. Uh, then you are going to add the plugin. After you do that, if you are brand new to Codecs, you're probably gonna have to go through your Mac security settings and grant it some access uh to some uh different um privacy features that it needs. Because as an example, this uses computer use. So you do give it access to essentially read and write to your entire computer. So, you know, if you are using this in an enterprise setting, make sure to get the permissions you need before doing this. Understands the pros and the cons and the risks, and that just goes for not just the record and replay in codecs, but for anything desktop, right? So whether you're talking about codecs, uh clawed, um, you know, desktop using clawed code, claud cowork, uh, anti-gravity, whatever you're using on your desktop, always keep that in mind. So after you have codecs, you're logged in with your ChatGPT plan, you've installed the record and replay plugin. Here's what we're gonna do. So I'm gonna do this. It might seem a little backwards, all right, but stick with me here. I'm doing it like this for a reason, mainly so hopefully the demo can work. All right, so live stream audience, let me know if you can see my screen here. All right. I think we're pretty good, not giving away too much. Uh, aside from the fact, yes, I do have uh, you know, codecs and claude code running here at the bottom, like I always do. Uh, and a couple other secret uh secret projects I'm working on down there. Okay, so I'm going to put in a prompt first describing what I'm going to do. And you might be saying, Jordan, that defeats the entire purpose. Uh yes and no. Okay, because my example here requires a little nuance. So also think of what this is and what it isn't, because right now, this feature all it does is it sees your screen. So uh I do think uh that they're going to uh add in the feature where you can talk to it at the same time. So you can talk it through and walk it through visually at the same time. But essentially, this is gonna record my screen, everything that I do. So, because I'm doing a live demo and I want this to hopefully work and you know, not have to accidentally log into all these different places, I'm first gonna give it a precursor on what I'm doing. So I'm just putting in a simple prompt. Uh, and for this, I'm gonna go ahead um, I'm gonna go ahead and put this prompt in. Okay. Nothing crazy here. I'm just telling it what I'm gonna do. So I want you to think of that manual thing that you do that usually requires what I call kind of this contacts carry. So, you know, carrying contacts from different programs. I'm doing something extremely simple, very straightforward. All right. Um, I'm just having my thing that I'm trying to automate. This is something I don't have time for, even though it's not super time consuming. It's just all of these small little tasks. When I think about them in my head, I'm like, oh, this requires three, four, five different websites, all these different steps, even if it only takes 10 minutes, a lot of times I just don't do that because my day-to-day schedule is absolutely bugger bananas, right? In in my mind, even if a task is 10 minutes, if it requires me going into five, six, seven, eight different systems, doing a lot of copying and pasting, carrying contacts over, even a little bit of brain power, sometimes I just don't do them, which is why I really like this new feature. So, all I'm doing in this prompt, I'm saying for this chat, I'm going to be using the record and replay eventually, but do not use it yet. And I'm just saying, here's a primer on what I'm going to do before I hit record. Yes, I know this is kind of counterintuitive, but because it requires a little bit of nuance, um, this is why I'm typing it out. And the reason being, what I'm trying to do is I'm gonna go into my Beehive newsletter, that is our email newsletter provider, and I'm gonna find the five most clicked on stories from our last newsletter. But here's the thing most days, that's our actual website, right? It's everyone clicks on the episode page, they go to our website. I don't care about that, right? So, what I'm ultimately trying to do is I want to see the five most clicked stories that are not ours, right? That aren't also not something, you know. Sometimes we link out to Twitter or, you know, a lot of times it's Spotify. People click on our Spotify link. So, you know, I can't just go click on the five uh, you know, top stories. I do have to explain that nuance. And Codecs would get this right because this is an editable skill that ultimately this is gonna do, that then I can run uh, you know, whenever I want by calling on the skill, or I can make this a daily um automation, which is what I'm going to do. But all I'm gonna do, the five most popular stories, take all the text in there, copy and paste it, put it into notebook LM, and then I'm going to create an audio overview, the debate style. So essentially every day I can wake up, I can know the five most important stories that our audience thinks about. Not only that, the content of everything, have it be extremely relevant, but then have two hosts, right? I love notebook LM to have two hosts kind of debate different sides of those stories. So I can hopefully right uh better understand how our audience is thinking about these things. All right. So that's all I'm doing. So uh I went through, I told it, this is what I'm doing. So now I'm gonna click the at button, all right, and then I'm gonna click record and replay. All right. And so I'm all I'm doing is saying, uh, you know, let's start. All right. So I just clicked the at button. Uh, I typed in record and replay. I click let's start. So now what should happen is it's gonna say, okay, we're recording now, and then it's gonna uh create a little recording thing at the bottom. And I want to make sure I have everything ready. Yes, I do. I'm gonna drag this over here, make sure I'm not showing anything. All right, we're good. Yeah, I can't even show all my notebook LMs uh because yeah, there's some some private uh things in there. All right, so here we go. I'm just doing inside codecs. I at tagged record and replay. I said, let's start. It should. Let's see if it goes here. All right, it says it's reading the skill, it's thinking. There we go. And so now I have a little recording at the bottom of my screen. So I'm gonna try to go as fast as I can, but I'll talk you through it for a podcast audience. Nothing crazy is happening here, but Codex is now recording my screen. I do have dual monitor, so hopefully this will work okay. All right, so now I'm going into my Beehive account, I'm logged in, I'm going to the newsletter, I'm going to yesterday's um yesterday's uh newsletter. I'm going to uh where am I going? Performance, yes. Okay. So now here's all the different ones, the different clicks. These are all the different stories. So uh I'm gonna click what I think are five of the most read stories. I'm just going in order here. All right. I want to make sure all of these are actual news stories. There we go. It looks like we have them all. Okay, one is just for uh GLM, uh, which just takes you to their website. So that that's not gonna tell me anything. So I'm just actually gonna Google it because it looks like we just linked to their actual website. Um, but let me just go ahead and grab the actual announcement post there. There we go. Okay, so I think I'm good. All right, this one couldn't be reached for whatever reason. All right, we're good, we're good. What is this? This was a random tool. All right, so I'm not gonna do that one. We got to verify myself via Cloudflare. Yeah, this is fun. All right, let's do one more. I do want to make sure that we have five that we can actually copy and paste and grab the text. All right, it looks like this fortune article is not working. Cool. So maybe Codex will see that I can't copy and paste this because it's, you know, apparently behind a paywall. All right, we'll see if Codex picks up on that or not. But again, this is editable. So I can always go and say if you can't access um the text, then you know, go down to the next story. Okay, so it looks like we're good here. So I'm gonna go ahead and grab all of the text from these. So I'm gonna copy and paste. Um, I'm gonna go into copied text. I'm gonna paste, insert, next. I'm clicking, uh, let me just X out of that one. That was something about NVIDIA, about the uh water sustainability. I'm gonna click on add sources. Here's OpenAI announcing um how they're using AI to diagnose rare genetic diseases. Uh, I'm pasting that into Notebook LM. Again, I'm just copying and pasting the context of all of these pages. All right, next, there was a micron in anthropic uh announcement. I'm copying and pasting that. I'm adding this inside of Notebook LM. We have three stories so far. Let's get two more. I'm copying and pasting the GLM 5.2. All right, putting that in there. There we go. And then our last one here something about uh someone leaving DeepMind for Anthropic. Let's go ahead. Actually, there's a ton of extra stuff on that page. So we'll see if Codex uh picks this up that there's a lot of extra stuff here. It's a shorter article. So uh I'm gonna copy and paste that in and done. All right, the next thing, and this is all laid out in the instructions. I want Codex to go on here, click audio, and then, but I want to customize this. So there's a little carrot. I'm gonna click that. I want it to go to the debate. All right, and then I actually have a prompt that I'm going to want it to copy and paste. And I told it this. So the prompt, the custom prompt that I'm using for this audio overview is this the included sources are web page context of the most clicked articles from the everyday AI newsletter. Please simulate how business, I'm gonna say how busy business leaders who are leading AI implementation at their companies. That's you guys, right? Uh, maybe interpreting these stories and debate how each side uh thoughtfully thinks about these with pinpoint detail, avoid generalities or soft takes. All right. And then I told Codecs, once I hit generate, it is done. So let me go back. Uh, all I have to do is I just have to click the record button one more time. And then what that does is it sends a message to Codecs actually that says, I'm done recording. Okay, so now what it says, what it's saying is it says I'm stopping the recorder now. Then I'll inspect the captured event stream directly. After that, I'll build the reusable skill rather than just summarizing the demo. So um, pretty good. So I'm gonna give that a minute to cook as we go over some of the other details, and then hopefully we'll see it at the end and I'll see if I can actually um you know go and run this skill. We'll see how long it takes. All right. Also, I just realized I forgot to do a couple of things that I told Codex I was gonna do, like renaming the notebook, uh, which I didn't do. So we'll see what it picks up on, some of the nuance there. But let's just quickly go over what's new. I talked about a lot of this as I was going through, but right now this is Mac only. And during the recording, Codex observes everything you do. So actions, what you click on. You can do different programs. So I just told it to go to a specific Chrome profile. So when it builds this skill, uh, it is going to tell itself to use that Chrome profile. So uh Codex has all of these different skills, right? So the big one of the big differences between Codex and ChatGPT is well, Codex is an autonomous coding agent. Don't worry if you're not technical. All that means is it can use your computer, it can use any programs, any other desktop programs. One of the most used things I have Codex do is to use Claude Code on the desktop, uh, to use Google Gemini on the desktop. So I just have it use all my other desktop programs. It can read and write files, all that good stuff. But Codex observes everything you do. So different actions you take, what you click on, what you highlight, what you type in, etc., and also the window content. And then those observations, in order sequentially, essentially become this skill file that changes what you're doing, right? So when the recording stops, and this is what's happening now live on my other screen, codecs starts to create this skill. And this is what I think is really important because a skill is an editable file, which is really nice. At any time, I can go in and I can update this. But here's the big unlock that I don't think most people are talking about. Aside from the fact, like, oh my gosh, apparently uh GPD5. So before it could use Python, it could go in, you could upload a video, it could look at things frame by frame. But as far as I know, this is the first time that openai slash codec slash chat GPT can actually ingest and understand video. And like I said, this is a huge advantage that Google has had for like a year and a half. It's something this process here, I've actually been doing this well before Codecs recorded and replay. I was using Google Gemini because I would dictate something, I would upload these little things, and I was creating skills. I was already doing this, but again, I'm a little dorky. I always try to break these things, but um, this is pretty big. But keep in mind this skills MD file, this is an asset that doesn't just have to live inside of Codex. Obviously, it's gonna work best in Codex because everything that it writes, you know, if it's using different skills, if it's using different apps, it's going to be tagging those in a codecs manner that codecs can read. However, you can use this in Claude Desktop, right? You'll you'll have to go in, but you can import this skill into Claude Desktop. You can change, you know, if it does call a Codec-specific skill, you can change that over. If it's, you know, using Claud's or sorry, if it's using Codecs computer use or Codecs Chrome, you know, you can go and updo update those things accordingly. But essentially, you are using this record uh feature to literally do anything, right? Uh and this is why I think codecs and claude can actually become more connected now. This is, you know, don't think of this as just locking yourself into codecs, although I do think the hardness and codex is much better than Claude Desktop and all the benchmarks show that. But if you are a heavy uh clawed desktop user and that's where you get the majority of your work done, but you still also have a ChatGPT account, you should probably go in and give this a try because that shared structure matters. Because now these replays, the record and replays, can be used across many different surfaces. And remember, it works through your screens, your browsers, and your plugins. So you can use other saved skills, you can use other plugins because this is an editable feature. So anything that I would have typed in codex, right? All I did when I started, I just said, let's start, right? I could have typed something out and said, let's use this skill, let's use these apps. You, you know, you're gonna watch me work on my computer. I just did something simple that would hopefully work. All right. Also, scripts and connectors can increase the reliability, but screen control, you know, it does increase those trust requirements. So if you are rolling this out, you know, in your enterprise, make sure that you get all those proper permissions that you need. Uh, and the limits right now are privacy and scope. So, you know, not everyone has this. You do have to be on a Mac. Uh, if you are in the EU, UK, Switzerland, you don't have uh access to this. And I would always start off with low stakes workflow. So what I just showed, right, this isn't something that's being published live. This is something that I'm using internally. This is still a very new feature, even for me. I've been using it a lot. I like it. Uh, it's still a new feature. So I would start by using this with low stakes things. Uh so you know, I would go through list your weekly computer task uh with all the steps and clear success criteria. So for me, the success criteria for this is okay, is it going to actually Accurately find those five things. And I open my notebook LM app every morning. If I go take a walk, I can listen to this and be better up to date on probably hopefully what all of you are thinking. So I would start just recording one short workflow using realistic inputs, no secrets, no, you're right, nothing, no passwords, nothing like that. Read each skill when it's done before rerunning it or setting it up as a scheduled automation. And to do that in codecs, literally, uh, so it looks like this is done recording. It called this skill just the Beehive Notebook LM debate, right? So now I can say run this every day at 6 a.m. Right. That's all you have to do. And then every day at 6 a.m., I can open my notebook LM app and it's that uh go into the studio section and it's waiting for me. All right, so we can read each skill file, refine pre uh preferences, and build your automation library. All right, let's check in. I don't think I'm gonna have time to rerun this skill, but I do want to check it. All right, so it says done. So it took three minutes. Oh wow, and impressive here. That was in 167 event capture. And I would go through, always encourage people, read the chain of thought. So I'm going through here, I can see exactly what it did. Uh, make sure everything is correct. I'm looking through this, it looks like it's doing a pretty good job. Um, I don't see anything that's wrong. And then it says uh validation passed with official skill validator. The only hiccup was uh something with uh Py YAML, uh, so something with the programming language there. And uh instead it kind of wrote its way around that. So pretty good. And then it says that anytime I can invoke this later using the Beehive uh notebook LM debate skill. So, as an example, let me just open this. Uh, I'm gonna say run the beehive notebook uh skill uh right now. All right. So we'll just see, make sure it's gonna kick off. It should start uh taking over my browser here. So we'll probably see that a little bit in the background, just to show everyone that things are well, hopefully uh running and working. And there we go. So it's running it now. I'm I'm not gonna, you know, sit around for another, you know, three to 10 minutes and blab on because I wanted to keep this one tight. Uh, you know, hey, under 30 minutes. There we go. Uh, but it's going, it's working. And now from here, I would validate it. Uh, I would run it once, I would make sure everything is correct. I would go listen to it on uh my notebook LM app. I would see what I wanted to tweet. Once I get it exactly how I want it, then I schedule it. You schedule it and forget it. But as we wrap up today's show, I want you to really understand what this unlocks. Because a couple of things. Number one, it is that I think now the easiest way to create skills. Skills, still to this day, skills and uh, you know, plugins, which is, you know, Codex's version of using skills and apps. So skills and plugins are, I think, probably the new measure, right? It's not about how many tokens you and your teams use. It's about how often is your agent running a skill or a um or a scheduled skill. I think that's huge because that's where you are starting to actually win back time by running repeatable workflows without you, the human, having to go in, type something, find some. Oh, where's this Google Doc? Where's this Obsidian Vault? Where's all this at? Right. So, you know, really recording your mundane workflows across different browser tabs. Obviously, you can do things where you're logged in like this, right? Obviously, I was logged into my Gmail to reach uh Notebook LM. I was logged into my um my Beehive account. So I told it to use my Chrome profile because I keep those logged in. So think about all of those things that you do over and over and over, uh, right, and not having to write something out by hand. Although I kind of did a little bit of that just to ensure that this would hopefully work zero shot. But think of what this unlocks and what this changes. And it doesn't require literally anything, it doesn't require any technical skill. I'm not an overly technical person. You go in, if you have a ChatGPT account, download codex, go through again, assuming you have all permissions, record these workflows, and it's done. Right. And now you have a worker working for you that you just showed it what to do and it did it. And I think this is a breath of fresh air, a very small feature that is going to make a big difference. So thank you for tuning in. I hope this was helpful doing our AI Wednesday at work series. Like I said, I'd done a couple other uh codex walkthroughs. Uh, so make sure you go check out 781, episode 781, where we went over Codex sites. Another great one with I think a lot of value there. So make sure you go check that out. And then go to your everydayai.com, sign up for the free daily newsletter. We're going to be recapping the highlights from today's show and a whole lot more. Thank you for tuning in. Hope to see you back tomorrow and every day for more everyday AI. Thanks, y'all.