Beyond Workflows: The Rise of AI Agents Artwork

Digital Transformation Playbook

Kieran Gilmurray is a globally recognised authority on Artificial Intelligence, cloud, intelligent automation, data analytics, agentic AI, and digital transformation. He has authored three influential books and hundreds of articles that have shaped industry perspectives on digital transformation, data analytics, intelligent automation, agentic AI and artificial intelligence.

𝗪𝗵𝗮𝘁 does Kieran do❓

When I'm not chairing international conferences, serving as a fractional CTO or Chief AI Officer, I’m delivering AI, leadership, and strategy masterclasses to governments and industry leaders.

My team and I help global businesses drive AI, agentic ai, digital transformation and innovation programs that deliver tangible business results.

🏆 𝐀𝐰𝐚𝐫𝐝𝐬:

🔹Top 25 Thought Leader Generative AI 2025
🔹Top 50 Global Thought Leaders and Influencers on Agentic AI 2025
🔹Top 100 Thought Leader Agentic AI 2025

🔹Top 100 Thought Leader Legal AI 2025
🔹Team of the Year at the UK IT Industry Awards
🔹Top 50 Global Thought Leaders and Influencers on Generative AI 2024
🔹Top 50 Global Thought Leaders and Influencers on Manufacturing 2024
🔹Best LinkedIn Influencers Artificial Intelligence and Marketing 2024
🔹Seven-time LinkedIn Top Voice.
🔹Top 14 people to follow in data in 2023.
🔹World's Top 200 Business and Technology Innovators.
🔹Top 50 Intelligent Automation Influencers.
🔹Top 50 Brand Ambassadors.
🔹Global Intelligent Automation Award Winner.
🔹Top 20 Data Pros you NEED to follow.

𝗖𝗼𝗻𝘁𝗮𝗰𝘁 my team and I to get business results, not excuses.

☎️ https://calendly.com/kierangilmurray/30min
✉️ kieran@gilmurray.co.uk
🌍 www.KieranGilmurray.com
📘 Kieran Gilmurray | LinkedIn

All Episodes

Digital Transformation Playbook

Beyond Workflows: The Rise of AI Agents

May 05, 2025 • Kieran Gilmurray

Curious about what AI agents really are and how they're reshaping automation? This deep dive cuts through the jargon to deliver precisely what you need to know about these powerful systems built on large language models. Well listen to AI explain AI.

TLDR:

Three key components make an AI agent: an LLM as its brain, tools to interact with the world, and guardrails to ensure appropriate behaviour
AI agents excel where traditional rule-based systems fail: complex decision-making, overly complicated rules, and processing unstructured data
Start building with the most capable model to prove your concept, then optimize later with smaller, faster, cheaper options if needed
Tools come in three types: data tools for fetching information, action tools for doing things, and orchestration tools for calling other agents
Clear instructions are vital - leverage existing SOPs, prompt the agent to break down tasks, and anticipate edge cases
Begin with simple single-agent systems before moving to multi-agent approaches like the manager pattern or decentralized pattern
Implement layered guardrails including relevance checks, safety classifiers, PII filters, moderation tools, and risk-based controls
Human intervention remains critical, especially for high-risk actions or when the agent struggles with certain tasks

We explore how AI agents fundamentally differ from traditional software by independently accomplishing tasks through their LLM "brain," specialized tools, and carefully designed guardrails. Rather than just following rigid rules, these systems can reason through complex problems, adapt on the fly, and make nuanced judgment calls – like having tiny specialized workers available 24/7.

You'll discover the three key scenarios where AI agents truly shine: handling complex decisions requiring judgment, replacing brittle rule systems that have become maintenance nightmares, and processing mountains of unstructured data. We break down the building blocks of effective agent design, from choosing the right model to crafting clear instructions and implementing proper safety mechanisms.

The conversation moves from simple single-agent systems to sophisticated multi-agent architectures, explaining when to use manager patterns versus decentralized approaches. We emphasize the critical importance of layered safety measures – from privacy protections to content moderation – and the continuing role of human oversight, especially for high-risk actions.

Whether you're just exploring the concept or actively looking to implement AI agents in your organization, this episode provides the clear, practical understanding you need to evaluate their potential and approach their development responsibly. The future of work is changing – are you ready to rethink what automation can accomplish?

Support the show

𝗖𝗼𝗻𝘁𝗮𝗰𝘁 my team and I to get business results, not excuses.

☎️ https://calendly.com/kierangilmurray/results-not-excuses
✉️ kieran@gilmurray.co.uk
🌍 www.KieranGilmurray.com
📘 Kieran Gilmurray | LinkedIn
🦉 X / Twitter: https://twitter.com/KieranGilmurray
📽 YouTube: https://www.youtube.com/@KieranGilmurray

AI Voice One: 0:00

Hi there. You're probably here because, like us, you want to get a real handle on something important you know, without wading through endless articles and jargon.

AI Voice Two: 0:08

Yeah, cut right to the chase.

AI Voice One: 0:09

Exactly, and today we're diving deep into AI agents. We've gathered a bunch of info that really paints a picture of these systems, the ones powered by large language models, or LLMs.

AI Voice Two: 0:22

Right.

AI Voice One: 0:22

And they seem like more than just you know clever software. This could genuinely be a fundamental shift in automation.

AI Voice Two: 0:28

It really could. A move beyond the workflows you have to manually kick off.

AI Voice One: 0:32

But something that can, what act independently on your behalf.

AI Voice Two: 0:36

Pretty much. Our mission today, then, is to pull out the core knowledge you really need to understand this potential game changer.

AI Voice One: 0:44

Okay, so let's start there. An AI agent How's it different from software that helps me do something?

AI Voice Two: 0:49

Ah, good question. Well, the sources we looked at define an agent as a system specifically designed to independently accomplish tasks. It's about delegating entire processes, not just like individual steps.

AI Voice One: 1:01

Independently accomplishing tasks, yeah, okay. So what makes it an agent, then, rather than just a really fancy program that happens to use an LLM? What are the essential ingredients?

AI Voice Two: 1:11

The material consistently points to three key things. First, the agent uses an LLM as its core. Like its brain, its reasoning engine, it actively manages a workflow and makes decisions as it goes. So it's not just spitting out text, it's reasoning engine, it actively manages a workflow and makes decisions as it goes.

AI Voice One: 1:25

So it's not just spitting out text, it's directing things.

AI Voice Two: 1:28

Exactly Orchestrating actions. Second, it needs access to what are called tools.

AI Voice One: 1:33

Tools Like software tools.

AI Voice Two: 1:35

Sort of yeah, Think of them like extensions or plugins. They let the agent interact with the outside world, query a database, send an email, search the web, that kind of thing.

AI Voice One: 1:43

Gotcha, so it can actually do stuff.

AI Voice Two: 1:45

Right. And third, and this is crucial, its operation is governed by defined guardrails, instructions and boundaries to make sure it behaves acceptably.

AI Voice One: 1:58

Okay, LLM brain tools for action and guardrails for safety Makes sense, but when would you actually go through the effort of building one? It sounds like a bigger deal than standard automation.

AI Voice Two: 2:04

It definitely can be, and that's a really important question. The sources address Agents truly shine where traditional, like rule-based automation starts hitting its limits. Limits like what Well take payment fraud analysis, for instance. A standard system might just flag transactions matching very specific preset rules, bang rule triggered. But an AI agent, it can reason through the context. It can look at subtle indicators, things that don't fit a neat rule, and make a more nuanced judgment. It's almost like having a tiny fraud investigator working 24-7.

AI Voice One: 2:36

Ah, I see. So it's less about rigid if this than that and more about understanding the bigger picture.

AI Voice Two: 2:42

Exactly, it moves beyond those brittle rules towards something more flexible, almost intuitive, you could say.

AI Voice One: 2:50

So are there specific areas where this really pays off, signs that an agent might be the way to go?

AI Voice Two: 2:55

Yeah. The material highlights three main value areas. First is complex decision making. You know workflows needing judgment calls, handling weird exceptions, adapting on the fly, like approving a tricky customer refund.

AI Voice One: 3:08

Right where it's not just black and white.

AI Voice Two: 3:10

Precisely. Second, situations where your rules have become insanely complicated and a nightmare to maintain Think vendor security reviews with thousands of branching rules.

AI Voice One: 3:20

Oh yeah, I can imagine.

AI Voice Two: 3:22

And third is when you're drowning in unstructured data, like sifting through thousands of customer emails written in natural language or pulling key facts from messy insurance claim documents.

AI Voice One: 3:32

Okay, complex decisions, hard to maintain rules or lots of unstructured data.

AI Voice Two: 3:37

If your problem ticks one or more of those boxes, an agent is definitely worth considering.

AI Voice One: 3:41

Right. So okay, let's say you've identified a good use case. Where do you start designing one? What are those core building blocks?

AI Voice Two: 3:49

again, so back to those three core components. We mentioned First the model, the LLM itself.

AI Voice One: 3:55

The brain.

AI Voice Two: 3:55

The brain. Yeah, and different models have different strengths, right. Some are better at complex reasoning, some are faster, some are cheaper.

AI Voice One: 4:01

So how do you choose?

AI Voice Two: 4:02

Well, the common advice seems to be start prototyping with the most capable model you can get access to. Really push the boundaries, see what's possible.

AI Voice One: 4:11

Prove the concept first.

AI Voice Two: 4:12

Exactly. Then, once you've got something working, you can experiment, try smaller, faster, cheaper models and see if the performance is still good enough for your specific needs. Optimization comes later.

AI Voice One: 4:24

Smart, Prove it, then refine it. Component one the model. What was number two?

AI Voice Two: 4:30

The tools. These are those external functions or APIs application programming interfaces that let the agent interact with the world outside the LLM.

AI Voice One: 4:40

The hands, basically the hands.

AI Voice Two: 4:41

yeah, that's a good way to put it. The sources break them down into roughly three types. You've got data tools for fetching info, querying databases, reading files, searching the web.

AI Voice One: 4:51

Okay.

AI Voice Two: 4:51

Then action tools for doing things sending emails, updating Salesforce records, creating support tickets.

AI Voice One: 4:56

Makes sense.

AI Voice Two: 4:57

And interestingly, there are also orchestration tools where one agent can actually call another agent as one of its tools to handle a subtask.

AI Voice One: 5:05

Whoa agents using other agents? Okay, meta.

AI Voice Two: 5:08

It can get pretty sophisticated. The point is equipping the agent with exactly the capabilities it needs for its job.

AI Voice One: 5:14

Got it Model tools and the third piece was instructions.

AI Voice Two: 5:18

Instructions yes, these are the explicit guidelines and the guardrails that define how the agent should behave. Think of it as the agent's rulebook or standard operating procedure.

AI Voice One: 5:28

And getting these right sounds critical.

AI Voice Two: 5:30

Absolutely vital. Clear instructions reduce ambiguity, improve the quality of the agent's decisions and prevent it from going off the rails.

AI Voice One: 5:39

So how do you write good instructions for an AI? It can be quite like writing an email to a colleague, right?

AI Voice Two: 5:46

Not quite. No, the sources suggest starting with what you already have existing standard operating procedures, maybe customer support scripts, internal wikis.

AI Voice One: 5:55

Leverage existing knowledge Exactly.

AI Voice Two: 5:57

It's also really helpful to prompt the agent itself to break down big tasks into smaller steps Like okay, outline the steps you'd take to resolve this issue.

AI Voice One: 6:06

Ah, make it think about its own process.

AI Voice Two: 6:08

Yes, and for each step you need to define a really clear action or outcome. Minimize wiggle room and this is key. Anticipate the weird stuff, the edge cases. What happens if the database is down? What if the customer gives contradictory information? You need instructions for that.

AI Voice One: 6:23

Plan for the unexpected.

AI Voice Two: 6:25

You have to. Interestingly, the sources even mention using other advanced LLMs to help generate the initial set of instructions by feeding them your existing documents. There is even an example prompt for doing that.

AI Voice One: 6:38

Using AI to bootstrap the instructions for another AI. That's efficient, I guess. It's a potential accelerator for sure. Okay, so you've got your model, your tools, your carefully crafted instructions. How do you actually make the agent you know run? How does it execute a workflow? This is orchestration, right.

AI Voice Two: 6:59

Precisely. Orchestration is all about the patterns and strategies that let the agent follow those instruction and use its tools effectively to reach the goal and where do you start?

AI Voice One: 7:08

seems like it could get complicated fast it can.

AI Voice Two: 7:11

The advice is generally to start simple, usually with what's called a single agent system meaning, just one agent does everything well, one primary agent manages the whole process.

AI Voice Two: 7:20

It might have lots of tools, but it's one central brain coordinating things. It runs in a loop. Basically, yeah, think of it as read the instructions, figure out the next step, maybe use a tool, get the result, figure out the next step, and so on. This run keeps going until a specific condition is met, like what maybe the agent calls a specific task, complete tool, or it generates the final output you wanted, or maybe it hits an error it can't resolve, or, importantly, it might hit a maximum number of turns or steps to prevent it from just running forever a safety mechanism.

AI Voice Two: 7:54

Definitely the material actually mentioned a function like runner dot run from something called the agents SDK, a software development kit for building these. Think of that as the go button for the agent's loop.

AI Voice One: 8:06

Okay, and if that single agent have like dozens of tools and complex logic, how to keep that manageable?

AI Voice Two: 8:13

Ah, good point. Prompt templates are apparently very useful here. Instead of writing unique instructions for every tiny variation, you create a template with placeholders, variables.

AI Voice One: 8:22

Like a fill in the blanks prompt.

AI Voice Two: 8:24

Exactly so for a call center agent. You might have variables for customer name accountage issue type. You fill those in based on the current situation. It makes the core instructions much easier to manage and scale.

AI Voice One: 8:37

Makes sense. Reuse the core logic.

AI Voice Two: 8:39

Yeah, and the sources generally advise pushing that single agent approach as far as you can before jumping to multiple agents.

AI Voice One: 8:45

Why is that?

AI Voice Two: 8:46

Because coordinating multiple agents just adds another layer of complexity. You'd only really move to multi-agent systems if the logic gets super tangled or if the single agent has so many tools it keeps picking the wrong one, you know.

AI Voice One: 9:00

Okay, so only add complexity when you really have to yeah, but if you do need more than one agent, what then? That's multi-agent systems right.

AI Voice Two: 9:08

This is where you break down the workflow and have several agents collaborating. The sources focus on two main patterns here okay, pattern one is manager pattern. Imagine a central manager agent acting like a project lead. It doesn't do all the work itself, instead, it directs traffic. It calls on specialized worker agents using tools. Hey translation agent, translate this to Spanish. Hey database agent fetch this customer record.

AI Voice One: 9:34

So the worker agents were basically tools for the manager agent Pretty much the manager assigns tasks, collects the results from the workers agent.

AI Voice Two: 9:39

Pretty much the manager assigns tasks, collects the results from the workers and then synthesizes the final output or decides the next overall step. The example given was that translation scenario a manager using separate Spanish, french, italian agents.

AI Voice One: 9:54

Got it Like an orchestra conductor, making sure everyone plays their part.

AI Voice Two: 9:57

It's a perfect analogy the manager keeps control. The sources did mention a contrast here with some visual flowchart style builders saying that, while those look clear, a code first approach, like with the agent SDK, might offer more flexibility for these complex interactions.

AI Voice One: 10:12

Interesting trade-off. Okay, so manager pattern is one. What's the other? Big one?

AI Voice Two: 10:15

The other is the decentralized pattern. Here agents act more like peers on a team. They hand off tasks directly to each other, based on specialization.

AI Voice One: 10:24

So no central manager.

AI Voice Two: 10:25

Not really. No, it's more like an assembly line or a relay race. An agent finishes its part and then uses a specific tool or function to pass the whole task onto the next appropriate specialist agent.

AI Voice One: 10:38

And it's usually a one-way handoff.

AI Voice Two: 10:39

Typically yeah. Once Agent A hands off to Agent B, Agent B takes over. The example used was a customer service flow.

AI Voice One: 10:48

How did that work?

AI Voice Two: 10:49

Well, you might have a general triage agent that first talks to the customer, Based on the issue. It might hand off to a technical support agent or sales agent or an order management agent.

AI Voice One: 11:00

Ah. Routing based on need.

AI Voice Two: 11:02

Exactly. Each specialist handles their piece. This pattern is apparently really good for that kind of conversation routing or task triage.

AI Voice One: 11:09

I guess you're building a team of specialists. Okay, but with all those power agents making decisions, taking actions, potentially using other agents, how do you keep them from messing up or doing things they shouldn't? Guardrails right.

AI Voice Two: 11:22

Absolutely critical. Guardrails are your safety net. You're managing risks like exposing private data, saying something off-brand or just making bad decisions. Think of them like safety features on heavy machinery.

AI Voice One: 11:33

And it's not just one big stop button.

AI Voice Two: 11:35

No, the sources really emphasize a layered defense, multiple types of guardrails working together.

AI Voice One: 11:41

Okay, like what? Give me some examples.

AI Voice Two: 11:43

Sure, you might have a relevance classifier that flags if a user asks the agent something totally unrelated to its job.

AI Voice One: 11:50

Keep it on topic.

AI Voice Two: 11:51

Right, A safety classifier to detect harmful inputs. People trying to jailbreak the agent or feed it malicious instructions.

AI Voice One: 11:59

I'm taking the agent itself.

AI Voice Two: 12:00

Exactly. Then things like a PII filter to stop the agent from unnecessarily asking for or revealing personal info like credit card numbers.

AI Voice One: 12:09

Privacy protection Crucial.

AI Voice Two: 12:10

Very. Also moderation tools to check the agent's output for harmful or inappropriate content before it reaches the user.

AI Voice One: 12:17

So checking both input and output, yes, you can also have tool safeguards.

AI Voice Two: 12:21

Maybe certain tools are riskier, like delete customer account. You could rate that tool as high risk, triggering extra checks or even needing human approval before the agent can use it. Smart.

AI Voice One: 12:31

Risk-based controls.

AI Voice Two: 12:33

And then there are more traditional things too Simple, rules-based protections like block lists for certain words, limits on input length, using rejects patterns to validate formats and, finally, output validation, just to ensure the agent's tone and style match your brand voice.

AI Voice One: 12:48

Wow, that's quite a few layers. How do you decide where to focus? You can't build all of that on day one, surely?

AI Voice Two: 12:54

No, probably not. The guidance suggests this pragmatic approach. Start by focusing on the big risks privacy and basic safety. Get those fundamentals in place, then add more specific guardrails reactively, based on actual failures or near misses. You see, when testing or deploying the agent, learn from experience.

AI Voice One: 13:13

Let reality guide the hardening process.

AI Voice Two: 13:16

Pretty much it's a continuous balancing act between security and making sure the agent is still useful and not annoying to interact with. The material showed a code snippet using the agent's SDK for an input guardrail, specifically detecting if a customer seems likely to churn.

AI Voice One: 13:30

And how did that work?

AI Voice Two: 13:31

It used an optimistic execution approach. The main agent process would continue, but in the background, this guardrail would analyze the input for churn signals. If detected, it could trigger a specific action, like alerting a human retention specialist.

AI Voice One: 13:46

So the guardrail runs in parallel, potentially.

AI Voice Two: 13:49

In that example. Yes, it avoids blocking the main flow unless necessary.

AI Voice One: 13:53

Okay, but even with all these automated checks, is there still a place for a human in the loop?

AI Voice Two: 13:59

Oh, absolutely. Human intervention is highlighted as a critical safeguard, especially early on.

AI Voice One: 14:04

Why especially early on?

AI Voice Two: 14:06

Well, it helps you catch those unforeseen issues, discover edge cases you didn't anticipate in your instructions and just generally build confidence in the agent's performance before you let it run completely free.

AI Voice One: 14:16

Makes sense Train it with supervision first.

AI Voice Two: 14:19

Right, and the sources point to two main triggers for pulling a human in. First, if the agent starts failing too often, maybe it exceeds a certain threshold for errors or retries on a task.

AI Voice One: 14:30

Too many mistakes Call for help.

AI Voice Two: 14:32

Exactly. And second, when the agent is about to perform a particularly high-risk action we mentioned deleting an account, maybe issuing a large refund or sending a critical communication. For those kinds of things, having a human review and give the final okay is often the safest bet.

AI Voice One: 14:50

Better safe than sorry, especially with high stakes of things, having a human review and give the final OK is often the safest bet. Better safe than sorry, especially with high stakes. Ok, so let's try and wrap this up. If we boil it all down, what's the main thing people should take away about AI agents from this deep dive?

AI Voice Two: 15:01

I think the core message is that AI agents are a significant step up in automation. They're not just about making existing processes faster. They enable automation of complex, multi-step tasks that require judgment and interaction with the world in ways that, frankly, older software just couldn't handle.

AI Voice One: 15:17

And they're especially good for.

AI Voice Two: 15:18

For those really tricky workflows, the ones involving complex decisions, messy, unstructured data or those brittle, hard-to-maintain rule systems we talked about. That's where they can be transformative.

AI Voice One: 15:30

And building them reliably means.

AI Voice Two: 15:31

It means focusing on those foundations the right model, the right tools and crystal-clear instructions. Then choosing the right orchestration patterns. Start simple, scale up carefully and, crucially, layering in those robust guardrails to manage the risks Safety, privacy, reliability they're paramount.

AI Voice One: 15:53

Right, so for you listening. Hopefully that gives you a much clearer picture of what AI agents are, where they might fit and what it takes to build them effectively and responsibly.

AI Voice Two: 16:01

Yeah, the potential is definitely there.

AI Voice One: 16:03

It really is. And it leads to a final thought, I suppose as these agents become more common, more integrated, how is that going to change our basic ideas about what work even means or what assistance looks like?

AI Voice Two: 16:15

That's a big question.

AI Voice One: 16:16

It is Definitely something to chew on. Well, thanks for joining us for this deep dive.

AI Voice Two: 16:20

My pleasure.

People on this episode

Mr Kieran Gilmurray

Host