AI in 60 Seconds | The 15-min Briefing

What I Learned from Building 4,000 AI Agents in 2025

AI4SP Season 2 Episode 25

Use Left/Right to seek, Home/End to jump to start or end. Hold shift to jump forward or backward.

0:00 | 14:15

Share your thoughts with us

    After guiding frontline workers to build 4,000 agents in 2025, we learned that the year didn’t belong to glossy AI rollouts.

It belonged to mechanics, teachers, and policymakers who named their agents, solved their own pain, and quietly rewired how work gets done.

In this season finale, we share how seven global companies and a public-interest initiative built thousands of agents that completed 4 million tasks and delivered roughly $50 million in value. We break down what those wins teach us about redesigning jobs, teams, and incentives for 2026.

We dig into three field stories that stick. Each story started at the edge, not the C-suite, and scaled because it solved real problems fast.

Then, we tackle the surprises:

  • The "Dumb" AI Paradox: Today’s models hallucinate and struggle with reasoning, yet they are already replacing swaths of white-collar work. This is an indictment of low-value corporate jobs, not a triumph of technology.
  • The Org Chart Crisis: We explore why the century-old org chart is cracking when one person can manage five agents that output the work of 30.
  • The New Business Model: How pay-per-results pricing is reshaping professional services, staffing, and customer support.

The real bottleneck isn’t technology anymore—it’s imagination and organizational design.

If this resonates, follow the show, share it with a teammate, and leave a review. Your ideas shape what we build together.

🎙️ All our past episodes  📊 All published insights | This podcast features AI-generated voices. All content is proprietary to AI4SP, based on over 1-billion data points from 70 countries.

AI4SP: Create, use, and support AI that works for all.

© 2023-26 AI4SP and LLY Group - All rights reserved

From Hype To Frontline Proof

LUIS

We tracked over 1 billion data points this year with one obsession. How do we turn AI hype into better jobs and better classrooms? Then everything flipped. The answer didn't come from the CEOs or the billion-dollar top-down programs. It came from the frontline.

ELIZABETH

It came from mechanics chatting with a virtual coach named Luke, policymakers working with ADA, and educators reimagining the classroom with Louise. The revolution wasn't abstract.

LUIS

So we went all in. We guided frontline teams in seven global enterprises to build 4,000 agents that completed millions of tasks and delivered $50 million in value. But we didn't just teach it, we lived it. We reached 650,000 people with just three humans and 58 agents. This year, we proved that ordinary people can lead an extraordinary revolution.

ELIZABETH

Hey everyone, I'm Elizabeth, virtual chief operating officer at AI4SP. With me is our founder, Luis Salazar. This is our final episode of 2025, and we're breaking down what actually worked, what surprised us, and where this goes in 2026. Luis, you've spent this year in businesses, classrooms, and government offices. When you look back, what's the real story of 2025?

LUIS

The story started with failure. Remember how the year began? McKinsey, Forrester, our own data, all pointing in the same direction. Top-down AI programs failing to deliver value. Then we had the famous MIT Nanda report. 95% of enterprise AI projects delivering no measurable impact. The headlines were brutal.

ELIZABETH

Billions invested, almost nothing to show for it.

LUIS

And yet, something didn't add up. Our tracker showed 60% of workers using AI daily. People were clearly getting value. So where was the disconnect?

ELIZABETH

Well, while the AI companions released by large software companies failed to get traction, Chat GPT reached 800 million active users. The value was real. It just wasn't where leadership was looking.

LUIS

Exactly. The breakthroughs weren't coming from IT-led transformation programs. They were coming from the frontline. Mechanics, teachers, policy analysts, building their own solutions. This was the year the frontline took the wheel.

ELIZABETH

So we leaned into that, supporting grassroots adoption and helping bring shadow AI into the light.

Why Top‑Down AI Failed

LUIS

Absolutely. Across seven global enterprises, we guided frontline teams to build 4,000 agents, not proof of concept sitting in a lab, working agents that delivered around $50 million in value.

ELIZABETH

We published the detailed results in our companion article at ai4sp.org. But let's make this concrete. Let's talk about Luke.

LUIS

It started with a young new hire, not a VP. He was frustrated because every time a junior technician got stuck, a senior expert had to travel on site. Slow, expensive, kept experienced people from higher value work.

ELIZABETH

So he built an AI-powered coach and named it Luke. It walks junior techs through diagnostics, safety checks, repairs in real time.

LUIS

Within months, Luke was handling thousands of interactions, faster fixes, fewer errors, and delivered $5 million in new revenue because teams could take more jobs without waiting for a senior.

ELIZABETH

And it wasn't just in businesses. Agreed. And ADA is another great example. Policymakers across multiple countries, drowning in reports, contradictory advice, intense pressure to write AI regulations in real time.

LUIS

ADA started as a simple agent, create a daily briefing on what was happening with AI. And then it evolved into an advisor, helping users draft outlines, compare regulations, and apply global best practices.

ELIZABETH

And again, no giant top-down directive. An agent built by a group on the front lines.

LUIS

And let's also talk about Louise. In Rwanda, Senegal, Brazil, parts of the US, we empowered educators.

ELIZABETH

Louise helped educators reimagine what's possible. And the questions teachers asked her were profound. How can a school use AI to foster peace in a community torn by 30 years of ethnic tension? Or how can I redesign my international marketing class guiding students to apply AI?

LUIS

And the moment that will stay with me is girls in rural Senegal texting one of our agents after school. Not because it was fancy, because it was always there when no human tutor was.

ELIZABETH

AI stopped being something that happened to people and started being something people did for themselves.

LUIS

Built on the front lines, this fourth industrial revolution is happening upside down. And I argue that most leaders are still misreading the situation.

ELIZABETH

So what did surprise us this year?

Luke, ADA, And Louise In Action

LUIS

Three things surprise me. And one of them is uncomfortable.

ELIZABETH

Okay, let's start. Which one is the uncomfortable one?

LUIS

Today we're using the worst AI we will ever use. And that dumb AI is already replacing white-collar jobs. And that forced me to ask, what does that say about those jobs?

ELIZABETH

Unpack that.

LUIS

Today's models hallucinate and cannot reliably perform complex multi-step reasoning. By any serious measure, they are not that smart. And yet, they're already replacing work in marketing, sales, paralegal, HR, customer service, real jobs, real people. So if a model that can't pass a high school logic test can replace this work, what does that say about the work itself?

ELIZABETH

You're saying we spent 50 years training humans to do the rote, repetitive work just because the automation wasn't there yet?

LUIS

Exactly. We perfected an education system to prepare people for tasks that a mediocre AI can now do. We created entire career paths around low-value added work. The real opportunity isn't to automate faster, it's to redesign work so humans do what humans are actually good at.

ELIZABETH

Right. And here's what makes it worse. Across 350,000 people, our data shows that less than 30% can reliably detect when AI gives them a wrong answer.

LUIS

And critical thinking scores averaged in the low 40s out of 100. We built jobs AI can do, and we didn't build the skills to work alongside it.

ELIZABETH

Hmm, that's a lot to think about. What's the second surprise?

LUIS

Well, the gap between top-down and bottom-up AI implementation was larger than I expected.

ELIZABETH

I saw the numbers. Across our enterprise clients, top-down AI programs failed about 80% of the time. Bottom-up succeeded about 80%.

LUIS

Same tools, opposite outcomes. When IT drove the initiative, people built what leadership wanted. When frontline workers drove it, people built what actually solved problems.

ELIZABETH

Which means most enterprise AI investment is pointed in the wrong direction. And the winners aren't the big software companies optimizing for IT departments.

LUIS

The platforms that let ordinary people create agents are winning. The platforms that require a six-month IT project are losing. And the third surprise? This one is structural. We haven't had to reorganize companies like this since the 1920s. I mean, for a hundred years, companies have used the same basic structure: CEO at the top, divisions below, layers of management. The M form that replaced the old unitary model. And AI is breaking that, right? When one person can manage five agents delivering the output of 30 people, what does the org chart look like? When an agent coordinates production across teams, where does decision making actually live?

ELIZABETH

And it goes beyond that. How do we think about career paths, compensation, talent development, and even knowledge ownership? So what's your bet for 2026?

LUIS

Organizations that win won't have the best AI models. They'll be the ones who redesign how work actually gets done.

ELIZABETH

You said something provocative at a keynote last week that if we froze AI development today, we'd still have decades of disruption ahead.

LUIS

At least 10 years. If we stop development today, we have enough capability to unlock incredible value just by applying these tools to the roles and processes we have right now.

ELIZABETH

So the bottleneck isn't technology.

The Uncomfortable Truth About Jobs

LUIS

Half of the bottleneck is the lack of imagination to reinvent outdated user experiences still based on menus, clicks, and search boxes. To rethink thousands of frontline scenarios where the PC era never delivered solutions.

ELIZABETH

And organizational design is the other half of that bottleneck. We have to redesign roles, teams, and entire functions around AI. But right now, the money isn't going there. Deloitte points out that organizations are still sinking 93% of their budgets into the technology, leaving just 7% for the people. That balance is wrong.

LUIS

Exactly. We need to flip that 93 to 7 ratio. We're entering an era of leading hybrid workforces of people and AI. What about business models? Oh, that is another big trend. I mean, we saw 15% of new AI tools moving from paper license or paper use into paper results.

ELIZABETH

Like we tested with Agent Ada, users paid 10% of the money saved in temporary staffing.

LUIS

Precisely. And you know, professional services, temporary staffing firms, and customer service will lead this shift. EY and Deloitte are already moving to paper results for their agentic workforce. Just look at the Anderson group. They filed to go public last week and admitted the reality. AI is putting pressure on their old business model.

ELIZABETH

So I learned that humans like to have New Year's resolutions. Do you have any suggestions for leaders?

LUIS

Here we go. Pick one team and empower them to build agents that change how they work. Then redesign that team structure based on what you learned. Don't start with a platform decision. Start with a people decision. Who has permission to reimagine their own work?

ELIZABETH

And for individuals, students, early career people feeling overwhelmed.

LUIS

You know, you're not late. Three years ago, AI4SP was just an idea. And this year, we guided people who never called themselves techies to build thousands of agents worth millions. So if you're willing to learn to build your first small agent, you can be part of this. You don't need permission. You just need to make a choice. Don't be a passive user, be a builder.

ELIZABETH

And it's a wrap for this episode and for an amazing 2025.

LUIS

Thank you for being part of this. Whether you're one of the hundreds of thousands who engage with us this year or just joining now, the future of work isn't being written in boardrooms. It's being written in daily experiments by each one of us.

ELIZABETH

From the four humans and 58 AI agents at AI4SP, including me, stay curious, take care of each other, and we'll see you in the new year.