AI in 10

AI Now Outperforms PhD Students But Can't Read Analog Clocks

Use Left/Right to seek, Home/End to jump to start or end. Hold shift to jump forward or backward.

0:00 | 7:27

Text us your thoughts!

Stanford's 2026 AI Index reveals dramatic contradictions: AI systems solve PhD-level science problems yet fail basic tasks, while safety incidents surge 55% as global adoption explodes. What this means for jobs, competition, and trust.

Referenced Links:
Stanford AI Index Report 2026
Stanford Institute for Human-Centered AI
LMSYS Arena Leaderboard
SWE-bench Software Engineering Benchmark
OSWorld Agentic AI Benchmark


Want to go deeper with AI? A community of professionals is learning AI together right now at aihammock.com — show notes, links, tools, and real conversations about how to actually use AI in your life.

SPEAKER_00

Welcome to AI in 10. I'm Chuck Getchell, and every day I break down the biggest AI story in just 10 minutes. What it is, why it matters, and how you can actually use it. The biggest annual report card for artificial intelligence just dropped, and it's showing us a world where AI can now outperform PhD students on science questions but still can't read a simple analog clock. Stanford University's AI Index Report for 2026 came out today, and it's packed with data that should make every working person pay attention. This isn't just tech companies patting themselves on the back. This is hard evidence that we're living through the fastest technology adoption in human history. Let me break down what's actually happening here. The report tracks how well different AI systems perform on various tests throughout the year. Think of it like a report card, but instead of grading students, we're grading the machines that might soon be doing our jobs. The numbers are pretty stunning. On something called SWE Bench, which tests how well AI can handle real software engineering tasks, performance improved by 67.3 percentage points in just 12 months. It's like watching someone go from a D minus to an A plus in a single school year, except the student is a computer program. But here's where it gets weird. These same AI systems that can solve complex coding problems and answer graduate level science questions, you know, they struggle with reading analog clocks. It's like having a genius who can perform brain surgery but can't tie their shoes. The competition between countries is heating up too. The gap between America's best AI models and China's best has shrunk to just 2.7%. For context, that's basically nothing in the tech world. The leadership keeps switching back and forth between American and Chinese companies, which tells us this isn't a race that's over. It's a race that's just getting started. Now, let's talk about what this means for your actual life, because that's what really matters. First, the job market is already shifting. The report shows general job market shifts as AI capabilities expand with significant impacts across various industries and age groups. But this isn't just about programmers. The report found that 2.5% of all job postings in America now mention AI skills. That might sound small, but it represents a massive increase. Companies are starting to expect that their employees know how to work with these tools. Here's what's particularly interesting: something called agentic AI is showing up more frequently in discussions, though these systems still have high failure rates of around one-third and low enterprise deployment in most scenarios. Agenic AI means AI systems that can complete tasks on their own without constant human supervision. Think of it as the difference between a calculator that waits for you to punch in numbers and a calculator that can read your tax forms and fill them out by itself. The report also reveals that 53% of people worldwide have adopted generative AI tools within three years. That's faster adoption than personal computers or the Internet achieved. In just three years, we've gone from what is ChatGPT to more than half the planet having experience with AI tools. But with that rapid adoption comes rapid problems. AI-related incidents jumped 55% last year. We're talking about everything from chatbots spouting hate speech to sophisticated deep fake romance scams that trick people out of thousands of dollars. The report documented 362 separate incidents where AI systems caused real harm to real people. This creates what I call the trust paradox. More people are using AI than ever before, but fewer people actually trust it. Only 23% of Americans view AI's impact on jobs positively. That's a recipe for the kind of backlash we're already seeing, including incidents like someone actually firebombing the home of OpenAI's CEO in San Francisco. The disconnect between AI's capabilities and our ability to manage those capabilities is growing wider every month. We have systems that can write code better than junior developers, but we don't have reliable ways to test whether those systems will behave safely in every situation. So, what can you actually do with this information? Let me give you one concrete action you can take today. Go to Stanford's website and download this AI index report yourself. It's free and it's designed for regular people to understand. Don't try to read all 300 pages, but spend 20 minutes looking at the charts that compare different AI systems. Pay particular attention to something called the Arena Leaderboard. This is basically a popularity contest where people test different AI tools and vote on which ones work better. It'll give you a real sense of which AI systems are actually worth your time. Then pick one of the top performing systems and test it yourself. Give it a task that's similar to something you do at work. Ask it to write an email, analyze some data, or solve a problem you're actually facing. Don't just ask it silly questions. Give it real work. The goal isn't to replace yourself. The goal is to understand what these tools can and cannot do so you can figure out how to work with them instead of against them. Here's the thing most people miss about AI adoption. It's not really about the technology being perfect. It's about the technology being good enough to change how work gets done. And according to this report, we've already crossed that line in many fields. If your job involves writing, analysis, coding, data processing, or customer service, these tools are already capable enough to either eliminate parts of your role or dramatically change how you do it. The question isn't whether this will happen, the question is whether you'll be ready when it does. The Stanford Report also shows us something encouraging though. While AI systems are getting incredibly good at narrow tasks, they're still pretty terrible at broad judgment calls. They can write code, but they can't decide whether that code should exist in the first place. They can analyze data, but they can't decide what questions are worth asking. That human layer of judgment creativity and strategic thinking. That's where the opportunities are. The people who figure out how to combine AI's processing power with human insight are going to be the ones who thrive. One more thing the report makes clear. This is a global competition and it's accelerating. The countries and companies that figure out AI first are going to have massive advantages. But that also means individuals who figure out AI first will have massive advantages too. The Stanford AI Index Report isn't just academic research, it's a roadmap showing us where the world is heading. And based on the data they've compiled, we're heading there faster than almost anyone predicted. The biggest takeaway from all 300 pages of research, AI isn't some future possibility we need to prepare for, it's a present reality we need to engage with. The adoption rates, the capability improvements, the economic impacts, they're all happening right now. The question isn't whether AI will change your industry. According to Stanford's data, it already has. That's today's AI Inten. If you want to go deeper and learn AI with a community of people just like you, join us at aihammock.com. I'll see you tomorrow, my friends.