De Nederlandse Kubernetes Podcast
De Nederlandse Kubernetes Podcast: gemaakt door én voor mensen met een hart voor IT. In deze reeks gaan Ronald Kers en Jan Stomphorst in gesprek over Kubernetes met als doel Kubernetes toegankelijk te maken voor iedereen.
De Nederlandse Kubernetes Podcast
#116 Running AI on Kubernetes: From GPUs to CRO
In this episode of De Nederlandse Kubernetes Podcast, we talk with Carlos Santana, Principal Partner Solution Architect at AWS and long-time contributor to the Kubernetes and AI communities.
Carlos joins us to explore what it really takes to run AI workloads on Kubernetes, from GPU scheduling to scaling inference and training efficiently across clusters. We discuss how AI and machine learning are transforming the cloud-native ecosystem — and why orchestration is becoming just as important as the models themselves.
He shares insights into:
- 💡 The challenges of scheduling and sharing GPUs in multi-tenant Kubernetes clusters
- ⚙️ Why Kubernetes Resource Orchestrator (CRO) could be the next big abstraction layer
- 🚀 The balance between performance, cost efficiency, and developer experience
- 🧠 His hands-on experiments with Jetson devices, edge computing, and model optimization
- 🌐 How open source projects and cloud providers are shaping the future of AI infrastructure
A forward-looking conversation about where AI, Kubernetes, and cloud-native engineering are heading — from someone building that future at scale.
ACC ICT Specialist in IT-CONTINUÏTEIT
Bedrijfskritische applicaties én data veilig beschikbaar, onafhankelijk van derden, altijd en overal
Like and subscribe! It helps out a lot.
You can also find us on:
De Nederlandse Kubernetes Podcast - YouTube
Nederlandse Kubernetes Podcast (@k8spodcast.nl) | TikTok
De Nederlandse Kubernetes Podcast
Where can you meet us:
Events
This Podcast is powered by:
ACC ICT - IT-Continuïteit voor Bedrijfskritische Applicaties | ACC ICT