
Building the Backend: Data Solutions that Power Leading Organizations
Welcome to the Building the Backend Podcast! We’re a data podcast focused on uncovering the data technologies, processes, and patterns that are driving today’s most successful companies. You will hear from data leaders sharing their knowledge and insights with what’s working and what’s not working for them. Our goal is to bring you valuable insights that will save you and your team time when building a modern data architecture in the cloud. Topics will span from big data, AI, ML, governance, visualizations, and best practices for enabling your organization to be data-driven. If you are a chief data officer, data architect, data engineer, data analyst, and those building the backend data solutions then HIT SUBSCRIBE!
Episodes
43 episodes
The Analytics Engine for All Your Data with Justin Borgman @ Starburst
In this episode we speak with Justin Borgman, Chairman & CEO at Starburst, which is based on open source Trino (formerly PrestoSQL) and was recently valued at $3.35 billion after securing their series D funding. In this episode we dis...
•
Season 1
•
Episode 41
•
36:12

Transform Your Object Storage Into a Git-like Repository With Paul Singman @ LakeFS
In this episode we speak with Paul Singman Developer Advocate at Treeverse / LakeFS. LakeFS is an open source project that allows you to transform your object storage into a Git-like repository. Top 3 takeaways<...
•
Season 1
•
Episode 40
•
27:23

Enable Faster Data Processing and Access with Apache Arrow with Matt Topol @ Factset
In this episode we speak with Matt Topol, Vice President, Principal Software Architect @ FactSet and dive deep into how they are taking advantage of Apache Arrow for faster processing and data access. Below are the top 3 value bo...
•
Season 1
•
Episode 39
•
49:15

Implementing Amundsen @ Convoy with Chad Sanderson
In this episode we speak with Chad Sanderson head of data and early stage startup advisor focused on data innovation @ Convoy and uncover their journey to implementing Amundsen, an open source data catalog.Below are the top 3 v...
•
Season 1
•
Episode 38
•
35:52

The Importance of Treating Your Data Initiatives as Products with Murali Bhogavalli
Your data team should not just be keeping the lights on, but should be building and creating data products to support the business. In this episode we speak with Murali Bhogavalli a data product manager and explore what is a data product manage...
•
Season 1
•
Episode 37
•
26:33

Open-Source Data Catalog Amundsen with Mark Grover @ Stemma
In this episode of Building The Backend we hear from Mark Grover founder @ Stemma, co-creator of Amundsen. Stemma is a fully managed data catalog, powered by the leading open-source data catalog, Amundsen.Below are top 3 value bombs:<...
•
Season 1
•
Episode 36
•
41:11

Architecting a Modern Data Lake with Dipti Borkar from Ahana
In this episode of Building The Backend we hear from Dipti Borkar cofounder @ Ahana a managed service for Presto on AWS, where we talk all about the data lake, how it should be structured and where the industry is going. B...
•
Season 1
•
Episode 35
•
39:32

Open Source BI with Apache Superset
What tools are you using for data viz? Are they low cost? One option is Apache Superset, in this episode we speak with Robert Stolz to learn more about Superset and other open source data tools. Top 3 Value Bombs:
•
Season 1
•
Episode 34
•
29:15

Edge Computing and Continuous Intelligence with Swim
In this episode of Building The Backend we hear from Simon Crosby – CTO @ Swim an open source edge computing operating system, where we talk all about edge computing, event streaming and much more. Below are top 3 value bombs:&nb...
•
Season 1
•
Episode 33
•
34:17

12 Modern Data Architecture Principles That Should Be Implemented in 2022
This episode is a little different then the usual format. Instead of interviewing a data leader - I share what I consider are the
•
Season 1
•
Episode 32
•
20:24

The Keys to Good Data Quality With Prukalpa Sankar from Atlan
In this episode of Building The Backend we hear from Prukalpa Sankar – Co-founder of Atlan, where we talk all about data quality/governance, common issues organizations face when implementing data quality and much much more. Below ar...
•
Season 1
•
Episode 31
•
37:21

Designing a Modern Data Architecture – Teradata
This is a podcast episode you do not want to miss with Stephen Brobst, CTO @ Teradata. We discuss all things Data Warehouses, the shift to the distributed cloud and, key principles to implementing successful DW's. Top 3 Value B...
•
Season 1
•
Episode 30
•
44:29

Exploring Open-Source Data Integration With Airbyte
“The hardest part of ETL is not building the connectors, it is maintaining them.” Truer words never spoken. Really enjoyed this episode with Michel Tricot CEO & Co-Founder of Airbyte where we discuss all things data integration and connecto...
•
Season 1
•
Episode 29
•
35:42

How To Effectively Reduce Data Quality Incidents 10x with Datafold
This episode features Gleb Mezhanskiy Co-Founder & CEO @ Datafold, during our discussion we talk all about data observability and how to improve your data quality. Before Datafold, Gleb was a founding member of data teams at Lyft and Autode...
•
Season 1
•
Episode 28
•
39:12

Applying Transformations to Streaming Data with Materialize
This episode features Arjun Narayan Co-Founder & CEO @ Materialize, during our discussion we talk all about transforming streaming data, the do’s the don’ts and how Materialize is changing the landscape of streaming. Top 3 Va...
•
Season 1
•
Episode 27
•
32:55

Optimizing Spark in the Cloud - with Jean-Yves Stephan
This episode features Jean-Yves Stephan Co-Founder & CEO @ Data Mechanics (recently Acq. by Spot by NetApp), during our discussion we talk about optimizing Spark to run in the cloud at a low cost.Top 3 Value Bombs:...
•
Season 1
•
Episode 26
•
32:26

How To Achieve Better Observability and Control Over Your Data Pipelines with Josh Benamram
This episode features Josh Benamrum, who is the co-founder of Databand. Databand is a company that helps engineering teams achieve better observability and control over their tech stack.Top 3 Value Bombs: When ob...
•
Season 1
•
Episode 25
•
37:03

Unify Your Data Operations with Nexla
Travis welcomes to his podcast Saket Saurabh, who provides a window into the world of data management and the self-service options that are democratizing it. Co-founder and CEO of Nexla, Saket has a passion for data and infrastructure and how t...
•
Season 1
•
Episode 24
•
25:12

A Powerful Open Source Database That Supports Many Storage Needs (MariaDB)
In this episode, we speak with Rob Hedgpeth, a director of developer developer relations at Maria DB. We explore all things Maria DB, the capabilities it has and when you should consider it for your next project. To...
•
Season 1
•
Episode 23
•
27:33

Increase the Quality and Reliability of Your Data
In this episode, we speak with Lior Gavish, the co-founder of Monte Carlo to explore all things data quality. Monte Carlo is a data lineage and observability tool that lowers your data downtime.Top 3 Value Bombs:Data ...
•
Season 1
•
Episode 22
•
31:12

Build Real-Time Data Pipelines in Minutes Not Months with Meroxa
In this episode, we speak with DeVaris Brown, he is the CEO and co-founder of Meroxa, which is a data platform that enables organizations to build real time data pipelines in minutes not months. Prior to founding Meroxa, DeVaris was a product l...
•
Season 1
•
Episode 21
•
36:33

Launch, Monitor, and Share Data Pipelines In a Matter of Minutes
In this episode, we speak with Blake Burch, co-founder of Shipyard, a data orchestrator tool that allows you to create powerful workflows in a matter of minutes.Top 3 Value Bombs: Data tests are often for the assu...
•
Season 1
•
Episode 20
•
32:07

The Data Warehouse for Distributed Clouds - Yellowbrick
In this episode, we speak with Mark Cusack, CTO at Yellowbrick. Yellowbrick is a data warehouse platform that was built from the ground up for performance and cost that can be deployed across clouds and on-prem. Top 3 Value Bombs...
•
Season 1
•
Episode 19
•
37:57

What You Should Know Before Getting Started With Data Science with DATA SCIENCE I N F I N I T Y
In this episode, we speak with Andrew Jones who has spent 13 years in Data Science at companies including Amazon & more recently Sony PlayStation where he developed and prototyped Machine Learning based features for the PlayStation 5, sever...
•
Season 1
•
Episode 18
•
43:42
