Building the Backend: Data Solutions that Power Leading Organizations
Episodes
43 episodes
The Analytics Engine for All Your Data with Justin Borgman @ Starburst
In this episode we speak with Justin Borgman, Chairman & CEO at Starburst, which is based on open source Trino (formerly PrestoSQL) and was recently valued at $3.35 billion after securing their series D funding. In this episode we dis...
Transform Your Object Storage Into a Git-like Repository With Paul Singman @ LakeFS
In this episode we speak with Paul Singman Developer Advocate at Treeverse / LakeFS. LakeFS is an open source project that allows you to transform your object storage into a Git-like repository. Top 3 takeaways<...
Enable Faster Data Processing and Access with Apache Arrow with Matt Topol @ Factset
In this episode we speak with Matt Topol, Vice President, Principal Software Architect @ FactSet and dive deep into how they are taking advantage of Apache Arrow for faster processing and data access. Below are the top 3 value bo...
Implementing Amundsen @ Convoy with Chad Sanderson
In this episode we speak with Chad Sanderson head of data and early stage startup advisor focused on data innovation @ Convoy and uncover their journey to implementing Amundsen, an open source data catalog.Below are the top 3 v...
The Importance of Treating Your Data Initiatives as Products with Murali Bhogavalli
Your data team should not just be keeping the lights on, but should be building and creating data products to support the business. In this episode we speak with Murali Bhogavalli a data product manager and explore what is a data product manage...
Open-Source Data Catalog Amundsen with Mark Grover @ Stemma
In this episode of Building The Backend we hear from Mark Grover founder @ Stemma, co-creator of Amundsen. Stemma is a fully managed data catalog, powered by the leading open-source data catalog, Amundsen.Below are top 3 value bombs:<...
Architecting a Modern Data Lake with Dipti Borkar from Ahana
In this episode of Building The Backend we hear from Dipti Borkar cofounder @ Ahana a managed service for Presto on AWS, where we talk all about the data lake, how it should be structured and where the industry is going. B...
Open Source BI with Apache Superset
What tools are you using for data viz? Are they low cost? One option is Apache Superset, in this episode we speak with Robert Stolz to learn more about Superset and other open source data tools. Top 3 Value Bombs:
Edge Computing and Continuous Intelligence with Swim
In this episode of Building The Backend we hear from Simon Crosby – CTO @ Swim an open source edge computing operating system, where we talk all about edge computing, event streaming and much more. Below are top 3 value bombs:&nb...
12 Modern Data Architecture Principles That Should Be Implemented in 2022
This episode is a little different then the usual format. Instead of interviewing a data leader - I share what I consider are the
The Keys to Good Data Quality With Prukalpa Sankar from Atlan
In this episode of Building The Backend we hear from Prukalpa Sankar – Co-founder of Atlan, where we talk all about data quality/governance, common issues organizations face when implementing data quality and much much more. Below ar...
Designing a Modern Data Architecture – Teradata
This is a podcast episode you do not want to miss with Stephen Brobst, CTO @ Teradata. We discuss all things Data Warehouses, the shift to the distributed cloud and, key principles to implementing successful DW's. Top 3 Value B...
Exploring Open-Source Data Integration With Airbyte
“The hardest part of ETL is not building the connectors, it is maintaining them.” Truer words never spoken. Really enjoyed this episode with Michel Tricot CEO & Co-Founder of Airbyte where we discuss all things data integration and connecto...
How To Effectively Reduce Data Quality Incidents 10x with Datafold
This episode features Gleb Mezhanskiy Co-Founder & CEO @ Datafold, during our discussion we talk all about data observability and how to improve your data quality. Before Datafold, Gleb was a founding member of data teams at Lyft and Autode...
Applying Transformations to Streaming Data with Materialize
This episode features Arjun Narayan Co-Founder & CEO @ Materialize, during our discussion we talk all about transforming streaming data, the do’s the don’ts and how Materialize is changing the landscape of streaming. Top 3 Va...
Optimizing Spark in the Cloud - with Jean-Yves Stephan
This episode features Jean-Yves Stephan Co-Founder & CEO @ Data Mechanics (recently Acq. by Spot by NetApp), during our discussion we talk about optimizing Spark to run in the cloud at a low cost.Top 3 Value Bombs:...
How To Achieve Better Observability and Control Over Your Data Pipelines with Josh Benamram
This episode features Josh Benamrum, who is the co-founder of Databand. Databand is a company that helps engineering teams achieve better observability and control over their tech stack.Top 3 Value Bombs: When ob...
Unify Your Data Operations with Nexla
Travis welcomes to his podcast Saket Saurabh, who provides a window into the world of data management and the self-service options that are democratizing it. Co-founder and CEO of Nexla, Saket has a passion for data and infrastructure and how t...
A Powerful Open Source Database That Supports Many Storage Needs (MariaDB)
In this episode, we speak with Rob Hedgpeth, a director of developer developer relations at Maria DB. We explore all things Maria DB, the capabilities it has and when you should consider it for your next project. To...
Increase the Quality and Reliability of Your Data
In this episode, we speak with Lior Gavish, the co-founder of Monte Carlo to explore all things data quality. Monte Carlo is a data lineage and observability tool that lowers your data downtime.Top 3 Value Bombs:Data ...
Build Real-Time Data Pipelines in Minutes Not Months with Meroxa
In this episode, we speak with DeVaris Brown, he is the CEO and co-founder of Meroxa, which is a data platform that enables organizations to build real time data pipelines in minutes not months. Prior to founding Meroxa, DeVaris was a product l...
Launch, Monitor, and Share Data Pipelines In a Matter of Minutes
In this episode, we speak with Blake Burch, co-founder of Shipyard, a data orchestrator tool that allows you to create powerful workflows in a matter of minutes.Top 3 Value Bombs: Data tests are often for the assu...
The Data Warehouse for Distributed Clouds - Yellowbrick
In this episode, we speak with Mark Cusack, CTO at Yellowbrick. Yellowbrick is a data warehouse platform that was built from the ground up for performance and cost that can be deployed across clouds and on-prem. Top 3 Value Bombs...
What You Should Know Before Getting Started With Data Science with DATA SCIENCE I N F I N I T Y
In this episode, we speak with Andrew Jones who has spent 13 years in Data Science at companies including Amazon & more recently Sony PlayStation where he developed and prototyped Machine Learning based features for the PlayStation 5, sever...