Open-Source Data Catalog Amundsen with Mark Grover @ Stemma Artwork

Building the Backend: Data Solutions that Power Leading Organizations

Welcome to the Building the Backend Podcast! We’re a data podcast focused on uncovering the data technologies, processes, and patterns that are driving today’s most successful companies. You will hear from data leaders sharing their knowledge and insights with what’s working and what’s not working for them. Our goal is to bring you valuable insights that will save you and your team time when building a modern data architecture in the cloud. Topics will span from big data, AI, ML, governance, visualizations, and best practices for enabling your organization to be data-driven. If you are a chief data officer, data architect, data engineer, data analyst, and those building the backend data solutions then HIT SUBSCRIBE!

All Episodes

Building the Backend: Data Solutions that Power Leading Organizations

Open-Source Data Catalog Amundsen with Mark Grover @ Stemma

January 11, 2022 • Travis Lawrence • Season 1 • Episode 36

0:00 | 41:11

In this episode of Building The Backend we hear from Mark Grover founder @ Stemma, co-creator of Amundsen. Stemma is a fully managed data catalog, powered by the leading open-source data catalog, Amundsen.

Below are top 3 value bombs:

Automated data catalogs are critical to help wrangle the growing data across organizations. (i.e. Being able to identify out of 150 columns on this table only 10 are being used downstream)
Tribal knowledge and context cannot be automated - data catalogs cannot be 100% automated.
Amundsen is an open-source data catalog originally created at Lyft. Stemma has created a managed version of Amundsen.

Help me improve the podcast by completing this 60 second survey: https://buildingthebackend.com/survey