
Building the Backend: Data Solutions that Power Leading Organizations
Welcome to the Building the Backend Podcast! We’re a data podcast focused on uncovering the data technologies, processes, and patterns that are driving today’s most successful companies. You will hear from data leaders sharing their knowledge and insights with what’s working and what’s not working for them. Our goal is to bring you valuable insights that will save you and your team time when building a modern data architecture in the cloud. Topics will span from big data, AI, ML, governance, visualizations, and best practices for enabling your organization to be data-driven. If you are a chief data officer, data architect, data engineer, data analyst, and those building the backend data solutions then HIT SUBSCRIBE!
Building the Backend: Data Solutions that Power Leading Organizations
Open-Source Data Catalog Amundsen with Mark Grover @ Stemma
•
Travis Lawrence
•
Season 1
•
Episode 36
In this episode of Building The Backend we hear from Mark Grover founder @ Stemma, co-creator of Amundsen. Stemma is a fully managed data catalog, powered by the leading open-source data catalog, Amundsen.
Below are top 3 value bombs:
- Automated data catalogs are critical to help wrangle the growing data across organizations. (i.e. Being able to identify out of 150 columns on this table only 10 are being used downstream)
- Tribal knowledge and context cannot be automated - data catalogs cannot be 100% automated.
- Amundsen is an open-source data catalog originally created at Lyft. Stemma has created a managed version of Amundsen.
Help me improve the podcast by completing this 60 second survey: https://buildingthebackend.com/survey