Building the Backend: Data Solutions that Power Leading Organizations

Open-Source Data Catalog Amundsen with Mark Grover @ Stemma

January 11, 2022 Travis Lawrence Season 1 Episode 36
Building the Backend: Data Solutions that Power Leading Organizations
Open-Source Data Catalog Amundsen with Mark Grover @ Stemma
Show Notes

In this episode of Building The Backend we hear from Mark Grover founder @ Stemma, co-creator of Amundsen. Stemma is a fully managed data catalog, powered by the leading open-source data catalog, Amundsen.

Below are top 3 value bombs:

  •  Automated data catalogs are critical to help wrangle the growing data across organizations. (i.e. Being able to identify out of 150 columns on this table only 10 are being used downstream)
  • Tribal knowledge and context cannot be automated - data catalogs cannot be 100% automated. 
  • Amundsen is an open-source data catalog originally created at Lyft. Stemma has created a managed version of Amundsen. 

Help me improve the podcast by completing this 60 second survey: https://buildingthebackend.com/survey