Data Mesh Governance / Policies / Discoverability / Data Catalog

AWS Glue Data Catalog as Data Catalog for Data Products

Category: Discoverability Platform: AWS

Context

We use AWS S3 as Storage for Data Products and AWS Athena as Query Engine. Athena already forces us to define table structure in the metastore AWS Glue Data Catalog.

How can we register data products in a central place for other teams to discover and understand data products?

Decision

We use AWS Glue Data Catalog as catalog for data products.

We use a central AWS governance account that hosts the data catalog that links to data that lives in domain team’s AWS account.

Example of a table in AWS Glue Data Catalog

Consequences

Considered Alternatives

Automation