AWS S3 as Storage for Data Products

Category: Data Platform Platform: AWS

Context

How do we store analytical data, so that they can be processed efficiently and shared with other teams as data products?

Most teams use Apache Kafka to exchange data and publish domain events. On average, we expect a total of 10 TB analytical data per team.

Many domain teams use AWS services for their operational systems.

Decision

We use AWS S3 as storage for data products.

Consequences

Considered Alternatives

Automation