Datasets
dlt+
This page is for dlt+, which requires a license. Join our early access program for a trial license.
A dataset is a physical collection of data and dlt metadata, including the schema on a destination. One destination can have multiple datasets; for now, datasets are bound to a physical destination, but this may change in future iterations.
By treating datasets as individual entities, dlt+ enables data cataloging and data governance.
Data cataloging​
Datasets automatically create data catalogs that can be used to discover schema and read and write data.
Data governance​
Datasets are a fundamental unit of governance in the dlt+ Project. Using the declarative interface of dlt+, you can control:
- Where they are materialized: you can specify which destinations you would like to materialize the datasets in.
- Who can access them: you can enable and disable them per profile.
- Ways in which the schema can be modified: you can also set schema contracts per profile.