Skip to main content
Version: 1.7.0 (latest)

Datasets

dlt+

This page is for dlt+, which requires a license. Join our early access program for a trial license.

A dataset is a physical collection of data and dlt metadata, including the schema on a destination. One destination can have multiple datasets; for now, datasets are bound to a physical destination, but this may change in future iterations.

By treating datasets as individual entities, dlt+ enables data cataloging and data governance.

Data cataloging​

Datasets automatically create data catalogs that can be used to discover schema and read and write data.

Data governance​

Datasets are a fundamental unit of governance in the dlt+ Project. Using the declarative interface of dlt+, you can control:

  1. Where they are materialized: you can specify which destinations you would like to materialize the datasets in.
  2. Who can access them: you can enable and disable them per profile.
  3. Ways in which the schema can be modified: you can also set schema contracts per profile.

This demo works on codespaces. Codespaces is a development environment available for free to anyone with a Github account. You'll be asked to fork the demo repository and from there the README guides you with further steps.
The demo uses the Continue VSCode extension.

Off to codespaces!

DHelp

Ask a question

Welcome to "Codex Central", your next-gen help center, driven by OpenAI's GPT-4 model. It's more than just a forum or a FAQ hub – it's a dynamic knowledge base where coders can find AI-assisted solutions to their pressing problems. With GPT-4's powerful comprehension and predictive abilities, Codex Central provides instantaneous issue resolution, insightful debugging, and personalized guidance. Get your code running smoothly with the unparalleled support at Codex Central - coding help reimagined with AI prowess.