Skip to main content

How dlt works

dlt automatically turns JSON returned by any source (e.g. an API) into a live dataset stored in the destination of your choice (e.g. Google BigQuery). It does this by first extracting the JSON data, then normalizing it to a schema, and finally loading it to the location where you will store it.

architecture-diagram

Extractโ€‹

The Python script requests data from an API or a similar source. Once this data is received, the script parses the JSON and provides it to dlt as input, which then normalizes that data.

Normalizeโ€‹

The configurable normalization engine in dlt recursively unpacks this nested structure into relational tables (i.e. inferring data types, linking tables to create parent-child relationships, etc.), making it ready to be loaded. This creates a schema, which will automatically evolve to any future source data changes (e.g. new fields or tables).

Loadโ€‹

The data is then loaded into your chosen destination. dlt uses configurable, idempotent, atomic loads that ensure data safely ends up there. For example, you don't need to worry about the size of the data you are loading and if the process is interrupted, it is safe to retry without creating errors.

This demo works on codespaces. Codespaces is a development environment available for free to anyone with a Github account. You'll be asked to fork the demo repository and from there the README guides you with further steps.
The demo uses the Continue VSCode extension.

Off to codespaces!

DHelp

Ask a question

Welcome to "Codex Central", your next-gen help center, driven by OpenAI's GPT-4 model. It's more than just a forum or a FAQ hub โ€“ it's a dynamic knowledge base where coders can find AI-assisted solutions to their pressing problems. With GPT-4's powerful comprehension and predictive abilities, Codex Central provides instantaneous issue resolution, insightful debugging, and personalized guidance. Get your code running smoothly with the unparalleled support at Codex Central - coding help reimagined with AI prowess.