dlt is the open-source Python library for data pipelines. dltHub Pro is the agentic platform that deploys, monitors, and scales them. Together, they cover every phase of data engineering.
dltHub Pro launches in full this May. The window to move first is now.

"What I didn't expect is how much it unblocks the team. A mid-level engineer can spin up a prototype, browse the raw data in dltHub Pro's local DuckDB workspace, validate the SQL schema - all without pulling in a senior. That loop of prototype, inspect, fix, re-run - that's the real unlock."

Marcello Victorino
Staff Data Engineer, Tasman Analytics

Marcello Victorino
Staff Data Engineer, Tasman Analytics
Same code from local prototype to production. Same governance from raw data to dashboard. Same agents writing every stage.
The bootstrap toolkit gives agents shared rules, secrets handling, and MCP routing while its skills call dlthub ai init --agent claude and dlthub ai mcp install.
Get a governed pipeline brief, local workspace, and live agent context before code is written.
uv run dlthub ai toolkit bootstrap installOpus 4.6 · dltHub · ~/agent-observability
The rest-api-pipeline toolkit guides agents through source discovery, endpoint setup, and schema-safe loading while its skills call dlthub init rest_api duckdb and dlthub pipeline run. So source data lands locally with repeatable pipeline code.
uv run dlthub ai toolkit rest-api-pipeline installOpus 4.6 · dltHub · ~/agent-observability
The data-quality toolkit adds checks and verification steps while its skills call dlthub ai install data-quality and dlthub transform verify --inputs/--outputs.
Schema drift and quality failures become visible before dashboards depend on them.
uv run dlthub ai toolkit data-quality installOpus 4.6 · dltHub · ~/agent-observability
The dlthub-platform toolkit prepares production profiles, jobs, schedules, and logs while its skills call dlthub deploy <pipeline>, dlthub runtime schedule, and dlthub runtime logs. Outcome: the same pipeline runs in managed production with observable jobs.
uv run dlthub ai toolkit dlthub-platform installOpus 4.6 · dltHub · ~/agent-observability
The transformations toolkit turns loaded resources into reusable models with @dlt.hub.transformation while its skills call dlthub transform run and dlthub dbt generate. Outcome: raw tables become governed analytical datasets without leaving the workflow.
uv run dlthub ai toolkit transformations installOpus 4.6 · dltHub · ~/agent-observability
The data-exploration toolkit helps agents inspect datasets and build Marimo views while its skills call dlthub runtime serve --app-type marimo and dlthub dataset head. Outcome: users see fresh, validated data as interactive analysis.
uv run dlthub ai toolkit data-exploration installOpus 4.6 · dltHub · ~/agent-observability
The lifecycle closes where operations begin: every pipeline, transformation, validation, notebook, and shared answer remains traceable from the dltHub workspace to Runtime.
| Run | Started | Duration | Rows | Status |
|---|---|---|---|---|
| #4128 | 13:42 · today | 4.21s | 2,214 | Running |
| #4127 | 13:12 · today | 4.04s | 2,189 | Success |
| #4126 | 12:42 · today | 4.17s | 2,202 | Success |
| #4125 | 12:12 · today | 4.31s | 2,176 | Success |
| #4124 | 11:42 · today | 42.5s | 0 | Failed |
| #4123 | 11:12 · today | 4.08s | 2,164 | Success |
Agentic Workflows
Not autocomplete, not a chatbot on a dashboard. A guided sequence of skills, commands, rules, and MCP - with guardrails agents can't skip. Maintained by dltHub, controlling the infrastructure agents and pipelines operate on.
Agentic Workflows in Detail
See how each workflow guides your agent - step by step, from first prompt to production deployment.
Find a dlt source for a given API or data provider. Use when the user asks about a source, wants to find a connector, or asks to implement a pipeline for a specific data source.
Sonnet 4.6 · REST API Pipeline · ~/pipelines
Find a dlt source for a given API or data provider. Use when the user asks about a source, wants to find a connector, or asks to implement a pipeline for a specific data source.
Sonnet 4.6 · REST API Pipeline · ~/pipelines
dlt (data load tool) is an open-source Python library for building data pipelines. It handles schema inference, incremental loading, nested data normalization, and works with 9,700+ sources. Apache 2.0 licensed and always free to use.
dltHub Pro is the managed agentic platform for running dlt pipelines in production. It bundles a managed runtime (deploy with one command, no infra to patch), Python and SQL transformations orchestrated inside your pipeline, data quality checks that fail fast with actionable errors, a managed Iceberg lakehouse with the option to bring your own storage, and an MCP server so agents can analyze pipelines and datasets directly. The outcome: teams ship trustworthy data faster, without owning the infrastructure. See the full feature list in the dltHub docs.
Tools like Claude skills or Replit are great for writing and running code. But they are not built for data engineering workflows end to end. dltHub Pro gives your team complete agentic workflows that cover every phase: coding, running, deploying, and debugging pipelines, on infrastructure you control.
dlt is the perfect match between standardization and customization. You get the automation that matters: schema inference, incremental state, normalization, and loading, while keeping the full flexibility and portability of plain Python. And with agentic dltHub workflows, your team can code, run, deploy, and debug pipelines faster.
dltHub Pro is currently in public preview. Fill out the form above and we'll get you set up.