The fastest way for agentic teams to build and run trusted data pipelines, from any source.

dlt is the open-source Python library 50,000+ developers use to build data pipelines. When AI makes it easy to build, dltHub becomes the fastest way to put trusted dlt data pipelines into production.

dltHub

Agents build your dlt pipelines from a prompt. Paste this prompt into Claude/Codex/Cursor:

Run uvx dlthub-start@latest to build my first example pipeline and run it on dlthub

Book a demo See pricing

"What I didn't expect is how much it unblocks the team. A mid-level engineer can spin up a prototype, browse the raw data in dltHub's local DuckDB workspace, validate the SQL schema - all without pulling in a senior. That loop of prototype, inspect, fix, re-run - that's the real unlock."

Marcello Victorino

Staff Data Engineer, Tasman Analytics

Read the Tasman case study

Marcello Victorino

Staff Data Engineer, Tasman Analytics

Read the Tasman case study

THE AGENTIC DATA ENGINEERING LIFECYCLE ON DLTHUB

From the outcome you define to the answer you ship, end to end.

Same code from local prototype to production. Same governance from raw data to dashboard. Same agents writing every stage.

Rows / hour

+18% vs yesterday

Active pipelines

2 added this week

Success rate

0.0%

+0.6%

Avg run · seconds

-12%

Pipeline throughput

2.84Mrows in the last 24h

Top pipelines

Sorted by rows loaded · last 24h

github_events

1.24M

Success

stripe_payouts

482k

Running

hubspot_contacts

218k

Success

1 · DEFINE

Turn the business outcome into an agent-ready build plan

The bootstrap toolkit gives agents shared rules, secrets handling, and MCP routing while its skills call dlthub ai init --agent claude and dlthub ai mcp install.

Get a governed pipeline brief, local workspace, and live agent context before code is written.

Read the docs

dltHub toolkit·bootstrap

1 skill

uv run dlthub ai toolkit bootstrap install

/init-workspace

Opus 4.6 · dltHub · ~/agent-observability

? for shortcuts

2 · INGEST

Load source data with a reusable dlt pipeline

The rest-api-pipeline toolkit guides agents through source discovery, endpoint setup, and schema-safe loading while its skills call dlthub init rest_api duckdb and dlthub pipeline run. So source data lands locally with repeatable pipeline code.

Read the docs

dltHub toolkit·rest-api-pipeline

5 skills

uv run dlthub ai toolkit rest-api-pipeline install

/find-source/create-rest-api-pipeline/new-endpoint/adjust-endpoint/debug-pipeline

Opus 4.6 · dltHub · ~/agent-observability

? for shortcuts

3 · VALIDATE

Catch drift and data issues before they reach consumers

The data-quality toolkit adds checks and verification steps while its skills call dlthub ai install data-quality and dlthub transform verify --inputs/--outputs.

Schema drift and quality failures become visible before dashboards depend on them.

Read the docs

dltHub toolkit·data-quality

1 skill

uv run dlthub ai toolkit data-quality install

/setup-data-quality

Opus 4.6 · dltHub · ~/agent-observability

? for shortcuts

4 · DEPLOY

Run the pipeline in dltHub platform

The dlthub-platform toolkit prepares production profiles, jobs, schedules, and logs while its skills call dlthub deploy <pipeline>, dlthub runtime schedule, and dlthub runtime logs. Outcome: the same pipeline runs in managed production with observable jobs.

Read the docs

dltHub toolkit·dlthub-platform

4 skills

uv run dlthub ai toolkit dlthub-platform install

/setup-runtime/prepare-deployment/deploy-workspace/debug-deployment

Opus 4.6 · dltHub · ~/agent-observability

? for shortcuts

5 · TRANSFORM

Promote raw loads into governed models

The transformations toolkit turns loaded resources into reusable models with @dlt.hub.transformation while its skills call dlthub transform run and dlthub dbt generate. Outcome: raw tables become governed analytical datasets without leaving the workflow.

Read the docs

dltHub toolkit·transformations

4 skills

uv run dlthub ai toolkit transformations install

/annotate-sources/create-ontology/generate-cdm/create-transformation

Opus 4.6 · dltHub · ~/agent-observability

? for shortcuts

6 · VISUALIZE

Explore fresh data in notebooks and dashboards

The data-exploration toolkit helps agents inspect datasets and build Marimo views while its skills call dlthub runtime serve --app-type marimo and dlthub dataset head. Outcome: users see fresh, validated data as interactive analysis.

Read the docs

dltHub toolkit·data-exploration

2 skills

uv run dlthub ai toolkit data-exploration install

/explore-data/build-notebook

Opus 4.6 · dltHub · ~/agent-observability

? for shortcuts

9 of 9

Ship the answer with pipeline-level confidence

The lifecycle closes where operations begin: every pipeline, transformation, validation, notebook, and shared answer remains traceable from the dltHub workspace to Runtime.

Explore the full feature list in the dltHub docs

github_eventsRunning

Source: github (rest_api) · 4 resources · destination bigquery://github_data

Schedule

every 30 min

Success rate

99.4%

Avg duration

4.21s

Last run

just now

Rows loaded · last 24 hours

54,128

↑ 6.2% vs yesterday

Resources

issues1,284

pulls612

comments318

releases0

Recent runs

#4128

13:42 · today · 4.21s

2,214

Running

#4127

13:12 · today · 4.04s

2,189

Success

#4126

12:42 · today · 4.17s

2,202

Success

Run	Started	Duration	Rows	Status
#4128	13:42 · today	4.21s	2,214	Running
#4127	13:12 · today	4.04s	2,189	Success
#4126	12:42 · today	4.17s	2,202	Success
#4125	12:12 · today	4.31s	2,176	Success
#4124	11:42 · today	42.5s	0	Failed
#4123	11:12 · today	4.08s	2,164	Success

Complete agentic workflows for every phase of data engineering

Not autocomplete, not a chatbot on a dashboard. A guided sequence of skills, commands, rules, and MCP - with guardrails agents can't skip. Maintained by dltHub, controlling the infrastructure agents and pipelines operate on.

Cheatsheet|agentic-workflows.md

REST API Pipelinedlt

dltHub PlatformdltHub

Data Explorationdlt

TransformationsdltHub

Initdlt

REST API Pipelinedlt

dltHub PlatformdltHub

Data Explorationdlt

TransformationsdltHub

Initdlt

Discover individual skills per agentic workflow

See how each workflow guides your agent - step by step, from first prompt to production deployment.

REST API Pipeline1/7

Find a dlt source for a given API or data provider. Use when the user asks about a source, wants to find a connector, or asks to implement a pipeline for a specific data source.

Sonnet 4.6 · REST API Pipeline · ~/pipelines

? for shortcuts

REST API Pipelinedlt

Connect to any API and load data automatically

7 skills1 ruleMCP

dltHub PlatformdltHub

Deploy to production with one command

4 skills2 rules

Data Explorationdlt

Explore data locally, build notebooks, ship Marimo dashboards

2 skills1 rule

TransformationsdltHub

Transform raw data into a Canonical Data Model

4 skills1 ruleMCP

Initdlt

Cross-toolkit rules, secrets management, and agent routing

3 skills1 cmd1 ruleMCP

REST API Pipeline1/7

Find a dlt source for a given API or data provider. Use when the user asks about a source, wants to find a connector, or asks to implement a pipeline for a specific data source.

Sonnet 4.6 · REST API Pipeline · ~/pipelines

? for shortcuts

Blueprints for data workflows

dltHub is a composable data platform. Blueprints are its ready-made builds: each one dltHub assembled for a specific use case, end to end, from the sources you already use to a production dashboard or API.

Every teamBrowse every dltHub Blueprint

Trace pipelines

Pydantic Logfire
Arize
Langfuse
LangChain

dltHub

Ingest and standardize traces into the OpenAI messages format as a training-ready dataset.

API

distil labs

Fine-tune a specialist model, served as a drop-in replacement via API to distil labs customers.

View the Agent distillation with distil labs blueprint

Frequently Asked Questions

What is dlt?

dlt (data load tool) is an open-source Python library for building data pipelines. It handles schema inference, incremental loading, nested data normalization, and works with 10,100+ sources. Apache 2.0 licensed and always free to use.

What is dltHub?

dltHub is the managed agentic platform for running dlt pipelines in production. It bundles a managed runtime (deploy with one command, no infra to patch), Python and SQL transformations orchestrated inside your pipeline, data quality checks that fail fast with actionable errors, a managed Iceberg lakehouse with the option to bring your own storage, and an MCP server so agents can analyze pipelines and datasets directly. The outcome: teams ship trustworthy data faster, without owning the infrastructure. See the full feature list in the dltHub docs.

How is dltHub different from a Claude skill or tools like Replit?

Tools like Claude skills or Replit are great for writing and running code. But they are not built for data engineering workflows end to end. dltHub gives your team complete agentic workflows that cover every phase: coding, running, deploying, and debugging pipelines, on infrastructure you control.

How is dlt different from Fivetran or a Python script that uses the request library?

dlt is the perfect match between standardization and customization. You get the automation that matters: schema inference, incremental state, normalization, and loading, while keeping the full flexibility and portability of plain Python. And with agentic dltHub workflows, your team can code, run, deploy, and debug pipelines faster.

How do I get access to dltHub?

dltHub is available now. Book a demo with our team to get set up, or see our pricing page for plans and what's included.

Complete agentic workflows for every phase of data engineering

REST API Pipelinedlt

dltHub PlatformdltHub

Data Explorationdlt

TransformationsdltHub

Initdlt

REST API Pipelinedlt

dltHub PlatformdltHub

Data Explorationdlt

TransformationsdltHub

Initdlt

Discover individual skills per agentic workflow

See how each workflow guides your agent - step by step, from first prompt to production deployment.

REST API Pipeline1/7

Find a dlt source for a given API or data provider. Use when the user asks about a source, wants to find a connector, or asks to implement a pipeline for a specific data source.

Sonnet 4.6 · REST API Pipeline · ~/pipelines

? for shortcuts

REST API Pipelinedlt

Connect to any API and load data automatically

7 skills1 ruleMCP

dltHub PlatformdltHub

Deploy to production with one command

4 skills2 rules

Data Explorationdlt

Explore data locally, build notebooks, ship Marimo dashboards

2 skills1 rule

TransformationsdltHub

Transform raw data into a Canonical Data Model

4 skills1 ruleMCP

Initdlt

Cross-toolkit rules, secrets management, and agent routing

3 skills1 cmd1 ruleMCP

REST API Pipeline1/7

Find a dlt source for a given API or data provider. Use when the user asks about a source, wants to find a connector, or asks to implement a pipeline for a specific data source.

Sonnet 4.6 · REST API Pipeline · ~/pipelines

? for shortcuts

dltHub

THE AGENTIC DATA ENGINEERING LIFECYCLE ON DLTHUB

From the outcome you define to the answer you ship, end to end.

1 · DEFINE

Turn the business outcome into an agent-ready build plan

2 · INGEST

Load source data with a reusable dlt pipeline

3 · VALIDATE

Catch drift and data issues before they reach consumers

4 · DEPLOY

Run the pipeline in dltHub platform

5 · TRANSFORM

Promote raw loads into governed models

6 · VISUALIZE

Explore fresh data in notebooks and dashboards

7 · SHARE

Hand off a live answer your team can use

9 of 9

Ship the answer with pipeline-level confidence

Complete agentic workflows for every phase of data engineering

Discover individual skills per agentic workflow

Connect to any API and load data automatically

Deploy to production with one command

Explore data locally, build notebooks, ship Marimo dashboards

Transform raw data into a Canonical Data Model

Cross-toolkit rules, secrets management, and agent routing

dltHub

THE AGENTIC DATA ENGINEERING LIFECYCLE ON DLTHUB

From the outcome you define to the answer you ship, end to end.

1 · DEFINE

Turn the business outcome into an agent-ready build plan

2 · INGEST

Load source data with a reusable dlt pipeline

3 · VALIDATE

Catch drift and data issues before they reach consumers

4 · DEPLOY

Run the pipeline in dltHub platform

5 · TRANSFORM

Promote raw loads into governed models

6 · VISUALIZE

Explore fresh data in notebooks and dashboards

7 · SHARE

Hand off a live answer your team can use

9 of 9

Ship the answer with pipeline-level confidence

Complete agentic workflows for every phase of data engineering

Discover individual skills per agentic workflow

Connect to any API and load data automatically

Deploy to production with one command

Explore data locally, build notebooks, ship Marimo dashboards

Transform raw data into a Canonical Data Model

Cross-toolkit rules, secrets management, and agent routing