dltHub Pro - Public Preview

From First Pipeline to Production at Scale

dlt is the open-source Python library for data pipelines. dltHub Pro is the agentic platform that deploys, monitors, and scales them. Together, they cover every phase of data engineering.

dltHub Pro launches in full this May. The window to move first is now.

50x faster prototyping

20 min end-to-end

Invite-only program

Trusted by Tasman Analytics

dltHub Pro

"What I didn't expect is how much it unblocks the team. A mid-level engineer can spin up a prototype, browse the raw data in dltHub Pro's local DuckDB workspace, validate the SQL schema - all without pulling in a senior. That loop of prototype, inspect, fix, re-run - that's the real unlock."

Marcello Victorino

Staff Data Engineer, Tasman Analytics

Read the Tasman case study

Marcello Victorino

Staff Data Engineer, Tasman Analytics

Read the Tasman case study

THE AGENTIC DATA ENGINEERING LIFECYCLE ON DLTHUB

From the outcome you define to the answer you ship — end-to-end.

Same code from local prototype to production. Same governance from raw data to dashboard. Same agents writing every stage.

Rows / hour

+18% vs yesterday

Active pipelines

2 added this week

Success rate

0.0%

+0.6%

Avg run · seconds

-12%

Pipeline throughput

2.84Mrows in the last 24h

Top pipelines

Sorted by rows loaded · last 24h

github_events

1.24M

Success

stripe_payouts

482k

Running

hubspot_contacts

218k

Success

salesforce_objects

94k

Queued

postgres_orders_cdc

12k

Failed

01 · DEFINE

Turn the business outcome into an agent-ready build plan

The bootstrap toolkit gives agents shared rules, secrets handling, and MCP routing while its skills call dlthub ai init --agent claude and dlthub ai mcp install.

Get a governed pipeline brief, local workspace, and live agent context before code is written.

Read the docs

dltHub toolkit·bootstrap

1 skill

uv run dlthub ai toolkit bootstrap install

/init-workspace

Opus 4.6 · dltHub · ~/agent-observability

? for shortcuts

02 · INGEST

Load source data with a reusable dlt pipeline

The rest-api-pipeline toolkit guides agents through source discovery, endpoint setup, and schema-safe loading while its skills call dlthub init rest_api duckdb and dlthub pipeline run. So source data lands locally with repeatable pipeline code.

Read the docs

dltHub toolkit·rest-api-pipeline

5 skills

uv run dlthub ai toolkit rest-api-pipeline install

/find-source/create-rest-api-pipeline/new-endpoint/adjust-endpoint/debug-pipeline

Opus 4.6 · dltHub · ~/agent-observability

? for shortcuts

03 · VALIDATE

Catch drift and data issues before they reach consumers

The data-quality toolkit adds checks and verification steps while its skills call dlthub ai install data-quality and dlthub transform verify --inputs/--outputs.

Schema drift and quality failures become visible before dashboards depend on them.

Read the docs

dltHub toolkit·data-quality

1 skill

uv run dlthub ai toolkit data-quality install

/setup-data-quality

Opus 4.6 · dltHub · ~/agent-observability

? for shortcuts

04 · DEPLOY

Run the pipeline in dltHub platform

The dlthub-platform toolkit prepares production profiles, jobs, schedules, and logs while its skills call dlthub deploy <pipeline>, dlthub runtime schedule, and dlthub runtime logs. Outcome: the same pipeline runs in managed production with observable jobs.

Read the docs

dltHub toolkit·dlthub-platform

4 skills

uv run dlthub ai toolkit dlthub-platform install

/setup-runtime/prepare-deployment/deploy-workspace/debug-deployment

Opus 4.6 · dltHub · ~/agent-observability

? for shortcuts

05 · TRANSFORM

Promote raw loads into governed models

The transformations toolkit turns loaded resources into reusable models with @dlt.hub.transformation while its skills call dlthub transform run and dlthub dbt generate. Outcome: raw tables become governed analytical datasets without leaving the workflow.

Read the docs

dltHub toolkit·transformations

4 skills

uv run dlthub ai toolkit transformations install

/annotate-sources/create-ontology/generate-cdm/create-transformation

Opus 4.6 · dltHub · ~/agent-observability

? for shortcuts

06 · VISUALIZE

Explore fresh data in notebooks and dashboards

The data-exploration toolkit helps agents inspect datasets and build Marimo views while its skills call dlthub runtime serve --app-type marimo and dlthub dataset head. Outcome: users see fresh, validated data as interactive analysis.

Read the docs

dltHub toolkit·data-exploration

2 skills

uv run dlthub ai toolkit data-exploration install

/explore-data/build-notebook

Opus 4.6 · dltHub · ~/agent-observability

? for shortcuts

09 — 09

Ship the answer with pipeline-level confidence

The lifecycle closes where operations begin: every pipeline, transformation, validation, notebook, and shared answer remains traceable from the dltHub workspace to Runtime.

Explore the full feature list in the dltHub docs

github_eventsRunning

Source: github (rest_api) · 4 resources · destination bigquery://github_data

Schedule

every 30 min

Success rate

99.4%

Avg duration

4.21s

Last run

just now

Rows loaded · last 24 hours

54,128

↑ 6.2% vs yesterday

Resources

issues1,284

pulls612

comments318

releases0

Recent runs

#4128

13:42 · today · 4.21s

2,214

Running

#4127

13:12 · today · 4.04s

2,189

Success

#4126

12:42 · today · 4.17s

2,202

Success

Run	Started	Duration	Rows	Status
#4128	13:42 · today	4.21s	2,214	Running
#4127	13:12 · today	4.04s	2,189	Success
#4126	12:42 · today	4.17s	2,202	Success
#4125	12:12 · today	4.31s	2,176	Success
#4124	11:42 · today	42.5s	0	Failed
#4123	11:12 · today	4.08s	2,164	Success

Agentic Workflows

Complete agentic workflows for every phase of data engineering

Not autocomplete, not a chatbot on a dashboard. A guided sequence of skills, commands, rules, and MCP - with guardrails agents can't skip. Maintained by dltHub, controlling the infrastructure agents and pipelines operate on.

Cheatsheet|agentic-workflows.md

REST API Pipelinedlt

dltHub PlatformPRO

Data Explorationdlt

TransformationsPRO

Initdlt

REST API Pipelinedlt

dltHub PlatformPRO

Data Explorationdlt

TransformationsPRO

Initdlt

Agentic Workflows in Detail

Discover individual skills per agentic workflow

See how each workflow guides your agent - step by step, from first prompt to production deployment.

REST API Pipeline1/7

Find a dlt source for a given API or data provider. Use when the user asks about a source, wants to find a connector, or asks to implement a pipeline for a specific data source.

Sonnet 4.6 · REST API Pipeline · ~/pipelines

? for shortcuts

REST API Pipelinedlt

Connect to any API and load data automatically

7 skills1 ruleMCP

dltHub PlatformPRO

Deploy to production with one command

4 skills2 rules

Data Explorationdlt

Explore data locally, build notebooks, ship Marimo dashboards

2 skills1 rule

TransformationsPRO

Transform raw data into a Canonical Data Model

4 skills1 ruleMCP

Initdlt

Cross-toolkit rules, secrets management, and agent routing

3 skills1 cmd1 ruleMCP

REST API Pipeline1/7

Find a dlt source for a given API or data provider. Use when the user asks about a source, wants to find a connector, or asks to implement a pipeline for a specific data source.

Sonnet 4.6 · REST API Pipeline · ~/pipelines

? for shortcuts

Frequently Asked Questions

What is dlt?

dlt (data load tool) is an open-source Python library for building data pipelines. It handles schema inference, incremental loading, nested data normalization, and works with 9,700+ sources. Apache 2.0 licensed and always free to use.

What is dltHub Pro?

dltHub Pro is the managed agentic platform for running dlt pipelines in production. It bundles a managed runtime (deploy with one command, no infra to patch), Python and SQL transformations orchestrated inside your pipeline, data quality checks that fail fast with actionable errors, a managed Iceberg lakehouse with the option to bring your own storage, and an MCP server so agents can analyze pipelines and datasets directly. The outcome: teams ship trustworthy data faster, without owning the infrastructure. See the full feature list in the dltHub docs.

How is dltHub Pro different from a Claude skill or tools like Replit?

Tools like Claude skills or Replit are great for writing and running code. But they are not built for data engineering workflows end to end. dltHub Pro gives your team complete agentic workflows that cover every phase: coding, running, deploying, and debugging pipelines, on infrastructure you control.

How is dlt different from Fivetran or a Python script that uses the request library?

dlt is the perfect match between standardization and customization. You get the automation that matters: schema inference, incremental state, normalization, and loading, while keeping the full flexibility and portability of plain Python. And with agentic dltHub workflows, your team can code, run, deploy, and debug pipelines faster.

How do I get access to dltHub Pro?

dltHub Pro is currently in public preview. Fill out the form above and we'll get you set up.

Run

Started

Duration

Rows

Status

#4128

13:42 · today

4.21s

2,214

Running

#4127

13:12 · today

4.04s

2,189

Success

#4126

12:42 · today

4.17s

2,202

Success

#4125

12:12 · today

4.31s

2,176

Success

#4124

11:42 · today

42.5s

Failed

#4123

11:12 · today

4.08s

2,164

Success

Complete agentic workflows for every phase of data engineering

REST API Pipelinedlt

dltHub PlatformPRO

Data Explorationdlt

TransformationsPRO

Initdlt

REST API Pipelinedlt

dltHub PlatformPRO

Data Explorationdlt

TransformationsPRO

Initdlt

Discover individual skills per agentic workflow

See how each workflow guides your agent - step by step, from first prompt to production deployment.

REST API Pipeline1/7

Find a dlt source for a given API or data provider. Use when the user asks about a source, wants to find a connector, or asks to implement a pipeline for a specific data source.

Sonnet 4.6 · REST API Pipeline · ~/pipelines

? for shortcuts

REST API Pipelinedlt

Connect to any API and load data automatically

7 skills1 ruleMCP

dltHub PlatformPRO

Deploy to production with one command

4 skills2 rules

Data Explorationdlt

Explore data locally, build notebooks, ship Marimo dashboards

2 skills1 rule

TransformationsPRO

Transform raw data into a Canonical Data Model

4 skills1 ruleMCP

Initdlt

Cross-toolkit rules, secrets management, and agent routing

3 skills1 cmd1 ruleMCP

REST API Pipeline1/7

Find a dlt source for a given API or data provider. Use when the user asks about a source, wants to find a connector, or asks to implement a pipeline for a specific data source.

Sonnet 4.6 · REST API Pipeline · ~/pipelines

? for shortcuts

dltHub Pro - Public Preview

From First Pipeline to Production at Scale

Get Started with dltHub Pro

THE AGENTIC DATA ENGINEERING LIFECYCLE ON DLTHUB

From the outcome you define to the answer you ship — end-to-end.

01 · DEFINE

Turn the business outcome into an agent-ready build plan

02 · INGEST

Load source data with a reusable dlt pipeline

03 · VALIDATE

Catch drift and data issues before they reach consumers

04 · DEPLOY

Run the pipeline in dltHub platform

05 · TRANSFORM

Promote raw loads into governed models

06 · VISUALIZE

Explore fresh data in notebooks and dashboards

07 · SHARE

Hand off a live answer your team can use

09 — 09

Ship the answer with pipeline-level confidence

Complete agentic workflows for every phase of data engineering

Discover individual skills per agentic workflow

Connect to any API and load data automatically

Deploy to production with one command

Explore data locally, build notebooks, ship Marimo dashboards

Transform raw data into a Canonical Data Model

Cross-toolkit rules, secrets management, and agent routing

dltHub Pro - Public Preview

From First Pipeline to Production at Scale

Get Started with dltHub Pro

THE AGENTIC DATA ENGINEERING LIFECYCLE ON DLTHUB

From the outcome you define to the answer you ship — end-to-end.

01 · DEFINE

Turn the business outcome into an agent-ready build plan

02 · INGEST

Load source data with a reusable dlt pipeline

03 · VALIDATE

Catch drift and data issues before they reach consumers

04 · DEPLOY

Run the pipeline in dltHub platform

05 · TRANSFORM

Promote raw loads into governed models

06 · VISUALIZE

Explore fresh data in notebooks and dashboards

07 · SHARE

Hand off a live answer your team can use

09 — 09

Ship the answer with pipeline-level confidence

Complete agentic workflows for every phase of data engineering

Discover individual skills per agentic workflow

Connect to any API and load data automatically

Deploy to production with one command

Explore data locally, build notebooks, ship Marimo dashboards

Transform raw data into a Canonical Data Model

Cross-toolkit rules, secrets management, and agent routing