Skip to main content
Version: 1.27.0 (latest) View Markdown

dlt and dltHub

note

dltHub offers two products: dlt (open source) and dltHub (commercial). This page explains both products and how they relate.

The two products at a glance

dlt is free and open source under Apache 2.0, dltHub is a paid product.

CapabilitydltdltHub
Build ingestion pipelines using dlt verified sources (except premium sources)
Build pipelines with the dltHub AI Workbench
Data quality metrics & checks
Build transformation pipelines (dltHub/dbt)
Managed runtime: deploy, run/schedule pipelines, serve data apps, monitor jobs
Premium sources (MS SQL) and destinations (Iceberg, Delta, Snowflake + Iceberg/Open Catalog)

dlt (OSS)

dlt is the open source ingestion library, distributed under Apache 2.0.

ComponentDescriptionHow to access itGet started
dlt libraryThe Python pipeline engine: extract, normalize, load. Includes schema inference and evolution, incremental loading, write dispositions, pipeline state, and the dlt pipeline … CLI for inspection.dlt — Apache 2.0, on PyPIIntro · Tutorial · pip install dlt
Core sourcesThree flexible, generic sources shipped with the library that cover most ingestion scenarios out of the box:
  • rest_api (any REST API via declarative config for endpoints, pagination, and auth)
  • sql_database (30+ SQL databases via SQLAlchemy / PyArrow / pandas / ConnectorX with table reflection)
  • filesystem (local and cloud storage — S3, GCS, Azure Blob, Google Drive, SFTP — with native CSV / Parquet / JSONL support)
Distributed with dlt libraryREST API · SQL Database · Filesystem & Cloud Storages
Verified sourcesA curated set of dltHub-maintained connectors (for example, Kafka, MongoDB, Postgres CDC, Stripe, Hubspot, …) pulled into your project with dlt init <source> <destination> .dlt-hub/verified-sourcesVerified sources docs · dlt init -l to list available sources

dlt is a good fit if: You want a lightweight, code-first ingestion library, are comfortable managing orchestration, scheduling, and operations yourself, or you need to deploy on-prem, on a VPS, or in any environment where managed cloud solution is not an option. dlt runs anywhere Python runs, with no platform dependency.

dltHub

dltHub is a managed cloud platform for running your dlt pipelines, transformations, and notebooks. You can work with dltHub in two complementary ways:

  • Web UI at dlthub.app — sign up to deploy, schedule, monitor pipelines, manage profiles, browse datasets.
  • Locally, from the CLI or Python — bootstrap a new workspace in one command:
    uvx create-dlthub-workspace my-project
    This creates a runnable workspace with the AI Workbench, example pipelines, and the dlt[hub] extra installed. To add dltHub to an existing project instead, run:
    pip install "dlt[hub]"
    dlthub init
    Either way, you get the dltHub workspace + dashboard, the AI development tooling (dlthub ai, MCP server, AI Workbench), per-source contexts, and the dlthub library that adds data quality, transformations, and premium sources/destinations.

Every component below is part of dltHub and requires a license. Most components are source-available under their own licenses; all are distributed through the dlthub PyPI package or the dltHub repositories.

ComponentWhat it isHow to access itGet started
dltHub PlatformThe hosted Web UI and managed runtime at dlthub.app — deploy and schedule pipelines, monitor runs, manage workspaces and profiles, browse datasets, collaborate.dlthub.appPlatform · Runtime overview · Runtime tutorial
AI ToolkitsThe dltHub AI Workbench: a collection of toolkits made of skills, rules, workflows, and MCP wiring that drive agentic pipeline development inside Claude Code, Cursor, and Codex.dlt-hub/dlthub-ai-workbench — source-available under its own licenseLLM-native workflow walkthrough
dltHub ContextPer-source LLM contexts (specs, endpoint documentation, prompts) that prime your coding assistant for thousands of APIs. Automatically used by AI WorkbenchBrowse and copy contexts at dlthub.com/contextBuild a source with AI
dlthub libraryPython package shipped via dlt[hub]. Adds the production capabilities: data quality, Python transformations (@dlt.hub.transformation) and dbt transformations, and premium sources/destinations such as Iceberg / DuckLake and MSSQL Change Tracking.On PyPIInstallation

dltHub is a good fit if: You are running pipelines in production, want a coding agent to do the heavy lifting with tooling that supports the generation of production-grade code, need transformations or data quality checks, if you want managed infrastructure, or if you are working as a team.

tip

For tier breakdown (Pro / Scale / Enterprise), see the intro. If you have a specific question, feature request, or unique use case, feel free to reach out.

Getting started

We recommend starting with the dltHub AI workbench to get started quickly while following dltHub best practices.

This demo works on codespaces. Codespaces is a development environment available for free to anyone with a Github account. You'll be asked to fork the demo repository and from there the README guides you with further steps.
The demo uses the Continue VSCode extension.

Off to codespaces!

DHelp

Ask a question

Welcome to "Codex Central", your next-gen help center, driven by OpenAI's GPT-4 model. It's more than just a forum or a FAQ hub – it's a dynamic knowledge base where coders can find AI-assisted solutions to their pressing problems. With GPT-4's powerful comprehension and predictive abilities, Codex Central provides instantaneous issue resolution, insightful debugging, and personalized guidance. Get your code running smoothly with the unparalleled support at Codex Central - coding help reimagined with AI prowess.