Lightweight Python code to move data
We focus on the needs & constraints of Python-first data platform teams: how to write any data source, achieve data democracy, modernise legacy systems and reduce cloud costs.
OPEN SOURCE
Pip install dlt and go
With over 600k downloads per month, dlt 1.0 is the most popular production-ready Python library for moving data. You can add dlt to your Python scripts to load data from various and often messy data sources into well-structured, live datasets. Unlike other non-Python solutions, with dlt, there's no need to use any backends or containers. We do not replace your data platform, deployments, or security models. Simply import dlt in a Python file or a Jupyter Notebook cell. You can load data from any source that produces Python data structures, including APIs, files, databases, and more.
DLT+ FOR DATA PLATFORM TEAMS
The Portable Data Lake
With dlt+ we propose a new approach to building data platforms that better meet the needs of diverse data teams working primarily in Python.
dlt+ works where you work - whether that’s local storage, data lakes, open storage formas, Pythonic engines & AI supported workfows.
Want to be the among the first to try the Portable Data Lake?
OPEN SOURCE
Access any data you want in Python
Today it is easier to pip install dlt and write a custom pipeline than to setup and configure a traditional ETL platform. In June '24 we crossed 5,000 dlt total custom sources created by the community since we launched dlt in summer '23. Because dlt is code we continue to automate engineering work and pass on productivity gains to organisations using dlt. Our new REST API Source toolkit is a short, declarative configuration driven way of creating sources. dlt-init-openapi is a a new tool that generates pipelines code out of any OpenAPI spec.