Command Line Interface Reference
This page contains all commands available in the dlt CLI and is generated automatically from the fully populated python argparse object of dlt.
Flags and positional commands are inherited from the parent command. Position within the command string is important. For example if you want to enable debug mode on the pipeline command, you need to add the debug flag to the base dlt command:
dlt --debug pipeline
Adding the flag after the pipeline keyword will not work.
dlt
Creates, adds, inspects and deploys dlt pipelines. Further help is available at https://dlthub.com/docs/reference/command-line-interface.
Usage
dlt [-h] [--version] [--disable-telemetry] [--enable-telemetry]
[--non-interactive] [--debug]
{telemetry,schema,pipeline,init,render-docs,deploy} ...
Show Arguments and Options
Options
-h, --help
- Show this help message and exit--version
- Show program's version number and exit--disable-telemetry
- Disables telemetry before command is executed--enable-telemetry
- Enables telemetry before command is executed--non-interactive
- Non interactive mode. default choices are automatically made for confirmations and prompts.--debug
- Displays full stack traces on exceptions. useful for debugging if the output is not clear enough.
Available subcommands
telemetry
- Shows telemetry statusschema
- Shows, converts and upgrades schemaspipeline
- Operations on pipelines that were ran locallyinit
- Creates a pipeline project in the current folder by adding existing verified source or creating a new one from template.render-docs
- Renders markdown version of cli docsdeploy
- Creates a deployment package for a selected pipeline script
dlt telemetry
Shows telemetry status.
Usage
dlt telemetry [-h]
Description
The dlt telemetry
command shows the current status of dlt telemetry. Lern more about telemetry and what we send in our telemetry docs.
Show Arguments and Options
dlt schema
Shows, converts and upgrades schemas.
Usage
dlt schema [-h] [--format {json,yaml}] [--remove-defaults] file
Description
The dlt schema
command will load, validate and print out a dlt schema: dlt schema path/to/my_schema_file.yaml
.
Show Arguments and Options
Inherits arguments from dlt
.
Positional arguments
file
- Schema file name, in yaml or json format, will autodetect based on extension
Options
-h, --help
- Show this help message and exit--format {json,yaml}
- Display schema in this format--remove-defaults
- Does not show default hint values
dlt pipeline
Operations on pipelines that were ran locally.
Usage
dlt pipeline [-h] [--list-pipelines] [--hot-reload] [--pipelines-dir
PIPELINES_DIR] [--verbose] [pipeline_name]
{info,show,failed-jobs,drop-pending-packages,sync,trace,schema,drop,load-package}
...
Description
The dlt pipeline
command provides a set of commands to inspect the pipeline working directory, tables, and data in the destination and check for problems encountered during data loading.
Show Arguments and Options
Inherits arguments from dlt
.
Positional arguments
pipeline_name
- Pipeline name
Options
-h, --help
- Show this help message and exit--list-pipelines, -l
- List local pipelines--hot-reload
- Reload streamlit app (for core development)--pipelines-dir PIPELINES_DIR
- Pipelines working directory--verbose, -v
- Provides more information for certain commands.
Available subcommands
info
- Displays state of the pipeline, use -v or -vv for more infoshow
- Generates and launches streamlit app with the loading status and dataset explorerfailed-jobs
- Displays information on all the failed loads in all completed packages, failed jobs and associated error messagesdrop-pending-packages
- Deletes all extracted and normalized packages including those that are partially loaded.sync
- Drops the local state of the pipeline and resets all the schemas and restores it from destination. the destination state, data and schemas are left intact.trace
- Displays last run trace, use -v or -vv for more infoschema
- Displays default schemadrop
- Selectively drop tables and reset stateload-package
- Displays information on load package, use -v or -vv for more info
dlt pipeline info
Displays state of the pipeline, use -v or -vv for more info.
Usage
dlt pipeline [pipeline_name] info [-h]
Description
Displays the content of the working directory of the pipeline: dataset name, destination, list of schemas, resources in schemas, list of completed and normalized load packages, and optionally a pipeline state set by the resources during the extraction process.
Show Arguments and Options
dlt pipeline show
Generates and launches Streamlit app with the loading status and dataset explorer.
Usage
dlt pipeline [pipeline_name] show [-h]
Description
Generates and launches Streamlit (https://streamlit.io/) app with the loading status and dataset explorer.
This is a simple app that you can use to inspect the schemas and data in the destination as well as your pipeline state and loading status/stats. It should be executed from the same folder from which you ran the pipeline script to access destination credentials.
Requires streamlit
to be installed in the current environment: pip install streamlit
.
Show Arguments and Options
dlt pipeline failed-jobs
Displays information on all the failed loads in all completed packages, failed jobs and associated error messages.
Usage
dlt pipeline [pipeline_name] failed-jobs [-h]
Description
This command scans all the load packages looking for failed jobs and then displays information on files that got loaded and the failure message from the destination.
Show Arguments and Options
dlt pipeline drop-pending-packages
Deletes all extracted and normalized packages including those that are partially loaded.
Usage
dlt pipeline [pipeline_name] drop-pending-packages [-h]
Description
Removes all extracted and normalized packages in the pipeline's working dir.
dlt
keeps extracted and normalized load packages in the pipeline working directory. When the run
method is called, it will attempt to normalize and load
pending packages first. The command above removes such packages. Note that pipeline state is not reverted to the state at which the deleted packages
were created. Using dlt pipeline ... sync
is recommended if your destination supports state sync.
Show Arguments and Options
dlt pipeline sync
Drops the local state of the pipeline and resets all the schemas and restores it from destination. The destination state, data and schemas are left intact.
Usage
dlt pipeline [pipeline_name] sync [-h] [--destination DESTINATION]
[--dataset-name DATASET_NAME]
Description
This command will remove the pipeline working directory with all pending packages, not synchronized state changes, and schemas and retrieve the last synchronized data from the destination. If you drop the dataset the pipeline is loading to, this command results in a complete reset of the pipeline state.
In case of a pipeline without a working directory, the command may be used to create one from the
destination. In order to do that, you need to pass the dataset name and destination name to the CLI
and provide the credentials to connect to the destination (i.e., in .dlt/secrets.toml
) placed in the
folder where you execute the pipeline sync
command.
Show Arguments and Options
Inherits arguments from dlt pipeline
.
Options
-h, --help
- Show this help message and exit--destination DESTINATION
- Sync from this destination when local pipeline state is missing.--dataset-name DATASET_NAME
- Dataset name to sync from when local pipeline state is missing.
dlt pipeline trace
Displays last run trace, use -v or -vv for more info.
Usage
dlt pipeline [pipeline_name] trace [-h]
Description
Displays the trace of the last pipeline run containing the start date of the run, elapsed time, and the
same information for all the steps (extract
, normalize
, and load
). If any of the steps failed,
you'll see the message of the exceptions that caused that problem. Successful load
and run
steps
will display the load info instead.
Show Arguments and Options
dlt pipeline schema
Displays default schema.
Usage
dlt pipeline [pipeline_name] schema [-h] [--format {json,yaml}]
[--remove-defaults]
Description
Displays the default schema for the selected pipeline.
Show Arguments and Options
Inherits arguments from dlt pipeline
.
Options
-h, --help
- Show this help message and exit--format {json,yaml}
- Display schema in this format--remove-defaults
- Does not show default hint values
dlt pipeline drop
Selectively drop tables and reset state.
Usage
dlt pipeline [pipeline_name] drop [-h] [--destination DESTINATION]
[--dataset-name DATASET_NAME] [--drop-all] [--state-paths [STATE_PATHS ...]]
[--schema SCHEMA_NAME] [--state-only] [resources ...]
Description
Selectively drop tables and reset state.
dlt pipeline <pipeline name> drop [resource_1] [resource_2]
Drops tables generated by selected resources and resets the state associated with them. Mainly used
to force a full refresh on selected tables. In the example below, we drop all tables generated by
the repo_events
resource in the GitHub pipeline:
dlt pipeline github_events drop repo_events
dlt
will inform you of the names of dropped tables and the resource state slots that will be
reset:
About to drop the following data in dataset airflow_events_1 in destination dlt.destinations.duckdb:
Selected schema:: github_repo_events
Selected resource(s):: ['repo_events']
Table(s) to drop:: ['issues_event', 'fork_event', 'pull_request_event', 'pull_request_review_event', 'pull_request_review_comment_event', 'watch_event', 'issue_comment_event', 'push_event__payload__commits', 'push_event']
Resource(s) state to reset:: ['repo_events']
Source state path(s) to reset:: []
Do you want to apply these changes? [y/N]
As a result of the command above the following will happen:
- All the indicated tables will be dropped in the destination. Note that
dlt
drops the nested tables as well. - All the indicated tables will be removed from the indicated schema.
- The state for the resource
repo_events
was found and will be reset. - New schema and state will be stored in the destination.
The drop
command accepts several advanced settings:
- You can use regexes to select resources. Prepend the
re:
string to indicate a regex pattern. The example below will select all resources starting withrepo
:
dlt pipeline github_events drop "re:^repo"
- You can drop all tables in the indicated schema:
dlt pipeline chess drop --drop-all
- You can indicate additional state slots to reset by passing JsonPath to the source state. In the example
below, we reset the
archives
slot in the source state:
dlt pipeline chess_pipeline drop --state-paths archives
This will select the archives
key in the chess
source.
{
"sources":{
"chess": {
"archives": [
"https://api.chess.com/pub/player/magnuscarlsen/games/2022/05"
]
}
}
}
This command is still experimental and the interface will most probably change.
Show Arguments and Options
Inherits arguments from dlt pipeline
.
Positional arguments
resources
- One or more resources to drop. can be exact resource name(s) or regex pattern(s). regex patterns must start with re:
Options
-h, --help
- Show this help message and exit--destination DESTINATION
- Sync from this destination when local pipeline state is missing.--dataset-name DATASET_NAME
- Dataset name to sync from when local pipeline state is missing.--drop-all
- Drop all resources found in schema. supersedes [resources] argument.--state-paths [STATE_PATHS ...]
- State keys or json paths to drop--schema SCHEMA_NAME
- Schema name to drop from (if other than default schema).--state-only
- Only wipe state for matching resources without dropping tables.
dlt pipeline load-package
Displays information on load package, use -v or -vv for more info.
Usage
dlt pipeline [pipeline_name] load-package [-h] [load-id]
Description
Shows information on a load package with a given load_id
. The load_id
parameter defaults to the
most recent package. Package information includes its state (COMPLETED/PROCESSED
) and list of all
jobs in a package with their statuses, file sizes, types, and in case of failed jobs—the error
messages from the destination. With the verbose flag set dlt pipeline -v ...
, you can also see the
list of all tables and columns created at the destination during the loading of that package.
Show Arguments and Options
Inherits arguments from dlt pipeline
.
Positional arguments
load-id
- Load id of completed or normalized package. defaults to the most recent package.
Options
-h, --help
- Show this help message and exit
dlt init
Creates a pipeline project in the current folder by adding existing verified source or creating a new one from template.
Usage
dlt init [-h] [--list-sources] [--location LOCATION] [--branch BRANCH] [--eject]
[source] [destination]
Description
The dlt init
command creates a new dlt pipeline script that loads data from source
to destination
. When you run the command, several things happen:
- Creates a basic project structure if the current folder is empty by adding
.dlt/config.toml
,.dlt/secrets.toml
, and.gitignore
files. - Checks if the
source
argument matches one of our verified sources and, if so, adds it to your project. - If the
source
is unknown, uses a generic template to get you started. - Rewrites the pipeline scripts to use your
destination
. - Creates sample config and credentials in
secrets.toml
andconfig.toml
for the specified source and destination. - Creates
requirements.txt
with dependencies required by the source and destination. If one exists, prints instructions on what to add to it.
This command can be used several times in the same folder to add more sources, destinations, and pipelines. It will also update the verified source code to the newest
version if run again with an existing source
name. You will be warned if files will be overwritten or if the dlt
version needs an upgrade to run a particular pipeline.
Show Arguments and Options
Inherits arguments from dlt
.
Positional arguments
source
- Name of data source for which to create a pipeline. adds existing verified source or creates a new pipeline template if verified source for your data source is not yet implemented.destination
- Name of a destination ie. bigquery or redshift
Options
-h, --help
- Show this help message and exit--list-sources, -l
- Shows all available verified sources and their short descriptions. for each source, it checks if your localdlt
version requires an update and prints the relevant warning.--location LOCATION
- Advanced. uses a specific url or local path to verified sources repository.--branch BRANCH
- Advanced. uses specific branch of the verified sources repository to fetch the template.--eject
- Ejects the source code of the core source like sql_database or rest_api so they will be editable by you.
dlt render-docs
Renders markdown version of cli docs.
Usage
dlt render-docs [-h] [--compare] file_name
Description
The dlt render-docs
command renders markdown version of cli docs by parsing the argparse help output and generating a markdown file.
If you are reading this on the docs website, you are looking at the rendered version of the cli docs generated by this command.
Show Arguments and Options
Inherits arguments from dlt
.
Positional arguments
file_name
- Output file name
Options
-h, --help
- Show this help message and exit--compare
- Compare the changes and raise if output would be updated
dlt deploy
Creates a deployment package for a selected pipeline script.
Usage
dlt deploy [-h] pipeline-script-path {github-action,airflow-composer} ...
Description
The dlt deploy
command prepares your pipeline for deployment and gives you step-by-step instructions on how to accomplish it. To enable this functionality, please first execute pip install "dlt[cli]"
which will add additional packages to the current environment.
Show Arguments and Options
Inherits arguments from dlt
.
Positional arguments
pipeline-script-path
- Path to a pipeline script
Options
-h, --help
- Show this help message and exit
Available subcommands
github-action
- Deploys the pipeline to github actionsairflow-composer
- Deploys the pipeline to airflow
dlt deploy github-action
Deploys the pipeline to Github Actions.
Usage
dlt deploy pipeline-script-path github-action [-h] [--location LOCATION]
[--branch BRANCH] --schedule SCHEDULE [--run-manually] [--run-on-push]
Description
Deploys the pipeline to GitHub Actions.
GitHub Actions (https://github.com/features/actions) is a CI/CD runner with a large free tier which you can use to run your pipelines.
You must specify when the GitHub Action should run using a cron schedule expression. The command also takes additional flags:
--run-on-push
(default is False) and --run-manually
(default is True). Remember to put the cron
schedule expression in quotation marks.
For the chess.com API example from our docs, you can deploy it with dlt deploy chess.py github-action --schedule "*/30 * * * *"
.
Follow the guide on how to deploy a pipeline with GitHub Actions in our documentation for more information.
Show Arguments and Options
Inherits arguments from dlt deploy
.
Options
-h, --help
- Show this help message and exit--location LOCATION
- Advanced. uses a specific url or local path to pipelines repository.--branch BRANCH
- Advanced. uses specific branch of the deploy repository to fetch the template.--schedule SCHEDULE
- A schedule with which to run the pipeline, in cron format. example: '/30 * * *' will run the pipeline every 30 minutes. remember to enclose the scheduler expression in quotation marks!--run-manually
- Allows the pipeline to be run manually form github actions ui.--run-on-push
- Runs the pipeline with every push to the repository.
dlt deploy airflow-composer
Deploys the pipeline to Airflow.
Usage
dlt deploy pipeline-script-path airflow-composer [-h] [--location LOCATION]
[--branch BRANCH] [--secrets-format {env,toml}]
Description
Google Composer (https://cloud.google.com/composer?hl=en) is a managed Airflow environment provided by Google. Follow the guide in our docs on how to deploy a pipeline with Airflow to learn more. This command will:
create an Airflow DAG for your pipeline script that you can customize. The DAG uses the
dlt
Airflow wrapper (https://github.com/dlt-hub/dlt/blob/devel/dlt/helpers/airflow_helper.py#L37) to make this process trivial.provide you with the environment variables and secrets that you must add to Airflow.
provide you with a cloudbuild file to sync your GitHub repository with the
dag
folder of your Airflow Composer instance.
Show Arguments and Options
Inherits arguments from dlt deploy
.
Options
-h, --help
- Show this help message and exit--location LOCATION
- Advanced. uses a specific url or local path to pipelines repository.--branch BRANCH
- Advanced. uses specific branch of the deploy repository to fetch the template.--secrets-format {env,toml}
- Format of the secrets