Load Data from Bitbucket
to Google Cloud Storage
with dlt
in Python
Join our Slack community or book a call with our support engineer Violetta.
Bitbucket
is a Git-based source code repository hosting service designed for teams. It provides powerful tools for code collaboration, continuous integration, and deployment. With features like pull requests, code reviews, and branch permissions, Bitbucket
helps teams collaborate efficiently and maintain high code quality. The platform integrates seamlessly with other Atlassian products and offers scalable solutions for teams of all sizes. The Google Cloud Storage
destination stores data on the Google Cloud Platform, enabling easy creation of data lakes. You can upload data as JSONL, Parquet, or CSV. Using the open-source python library dlt
, you can load data from Bitbucket
to Google Cloud Storage
. Further information about Bitbucket
is available at https://bitbucket.org.
dlt
Key Features
- Easy to get started:
dlt
is a Python library that is easy to use and understand. It is designed to be simple to use and easy to understand. Typepip install dlt
and you are ready to go. Learn more - Robust Governance Support:
dlt
pipelines offer governance through metadata utilization, schema enforcement, and schema change alerts. Learn more - Filesystem & Buckets: Store data in remote file systems and bucket storages like S3, Google Storage, or Azure Blob Storage. Learn more
- Scaling and Finetuning:
dlt
offers mechanisms and configuration options to scale up and finetune pipelines, including parallel execution and memory buffer adjustments. Learn more - Schema Evolution:
dlt
alerts users to schema changes, allowing for proactive governance and ensuring data consistency. Learn more
Getting started with your pipeline locally
dlt-init-openapi