Data quality 🧪
dltHub
This page is for dltHub Feature, which requires a license. Join our early access program for a trial license.
warning
🚧 This feature is under development. Interested in becoming an early tester? Join dltHub early access.
dltHub will allow you to define data validation rules at the YAML level or using Pydantic models. This ensures your data meets expected quality standards at the ingestion step.
Example: Defining a quality contract in YAML​
You can specify quality contracts to enforce constraints on your data, such as expected value ranges and nullability.
engine_version: 10
name: scd_type_3
tables:
customers:
columns:
category:
data_type: bigint
nullable: false
quality_contracts:
expect_column_max_to_be_between:
min_value: 1
max_value: 100
Key features​
With dltHub, you will be able to:
- Define data tests and quality contracts using YAML configuration or Pydantic models.
- Apply both row-level and batch-level validation.
- Enforce constraints on distributions, boundaries, and expected values.
Stay tuned for updates as we expand these capabilities! 🚀