Blog /August 20, 2025

Emmanuel's production-ready Kafka framework: extending dlt the right way

Aman Gupta,
Data Engineer

Instead of rage-quitting Kafka (tempting) or spinning up a Spark cluster the size of a minor moon, Emmanuel Ogunwede built a slim framework on top of dlt that levels up the vanilla Kafka source into something you’d be happier to run in production.

👉Check out the repo on GitHub.

What vanilla dlt gives you (and where it stops short)

dlt’s built-in Kafka source is great if you just need a simple pipeline up and running:

You point it at specific topics
It happily ingests UTF-8 text (JSON out of the box)
But no Schema Registry integration
Adding new topics after the first run requires manual handling

Perfect for getting started and honestly, way easier than most first tries at Kafka.

But in production, Kafka throws curveballs and that’s where Emmanuel’s framework comes in.

Emmanuel’s upgrades 🚀

Instead of reinventing the wheel, Emmanuel identified the specific gaps and filled them systematically:

Dynamic topic discovery via regex patterns (.*_events finds all event topics automatically)
Avro + Schema Registry support with deserialization
CLI wrapper for running Kafka pipelines

The genius? He built on top of dlt, not around it.

See it in action 🎥

Emmanuel even made a video walking through it (watch at 2× speed if you’re impatient).

Done. The framework handles topic discovery, schema fetching, offset management, and loading.

Why this matters

Most Kafka setups live on two extremes:

Too complex: Spark/Flink sized for Mars missions
Too hacky: cron + script duct-taped together

Emmanuel found the middle ground: micro‑batch ingestion that’s production‑ready and maintainable.

It builds on what dlt already does well, like schema changes, normalization, and datatype inference, etc. and extends it to handle the messy realities of Kafka:

New topics discovered automatically
Enhanced offset management for newly discovered topics
Multiple serialization formats

Build smarter, not bigger

Don't fight your tools, extend them thoughtfully.

Emmanuel didn’t rage‑quit dlt because it lacked Avro. He bridged the gaps and ended up with something that feels like a natural extension, not a replacement.