pipeline@prod:~$cat this_week.md
Issue #47 — Live Now

The signal for people who keep the data flowing_

One curated weekly issue: orchestration war stories, dbt deep dives, and the DataOps tooling takes your Slack channel won't shut up about — with the vendor noise filtered out.

12,847engineers subscribed
pipeline.emailVol. 47
Feb 27, 2026
⚡ FEATURED

When Airflow Lies: Debugging Zombie DAGs in Production

INCIDENT TEARDOWNOrchestration

The 2 AM page that wasn't Airflow's fault (it was)

A scheduler heartbeat timeout masked a silent executor memory leak for 6 hours.

8 min read
TOOL REVIEWTesting

dbt-unit-testing vs. dbt-expectations: A field guide

After running both in prod for 90 days, here's what the benchmarks don't tell you.

6 min read
PLATFORM PATTERNSObservability

Monte Carlo vs. Great Expectations: Total cost of ownership

The license fee is the smallest line item. Engineering time is where it hurts.

5 min read
748 readers active
$ ls -la ./archive

Browse the archive

47 issues. Every tool, failure, and pattern that matters to people who build production data systems.

Topics
Orchestration(38 issues)
Showing latest 3
#44Feb 069 min read

Airflow 2.7 KubernetesExecutor: What changed and why it broke your DAGs

Pod template changes in 2.7 silently invalidate resource configs you've had in prod for two years.

#41Jan 1611 min read

Prefect vs. Dagster in 2026: An honest comparison from someone who ran both

Not a feature matrix. A real account of migration cost, observability gaps, and on-call experience.

#38Dec 267 min read

The hidden cost of dynamic task mapping in Airflow

XCom bloat, scheduler lag, and the memory patterns nobody puts in the docs.

$ whoami --contributors

The people who keep this going

Pipeline isn't one person. It's a community of engineers who annotate issues, flag vendor BS, and share what actually worked in their stack.

1,847
Active contributors
340+
Community answers / week
47
Issues discussed
< 2h
Avg. response time
#ContributorStreak
01
MO
Marcus Okonkwo
Staff Data Engineer @ Stripe
12w streak
02
PK
Priya Krishnamurthy
Analytics Eng Lead @ Shopify
8w streak
03
DF
Daniel Ferreira
DataOps Lead @ Nubank
6w streak
04
YT
Yuki Tanaka
Senior DE @ Mercari
5w streak
05
SC
Sarah Chen
Platform DE @ Airbnb
4w streak
06
AD
Amara Diallo
Analytics Engineer @ Spotify
3w streak
1,841 more contributors in the communityJoin the community →
$ cat stack_of_the_week.yaml

Stack of the Week

Each week we diagram a real production stack, annotate the failure modes, and explain the trade-offs nobody puts in the vendor docs.

Issue #47 — Stack Teardown: Mid-Market SaaS Analytics Platformannotated
SourcePostgres · Kafka · S3IngestFivetran · AirbyteOrchestrateAirflow 2.7Transformdbt Core · SparkStoreSnowflake · IcebergObserveMonte Carlo · OLServeLooker · dbt Semantic
Failure Mode

Airflow XCom serialization at >50k rows causes scheduler memory pressure

💡The Fix

Move large payloads to S3, pass only the key through XCom. Cuts memory 80%.

📋Full Teardown

Issue #47 covers the full migration path with Helm config and DAG rewrites.

pipeline@prod:~$echo "your_email" | subscribe_

Join 12,000+ data engineers

Every Thursday. No vendor fluff. Just the signal your on-call rotation actually needs.

RN
AK
MB
JL
+4 engineers subscribed this hour

No spam. Unsubscribe with one click. Read by engineers at Stripe, Shopify, Airbnb, and 200+ more.

Engineers at these companies read Pipeline

StripeShopifyAirbnbSpotifyMercariNubank