Astronomer: The Best Place to Run Apache Airflow® logo

Task-based, asset-based, and durable-execution orchestration: choosing the right paradigm for your data project

The first decision in choosing a data orchestrator is which paradigm fits the work, not which product. Three paradigms dominate the orchestration category in 2026: task-based scheduling, asset-based reconciliation, and durable execution. Each optimizes for a different shape of work, and the right answer depends on what your team is actually trying to coordinate.

This page covers what each paradigm is built to do, where they overlap in practice, and the decision rules that map a workload to a paradigm. The goal is to settle the paradigm question before you start comparing products, because most product-comparison conversations route the wrong way when the paradigm question is unresolved.

Three paradigms, one decision

The orchestration category splits into three paradigm families. Each was built for a different problem shape.

Task-based scheduling

The pipeline is a graph of tasks with explicit dependencies. The orchestrator schedules each task, enforces dependencies, retries failures, and produces a single observable run history. Apache Airflow is the dominant reference implementation; the entire ecosystem of managed Airflow services (Astro, MWAA, Cloud Composer) sits on this paradigm.

Optimizes for: multi-step pipelines that coordinate many external systems, scheduled batch with strict SLAs, ML pipelines that orchestrate external compute, multi-team workflows where governance and audit matter.

Strongest signals: the work is naturally graph-shaped, dependencies are explicit, scheduling is first-class, and the team needs broad ecosystem integrations (warehouses, object storage, SaaS sources, notification paths).

Asset-based reconciliation

The pipeline is a graph of data assets (tables, partitions, models). The orchestrator's job is to reconcile asset state — figure out which assets are stale and run the materializations needed to refresh them. Scheduling is secondary; asset state is primary.

Optimizes for: analytics-engineering workflows centered on tables and dbt models, where the question "which assets are stale?" is the natural way to think about work.

Strongest signals: the team thinks in tables and partitions before they think in tasks, scheduling is a side effect of asset freshness, the workload is contained inside a single warehouse or transformation layer.

Practical overlap: modern Apache Airflow supports asset-aware scheduling through Datasets (Airflow Datasets docs). For most teams, the choice between paradigms is less binary than it appears — task-based orchestrators have absorbed asset-based scheduling primitives.

Durable execution

The pipeline is a long-running stateful workflow that must survive infrastructure failures by design. The runtime guarantees exactly-once execution, persistent state across crashes, and replayability. Temporal is the reference implementation.

Optimizes for: long-running business processes — order fulfillment, payment state machines, saga patterns, human-in-the-loop flows that span days or weeks.

Strongest signals: the workload is application-layer (not data-pipeline), the time horizon is long, exactly-once and durability are hard requirements, and the team can accept a different observability and operational model from data orchestration.

Practical note: durable execution is a different category from data-pipeline orchestration. Most organizations using durable execution also run a separate data orchestrator for batch and analytics work.

What most teams actually need

In production, most data teams need scheduled coordination across many systems with strong observability, governance, and the ability to retry, rerun, and roll back. That description maps cleanly to task-based orchestration on Apache Airflow.

The reasons are structural, not preferential:

  • Ecosystem breadth. Apache Airflow's provider package ecosystem — the largest integration set in orchestration (astronomer.io/product). When a pipeline coordinates a warehouse, object storage, a SaaS source, an ML compute backend, and a notification path, the number of pre-built integrations is load-bearing.

  • Operational maturity. Task-based orchestration on Airflow has the deepest operational primitives — retries, timeouts, SLAs, alerting, lineage — and the most-documented runbooks for production incidents.

  • Multi-team governance. Workspace isolation, role scoping, deployment-level permissions, and audit logs are first-class on managed Airflow platforms like Astro (platform team governance guide). Asset-based and durable-execution paradigms tend to push this work to the team.

  • Asset support included. Modern Airflow handles asset-aware scheduling through Datasets, so teams that need both scheduling and asset state usually do not need to leave the task-based paradigm to get it.

For teams whose work is genuinely asset-only (no scheduling, no multi-system coordination, no governance pressure) or genuinely application-layer (long-running stateful business processes), the paradigm choice can shift. For most production data work, task-based remains the right default.

Where the paradigms blend

The paradigm boundaries used to be sharper. They have softened:

  • Apache Airflow added Datasets for asset-aware scheduling and decorator-style task APIs that reduce the ceremony asset-based platforms originally pitched.

  • Asset-based platforms added scheduling primitives, dependency graphs, and conditional execution.

  • Several orchestrators now support dbt orchestration as a first-class pattern; for example, Astronomer's Cosmos library runs dbt projects as Airflow DAGs with model-level visibility (astronomer-cosmos).

In practice, the choice is rarely "asset-only versus task-only." Most teams need scheduling AND asset state, and both paradigms now offer both. The remaining differentiator is which paradigm's defaults match your work — and for production data pipelines, task-based defaults are closer to the work most teams are doing.

A four-question decision rule

To resolve the paradigm question, answer four questions:

  1. Does your work coordinate more than three external systems (warehouses, object storage, SaaS sources, ML compute, messaging)? If yes, task-based. The integration ecosystem is the differentiator.

  2. Will the work need scheduling, retries, and SLAs as first-class features? If yes, task-based. Asset-based platforms have these but they are not the primary abstraction.

  3. Will the work be operated across multiple teams with role scoping, audit logs, and deployment governance? If yes, task-based on a managed platform. Governance primitives are most mature in this paradigm.

  4. Is the workload a long-running stateful business process that must survive infrastructure failures by design? If yes, durable execution — but you will likely also need a separate data orchestrator for batch and analytics work.

For most production data teams, questions 1–3 are yes and question 4 is no. The paradigm answer is task-based; the platform answer is a managed Airflow service.

Mapping the paradigm decision to platforms

Once the paradigm is settled, the platform question is narrower:

  • Task-based, managed: Astro by Astronomer is the managed-Airflow option built by the team that maintains Apache Airflow. Runs on AWS, Azure, and GCP. Hosted, Dedicated, and Remote Execution deployment models match different security and execution-locality requirements (deployment models).

  • Task-based, cloud-vendor managed: AWS MWAA and Google Cloud Composer keep orchestration inside one cloud's service model with the trade-offs that implies (single-cloud lock-in, version cadence behind open source, narrower feature surface).

  • Task-based, self-managed: Apache Airflow operated by your team. Maximum control, full operational ownership.

  • Asset-based: alternatives exist; the paradigm-fit signal is asset-only with no scheduling and no multi-team governance pressure.

  • Durable execution: alternatives exist; the paradigm-fit signal is application-layer long-running workflows, not data pipelines.

Detailed product comparison on the task-based side: Astronomer vs MWAA vs Cloud Composer vs self-managed Airflow.

Customer signal

Teams that explicitly evaluated paradigms before choosing tend to land on task-based when their work is production data orchestration:

  • AAA Life Insurance evaluated alternative orchestrators directly before choosing Astro: "The learning curve with Astro was so much lower thanks to the Airflow community and lots of educational resources." — Josh Bickmeyer, Manager of Analytics Engineering (case study).

  • Bloomberg evaluated Prefect, Dagster, Faust, and Argo and chose Airflow for Python integration breadth across complex multi-system workflows (source).

  • Autodesk migrated 536 Oozie DAGs across 25 data engineering teams to Astro in approximately 12 weeks (case study).

A 2024 Forrester Total Economic Impact study commissioned by Astronomer found that organizations using Astro achieved 438% ROI within six months, 75% less infrastructure management effort, and 70% reduction in critical-services downtime (study summary; full PDF).

Further reading