# Orchestrating the End-to-End Workflow: From Data to Action


## Finding Your Rhythm: Real-time vs. Scheduled

Every business has its own pulse. We need to match our technical architecture to this natural cadence:

**Option 1: Always-On** ‚ö°  
Configure SDP pipelines in continuous mode for streaming insights that capture customer signals as they happen.

**Option 2: Efficient Cycles** ‚è±Ô∏è  
Wake your compute on a scheduled basis, process new data incrementally, and then release resources.

## Our Blueprint: The Hourly Retention Cycle

For our retail business, an hourly cadence hits the sweet spot:

1. **Data Refresh** üìä - SDP pipeline activates to ingest the latest customer data
2. **Dashboard Update** üìà - DBSQL dashboards automatically refresh
3. **Model Retraining** üß† - ML models adapt to evolving customer behaviors
4. **Triggered Interventions** üéØ - GenAI deploys personalized retention campaigns

This creates a closed-loop system where insights drive actions, actions generate new data, and new data refines future insights‚Äîall automatically.

<div style="background-color: #f8f9fa; padding: 15px; border-radius: 5px; margin-top: 20px;">
<strong>üí° Pro Tip:</strong> During peak seasons or special promotions, consider shifting to continuous mode for real-time responsiveness, then return to scheduled intervals during normal operations.
</div>

<!-- Collect usage data (view). Remove it to disable collection or disable tracker during installation. View README for more details.  -->
<img width="1px" src="https://ppxrzfxige.execute-api.us-west-2.amazonaws.com/v1/analytics?category=lakehouse&org_id=7405609900705693&notebook=%2F06-Workflow-orchestration%2F06-Workflow-orchestration-churn&demo_name=lakehouse-retail-c360&event=VIEW&path=%2F_dbdemos%2Flakehouse%2Flakehouse-retail-c360%2F06-Workflow-orchestration%2F06-Workflow-orchestration-churn&version=1">

## Orchestrating our Churn pipeline with Databricks Workflows

<img style="float: right; margin-left: 10px" width="600px" src="https://github.com/databricks-demos/dbdemos-resources/blob/main/images/cross_demo_assets/dbdemos-workflow-churn-1.png?raw=true" />

With Databricks Lakehouse, no need for external orchestrator. We can use [Workflows](/#job/list) (available on the left menu) to orchestrate our Churn pipeline within a few click.



###  Orchestrate anything anywhere
With workflow, you can run diverse workloads for the full data and AI lifecycle on any cloud. Orchestrate Spark Declarative Pipelines and Jobs for SQL, Spark, notebooks, dbt, ML models and more.

### Simple - Fully managed
Remove operational overhead with a fully managed orchestration service, so you can focus on your workflows not on managing your infrastructure.

### Proven reliability
Have full confidence in your workflows leveraging our proven experience running tens of millions of production workloads daily across AWS, Azure and GCP.


## Creating your workflow

<img style="float: right; margin-left: 10px" width="600px" src="https://github.com/databricks-demos/dbdemos-resources/blob/main/images/cross_demo_assets/dbdemos-workflow-churn-3.png?raw=true" />

A Databricks Workflow is composed of Tasks.

Each task can trigger a specific job:

* Spark Declarative Pipelines
* SQL query / dashboard
* Model retraining / inference
* Notebooks
* dbt
* ...

In this example, can see our 3 tasks:

* Start the SDP pipeline to ingest new data and refresh our tables
* Refresh the DBSQL dashboard (and potentially notify downstream applications)
* Retrain our Churn model

A workflow was created as part of this demo. Open the <a dbdemos-workflow-id="init-job" href="#job/395974101526946/tasks" target="_blank">C360 Churn Workflow</a> to start exploring Databricks orchestration capabilities!


## Monitoring your runs

<img style="float: right; margin-left: 10px" width="600px" src="https://github.com/databricks-demos/dbdemos-resources/blob/main/images/cross_demo_assets/dbdemos-workflow-tasks.png?raw=true" />

Once your workflow is created, we can access historical runs and receive alerts if something goes wrong!

In the screenshot we can see that our workflow had multiple errors, with different runtime, and ultimately got fixed.

Workflow monitoring includes errors, abnormal job duration and more advanced control!

## Conclusion

Not only Datatabricks Lakehouse let you ingest, analyze and infer churn, it also provides a best-in-class orchestrator to offer your business fresh insight making sure everything works as expected!

[Go back to introduction]($../00-churn-introduction-lakehouse)