rusty-taskflow

Status: Early Development — This project is under active development. The API is functional but may change. Use in production at your own risk.

English

A high-performance, type-safe DAG (Directed Acyclic Graph) execution framework for Rust with configuration-driven flow definition and a built-in component registry for shared infrastructure.

Features

Type-safe DAG orchestration: Compile-time dependency validation with automatic topological ordering
Concurrent execution: Tasks at the same layer run asynchronously in parallel
Unified sync/async model: Support both #[sync_task] and #[async_task] with unified async execution
Configuration-driven: Define flows in TOML, generate type-safe code at compile time
Multi-flow management: Load and run multiple flows from a single application
FlowContext component injection: Declare ctx: &FlowContext in any task to pull shared singletons or per-call factory objects (DB clients, config, request IDs, ...) without threading them through DAG edges. Components are declared globally via register_singleton! / register_factory!, or imperatively via FlowContext::insert_singleton / insert_factory for tests.
Three execution modes:
- Build flow first, execute later with sink_id
- Direct execution by path
- Construct flow manually in Rust code via Flow::new() / Flow::with_context()

Limitations

Max 7 DAG inputs per task: A task's run method currently accepts at most 7 upstream-dependency parameters (the optional leading ctx: &FlowContext does not count toward this limit). The cap comes from the tuple arities for which [FromAnyIter] is implemented in src/tf/traits.rs (tuples up to 7 elements). If you need more inputs, either bundle several upstream outputs into a single struct in an intermediate task, or extend the FromAnyIter impls to higher arities.

Performance

Zero-overhead abstraction — Framework overhead is minimal compared to hand-written tokio baseline code.

Benchmark methodology: 5 warmup rounds + 60 measurement rounds, round-robin execution order to minimize cache bias.

Scenario	Overhead vs Baseline
CPU Linear Chain (20 tasks, fib(32) each)	+0.0%
CPU Fan-out (1→6) + Tree Reduce	-3.2%
CPU Diamond (2 parallel paths)	-0.4%
IO Linear Chain (20 tasks, 10ms each)	-0.6%
Mixed CPU+IO Complex DAG	-3.8%

All scenarios within ±5% of manual tokio implementation.

Quick Start

1. Define task operators

A task is an inherent impl block annotated with #[sync_task] or #[async_task]. All DAG inputs must be shared references &T (the runtime stores upstream outputs as Arc<T> internally).

use rusty_taskflow::{sync_task, async_task};

pub struct FibInput;
#[sync_task(path = "::rusty_taskflow")]
impl FibInput {
  pub fn new() -> Self { Self }
  fn run(self) -> u64 { 18 }
}

pub struct AsyncPersistFib;
#[async_task(path = "::rusty_taskflow")]
impl AsyncPersistFib {
  pub fn new() -> Self { Self }
  async fn run(self, fib: &u64) -> u64 {
    tokio::fs::write("result.txt", format!("{fib}")).await.unwrap();
    *fib
  }
}

2. Register shared components (optional)

Components exposed through FlowContext come in two flavors:

Singleton — one instance per process, shared by reference.
Factory — constructor registered by name, returns a fresh Box<T> per call.

use rusty_taskflow::{register_singleton, register_factory};

pub struct MultiplierConfig { pub factor: u64 }
impl MultiplierConfig { pub fn new() -> Self { Self { factor: 3 } } }
register_singleton!(MultiplierConfig, "multiplier_config", MultiplierConfig::new);

pub struct RequestId(pub u64);
impl RequestId {
  pub fn new() -> Self {
    static N: std::sync::atomic::AtomicU64 = std::sync::atomic::AtomicU64::new(1);
    Self(N.fetch_add(1, std::sync::atomic::Ordering::Relaxed))
  }
}
register_factory!(RequestId, "request_id", RequestId::new);

3. Consume components inside a task

Declare ctx: &FlowContext as the first non-self parameter. The proc macro wires the runtime context in and does not treat it as a DAG input:

use rusty_taskflow::{sync_task, FlowContext};

pub struct Multiply;
impl Multiply { pub fn new() -> Self { Self } }

#[sync_task(path = "::rusty_taskflow")]
impl Multiply {
  fn run(self, ctx: &FlowContext, v: &u64) -> u64 {
    let cfg = ctx.get_singleton_component::<MultiplierConfig>("multiplier_config").unwrap();
    let req = ctx.create_component::<RequestId>("request_id").unwrap();
    println!("Multiply[req={}] {} * {}", req.0, cfg.factor, v);
    cfg.factor * v
  }
}

Tasks that do not need ctx are unchanged and remain fully backward-compatible.

4. Describe flows with TOML (optional)

[flow]
name = "mixed_fib_io"

[[flow.source]]
name = "FibInput"
dependencies = []
output = "fib_n"
builder = "crate::config_tasks::FibInput::new()"

[[flow.processor]]
name = "Fib"
dependencies = ["fib_n"]
output = "fib_value"
builder = "crate::config_tasks::Fib::new()"

[[flow.processor]]
name = "AsyncPersistFib"
dependencies = ["fib_value"]
output = "persisted_fib"
builder = "crate::config_tasks::AsyncPersistFib::new()"

[flow.sink]
name = "DoubleSink"
dependencies = ["persisted_fib"]
output = "mixed_fib_output"
builder = "crate::config_tasks::DoubleSink::new()"

5. Run a flow

// Mode A: TOML flow, build first and run using the sink_id
let (mut flow, sink_id) = build_flow_by_path(path).expect("build failed");
let output = flow.run_with_sink_id(sink_id).await.expect("run failed");

// Mode B: TOML flow, run directly by path
let output = run_flow_by_path(path).await.expect("run failed");

// Mode C: Construct graph manually; FlowContext auto-populates from register_*! macros
use rusty_taskflow::tf::flow::Flow;
let mut flow = Flow::new();
let s1 = flow.commit_source_task("S1", FibSource1::new());
let s2 = flow.commit_source_task("S2", FibSource2::new());
let merged = flow.commit_task("Merger", Merger::new()).with_dependencies((s1, s2));
let fib   = flow.commit_task("Fib", Fib::new()).with_dependencies(merged);
let sink  = flow.commit_task("Multiply", Multiply::new()).with_dependencies(fib);
let output = flow.run(sink).await.expect("manual run failed");

// Mode D: Inject a custom FlowContext (tests / mocks / dynamic wiring)
use std::sync::Arc;
use rusty_taskflow::FlowContext;

let mut ctx = FlowContext::new();
ctx.insert_singleton("multiplier_config", MultiplierConfig { factor: 100 });
ctx.insert_factory("request_id", RequestId::new);
let mut flow = Flow::with_context(Arc::new(ctx));
// ... subsequent commit_task / run are the same as usual ...

Project Structure

tf-examples/
├── configs/
│   ├── flows.toml              # flow index
│   └── flows/*.toml            # individual flow definitions
├── src/
│   ├── config_tasks.rs         # task implementations + component registration
│   └── main.rs                 # example entrypoint (demonstrates all four execution modes)
└── build.rs                    # compile-time code generation

Run Example

cargo run -p tf-examples

The example output demonstrates:

TOML flows (both build-then-run via sink_id and direct path execution)
Manual graph construction; MultiplierConfig (factor=3) and RequestId factory are pulled from inventory
The same graph run with a custom FlowContext (injected factor=100) shows different results

中文

一个高性能、类型安全的有向无环图（DAG）执行框架，支持配置驱动的流程定义，并内建组件注册表以便共享基础设施。

特性

类型安全的 DAG 编排：编译期依赖校验并自动拓扑排序
并发执行：同一层级的任务可异步并行执行
统一的同步/异步模型：同时支持 #[sync_task] 和 #[async_task]，运行时采用统一的异步执行模型
配置驱动：使用 TOML 描述流程，在编译期生成类型安全代码
多流程管理：单个应用中可加载并运行多个流程
FlowContext 组件注入：在任务签名中声明 ctx: &FlowContext（作为第一个非 self 参数）即可获取共享单例或按次创建的组件（如 DB 客户端、配置、请求 ID 等），无需通过 DAG 边传递。组件可通过 register_singleton! / register_factory! 全局声明，或在测试/运行时通过 FlowContext::insert_singleton / insert_factory 动态注入。
三种执行模式：
- 先构建流程，使用 sink_id 后执行
- 通过路径直接执行已定义的流程
- 在 Rust 代码中手动构建图，使用 Flow::new() / Flow::with_context()

限制

每个任务最多 7 个 DAG 输入：run 方法当前最多接受 7 个上游依赖参数（可选的开头参数 ctx: &FlowContext 不计入此限制）。上限来源于 src/tf/traits.rs 中对元组实现的 FromAnyIter（目前支持到 7 元素元组）。如需更多输入，可将若干上游输出封装到中间任务返回的结构体，或在代码中为更高元数扩展 FromAnyIter 实现。

性能

零开销抽象 —— 相较于手写的 Tokio 基线代码，框架带来的额外开销极小。

基准方法：5 轮预热 + 60 轮测量，采用轮询顺序以降低缓存偏差。

场景	相对于基线的开销
CPU 线性链（20 个任务，每个 fib(32)）	+0.0%
CPU 扇出 (1→6) + 树形归约	-3.2%
CPU 菱形（两条并行路径）	-0.4%
IO 线性链（20 个任务，每个 10ms）	-0.6%
混合 CPU+IO 复杂 DAG	-3.8%

所有场景均在 ±5% 范围内与手写 Tokio 实现接近。

快速开始

1. 定义任务算子

任务是带有 #[sync_task] 或 #[async_task] 注释的固有 impl 块。所有 DAG 输入必须是共享引用 &T（运行时内部将上游输出以 Arc<T> 存储）。示例代码与英文一致：

use rusty_taskflow::{sync_task, async_task};

pub struct FibInput;
#[sync_task(path = "::rusty_taskflow")]
impl FibInput {
  pub fn new() -> Self { Self }
  fn run(self) -> u64 { 18 }
}

pub struct AsyncPersistFib;
#[async_task(path = "::rusty_taskflow")]
impl AsyncPersistFib {
  pub fn new() -> Self { Self }
  async fn run(self, fib: &u64) -> u64 {
    tokio::fs::write("result.txt", format!("{fib}")).await.unwrap();
    *fib
  }
}

2. 注册共享组件（可选）

组件通过 FlowContext 暴露，分为两类：

Singleton（单例） — 进程级共享实例。
Factory（工厂） — 按名注册的构造器，每次调用返回一个新的 Box<T>。

示例代码与英文一致：

use rusty_taskflow::{register_singleton, register_factory};

pub struct MultiplierConfig { pub factor: u64 }
impl MultiplierConfig { pub fn new() -> Self { Self { factor: 3 } } }
register_singleton!(MultiplierConfig, "multiplier_config", MultiplierConfig::new);

pub struct RequestId(pub u64);
impl RequestId {
  pub fn new() -> Self {
    static N: std::sync::atomic::AtomicU64 = std::sync::atomic::AtomicU64::new(1);
    Self(N.fetch_add(1, std::sync::atomic::Ordering::Relaxed))
  }
}
register_factory!(RequestId, "request_id", RequestId::new);

3. 在任务中使用组件

将 ctx: &FlowContext 声明为第一个非 self 参数。过程宏会自动将运行时上下文传入，且不会将其视为 DAG 输入：

use rusty_taskflow::{sync_task, FlowContext};

pub struct Multiply;
impl Multiply { pub fn new() -> Self { Self } }

#[sync_task(path = "::rusty_taskflow")]
impl Multiply {
  fn run(self, ctx: &FlowContext, v: &u64) -> u64 {
    let cfg = ctx.get_singleton_component::<MultiplierConfig>("multiplier_config").unwrap();
    let req = ctx.create_component::<RequestId>("request_id").unwrap();
    println!("Multiply[req={}] {} * {}", req.0, cfg.factor, v);
    cfg.factor * v
  }
}

不需要 ctx 的任务保持原样，向后兼容。

4. 使用 TOML 描述流程（可选）

示例 TOML 与英文一致：

[flow]
name = "mixed_fib_io"

[[flow.source]]
name = "FibInput"
dependencies = []
output = "fib_n"
builder = "crate::config_tasks::FibInput::new()"

[[flow.processor]]
name = "Fib"
dependencies = ["fib_n"]
output = "fib_value"
builder = "crate::config_tasks::Fib::new()"

[[flow.processor]]
name = "AsyncPersistFib"
dependencies = ["fib_value"]
output = "persisted_fib"
builder = "crate::config_tasks::AsyncPersistFib::new()"

[flow.sink]
name = "DoubleSink"
dependencies = ["persisted_fib"]
output = "mixed_fib_output"
builder = "crate::config_tasks::DoubleSink::new()"

5. 执行流程

运行方式与英文一致：

// 方式 A：TOML 流程，先构建后执行
let (mut flow, sink_id) = build_flow_by_path(path).expect("构建失败");
let output = flow.run_with_sink_id(sink_id).await.expect("执行失败");

// 方式 B：TOML 流程，直接执行
let output = run_flow_by_path(path).await.expect("执行失败");

// 方式 C：手动构图，FlowContext 自动从 register_*! 宏初始化
use rusty_taskflow::tf::flow::Flow;
let mut flow = Flow::new();
let s1 = flow.commit_source_task("S1", FibSource1::new());
let s2 = flow.commit_source_task("S2", FibSource2::new());
let merged = flow.commit_task("Merger", Merger::new()).with_dependencies((s1, s2));
let fib   = flow.commit_task("Fib", Fib::new()).with_dependencies(merged);
let sink  = flow.commit_task("Multiply", Multiply::new()).with_dependencies(fib);
let output = flow.run(sink).await.expect("手动执行失败");

// 方式 D：注入自定义 FlowContext（测试 / Mock / 动态装配）
use std::sync::Arc;
use rusty_taskflow::FlowContext;

let mut ctx = FlowContext::new();
ctx.insert_singleton("multiplier_config", MultiplierConfig { factor: 100 });
ctx.insert_factory("request_id", RequestId::new);
let mut flow = Flow::with_context(Arc::new(ctx));
// ... 后续 commit_task / run 与常规流程一致 ...

项目结构

与英文一致：

tf-examples/
├── configs/
│   ├── flows.toml              # 流程索引
│   └── flows/*.toml            # 单个流程定义
├── src/
│   ├── config_tasks.rs         # 任务实现 + 组件注册
│   └── main.rs                 # 业务入口（演示全部四种执行模式）
└── build.rs                    # 编译期代码生成

运行示例

cargo run -p tf-examples

输出会依次演示：

TOML 流程（sink_id 与路径直连两种调用方式）
手动构图，自动拉取 inventory 中的 MultiplierConfig（factor=3）和 RequestId 工厂
同一张图改用 Flow::with_context 注入 factor=100 的自定义 ctx，结果随之变化

Changelog

FlowContext component injection

Flow::new() now auto-populates a FlowContext from every register_singleton! / register_factory! declaration compiled into the binary.
Flow::with_context(Arc<FlowContext>) accepts a custom context for tests and dynamic wiring; FlowContext::insert_singleton / insert_factory allow runtime insertion (including capturing closures for factories).
#[sync_task] / #[async_task] detect a leading ctx: &FlowContext parameter and forward the runtime context automatically. Tasks that do not declare one are unchanged — fully backward-compatible.
ComponentEntry::Factory is now stored as Box<dyn Fn> so that both inventory-registered factories and runtime-inserted capturing factories share a single code path. The extra vtable indirection is dominated by the Box::new(T) allocation that every factory performs anyway.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
src		src
taskflow-build		taskflow-build
taskflow-macros		taskflow-macros
tf-examples		tf-examples
.gitignore		.gitignore
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

rusty-taskflow

English

Features

Limitations

Performance

Quick Start

1. Define task operators

2. Register shared components (optional)

3. Consume components inside a task

4. Describe flows with TOML (optional)

5. Run a flow

Project Structure

Run Example

中文

特性

限制

性能

快速开始

1. 定义任务算子

2. 注册共享组件（可选）

3. 在任务中使用组件

4. 使用 TOML 描述流程（可选）

5. 执行流程

项目结构

运行示例

Changelog

FlowContext component injection

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

rusty-taskflow

English

Features

Limitations

Performance

Quick Start

1. Define task operators

2. Register shared components (optional)

3. Consume components inside a task

4. Describe flows with TOML (optional)

5. Run a flow

Project Structure

Run Example

中文

特性

限制

性能

快速开始

1. 定义任务算子

2. 注册共享组件（可选）

3. 在任务中使用组件

4. 使用 TOML 描述流程（可选）

5. 执行流程

项目结构

运行示例

Changelog

FlowContext component injection

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages