example-pipeline-project

Example data pipeline project structure with modern Python tooling.

Subject

CSV file for simplicity
Invalid data needing corrections
Target format requirement data manipulation
Task failure due to unexpected data

This project follows a traditional data flow pattern.

graph
    A[Sources] --> B[Staging]
    B --> C[Managed Storage]
    C --> D[Task]
    C --> E[Task]
    C --> F[Task]
    D --> G[Compute]
    E --> H[Compute]
    F --> I[Compute]
    G --> J[Managed Storage]
    H --> K[Managed Storage]
    I --> L[Managed Storage]
    J --> M[Users]
    K --> M
    L --> M

The Data

This project processes shipment data from a denormalized, queried format.

route_id	order_id	sku_id	origin_id	origin_city	origin_state	origin_zip	origin_country	dest_id	dest_city	dest_state	dest_zip	dest_country	weight	weight_uom	quantity	quantity_uom	linehaul_cost	linehaul_cost_uom
72	465	292	1	Philadelphia	PA	20134	US	2	Vancouver	BC	ABC DFG	CA	279.429	LBS	3.2372	PLT	-344.4967	USD

Usage

Run Jobs

python run.py jobs --offline

For Help

# dev commands
make help

# general run.py help
python run.py --help

# check help messages for each subcommand
python run.py jobs --help

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
.github		.github
.vscode		.vscode
business		business
config		config
core		core
data		data
tasks		tasks
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Makefile		Makefile
README.md		README.md
poetry.lock		poetry.lock
poetry.toml		poetry.toml
pyproject.toml		pyproject.toml
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

example-pipeline-project

Subject

The Data

Usage

Run Jobs

For Help

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Uh oh!

Uh oh!

cnpryer/example-pipeline-project

Folders and files

Latest commit

History

Repository files navigation

example-pipeline-project

Subject

The Data

Usage

Run Jobs

For Help

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages