packflow is a software development kit (SDK) that simplifies the development process and standardizes packaging of AI/ML
running on streaming data sources.
What does Packflow do? Packflow provides a framework for writing, running, and packaging inference code. It offers a standardized structure (the InferenceBackend class) for writing model execution logic, provides the runtime backbone that executes models with built-in profiling and preprocessing, and includes tools to bundle code into portable zip archives. This means you write your inference code once using Packflow's structure, and Packflow handles both the optimized execution and creation of shareable packages for transfer between systems.
Many existing packaging frameworks are catered towards inference APIs and often require custom preprocessing steps. This can be particularly challenging when dealing with data sources that typically generate data one row at a time in key-value pairs (e.g., firewall logs or message streams).
Packflow, however, is optimized to run models on either individual events or batches of events, streamlining development and reducing the need for additional preprocessing. By leveraging Packflow, teams can focus on building and deploying models with custom out-of-the-box workflows and utilities, significantly reducing the time and effort required to onboard new capabilities.
The following instructions quickly walk through how to install Packflow and serve user documentation.
- Python (version 3.10+)
- Packflow can be installed directly from PyPI:
pip install packflowNote
If contributing to Packflow, it is recommended to install packflow from source in editable mode: pip install -e .
Packflow documentation is hosted at https://dow-cdao.github.io/packflow/. Pre-built HTML is also available on the gh-pages branch.
To build and serve documentation locally:
- Install system prerequisites: Pandoc (required in addition to the
pandocPython package) andmake1 - Navigate to the docs folder:
cd docs - Install Python dependencies:
pip install -r requirements.txt - Run
make devto serve with live reloading at https://127.0.0.1:8000/
Packflow provides a flexible base class called an InferenceBackend that allows users to build highly scalable platform- and tool-agnostic inference code, enabling simplified sharing across environments.
Additionally, Packflow's CLI can assist with creating projects, gathering environmental information, and creating distributable code packages for sharing reproducible inference code between disconnected environments.
To create a dummy Inference Backend, update the inference.py file to the following:
from packflow import InferenceBackend
class Backend(InferenceBackend):
def execute(self, inputs):
"""
Simply print 'Hello, world!' then return the input data
"""
print('Hello, world!')
return inputsIn a different Python file or from the command line in the same directory, execute the following:
from packflow.loaders import LocalLoader
backend = LocalLoader('inference:Backend').load()
backend({"sample": "data"})
# >> {"sample": "data"}Contributions to Packflow are welcomed and highly encouraged! Please refer to the CONTRIBUTING.md guide for more information and guidelines for contributing to Packflow.
Packflow is developed and maintained by Booz Allen Hamilton on behalf of the Federal Government of the United States of America and the Department of War's Chief Digital and Artificial Intelligence Office (CDAO).
packflow is distributed under the terms of the MIT license. Please refer to the LICENSE for more information of acceptable usage and distribution of Packflow.
Footnotes
-
Installation of
makevaries by operating system. On MacOS, installxcode-select. On Windows, it is recommend to use Windows Subsystem for Linux (WSL). On Debian/Ubuntu,makecan be installed viaaptpackage manager:sudo apt update sudo apt install make build-essential ↩