PEPPER: Profiling-based Edge Placement and Partitioning for Deep Learning Execution

This repository provides the code for a pipeline is for profiling and partitioning ONNX models, to enhance inference efficiency across heterogeneous hardware platforms. Optimal split points within the deep learning models are identified through the application of Tarjan’s Bridge-Finding Algorithm, and the inference times of the models are predicted per device based on the respective characteristics and CPU load. For the prediction of inference times, the XGBoost algorithm is employed. The effectiveness of the proposed approach is validated through experiments conducted on real-world edge devices, demonstrating that highly efficient and adaptable deployment of complex deep learning models can be achieved in such environments.

Architecture

Components

Cite Us

If you use the above code for your research, please cite our paper:

PEPPER: Profiling-based Edge Placement and Partitioning for Deep Learning Execution

@inproceedings{10.1145/3770501.3770528,
author = {Korontanis, Ioannis and Kontopoulos, Ioannis and Zacharia, Athina and Makris, Antonios and Chronis, Christos and Pateraki, Maria and Tserpes, Konstantinos and Varlamis, Iraklis},
title = {PEPPER: Profiling-based Edge Placement and Partitioning for Deep Learning Execution},
year = {2025},
isbn = {9798400715952},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
url = {https://doi.org/10.1145/3770501.3770528},
doi = {10.1145/3770501.3770528},
abstract = {Unlocking the full potential of AI at the edge requires overcoming the fundamental challenge of running complex models efficiently on devices with limited computational power. In this work, the challenge of optimizing the deployment of deep learning models in resource-constrained environments is addressed. A novel pipeline is proposed for profiling and partitioning ONNX models, to enhance inference efficiency across heterogeneous hardware platforms. Optimal split points within the deep learning models are identified through the application of Tarjan’s Bridge-Finding Algorithm, and the inference times of the models are predicted per device based on the respective characteristics and CPU load. For the prediction of inference times, the XGBoost algorithm is employed. The effectiveness of the proposed approach is validated through experiments conducted on real-world edge devices, demonstrating that highly efficient and adaptable deployment of complex deep learning models can be achieved in such environments.},
booktitle = {Proceedings of the 15th International Conference on the Internet of Things},
pages = {228–236},
numpages = {9},
keywords = {Profiling, Distributed Inference, Edge, Placement, IoT},
location = {},
series = {IOT '25}
}

Name		Name	Last commit message	Last commit date
Latest commit History 76 Commits
evaluation		evaluation
model_characteristics_extractor		model_characteristics_extractor
model_splitter		model_splitter
profiler		profiler
.gitignore		.gitignore
README.md		README.md
architecture.png		architecture.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PEPPER: Profiling-based Edge Placement and Partitioning for Deep Learning Execution

Architecture

Components

Cite Us

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PEPPER: Profiling-based Edge Placement and Partitioning for Deep Learning Execution

Architecture

Components

Cite Us

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages