Enhancing Yolo-v4 Performance using Scalar Matrix Multiplication in oneAPI

Introduction

This project explores the enhancement of Yolo-v4 performance through Scalar Matrix Multiplication (MM) in oneAPI. We focus on optimizing convolution layers in the Yolo-v4 model, integrating oneAPI with Python.

Background

We utilize oneAPI to optimize deep learning computations in the Yolo-v4 model, aiming for improved efficiency and accuracy in object detection.

Methodology

Our approach covers:

Scalar Matrix Multiplication in oneAPI
Python-C++ Integration
Convolution Layer Wrapper
Yolo-v4 Modification and Implementation
Performance Analysis

Commands to Run

Scalar MM: icpx -fsycl smm.cpp -o smm
Python-C++ Integration: icpx -fsycl -fPIC -shared -o libsmm.so shared.cpp
Convolution Wrapper: python3 wrapper.py
Yolo-v4: python3 yoto.py, python3 yolo.py
Performance Analysis: python3 compare.py

Results

Demonstrates minimal speedup in convolution layers of the Yolo-v4 model on CPU devices.

Challenges

Discusses challenges in FLOPs calculation for complex models with custom implementations.

Conclusion

Highlights the potential of Scalar MM in oneAPI for deep learning optimization.

Authors

Vikash Singh (vxs465)
Thomas Bornhorst (thb34)

Institution

Case Western Reserve University

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Enhancing Yolo-v4 Performance using Scalar Matrix Multiplication in oneAPI

Introduction

Background

Methodology

Commands to Run

Results

Challenges

Conclusion

Authors

Institution

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.github/workflows		.github/workflows
README.md		README.md
annotated-CSDS451_REPORT.pdf		annotated-CSDS451_REPORT.pdf
compare.py		compare.py
libsmm.so		libsmm.so
shared.cpp		shared.cpp
smm.cpp		smm.cpp
smm.h		smm.h
wrapper.py		wrapper.py
yolo.py		yolo.py
yoto.py		yoto.py

vicky157/Enhanced-YOLOV4-sycl-python-integration-

Folders and files

Latest commit

History

Repository files navigation

Enhancing Yolo-v4 Performance using Scalar Matrix Multiplication in oneAPI

Introduction

Background

Methodology

Commands to Run

Results

Challenges

Conclusion

Authors

Institution

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages