Enhancements to Inference Benchmarking #6

DimaBir · 2023-10-16T14:07:01Z

Description:

This pull request introduces several improvements to the inference benchmarking code. The primary changes include:

Updated the benchmarking function to handle batches of images instead of a single image.
Introduced throughput measurement alongside inference time.
Fixed issues related to TensorRT precision modes (FP16 and FP32).
Enhanced the plotting function to visualize both inference time and throughput.

Changes:

Modified the TensorRTInference class to handle precision-specific input data.
Updated the plot_benchmark_results function to display two side-by-side plots for inference time and throughput.
Addressed various bugs and errors encountered during the benchmarking process.

- Introduced object-oriented design for inference: - Created base inference class `InferenceBase`. - Derived classes for different inference modes: `ONNXInference`, `OVInference`, `PyTorchCPUInference`, `PyTorchCUDAInference`, and `TensorRTInference`. - Integrated `ModelLoader` to handle model loading and caching: - Models are now loaded once and saved locally under the `common/model` directory. - Checks for the existence of the model locally before loading to avoid redundant loads. - Enhanced benchmarking: - Integrated benchmarking logic into each inference class. - Added a `benchmark` method to each inference class to handle model-specific benchmarking. - Collected benchmark results for all models when `args.mode` is set to "all" and plotted the results using the `plot_benchmark_results` function. - Updated `main.py`: - Integrated the new inference classes and their methods. - Modified argument parsing to support different inference modes and other options. - Added logic to collect and plot benchmark results for all models when `args.mode` is set to "all". - Removed post-processing logic as prediction methods in inference classes now handle result printing. - Updated file hierarchy to better organize the codebase and support the new classes.

DimaBir added 30 commits October 15, 2023 21:45

Fixing import modules path

dc10c78

Fixing import modules path

17ef63c

Fixing import modules path

8ec05ba

Fixing import modules path

338dae5

Fixing import modules path

c022e06

Fixing import modules path

d3e862e

Fixing import modules path

42ea101

Fixing import modules path

fab8073

Fixing import modules path

949803f

Fixing import modules path

eaf2d83

Fixing import modules path

9569e0f

Fixing import modules path

6594c02

Fixing import modules path

f84745f

Fixing import modules path

826e25f

Fixing import modules path

78c7fb2

Fixing import modules path

4664081

Fixing import modules path

75e60b7

Fixing import modules path

e3731b8

Fixing import modules path

075076c

Fixing import modules path

f7399a1

Fixing import modules path

96e5c0e

Fixing import modules path

56348c6

Fixing import modules path

3c4ef7a

Fixing import modules path

5fe420a

Fixing import modules path

6ac019c

Fixing import modules path

c3ffbfe

Fixing import modules path

df27f19

Fixing import modules path

28d9499

Fixing import modules path

3d09fba

DimaBir added 29 commits October 16, 2023 12:48

Refactored tensorrt_inference.py

ed142f7

Refactored tensorrt_inference.py

0a92651

Refactored tensorrt_inference.py

c0491fb

Fixing errors for gpu

2c9c0e5

Dded debug prints

cda859f

Added debug prints

a947d78

Added debug prints

72f672e

Added debug prints

dbbb6df

Added debug prints

3572a62

Fixed tensorrt_inference.py

8ee038b

Fixed tensorrt_inference.py

21bd36b

Fixed tensorrt_inference.py

0e84803

Fixed tensorrt_inference.py

e3cd401

Fixed tensorrt_inference.py

61fe8fe

Fixed tensorrt_inference.py

5b8329d

Fixed tensorrt_inference.py

8034ff9

Expanded batch to be dynamic size

e1de139

Expanded batch to be dynamic size

72a90a7

Expanded batch to be dynamic size

6653ad0

Expanded batch to be dynamic size

d2a81b6

Expanded batch to be dynamic size

3bc7029

Expanded batch to be dynamic size

02e9fda

Expanded batch to be dynamic size

f24ed81

Expanded batch to be dynamic size

e23a24f

Updated README.md

fe70cfd

Updated README.md

f03f3a5

Updated README.md

7253b4e

Updated README.md

63c5938

Updated README.md

5bc6e59

DimaBir merged commit 2fd7dfe into master Oct 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhancements to Inference Benchmarking #6

Enhancements to Inference Benchmarking #6

Uh oh!

DimaBir commented Oct 16, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Enhancements to Inference Benchmarking #6

Enhancements to Inference Benchmarking #6

Uh oh!

Conversation

DimaBir commented Oct 16, 2023

Description:

Changes:

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants