Improved float `workspace` arg for TRT exports #9407

zldrobit · 2024-03-29T09:53:46Z

I would like to export a TensorRT model with about 250-500MB workspace size, yet the original repo does not allow setting workspace of fractional/float type:

ultralytics/ultralytics/engine/exporter.py

Lines 676 to 679 in 4a7ccba

    
           builder = trt.Builder(logger) 
        
           config = builder.create_builder_config() 
        
           config.max_workspace_size = self.args.workspace * 1 << 30 
        
           # config.set_memory_pool_limit(trt.MemoryPoolType.WORKSPACE, workspace << 30)  # fix TRT 8.4 deprecation notice

So, I tweaked the code to accept a float-type workspace. I guess using a lesser workspace (<1GB) is helpful for resource-constrained devices, such as Nvidia Jetson.

🛠️ PR Summary

_{Made with ❤️ by Ultralytics Actions}

🌟 Summary

Improved configuration and memory handling in Ultralytics software components.

📊 Key Changes

Added "workspace" to CFG_FLOAT_KEYS for better configuration management.
Removed "workspace" from a list where it no longer belongs, streamlining configuration clarity.
Updated memory workspace size calculation in the TensorRT exporter for enhanced precision and compatibility.

🎯 Purpose & Impact

Enhanced Clarity and Management: The changes make it easier to understand and manage configuration parameters, enhancing user experience. 🌈
Improved Memory Handling: The more precise memory workspace size calculation means better performance and efficiency, especially for users leveraging TensorRT for neural network inference. 💾🚀
Broad Compatibility: Adjusting the memory size calculation ensures compatibility with newer versions of TensorRT, ensuring users can stay up-to-date with the latest technology. 🆕

github-actions · 2024-03-29T09:54:00Z

CLA Assistant Lite bot All Contributors have signed the CLA. ✅

zldrobit · 2024-03-29T09:54:28Z

I have read the CLA Document and I sign the CLA

github-actions

👋 Hello @zldrobit, thank you for submitting an Ultralytics YOLOv8 🚀 PR! To allow your work to be integrated as seamlessly as possible, we advise you to:

✅ Verify your PR is up-to-date with ultralytics/ultralytics main branch. If your PR is behind you can update your code by clicking the 'Update branch' button or by running git pull and git merge main locally.
✅ Verify all YOLOv8 Continuous Integration (CI) checks are passing.
✅ Update YOLOv8 Docs for any new or updated features.
✅ Reduce changes to the absolute minimum required for your bug fix or feature addition. "It is not daily increase but daily decrease, hack away the unessential. The closer to the source, the less wastage there is." — Bruce Lee

See our Contributing Guide for details and let us know if you have any questions!

codecov · 2024-03-29T09:57:20Z

Codecov Report

Attention: Patch coverage is 50.00000% with 1 lines in your changes are missing coverage. Please review.

Project coverage is 76.94%. Comparing base (4a7ccba) to head (339ddf7).

❗ Current head 339ddf7 differs from pull request most recent head e946f63. Consider uploading reports for the commit e946f63 to get more accurate results

Files	Patch %	Lines
ultralytics/engine/exporter.py	0.00%	1 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #9407   +/-   ##
=======================================
  Coverage   76.94%   76.94%           
=======================================
  Files         117      117           
  Lines       14850    14850           
=======================================
+ Hits        11426    11427    +1     
+ Misses       3424     3423    -1

Flag	Coverage Δ
Benchmarks	`36.73% <50.00%> (ø)`
GPU	`38.85% <50.00%> (ø)`
Tests	`72.07% <50.00%> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

glenn-jocher · 2024-03-29T15:19:59Z

@zldrobit good change!

Co-authored-by: Glenn Jocher <glenn.jocher@ultralytics.com>

Add support for float type workspace

339ddf7

github-actions bot reviewed Mar 29, 2024

View reviewed changes

Update exporter.py

e946f63

glenn-jocher changed the title ~~Add support for float type workspace (exporting a model)~~ Improved float workspace arg for TRT exports Mar 29, 2024

glenn-jocher merged commit 03d0ffd into ultralytics:main Mar 29, 2024
10 checks passed

hmurari pushed a commit to hmurari/ultralytics that referenced this pull request Apr 17, 2024

Improved float workspace arg for TRT exports (ultralytics#9407)

5bfb37c

Co-authored-by: Glenn Jocher <glenn.jocher@ultralytics.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improved float `workspace` arg for TRT exports #9407

Improved float `workspace` arg for TRT exports #9407

zldrobit commented Mar 29, 2024 •

edited by github-actions bot

github-actions bot commented Mar 29, 2024 •

edited

zldrobit commented Mar 29, 2024

github-actions bot left a comment

codecov bot commented Mar 29, 2024 •

edited

glenn-jocher commented Mar 29, 2024

	builder = trt.Builder(logger)
	config = builder.create_builder_config()
	config.max_workspace_size = self.args.workspace * 1 << 30
	# config.set_memory_pool_limit(trt.MemoryPoolType.WORKSPACE, workspace << 30) # fix TRT 8.4 deprecation notice

Improved float workspace arg for TRT exports #9407

Improved float workspace arg for TRT exports #9407

Conversation

zldrobit commented Mar 29, 2024 • edited by github-actions bot

🛠️ PR Summary

🌟 Summary

📊 Key Changes

🎯 Purpose & Impact

github-actions bot commented Mar 29, 2024 • edited

zldrobit commented Mar 29, 2024

github-actions bot left a comment

Choose a reason for hiding this comment

codecov bot commented Mar 29, 2024 • edited

Codecov Report

glenn-jocher commented Mar 29, 2024

Improved float `workspace` arg for TRT exports #9407

Improved float `workspace` arg for TRT exports #9407

zldrobit commented Mar 29, 2024 •

edited by github-actions bot

github-actions bot commented Mar 29, 2024 •

edited

codecov bot commented Mar 29, 2024 •

edited