pipeline-parallelism

Here is 1 public repository matching this topic...

torchpipe / torchpipe

Boosting DL Service Throughput 1.5-4x by Ensemble Pipeline Serving with Concurrent CUDA Streams for PyTorch/LibTorch Frontend and TensorRT/CVCUDA, etc., Backends

deployment inference pytorch ray serve tensorrt serving pipeline-parallelism torch2trt triton-inference-server ray-serve cvcuda

Updated Jun 5, 2024
C++

Improve this page

Add a description, image, and links to the pipeline-parallelism topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pipeline-parallelism topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pipeline-parallelism

Here is 1 public repository matching this topic...

torchpipe / torchpipe

Improve this page

Add this topic to your repo