Fixing issue with sample_async failing on machines with multiple GPUs #1379

1tnguyen · 2024-03-12T20:10:02Z

Description

In library mode, we need to run a tracing pass before actual execution.
With multi-qpu, we set_exec_ctx on specific QPU id but didn't reset the context on that particular qpu id (fall back to reset QPU 0 context all the time).

Resolves #1374

In library mode, we need to run a tracing pass before actual execution. With multi-qpu, we set_exec_ctx on specific qpu id but didn't reset the context on that particular qpu id (fall back to reset QPU 0 context all the time).

bmhowe23

👍

1tnguyen requested review from amccaskey and bmhowe23 March 12, 2024 20:10

Merge branch 'main' into tnguyen/sample_async_bug

8c75756

bmhowe23 approved these changes Mar 12, 2024

View reviewed changes

Merge branch 'main' into tnguyen/sample_async_bug

0e1f544

1tnguyen enabled auto-merge (squash) March 12, 2024 20:15

1tnguyen merged commit 0c8e28b into NVIDIA:main Mar 12, 2024
133 checks passed

github-actions bot locked and limited conversation to collaborators Mar 12, 2024

bettinaheim changed the title ~~Fix a bug in sample.h affecting sample_async in library mode~~ Fixing issue with sample_async failing on machines with multiple GPUs Apr 17, 2024

bettinaheim added the bug fix To be listed under Bug Fixes in the release notes label Apr 17, 2024

bettinaheim added this to the release 0.7.1 milestone Apr 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixing issue with sample_async failing on machines with multiple GPUs #1379

Fixing issue with sample_async failing on machines with multiple GPUs #1379

1tnguyen commented Mar 12, 2024 •

edited

Loading

bmhowe23 left a comment

Fixing issue with sample_async failing on machines with multiple GPUs #1379

Fixing issue with sample_async failing on machines with multiple GPUs #1379

Conversation

1tnguyen commented Mar 12, 2024 • edited Loading

Description

bmhowe23 left a comment

Choose a reason for hiding this comment

1tnguyen commented Mar 12, 2024 •

edited

Loading