[xla:gpu] Add runtime optimization using frontend attribute #63430

copybara-service · 2024-03-11T17:28:19Z

[xla:gpu] Add runtime optimization using frontend attribute
_xla_send_recv_validation.

A collective-permute instruction inside a loop may not always send or receive
data that affect the output of the whole module in all iterations. Assume this
information is encoded in frontend attribute _xla_send_recv_validation attached
to the Send and Recv instructions decomposed from such a collective-permute
instruction, the runtime can use this information to skip the invocation of the
NCCL API that performs the Send and Recv operations.

Add tests.

_xla_send_recv_validation. A collective-permute instruction inside a loop may not always send or receive data that affect the output of the whole module in all iterations. Assume this information is encoded in frontend attribute _xla_send_recv_validation attached to the Send and Recv instructions decomposed from such a collective-permute instruction, the runtime can use this information to skip the invocation of the NCCL API that performs the Send and Recv operations. Add tests. PiperOrigin-RevId: 614751869

copybara-service bot force-pushed the test_611192382 branch from 648dc02 to 717fcbc Compare March 11, 2024 18:55

copybara-service bot force-pushed the test_611192382 branch from 717fcbc to 6924b9d Compare March 11, 2024 19:57

copybara-service bot merged commit 6924b9d into master Mar 11, 2024

copybara-service bot deleted the test_611192382 branch March 11, 2024 19:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[xla:gpu] Add runtime optimization using frontend attribute #63430

[xla:gpu] Add runtime optimization using frontend attribute #63430

copybara-service bot commented Mar 11, 2024

[xla:gpu] Add runtime optimization using frontend attribute #63430

[xla:gpu] Add runtime optimization using frontend attribute #63430

Conversation

copybara-service bot commented Mar 11, 2024