Add support for permute operation to Direct Python Bindings#4701
Add support for permute operation to Direct Python Bindings#4701
Conversation
|
Review updated until commit b6004e2 Description
Changes walkthrough 📝
PR Reviewer Guide 🔍Here are some key observations to aid the review process:
|
eb81af7 to
f4a5eee
Compare
14f2a01 to
a3ba693
Compare
|
!test |
|
!build |
…ings (#4809) This PR changes `test_dtensor.py` and `test_deepseek_v3.py` to use direct bindings. Modified `tests/python/multidevice/conftest.py` to have `multidevice_test` fixture for legacy tests and `multidevice_direct_test` for tests using direct_bindings. Included quality of life improvements: * Fixes #4560 by supporting basic printing of multi-device scheduled fusions. The schedule operations are not created in the definition. PR Stack: - #4697 - #4698 - #4704 - #4701 - #4809 **<<< This PR.**
This PR add MultiGpu Support to Direct Python Bindings. PR Stack: - NVIDIA#4689 **<<< This PR.** - NVIDIA#4697 - NVIDIA#4698 - NVIDIA#4704 - NVIDIA#4701 cc: @kshitij12345
This PR adds support for cast operations to Direct Python Bindings. PR Stack: - NVIDIA#4689 - NVIDIA#4697 **<<< This PR.** - NVIDIA#4698 - NVIDIA#4704 - NVIDIA#4701 - NVIDIA#4809
…IA#4698) This PR adds support for matmul and linear ops to Direct Python Bindings. PR Stack: - NVIDIA#4689 - NVIDIA#4697 - NVIDIA#4698 **<<< This PR.** - NVIDIA#4704 - NVIDIA#4701 - NVIDIA#4809
…ings (NVIDIA#4704) This PR adds size, shape, define_vector, and reshape ops to direct bindings. PR Stack: - NVIDIA#4689 - NVIDIA#4697 - NVIDIA#4698 - NVIDIA#4704 **<<< This PR.** - NVIDIA#4701 - NVIDIA#4809
) This PR adds support for replacing linear layers with TensorParallel NvFuser layer in deepseek model using Direct Python Bindings. PR Stack: - NVIDIA#4689 - NVIDIA#4697 - NVIDIA#4698 - NVIDIA#4704 - NVIDIA#4701 **<<< This PR.** - NVIDIA#4809
…ings (NVIDIA#4809) This PR changes `test_dtensor.py` and `test_deepseek_v3.py` to use direct bindings. Modified `tests/python/multidevice/conftest.py` to have `multidevice_test` fixture for legacy tests and `multidevice_direct_test` for tests using direct_bindings. Included quality of life improvements: * Fixes NVIDIA#4560 by supporting basic printing of multi-device scheduled fusions. The schedule operations are not created in the definition. PR Stack: - NVIDIA#4697 - NVIDIA#4698 - NVIDIA#4704 - NVIDIA#4701 - NVIDIA#4809 **<<< This PR.**
This PR adds support for replacing linear layers with TensorParallel NvFuser layer in deepseek model using Direct Python Bindings.
PR Stack:
test_dtensor.pyandtest_deepseek_v3.pyto direct bindings #4809