[pull] master from tensorflow:master by pull[bot] · Pull Request #265 · IrvanFza/tensorflow

pull · 2025-07-30T15:56:01Z

See Commits and Changes for more details.

Created by pull[bot] (v2.0.0-alpha.3)

Can you help keep this open source service alive? 💖 Please sponsor : )

… the constraints. The original issue manifested in the form of CUDA_ERROR_ILLEGAL_INSTRUCTION on microbenchmarks that used TMA. The constraint is undocumented in Nvidia's docs, and was informed via chat when the issue was raised. PiperOrigin-RevId: 788827881

This is a somewhat tricky case as scatter reuses buffers for operands for the corresponding output. So if we have the same repeated operand, we should instead transform that into having copies of the operand for the repetitions. PiperOrigin-RevId: 788836845

We were calling ComposePermutations where a permutation was applied to a non-permutation. This should be done with the method Permute() instead. This is a non-functional change in opt builds, but before this change, a DCHECK was triggered. PiperOrigin-RevId: 788860614

PiperOrigin-RevId: 788869943

…has passed. PiperOrigin-RevId: 788871192

PiperOrigin-RevId: 788871838

…if no layout has been set. Sometimes the `HloModules` have the `entry_computation_layout` field set with no layouts specified, and so we shouldn't exit early returning the current field. As we may need to tuplify it if `useTupleArgs` is true PiperOrigin-RevId: 788875871

PiperOrigin-RevId: 788876921

Imported from GitHub PR openxla/xla#29342 The XLA Linux x86 GPU ONEAPI pre-submit is failing because certain flags are being overridden, which is blocking the hermetic build from running successfully. Copybara import of the project: -- 9e4190e4c627e853ddd9dbcde936f733101c1211 by mraunak <mayank.kumar.raunak@intel.com>: Create ci_build_xla.sh Build with Clang -- bdc183da860a31f1be2f6dd3858a8ac44a53f9d2 by mraunak <mayank.kumar.raunak@intel.com>: Update configure.py -- 1a874cf69fd84d69b26ac4e37843887ac8291bb8 by mraunak <mayank.kumar.raunak@intel.com>: Add Clang as host compiler for oneAPI -- 342b1d67645f6fedc9ad48ea699fe558e273530b by mraunak <mayank.kumar.raunak@intel.com>: Update sycl_configure.bzl -- 2f38e0d154977ecac50bdb220060789a0f835c22 by mraunak <mayank.kumar.raunak@intel.com>: Update BUILD.sycl.tpl -- 7df8e577fea5861b2b28cc18906b289b4e9ac5df by mraunak <mayank.kumar.raunak@intel.com>: Update ci_build_xla.sh -- bef3a5ca6faf1378636de287996a70ce8177cb95 by mraunak <mayank.kumar.raunak@intel.com>: Update sycl_configure.bzl -- a960ff7855f5dcefebd0913d7b7712e579028cac by mraunak <mayank.kumar.raunak@intel.com>: Update tensorflow.bazelrc -- 2d60209a0333204c5abb907501abff4d0af21271 by mraunak <mayank.kumar.raunak@intel.com>: Update ci_test_xla.sh -- 8a5db012f9f1e664711c3fbb90c7215f21effd2d by mraunak <mayank.kumar.raunak@intel.com>: Address the comments -- 735f4591654ee377f808001959cd566b4b6a0544 by mraunak <mayank.kumar.raunak@intel.com>: Address the comments -- e7c412d19b25804c4fa4fb0b65329146c983a6e3 by mraunak <mayank.kumar.raunak@intel.com>: Address the comment -- b456c8f8dca013bb730a733b1f988646a0a744bb by mraunak <mayank.kumar.raunak@intel.com>: Update tensorflow.bazelrc -- cedc8586494f789e5ec92aef30e64e866a6b022d by mraunak <mayank.kumar.raunak@intel.com>: Apply patch to PR -- d2dc3846a7fdd55f35fc66f314ad56265e963c59 by mraunak <mayank.kumar.raunak@intel.com>: fix the test failure -- 33a3ca34306b6047881f499aa76fc69b4f5003a6 by mraunak <mayank.kumar.raunak@intel.com>: Resolve merge conflicts -- 329b71e7c72e1df001f9d104c25f44032739cc02 by mraunak <mayank.kumar.raunak@intel.com>: Update sycl_configure.bzl -- 253cdd753760e8718973b4493f6aafc07205bb2d by mraunak <mayank.kumar.raunak@intel.com>: Fix the pre-submit failure -- 326fb3a4bbaf6e62caaf8f2135405540b0901739 by mraunak <mayank.kumar.raunak@intel.com>: Update configure.py -- fea0523b3afc170ba7a9a7a7b72d7327c5aa2637 by mraunak <mayank.kumar.raunak@intel.com>: Update sycl_configure.bzl -- e841d90370dc2c45187df8a8965dde5d137a8455 by mraunak <mayank.kumar.raunak@intel.com>: Update configure.py -- 9d6e9d7512c4f6bd803d4ee0297dbb2306179c67 by mraunak <mayank.kumar.raunak@intel.com>: Fix the indentation issue Merging this change closes #29342 PiperOrigin-RevId: 788890366

Imported from GitHub PR openxla/xla#29332 A bug was reported in NVIDIA that GpuAotCompilationTest was failing, specifically these two ``` [ FAILED ] GpuAotCompilationTest.ExportAndLoadExecutable [ FAILED ] GpuAotCompilationTest.AotCompilationWithoutGpuDevice ``` with the error message ``` NOT_FOUND: Could not find registered platform with name: "CUDA". Available platform names are: Interpreter ``` The cause is the cuda platform was not properly registered. Modified the BUILD file to link the cuda platform to make sure it is registered before executing the test. Copybara import of the project: -- a245bd2958e9794d19ecd87306eb298e62015f75 by Chenhao Jiang <chenhaoj@nvidia.com>: Fix the gpu_aot_compilation_test Merging this change closes #29332 PiperOrigin-RevId: 788893566

PiperOrigin-RevId: 788895133

…ise int4 A100 tests are failing. In the cl we pass device description to the rewriter and check what exact device we have. PiperOrigin-RevId: 788908460

PiperOrigin-RevId: 788911526

PiperOrigin-RevId: 788914574

asus4 and others added 15 commits July 10, 2025 09:42

Enable 16kb page size alignment on libtensorflowlite_gpu_gl.so

ec74f8d

[XLA:CPU] Run inline pass multiple times.

1688645

PiperOrigin-RevId: 788869943

Remove PJRT v70 checks since the 3 week forward compatibility window …

df656de

…has passed. PiperOrigin-RevId: 788871192

[XLA:CPU] Add logging for LLVM compilation time.

03f861c

PiperOrigin-RevId: 788871838

Add a missing symbol to unbreak Windows.

0adf105

PiperOrigin-RevId: 788876921

Add unit test with non-flat call graph on shardy xla pass.

4717b2c

PiperOrigin-RevId: 788895133

[XLA:GPU] Fix: Check for bf16x2 enabled when using inline asm. Otherw…

0383dc7

…ise int4 A100 tests are failing. In the cl we pass device description to the rewriter and check what exact device we have. PiperOrigin-RevId: 788908460

[XLA:CPU] Don't fuse non-scalar constants.

bf820da

PiperOrigin-RevId: 788911526

Merge pull request #96702 from asus4:lite-gldelegate-16kb-support

fe21e74

PiperOrigin-RevId: 788914574

pull bot locked and limited conversation to collaborators Jul 30, 2025

pull bot added the ⤵️ pull label Jul 30, 2025

pull bot merged commit fe21e74 into IrvanFza:master Jul 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[pull] master from tensorflow:master#265

[pull] master from tensorflow:master#265
pull[bot] merged 15 commits intoIrvanFza:masterfrom
tensorflow:master

pull bot commented Jul 30, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

11 participants

Conversation

pull bot commented Jul 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

11 participants

pull bot commented Jul 30, 2025 •

edited

Loading