forked from tensorflow/tensorflow
-
Notifications
You must be signed in to change notification settings - Fork 93
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Develop upstream sync 230515 #2106
Merged
wenchenvincent
merged 349 commits into
develop-upstream
from
develop-upstream-sync-230515
May 19, 2023
Merged
Develop upstream sync 230515 #2106
wenchenvincent
merged 349 commits into
develop-upstream
from
develop-upstream-sync-230515
May 19, 2023
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
PiperOrigin-RevId: 530719240
PiperOrigin-RevId: 530726332
…encies. PiperOrigin-RevId: 530727072
PiperOrigin-RevId: 530729537
Updates LLVM usage to match [e9569748def8](llvm/llvm-project@e9569748def8) PiperOrigin-RevId: 530732122
PiperOrigin-RevId: 530734898
PiperOrigin-RevId: 530736714
PiperOrigin-RevId: 530740817
We check whether the PjRtClient has changed or not before compilation. We delete and create a new PJRT DeviceCompiler if it has changed. Ideally the cache should be invalidated upon shutdown, although getting a hold of the TpuDevice in TpuSystemShutdown() doesn't seem straightforward (the cache is stored in the TpuDevice's resource manager). PiperOrigin-RevId: 530751696
PiperOrigin-RevId: 530752262
…more clear that callers should pass them explicitly. PiperOrigin-RevId: 530752496
PiperOrigin-RevId: 530753496
PiperOrigin-RevId: 530763995
PiperOrigin-RevId: 530766976
PiperOrigin-RevId: 530775581
…e_id_tensor_int32_t PiperOrigin-RevId: 530776070
…evice information. The previous implementation was non-deterministic, as it relied on absl traversal order and functional control flow operations having a particular arrangement of nodes. PiperOrigin-RevId: 530781619
…t_id` helper. Compression is an input to `register_dataset`, not `from_dataset_id`. PiperOrigin-RevId: 530783906
PiperOrigin-RevId: 530795290
…p_outside_compilation can be used in the SPMD case and there are num_cores_per_replica host regions. Host regions are located on the host near the corresponding TPU core. Host regions are created by cloning the first region. A tensor that is sent or received to the host is split sharded. This tensor needs to be converted to MANUAL sharding before send/recv with _XlaHostComputeMlir. PiperOrigin-RevId: 530801371
This uses an ApproxTopK custom-call, which we add support for in supported by MHLO, by including a lowering to XLA's PartialReduce custom_call via the Client XLA ApproxTopK function. PiperOrigin-RevId: 530805966
PiperOrigin-RevId: 530811048
PiperOrigin-RevId: 530814881
See openxla/xla#2845 The modified code triggered: `warning C5101: use of preprocessor directive in function-like macro argument list is undefined behavior` Due to macro guards inside of a function-like macro invocation. The solution here was to merely clone the macro call into each directive region. PiperOrigin-RevId: 530835539
PiperOrigin-RevId: 530845246
PiperOrigin-RevId: 530845249
- use-after-free - double-free - detect more out of bounds accesses PiperOrigin-RevId: 530847579
PiperOrigin-RevId: 530855903
Use optimized 4bit eval when weight shape is larger than [4x32] PiperOrigin-RevId: 530859000
… experiment to debug why some jobs got slower. PiperOrigin-RevId: 532120751
PiperOrigin-RevId: 532132008
PiperOrigin-RevId: 532135421
…sformers/ have strict dependencies. PiperOrigin-RevId: 532136646
PiperOrigin-RevId: 532138742
PiperOrigin-RevId: 532141388
PiperOrigin-RevId: 532143105
…pendencies. PiperOrigin-RevId: 532143181
…ta/experimental/kernel_tests/optimization/ have strict dependencies. PiperOrigin-RevId: 532143351
…ncies. PiperOrigin-RevId: 532146642
PiperOrigin-RevId: 532148739
PiperOrigin-RevId: 532150448
PiperOrigin-RevId: 532153598
PiperOrigin-RevId: 532153784
PiperOrigin-RevId: 532157885
PiperOrigin-RevId: 532161692
When one worker has finished, the snapshot manager is in the "WindingDown" state. It doesn't need to create new streams. If it does, a worker is assigned two streams. PiperOrigin-RevId: 532171233
This helps with better visualizing Triton GEMMs as HLO ops in profiling tools PiperOrigin-RevId: 532173667
PiperOrigin-RevId: 532176002
PiperOrigin-RevId: 532178377
…ncies. PiperOrigin-RevId: 532182453
…rotos from AbstractStackTrace and StackTracesMap objects. Its API has options that cover various typical use cases. PiperOrigin-RevId: 532183689
… are created. PiperOrigin-RevId: 532194430
… into develop-upstream-sync-230515
…nto develop-upstream-sync-230515
…pen does not support fp64.
jayfurmanek
approved these changes
May 19, 2023
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
No description provided.