Update unit tests status badge in readme by khatwanimohit · Pull Request #1 · AI-Hypercomputer/maxtext

khatwanimohit · 2023-04-17T23:46:52Z

Update unit tests status badge in readme

rwitten

Congrats on the first commit!

Add pull request template

JAX is undergoing a rename of the contents of `jax.experimental.layouts` in preparation for its graduation from experimental: 1. "Formats" are replacing "layouts", and specifically `Layout` -> `Format` 2. "Layouts" are replacing "device local layouts", and specifically `DeviceLocalLayout` -> `Layout` This is an incremental update carrying out #1. PiperOrigin-RevId: 773086710

…s` and `.output_layouts` respectively. JAX is undergoing a rename of the contents of jax.experimental.layouts in preparation for its graduation from experimental: "Formats" are replacing "layouts", and specifically Layout -> Format "Layouts" are replacing "device local layouts", and specifically DeviceLocalLayout -> Layout This is an incremental update carrying out #1. PiperOrigin-RevId: 773141876

JAX is undergoing a rename of the contents of jax.experimental.layouts in preparation for its graduation from experimental: "Formats" are replacing "layouts", and specifically Layout -> Format "Layouts" are replacing "device local layouts", and specifically DeviceLocalLayout -> Layout This is an incremental update carrying out #1. PiperOrigin-RevId: 773163981

github-actions

📋 Review Summary

This pull request updates the URL for the unit tests status badge in the README.md file. The change correctly points the badge to the google/maxtext repository, which appears to be the upstream source, ensuring the test status is accurate.

🔍 General Feedback

This is a good maintenance update that improves the quality of the project's documentation.

github-actions

📋 Review Summary

I was unable to perform a code review for this pull request. The tools required to fetch the pull request data, such as mcp__github__get_pull_request_diff, were not available in my execution environment.

🔍 General Feedback

The instructions provided for my operation are contradictory. They refer to data-gathering tools (mcp__github__get_pull_request, mcp__github__get_pull_request_files, mcp__github__get_pull_request_diff) that are not present in the list of available tools.
My attempts to work around this limitation by using shell commands to find a diff file were blocked by security restrictions.
Without access to the code changes, I cannot provide any feedback on correctness, security, or other review criteria.

Please check the configuration of the code review agent and ensure that all necessary tools are available and that the instructions are consistent with the execution environment.

github-actions

📋 Review Summary

This pull request appears to intend to make train.py importable, but the primary change has the opposite effect. The review identifies a critical issue that prevents safe importation of the module.

🔍 General Feedback

The goal of making train.py importable is good for code structure and reusability.
However, the current implementation will cause side effects when the module is imported. My comment includes a suggested fix for this critical issue.

OPTIMIZATION RESULTS from comprehensive TPU testing: KEY FINDING AI-Hypercomputer#1: block_kv_compute=64 (not 128!) - Config: block_q=1024, block_kv_sparse=256, block_kv_compute=64 - Result: 1.095× speedup (9.5% FASTER than JAX) - Previous: 0.916× (9.2% slower) - Improvement: +19.5% performance gain KEY FINDING AI-Hypercomputer#2: Scales exceptionally well on large inputs - Config: 64 heads, 2048 q_len, 512 sparse_len - Result: 3.306× speedup on larger problems! - Shows kernel is production-ready for real workloads UPDATED: kascade_kernel.py defaults to use block_kv_compute=64 STATUS: ✅ EXCEEDS 1.2× threshold for production integration Next: Integrate into MaxText attention layer

- Benchmark history: add row AI-Hypercomputer#7 (scan=true, 71.1 ms / 450 tok/s, +28% vs AI-Hypercomputer#6) - Add 'scan_layers=true Analysis' section explaining the 15.4 ms overhead: root cause is loss of XLA inter-layer weight-prefetch pipelining (lax.while_loop prevents cross-iteration scheduling); ~0.32 ms/layer consistent with memory-bandwidth-bound workload losing prefetch overlap - Quantify sparse dispatch break-even bar: must recover >22% to beat 55.7 ms dense baseline; rough estimate ~34 ms achievable with ragged_all_to_all - Update 'Most Impactful Next Fixes' section: scan fix done, ragged_all_to_all is now AI-Hypercomputer#1 priority; update HEAD ref to 539cc04 - Re-rank optimisation table: ragged_all_to_all moved to rank 2 (highest unrun), scan_layers=true added as rank 3 (done, prerequisite)

Key insight: AR decode at batch=32/T=32 is weight-bandwidth-bound. Sparse MoE dispatch does not help (proven: 101.5ms, 83% regression). New ranking: AI-Hypercomputer#1 Batch size > 1 (near-linear throughput scaling) AI-Hypercomputer#2 Int8/FP8 weight quantisation (halves dominant cost) AI-Hypercomputer#3 Speculative decoding AI-Hypercomputer#4 Remove effects_barrier / async EOS Demoted: ragged_all_to_all for decode → bottom (failed) scan_layers=true + 4-phase decoder → AI-Hypercomputer#12 (prefill prereq only) Re-scoped ragged_all_to_all as prefill-only optimisation (valid at T=512 where intermediates 2.68 GB/layer dominate).

khatwanimohit requested a review from rwitten April 17, 2023 23:47

update readme

86f442f

khatwanimohit force-pushed the mohit/fix-readme branch from bee760d to 86f442f Compare April 18, 2023 00:02

rwitten approved these changes Apr 18, 2023

View reviewed changes

khatwanimohit merged commit 4ced406 into main Apr 18, 2023

khatwanimohit deleted the mohit/fix-readme branch April 18, 2023 00:05

A9isha pushed a commit that referenced this pull request Apr 11, 2024

Merge pull request #1 from RissyRan/pr

3534102

Add pull request template

A9isha mentioned this pull request May 24, 2024

Llama3-8b 🦙 #653

Merged

xuefgu mentioned this pull request Jun 22, 2024

Use dedicated requirements for GPUs #721

Closed

xy12181 mentioned this pull request Feb 20, 2025

[config] Add a llama2-7b-int8 test config for decode #1284

Merged

4 tasks

copybara-service Bot mentioned this pull request Jun 16, 2025

Use jax_layout.Format in place of jax_layout.Layout #1838

Merged

copybara-service Bot mentioned this pull request Jun 18, 2025

Use .input_formats and .output_formats in place of .input_layouts and .output_layouts respectively. #1849

Merged

copybara-service Bot mentioned this pull request Jun 19, 2025

Use .format in place of .layout. #1850

Closed

github-actions Bot reviewed Sep 25, 2025

View reviewed changes

github-actions Bot reviewed Oct 14, 2025

View reviewed changes

github-actions Bot reviewed Oct 16, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update unit tests status badge in readme#1

Update unit tests status badge in readme#1
khatwanimohit merged 1 commit into
mainfrom
mohit/fix-readme

khatwanimohit commented Apr 17, 2023 •

edited

Loading

Uh oh!

rwitten left a comment

Uh oh!

github-actions Bot left a comment

Uh oh!

github-actions Bot left a comment

Uh oh!

github-actions Bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

khatwanimohit commented Apr 17, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rwitten left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

📋 Review Summary

🔍 General Feedback

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

📋 Review Summary

🔍 General Feedback

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

📋 Review Summary

🔍 General Feedback

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

khatwanimohit commented Apr 17, 2023 •

edited

Loading