-
Notifications
You must be signed in to change notification settings - Fork 362
Pull requests: AI-Hypercomputer/maxtext
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Support subslice shapes in maxtext when using Pathways
#1854
opened Jun 20, 2025 by
copybara-service
bot
Loading…
Refactor: Decouple Core Transformer Blocks
#1852
opened Jun 19, 2025 by
parambole
Loading…
4 tasks done
Allow moe_test.py to be run on internal tools.
#1847
opened Jun 18, 2025 by
copybara-service
bot
Loading…
Enable Checkpoint Conversion from Huggingface to Maxtext
#1839
opened Jun 16, 2025 by
YixuanWang-99
Loading…
4 tasks
Integrate Multi-Token Prediction (MTP) Training objective
#1837
opened Jun 16, 2025 by
parambole
Loading…
4 tasks done
Refactor profiler in trainers
pull ready
#1833
opened Jun 14, 2025 by
SurbhiJainUSC
Loading…
4 tasks done
Upgrade dependencies (now supports and defaults to Python 3.12 and Ubuntu 24.04)
#1826
opened Jun 12, 2025 by
SamuelMarks
Loading…
4 tasks done
Moving device_put for the kernel earlier to avoid a host convert op
#1824
opened Jun 12, 2025 by
zhenying-liu
Loading…
Use the extended
jax.experimental.colocated_python.colocated_cpu_devices
API
#1822
opened Jun 11, 2025 by
copybara-service
bot
Loading…
fix runtime errors in colocated python dataloader
#1819
opened Jun 11, 2025 by
sadikneipp
Loading…
4 tasks done
Refactor: Recording and logging training and evaluation metrics in all trainers
#1815
opened Jun 10, 2025 by
SurbhiJainUSC
Loading…
4 tasks done
Add total memory usage logging in GB and unit tests for activation offload
pull ready
#1813
opened Jun 10, 2025 by
zhenying-liu
Loading…
Add Llama4VisionModel for multimodal decoding
#1809
opened Jun 5, 2025 by
hengtaoguo
Loading…
4 tasks done
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.