-
Notifications
You must be signed in to change notification settings - Fork 361
Pull requests: AI-Hypercomputer/maxtext
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Refactor: Decouple Core Transformer Blocks
#1852
opened Jun 19, 2025 by
parambole
Loading…
4 tasks done
Allow moe_test.py to be run on internal tools.
#1847
opened Jun 18, 2025 by
copybara-service
bot
Loading…
Enable Checkpoint Conversion from Huggingface to Maxtext
#1839
opened Jun 16, 2025 by
YixuanWang-99
•
Draft
4 tasks
Integrate Multi-Token Prediction (MTP) Training objective
#1837
opened Jun 16, 2025 by
parambole
Loading…
4 tasks done
Refactor profiler in trainers
pull ready
#1833
opened Jun 14, 2025 by
SurbhiJainUSC
Loading…
4 tasks done
Upgrade dependencies (now supports and defaults to Python 3.12 and Ubuntu 24.04)
#1826
opened Jun 12, 2025 by
SamuelMarks
Loading…
4 tasks done
Moving device_put for the kernel earlier to avoid a host convert op
#1824
opened Jun 12, 2025 by
zhenying-liu
Loading…
Use the extended
jax.experimental.colocated_python.colocated_cpu_devices
API
#1822
opened Jun 11, 2025 by
copybara-service
bot
Loading…
fix runtime errors in colocated python dataloader
#1819
opened Jun 11, 2025 by
sadikneipp
Loading…
4 tasks done
Refactor: Recording and logging training and evaluation metrics in all trainers
#1815
opened Jun 10, 2025 by
SurbhiJainUSC
Loading…
4 tasks done
Add total memory usage logging in GB and unit tests for activation offload
pull ready
#1813
opened Jun 10, 2025 by
zhenying-liu
Loading…
[NVIDIA] Add packing support for GPU flash attention
#1802
opened Jun 4, 2025 by
kocchop
Loading…
4 tasks done
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.