Skip to content

Issues: pytorch/torchtitan

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Question about custom cuda operators for tensor parallelism question Further information is requested
#434 opened Jun 28, 2024 by vermouth1992
Question about Pipeline parallelism question Further information is requested
#431 opened Jun 27, 2024 by vermouth1992
improve memory profiler to not to profile every iteration enhancement New feature or request
#422 opened Jun 24, 2024 by tianyu-l
ImportError in LLaMA Training Script question Further information is requested
#412 opened Jun 19, 2024 by viai957
Some testing from me
#407 opened Jun 17, 2024 by ad8e
How to use nsys? enhancement New feature or request
#399 opened Jun 13, 2024 by vedantroy
benchmark perf numbers on H100 GPUs and update performance.md documentation Improvements or additions to documentation
#394 opened Jun 12, 2024 by tianyu-l torchtitan release 1.0
add compiled RMSNorm into the norm config enhancement New feature or request
#374 opened May 30, 2024 by tianyu-l
Add torchdata to requirements after release better_engineering Repo code quality improvements
#351 opened May 21, 2024 by gokulavasan
freqs_cis in llama model should be a non-persistent buffer bug Something isn't working
#316 opened May 8, 2024 by tianyu-l
Question on Model Init question Further information is requested
#312 opened May 6, 2024 by XinDongol
add doc for adding custom dataset documentation Improvements or additions to documentation enhancement New feature or request
#311 opened May 5, 2024 by lessw2020
freezeing some part of the model enhancement New feature or request
#306 opened May 3, 2024 by tianyu-l
reload existing llama checkpoints enhancement New feature or request
#305 opened May 3, 2024 by tianyu-l
[Feature] Add gradient accumulation enhancement New feature or request
#292 opened May 1, 2024 by XinDongol
[Feature] Plan to add model_register enhancement New feature or request
#282 opened Apr 28, 2024 by XinDongol
numerical issue when running SDPA with DTensor bug Something isn't working help wanted Extra attention is needed
#267 opened Apr 24, 2024 by tianyu-l
Fused RMSNorm incompatible with PP tracing (dynamic stride) bug Something isn't working
#217 opened Apr 10, 2024 by wconstab
Verify that we can do eval / inference enhancement New feature or request
#192 opened Apr 4, 2024 by gnadathur
Add support for MoE model architecture enhancement New feature or request
#184 opened Apr 2, 2024 by gnadathur
ProTip! What’s not been updated in a month: updated:<2024-06-02.