-
Notifications
You must be signed in to change notification settings - Fork 115
Issues: pytorch/torchtitan
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Question about custom cuda operators for tensor parallelism
question
Further information is requested
#434
opened Jun 28, 2024 by
vermouth1992
Question about Pipeline parallelism
question
Further information is requested
#431
opened Jun 27, 2024 by
vermouth1992
improve memory profiler to not to profile every iteration
enhancement
New feature or request
#422
opened Jun 24, 2024 by
tianyu-l
Llama models with custom configurations and uploading to Hugging Face
#420
opened Jun 24, 2024 by
bkchang
ImportError in LLaMA Training Script
question
Further information is requested
#412
opened Jun 19, 2024 by
viai957
benchmark perf numbers on H100 GPUs and update performance.md
documentation
Improvements or additions to documentation
add compiled RMSNorm into the norm config
enhancement
New feature or request
#374
opened May 30, 2024 by
tianyu-l
Add torchdata to requirements after release
better_engineering
Repo code quality improvements
#351
opened May 21, 2024 by
gokulavasan
numerical difference for SDPA between non-dtensor vs dtensor, when math attention and fp16 are used
bug
Something isn't working
#317
opened May 8, 2024 by
tianyu-l
freqs_cis
in llama model should be a non-persistent buffer
bug
#316
opened May 8, 2024 by
tianyu-l
Question on Model Init
question
Further information is requested
#312
opened May 6, 2024 by
XinDongol
add doc for adding custom dataset
documentation
Improvements or additions to documentation
enhancement
New feature or request
#311
opened May 5, 2024 by
lessw2020
freezeing some part of the model
enhancement
New feature or request
#306
opened May 3, 2024 by
tianyu-l
reload existing llama checkpoints
enhancement
New feature or request
#305
opened May 3, 2024 by
tianyu-l
[Feature] Add gradient accumulation
enhancement
New feature or request
#292
opened May 1, 2024 by
XinDongol
[Feature] Plan to add New feature or request
model_register
enhancement
#282
opened Apr 28, 2024 by
XinDongol
numerical issue when running SDPA with DTensor
bug
Something isn't working
help wanted
Extra attention is needed
#267
opened Apr 24, 2024 by
tianyu-l
Fused RMSNorm incompatible with PP tracing (dynamic stride)
bug
Something isn't working
#217
opened Apr 10, 2024 by
wconstab
add unit test for ongoing numerical verification of fusedRMSNorm
better_engineering
Repo code quality improvements
#205
opened Apr 5, 2024 by
lessw2020
Verify that we can do eval / inference
enhancement
New feature or request
#192
opened Apr 4, 2024 by
gnadathur
Add support for MoE model architecture
enhancement
New feature or request
#184
opened Apr 2, 2024 by
gnadathur
Previous Next
ProTip!
What’s not been updated in a month: updated:<2024-06-02.