Skip to content

Issues: hpcaitech/ColossalAI

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[PROPOSAL]: FP8 with block-wise amax enhancement New feature or request
#6105 opened Oct 28, 2024 by Edenzzzz
1 task
[FEATURE]: Windows wheel needed enhancement New feature or request
#6103 opened Oct 27, 2024 by nitinmukesh
[BUG]: assert grad_chunk.l2_norm is not None bug Something isn't working
#6102 opened Oct 25, 2024 by liangzz1991
1 task done
[BUG]: weird stuck while training bug Something isn't working
#6095 opened Oct 19, 2024 by ericxsun
1 task done
[BUG]: Got nan during backward with zero2 bug Something isn't working
#6091 opened Oct 16, 2024 by flymin
1 task done
[BUG]: Unable to train on H20 machine bug Something isn't working
#6079 opened Oct 6, 2024 by kaixinbear
1 task done
FasterMoE shadow expert implement
#6076 opened Sep 29, 2024 by Guodanding
[DOC]: 环境安装失败 documentation Improvements or additions to documentation
#6066 opened Sep 21, 2024 by eccct
[BUG]: remove .github/workflows/submodule.yml bug Something isn't working
#6039 opened Aug 28, 2024 by BoxiangW
1 task done
[FEATURE]: Support Zerobubble pipeline enhancement New feature or request
#6037 opened Aug 28, 2024 by duanjunwen
[BUG]: errror Colossalai 0.4.0/0.4.2 /usr/bin/supervisord bug Something isn't working
#6032 opened Aug 23, 2024 by Storm0921
1 task done
[BUG]: AttributeError: 'GeminiDDP' object has no attribute 'module' bug Something isn't working
#6021 opened Aug 20, 2024 by dheerj188
1 task done
[BUG]: Torch compile causes multi-process to hang with python 3.9 bug Something isn't working
#5987 opened Aug 10, 2024 by Edenzzzz
1 task done
qwen2 fp8 forward/backward
#5971 opened Aug 7, 2024 by wangbluo
[BUG]: Pytest with a specific config failed after PR #5868 bug Something isn't working shardformer
#5949 opened Jul 29, 2024 by GuangyaoZhang
1 task done
[FEATURE]: Request updates for pretraining roberta enhancement New feature or request
#5948 opened Jul 29, 2024 by jiahuanluo
[BUG]: pip install . error: identifier "__hsub" is undefined bug Something isn't working
#5929 opened Jul 19, 2024 by jtmer
1 task done
[BUG]: Shardformer FP8 communication training accuracy degradation bug Something isn't working
#5920 opened Jul 18, 2024 by GuangyaoZhang
1 task done
[BUG]: Low_Level_Zero plugin crashes with LoRA bug Something isn't working
#5909 opened Jul 15, 2024 by Fallqs
1 task done
[BUG]: run opt inference but failed with No module named 'energonai' bug Something isn't working
#5906 opened Jul 13, 2024 by munger1985
1 task done
training issue
#5890 opened Jul 5, 2024 by MaleekaA
ProTip! Adding no:label will show everything without a label.