Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Disable peer access code #2082

Merged
merged 1 commit into from
Jul 19, 2024
Merged

Conversation

lzhangzz
Copy link
Collaborator

Disable peer access code because we don't have custom all-reduce yet and it slows down allocation in TP mode like A LOT.

@lvhan028 lvhan028 merged commit 263e8cf into InternLM:main Jul 19, 2024
8 of 9 checks passed
@zhyncs
Copy link
Collaborator

zhyncs commented Jul 19, 2024

ref https://github.com/zhyncs/lmdeploy-build/releases/tag/6a6e7f9

irexyc added a commit to irexyc/lmdeploy that referenced this pull request Jul 30, 2024
irexyc added a commit to irexyc/lmdeploy that referenced this pull request Aug 1, 2024
lvhan028 pushed a commit that referenced this pull request Aug 8, 2024
* add session_ids arg for multithread use of pipeline.stream_infer

* Revert "disable peer access code (#2082)"

This reverts commit 263e8cf.

* Revert "Revert "disable peer access code (#2082)""

This reverts commit 2b74d46.

* update

* add peer allocator

* fix lint

* check cuda error

* fix comments

* fix wrong allocator

---------

Co-authored-by: Li Zhang <lzhang329@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants