-
Notifications
You must be signed in to change notification settings - Fork 6.3k
[Feature] Support offload and wake up of SGLang Diffusion #19152
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Changes from all commits
76634da
411c310
0705b6d
db6d4a8
19dd060
3973602
035583c
42dbcc3
3c5809a
9f5e243
6aa39f1
5aad1c0
f9799ca
0dd389b
fa51ead
efffb49
34b4a08
006f9b2
a527dd2
3d4a471
5dc1faf
9ea9b89
c141e6a
7e7e373
3b05c1c
e5b1128
2cd6504
720e69e
f17cf9f
0ad9e82
27550c0
2a44ca9
4300ab9
c446886
746c86b
de41546
52b881a
2f12de0
4d63b7a
86d02c1
cc22119
05c7123
5a64640
48c9dcb
281374e
3f63339
adade66
e2b853a
a8572df
8f6bf08
9804b6a
5093c28
431bd48
9d3bb37
8cf3839
4298262
2170df4
6df31ed
4a0ed98
5ba4860
b19dc4e
6e581ca
13506c5
06e6986
aea49db
3a85460
1d780a6
19ee11d
3102e77
2d6a570
7d60cc5
ffdd714
a775d99
a21ac27
c109f84
5f90c43
f9de7e5
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -1,4 +1,8 @@ | ||
| """Request/response data structures for post-training APIs.""" | ||
| """Request/response data structures for post-training APIs. | ||
|
|
||
| TODO(Shuwen, Chenyang): Split RL-oriented request types and serving-oriented | ||
| request types into dedicated files. | ||
| """ | ||
|
|
||
| from dataclasses import dataclass | ||
|
|
||
|
|
@@ -17,3 +21,19 @@ class GetWeightsChecksumReqInput: | |
| """Compute SHA-256 checksum of loaded module weights for verification.""" | ||
|
|
||
| module_names: list[str] | None = None | ||
|
|
||
|
Collaborator
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. we need to separate rl-related types with serving ones later
Collaborator
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. What do you mean on this?
Collaborator
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I mean to use dedicated files for type definitions
Collaborator
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. All the types in this file ( |
||
|
|
||
| @dataclass | ||
| class ReleaseMemoryOccupationReqInput: | ||
| """Request to release (sleep) GPU memory occupation for the diffusion engine.""" | ||
|
|
||
| # TODO (Kun, Chenyang): We shall have rather dedicated | ||
| # control of the Diffusion model's memory occupation. | ||
| pass | ||
|
|
||
|
|
||
| @dataclass | ||
| class ResumeMemoryOccupationReqInput: | ||
| """Request to resume (wake) GPU memory occupation for the diffusion engine.""" | ||
|
|
||
| pass | ||
Uh oh!
There was an error while loading. Please reload this page.