Skip to content

How does Zero Stage 3 free up weight memory? Won't the computation graph reference it? #7186

Answered by tjruwase
Laiyi97 asked this question in Q&A
Discussion options

You must be logged in to vote

@Laiyi97, assuming I understand your question correctly, computation graph would have references to the tensor container but not to tensor.data payload which holds the weight values.

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@Laiyi97
Comment options

Answer selected by Laiyi97
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants