Skip to content
This repository was archived by the owner on Jun 4, 2025. It is now read-only.

Conversation

@KSGulin
Copy link

@KSGulin KSGulin commented Jun 29, 2022

Currently, trying to run quant transfer learning will lead to errors, as the checkpoint recipe includes a QAT modifier.

The primary fix needs to implemented in SparseML and ZooModels. This PR provides a temporary patch which gets around the issue by removing the the QAT modifier from the checkpoint recipe when a model is loaded for training. Additional logic is added for handling model saving and one-shot sparsification in this regime.

In addition, a bug where using the "--resume" keyword would increase the total epochs is fixed.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant