Skip to content

Explosive variance observed in latents and noise_pred when using torch.autocast() #119

Pinned Answered by raulc0399
AlezHibali asked this question in Q&A
Discussion options

You must be logged in to vote

@lawrence-cj 2 small changes in the PR, i added params for dora and rslora
https://huggingface.co/docs/peft/package_reference/lora#peft.LoraConfig.use_rslora
https://huggingface.co/docs/peft/package_reference/lora#peft.LoraConfig.use_dora

both of them show better results on my dataset.

@AlezHibali might give you better results as well.

Replies: 3 comments 18 replies

Comment options

You must be logged in to vote
1 reply
@AlezHibali
Comment options

Comment options

You must be logged in to vote
13 replies
@AlezHibali
Comment options

@lawrence-cj
Comment options

@raulc0399
Comment options

@raulc0399
Comment options

Answer selected by lawrence-cj
Comment options

You must be logged in to vote
4 replies
@raulc0399
Comment options

@lawrence-cj
Comment options

@raulc0399
Comment options

@raulc0399
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
3 participants