[Question] Usage of bnb.nn.Embedding with existing classes from other libraries #19

LSinev · 2021-11-22T08:52:18Z

Replace embedding layer if necessary: torch.nn.Embedding(..) -> bnb.nn.Embedding(..)

Does it suppose user creation of custom classes to replace (for example) huggingface transformers' GPT2DoubleHeadsModel?
Or there is something like bnb.optim.GlobalOptimManager which change provided model instance to use bitsandbytes embeddings instead of torch ones?

The text was updated successfully, but these errors were encountered:

TimDettmers · 2021-11-23T15:01:18Z

Currently, a replacement is required, since the layer also adds an embedding layer. This is critical for pretraining models. If you are fine-tuning then you do not need the StableEmbedding layer (for GLUE, I am not sure for fine-tuning GPT-2 or seq-to-seq).

If you want to use 32-bit optimizers for the embedding, but without layer norm, you can add the following code after the embedding class is defined in the GPT2DoubleHeadsModel:

  self.emb = torch.nn.Embedding(..)
  GlobalOptimManager.get_instance().override_config(self.emb.weight, 'optim_bits', 32)
  GlobalOptimManager.get_instance().register_parameters(self.emb.weight)

This will add further stability to the fine-tuning, especially for seq-to-seq or LM fine-tuning. I would recommend replacing the embedding with the StableEmbedding layer if you do pretraining from scratch.

TimDettmers · 2021-12-04T20:09:26Z

A standard Embedding layer has been added that is very easy to use in place of torch.nn.Embedding. The bnb.nn.Embedding class ensures that optimization happens in 32-bit for the embedding layer, even if the rest of the model is optimized with 8-bit optimizers. Thank you for this suggestion!

TimDettmers added enhancement New feature or request question Further information is requested labels Nov 23, 2021

TimDettmers added a commit that referenced this issue Nov 29, 2021

Added module override, bnb.nn.Embedding #13 #15 #19

20e1677

TimDettmers closed this as completed Dec 4, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] Usage of bnb.nn.Embedding with existing classes from other libraries #19

[Question] Usage of bnb.nn.Embedding with existing classes from other libraries #19

LSinev commented Nov 22, 2021

TimDettmers commented Nov 23, 2021

TimDettmers commented Dec 4, 2021

[Question] Usage of bnb.nn.Embedding with existing classes from other libraries #19

[Question] Usage of bnb.nn.Embedding with existing classes from other libraries #19

Comments

LSinev commented Nov 22, 2021

TimDettmers commented Nov 23, 2021

TimDettmers commented Dec 4, 2021