int8 support #488

mitchellnw · 2023-04-14T06:58:23Z

This PR introduces beta support for int8 training and inference. You can enable int8 training with --use-bnb-linear SwitchBackLinearGlobal or --use-bnb-linear SwitchBackLinearGlobalMemEfficient. For CLIP VIT-Huge this should currently correspond to a 10% training speedup with negligable accuracy difference.

More speedups coming when the attention layer is refactored so that linear layers may be replaced there, too. However, that will require more changes so best to get this merged first.

rwightman · 2023-04-16T15:23:19Z

@mitchellnw looks low risk, ready to merge?

mitchellnw · 2023-04-16T17:47:54Z

@rwightman yep, ready to merge!

rwightman · 2023-04-16T18:15:10Z

src/training/main.py

@@ -277,6 +287,8 @@ def main(args):
        if args.ddp_static_graph:
            # this doesn't exist in older PyTorch, arg only added if enabled
            ddp_args['static_graph'] = True
+        if args.use_bnb_linear is not None:


couldn't this be put right after replace_linear() in the previous if use_bnb_linear block?

@mitchellnw one q here

yep good point that simplifies -- i'll make the change and run a quick test to make sure all good.

fixed, thanks @rwightman!

int8 support

248e8e9

rwightman reviewed Apr 16, 2023

View reviewed changes

simplifying as per Ross comment

098dbc8

rwightman merged commit c48111d into mlfoundations:main Apr 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

int8 support #488

int8 support #488

mitchellnw commented Apr 14, 2023 •

edited

Loading

rwightman commented Apr 16, 2023

mitchellnw commented Apr 16, 2023

rwightman Apr 16, 2023

rwightman Apr 16, 2023

mitchellnw Apr 16, 2023

mitchellnw Apr 16, 2023

int8 support #488

int8 support #488

Conversation

mitchellnw commented Apr 14, 2023 • edited Loading

rwightman commented Apr 16, 2023

mitchellnw commented Apr 16, 2023

rwightman Apr 16, 2023

Choose a reason for hiding this comment

rwightman Apr 16, 2023

Choose a reason for hiding this comment

mitchellnw Apr 16, 2023

Choose a reason for hiding this comment

mitchellnw Apr 16, 2023

Choose a reason for hiding this comment

mitchellnw commented Apr 14, 2023 •

edited

Loading