micro_batch_size, step run time, total training time #134

AngainorDev · 2023-06-12T06:48:34Z

Hi,

Thanks a lot for this clear and fat-free code base!
I'm training Falcon-7B with adapters-v2 and an Alpaca-formated dataset of mine.

As usual, I'm trying to max out the vram use for best training time but in this case, there is no significant gain since the step time is almost proportional to the batch size.

step times:
micro_batch_size 1, 159ms
micro_batch_size 2, 293ms
micro_batch_size 4, 560ms

Is this expected, or can this be optimized?

Note:
I'll also open a new issue as advised with my attempt at batch inference, exhibiting the same lack of gain when batching at inference, see
Lightning-AI/lit-llama#188 (comment)

carmocca · 2023-09-20T05:54:27Z

This would be expected in a memory-bound regime, where you are limited by the transfer of data. However, you would need to profile the code in your hardware to verify this.

carmocca added the question Further information is requested label Sep 20, 2023

carmocca closed this as completed Sep 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

micro_batch_size, step run time, total training time #134

micro_batch_size, step run time, total training time #134

AngainorDev commented Jun 12, 2023

carmocca commented Sep 20, 2023

micro_batch_size, step run time, total training time #134

micro_batch_size, step run time, total training time #134

Comments

AngainorDev commented Jun 12, 2023

carmocca commented Sep 20, 2023