Remove FloatN & simplify adam/reduce with BF16 LayerNorms #295

ademeure · 2024-04-29T19:40:44Z

The MULTI_GPU path is untested, but everything else seems to work fine. I kept the per-tensor "param_sizeof" as it's used in test_gpt2.cu for example, it's not much code and may be useful again in the future.

This will create merge conflicts with #289 but hopefully not hard to fix.

karpathy · 2024-04-29T20:33:25Z

train_gpt2.cu

@@ -1871,45 +1851,18 @@ void gpt2_update(GPT2 *model, float learning_rate, float beta1, float beta2, flo
        cudaCheck(cudaMalloc((void**)&model->v_memory, model->num_parameters * sizeof(float)));
        cudaCheck(cudaMemset(model->m_memory, 0, model->num_parameters * sizeof(float)));
        cudaCheck(cudaMemset(model->v_memory, 0, model->num_parameters * sizeof(float)));
-        printf0("allocated %d MiB for AdamW optimizer state m\n", (int)round(model->num_parameters * sizeof(float) / (1024 * 1024)));
-        printf0("allocated %d MiB for AdamW optimizer state v\n", (int)round(model->num_parameters * sizeof(float) / (1024 * 1024)));
+        printf("allocated %zu MiB for AdamW optimizer state m\n", (model->num_parameters * sizeof(float)) >> 20);


fixing this in my local pr before push, should be print0

ademeure added 3 commits April 29, 2024 20:21

Remove FloatN and associated code for adam & allreduce

1d4effd

avoid warning for cublas_compute_type in BF16 mode

66c4548

Add back FP16 and disable multi_gpu all reduce when there is only 1 GPU

c24bb88

karpathy reviewed Apr 29, 2024

View reviewed changes

karpathy merged commit c24bb88 into karpathy:master Apr 29, 2024
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove FloatN & simplify adam/reduce with BF16 LayerNorms #295

Remove FloatN & simplify adam/reduce with BF16 LayerNorms #295

ademeure commented Apr 29, 2024 •

edited

karpathy Apr 29, 2024

Remove FloatN & simplify adam/reduce with BF16 LayerNorms #295

Remove FloatN & simplify adam/reduce with BF16 LayerNorms #295

Conversation

ademeure commented Apr 29, 2024 • edited

karpathy Apr 29, 2024

Choose a reason for hiding this comment

ademeure commented Apr 29, 2024 •

edited