[Feature Request] support dtype in mlx.core's module initialization #1232

pshishodia-kgp · 2024-06-25T15:13:20Z

Describe the bug
I want to create a linear layer with dtype=bfloat16.

In mlx, the initialization of any of mlx.core's modules (Linear/Embedding/LayerNorm, etc) doesn't accept dtype. so I end up doing the following

import mlx.nn as nn
import mlx.core as mx

bf16_layer = nn.Linear(10, 10)
bf16_layer.weight = bf16_layer.weight.astype(mx.bfloat16)
bf16_layer.bias = bf16_layer.bias.astype(mx.bfloat16)

Expected behavior
Expecting to have dtype support in initialization similar to pytorch

import torch.nn as nn
bf16_layer = nn.Linear(10, 10, dtype=torch.bfloat16)

Desktop (please complete the following information):

OS Version:MacOS14.5

angeloskath · 2024-06-25T15:35:56Z

The standard way for MLX would be to write model.set_dtype(mx.bfloat16) which is syntactic sugar over model.apply(lambda x: x.astype(mx.bfloat16) if mx.issubtype(x.dtype, mx.floating) else x).

See more at https://ml-explore.github.io/mlx/build/html/python/nn/_autosummary/mlx.nn.Module.set_dtype.html .

awni closed this as completed Jun 25, 2024

awni added the wontfix This will not be worked on label Jun 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] support dtype in mlx.core's module initialization #1232

[Feature Request] support dtype in mlx.core's module initialization #1232

pshishodia-kgp commented Jun 25, 2024

angeloskath commented Jun 25, 2024

[Feature Request] support dtype in mlx.core's module initialization #1232

[Feature Request] support dtype in mlx.core's module initialization #1232

Comments

pshishodia-kgp commented Jun 25, 2024

angeloskath commented Jun 25, 2024