Float16 compatibility #15

d-monnet · 2023-07-06T21:21:07Z

Hi there,

I would like to know if Float16 is supported. I followed this tutorial https://jso.dev/FluxNLPModels.jl/dev/tutorial/ and naively tried

w16 = Float16.(nlp.w)
obj(nlp,w16)

but got a Float32. Therefore I assume at least some computations are performed with Float32 when evaluating the objective. I also tried to modify the function getdata() as

function get_data(bs) 
   ENV["DATADEPS_ALWAYS_ACCEPT"] = "true"

  # Loading Dataset
  xtrain, ytrain = MLDatasets.MNIST(Tx = Float16, split = :train)[:]
  xtest, ytest = MLDatasets.MNIST(Tx = Float16, split = :test)[:]
  .
  .
  .
end

but still got a Float32 when evaluating the objective.
Any idea how to run in Float16 (or any other format)?

The text was updated successfully, but these errors were encountered:

d-monnet · 2023-07-31T20:15:38Z

Ok I found the issue: obj calls set_var!() which is not type stable.
The issue comes from nlp.w .= new_w at

FluxNLPModels.jl/src/utils.jl

Line 6 in a17ca94

    
           function set_vars!(nlp::AbstractFluxNLPModel{T, S}, new_w::AbstractVector{T}) where {T <: Number, S}

which is not type stable. The operator .= casts right hand side vector into left hand side vector's format.
For example:

x32 = ones(Float32,10)
x16 = ones(Float16,10)
x32 .= x16 # this is still a Float32

That is, even is the argument of obj is a Vector{Float16}, it is cast in whatever the parameter type S of FluxNLPModel{T,S} is.

d-monnet · 2023-07-31T20:36:18Z

Ok I found the issue: obj calls set_var!() which is not type stable. The issue comes from nlp.w .= new_w at

FluxNLPModels.jl/src/utils.jl

Line 6 in a17ca94

function set_vars!(nlp::AbstractFluxNLPModel{T, S}, new_w::AbstractVector{T}) where {T <: Number, S}

which is not type stable. The operator .= casts right hand side vector into left hand side vector's format.
For example:
x32 = ones(Float32,10)
x16 = ones(Float16,10)
x32 .= x16 # this is still a Float32
That is, even is the argument of obj is a Vector{Float16}, it is cast in whatever the parameter type S of FluxNLPModel{T,S} is.

In fact this is not even the bottom of the issue: Flux.destructure does not allow FP format modification via the restructure mechanism. From destructure documentation: "Such restoration follows the rules of ChainRulesCore.ProjectTo, and thus will restore floating point precision"
Since the restructure is called in set_var(), we're still can't allow fp format switch.
Any workaround would be welcomed!

farhadrclass · 2023-10-12T19:30:06Z

I can change the backend to change the model everytime

a quick change is as :

f64(m) = Flux.paramtype(Float64, m) # similar to https://github.com/FluxML/Flux.jl/blob/d21460060e055dca1837c488005f6b1a8e87fa1b/src/functor.jl#L217

then to change our model we use :

fluxnlp.model= f64(fluxnlp.model)

farhadrclass · 2023-10-30T18:04:24Z

Flux just recently added support for this
https://fluxml.ai/Flux.jl/stable/utilities/#Flux.f16

We have an error

This reverts commit 13abd49.

…oothOptimizers#4"

New Flux Update ## v0.14.0 (July 2023) * Flux now requires julia v1.9 or later. * CUDA.jl is not a hard dependency anymore. Support is now provided through the extension mechanism, by loading `using Flux, CUDA`. The package cuDNN.jl also needs to be installed in the environment. (You will get instructions if this is missing.) * After a deprecations cycle, the macro `@epochs` and the functions `Flux.stop`, `Flux.skip`, `Flux.zeros`, `Flux.ones` have been removed.

#15 and #4 is addressed here

d-monnet changed the title ~~Unstable type with obj()~~ Float16 compatibility Jul 6, 2023

tmigot added the question Further information is requested label Jul 13, 2023

farhadrclass mentioned this issue Oct 30, 2023

Add multiprecision tests #4

Closed

farhadrclass added a commit that referenced this issue Oct 30, 2023

#15 first fix, also added unit test for #4

13abd49

We have an error

farhadrclass added a commit that referenced this issue Oct 30, 2023

Revert "#15 first fix, also added unit test for #4"

441bf6e

This reverts commit 13abd49.

farhadrclass added a commit to Farhad-phd/FluxNLPModels.jl that referenced this issue Oct 31, 2023

"JuliaSmoothOptimizers#15 first fix, also added unit test for JuliaSm…

c244bfa

…oothOptimizers#4"

farhadrclass mentioned this issue Nov 23, 2023

Multiple dispatch for obj/grad method #30

Open

farhadrclass added a commit that referenced this issue Nov 23, 2023

Merge pull request #25 from Farhad-phd/main

ef8af7b

#15 and #4 is addressed here

farhadrclass closed this as completed Nov 23, 2023

tmigot added a commit that referenced this issue Dec 16, 2023

#15 and #4 is addressed here

bcb4119

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Float16 compatibility #15

Float16 compatibility #15

d-monnet commented Jul 6, 2023 •

edited

Loading

d-monnet commented Jul 31, 2023 •

edited

Loading

d-monnet commented Jul 31, 2023 •

edited

Loading

farhadrclass commented Oct 12, 2023 •

edited

Loading

farhadrclass commented Oct 30, 2023

Float16 compatibility #15

Float16 compatibility #15

Comments

d-monnet commented Jul 6, 2023 • edited Loading

d-monnet commented Jul 31, 2023 • edited Loading

d-monnet commented Jul 31, 2023 • edited Loading

farhadrclass commented Oct 12, 2023 • edited Loading

farhadrclass commented Oct 30, 2023

d-monnet commented Jul 6, 2023 •

edited

Loading

d-monnet commented Jul 31, 2023 •

edited

Loading

d-monnet commented Jul 31, 2023 •

edited

Loading

farhadrclass commented Oct 12, 2023 •

edited

Loading