Small negative values cause the gradient of `sigmoid` to become `NaN` #1139

wcshds · 2024-01-12T20:40:19Z

Here is the code:

use burn::{
    backend::{Autodiff, NdArray},
    tensor::{activation, Data, Tensor},
};

fn main() {
    let data = Data::<f32, 1>::from([-90.0]);

    let device = Default::default();
    let tensor_1 = Tensor::<Autodiff<NdArray>, 1>::from_data(data, &device).require_grad();

    let tensor_2 = activation::sigmoid(tensor_1.clone());
    let grads = tensor_2.backward();

    let grad_1 = tensor_1.grad(&grads).unwrap();
    println!("{}", grad_1);
}

The result is NaN.

The text was updated successfully, but these errors were encountered:

wcshds · 2024-01-12T20:50:59Z

Is it possible to manually define differentials during the activation function backpropagation? So we don't have to automatically differentiate log and exp.

wcshds mentioned this issue Jan 13, 2024

fix the problem of sigmoid gradient generating NaN #1140

Merged

2 tasks

louisfd closed this as completed in #1140 Jan 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Small negative values cause the gradient of `sigmoid` to become `NaN` #1139

Small negative values cause the gradient of `sigmoid` to become `NaN` #1139

wcshds commented Jan 12, 2024

wcshds commented Jan 12, 2024

Small negative values cause the gradient of sigmoid to become NaN #1139

Small negative values cause the gradient of sigmoid to become NaN #1139

Comments

wcshds commented Jan 12, 2024

wcshds commented Jan 12, 2024

Small negative values cause the gradient of `sigmoid` to become `NaN` #1139

Small negative values cause the gradient of `sigmoid` to become `NaN` #1139