What activation functions for regression neural networks? #113927

Beluker · 2024-03-24T00:21:15Z

Beluker
Mar 24, 2024

Body

I have recently started coding my first neural network and want to train it on a regression problem, e.g. computing sine of a number. Though I am not sure what activation functions to use, or what range I should have my weights and biases. Also do I train it any differently? Any help appreciated : )

Guidelines

I have read and understood this category's guidelines before making this post.

Answered by megamiii

Mar 24, 2024

For activation functions, ReLU is generally a solid choice for the hidden layers because it helps avoid some common issues during training, like vanishing gradients. For the output of a regression problem, you might not want any activation function at all, or you could use a linear activation to keep the output range flexible and unbounded.

Your initial weights are best when they're small random numbers. Xavier or He initialization are good starting points.

When it comes to training your model, you'll likely use Mean Squared Error (MSE) as your loss function because it's standard for regression problems. Adam is a popular optimizer choice. For the training, I'd say it's a lot of experimen…

View full answer

megamiii · 2024-03-24T15:04:06Z

megamiii
Mar 24, 2024

For activation functions, ReLU is generally a solid choice for the hidden layers because it helps avoid some common issues during training, like vanishing gradients. For the output of a regression problem, you might not want any activation function at all, or you could use a linear activation to keep the output range flexible and unbounded.

Your initial weights are best when they're small random numbers. Xavier or He initialization are good starting points.

When it comes to training your model, you'll likely use Mean Squared Error (MSE) as your loss function because it's standard for regression problems. Adam is a popular optimizer choice. For the training, I'd say it's a lot of experimentation, especially with hyperparameters like batch size and learning rate. You should adjust them based on your model's performance.

Good luck and enjoy!

1 reply

Beluker Mar 25, 2024
Author

Thanks, this really helps!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GitHub Community

What activation functions for regression neural networks? #113927

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

GitHub Community

What activation functions for regression neural networks? #113927

Uh oh!

Uh oh!

Beluker Mar 24, 2024

Body

Guidelines

Replies: 1 comment · 1 reply

Uh oh!

megamiii Mar 24, 2024

Uh oh!

Beluker Mar 25, 2024 Author

Beluker
Mar 24, 2024

Replies: 1 comment 1 reply

megamiii
Mar 24, 2024

Beluker Mar 25, 2024
Author