-
Notifications
You must be signed in to change notification settings - Fork 41
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Regression ability of gaussian process #16
Comments
Hi, Sorry for the delayed response. One possibility is that the default hyperparameters for the GP model are not good for your application. For example, when there isn't much data the length scale prior will tend give longer length scales and a function distribution which may not fit your discontinuous data well. I would suggest that you play with the GP hyperparameters or build your optimizer using edbo.bro.BO_express which attempts to select priors appropriate for the dimensionality of the data. In addition, you could use a random forest as the surrogate model for edbo (edbo.models.RF_Model). |
@shenggong1996 any update on this? Did you try adjusting the hyperparameters or using a random forest? Would be great to see any results (e.g. before and after parity plots) and/or an example of the "jumps" you were describing if you have them available. |
Hi Sterling, I played around with hyperparameters and random forest, and no significant improvement appeared. My solution is using a neural network instead, but since it is not straightforward to use NN to estimate uncertainty, I use NN in the exploitations and GP in explorations. Our data will be presented in a preprint soon, with the possible title "Bayesian Optimization Assisted Laser Reduction of poly(acrylonitrile) for Electrochemical Applications". |
Hi Sheng, thanks for the info! That seems like an interesting approach. When the preprint is posted, I'm interested in checking it out. |
Hi Ben,
I am using EDBO to study results of one type of experiments. There are four dimensions in the parameter space. I find that the gaussian process in EDBO has very poor regression performance even for the training set (4 dimensions, around 200 data points in total), and the dataset can be well learned by neural networks as a comparison. I think the reason for the failure of gaussian process is that my dataset is highly "discontinous" with many delta-function like jump between 0 and some finite values, and around 1/3 of data are zeros. Do you think it is reasonable that gaussian process cannot perform very well for a four-dimensional dataset? Thank you very much!
The text was updated successfully, but these errors were encountered: