Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Model Training Error #36

Open
Coaasim opened this issue Nov 6, 2023 · 0 comments
Open

Model Training Error #36

Coaasim opened this issue Nov 6, 2023 · 0 comments

Comments

@Coaasim
Copy link

Coaasim commented Nov 6, 2023

Hi! Thanks for this great drug perturbation prediction approach.
I am trying to apply the CPA model on my RNA-Seq data but unfortunately while training the data with my data I get the error:
"ValueError: Input X contains NaN.
NearestNeighbors does not accept missing values encoded as NaN natively. For supervised learning, you might want to consider sklearn.ensemble.HistGradientBoostingClassifier and Regressor which accept missing values encoded as NaNs natively. Alternatively, it is possible to preprocess the data, for instance by using an imputer transformer in a pipeline or drop samples with missing values. See https://scikit-learn.org/stable/modules/impute.html You can find a list of all estimators that handle NaN values at the following page: https://scikit-learn.org/stable/modules/impute.html#estimators-that-handle-nan-values"

When I checked my data again for NaNs, I couldn't find any missing values. Do you have any idea what could be wrong? How important is the introduction of adata.uns or the split columns in the adata.obs?
I would be grateful for any help!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant