The test data runs differently than the example #21

lifan2022 · 2024-03-20T13:14:57Z

Hello,

Thank you for bringing such a good piece of software, I'm having a little problem with your software.

I ran TOSICA with test data, but in the new_adata after the prediction, 2874 cells were predicted to be different from the original celltype.

#ref data
ref_adata = sc.read('./demo_train.h5ad')
ref_adata = ref_adata[:,ref_adata.var_names]
print(ref_adata)
print(ref_adata.obs.Celltype.value_counts())

#query data
query_adata = sc.read('./demo_test.h5ad')
query_adata = query_adata[:,ref_adata.var_names]
print(query_adata)
print(query_adata.obs.Celltype.value_counts())

#Training
TOSICA.train(ref_adata, gmt_path='./GO_bp.gmt', label_name='Celltype',epochs=3,project='hGOBP_demo')

#Prediction
model_weight_path = './hGOBP_demo/model-0.pth'
new_adata = TOSICA.pre(query_adata, model_weight_path = model_weight_path,project='hGOBP_demo')

The text was updated successfully, but these errors were encountered:

lifan2022 · 2024-03-20T13:20:48Z

Here's the result of my final visualization

apologize66 · 2024-03-26T12:58:00Z

Hello！
I encountered an error when running the 9th cell, which said "items in new_categories are not the same as in old categories." When I tried to change the order of the celltype defined by the original author to match the new_categories in order to solve this problem, I found that the result was the same as the one you obtained in the running result. Did you encounter the same error as well?
And,if you are also a Chinese student, perhaps we can further communicate .

lifan2022 · 2024-03-27T02:35:01Z

Hello！ I encountered an error when running the 9th cell, which said "items in new_categories are not the same as in old categories." When I tried to change the order of the celltype defined by the original author to match the new_categories in order to solve this problem, I found that the result was the same as the one you obtained in the running result. Did you encounter the same error as well? And,if you are also a Chinese student, perhaps we can further communicate .

Yes, I'm getting the same error

IvyYang00 · 2024-05-24T08:00:15Z

Hi! I encountered similar error as you guys. Solved as what [apologize66] did, I got a different result but still very different from the original celltype with relatively low accuracy.

IvyYang00 · 2024-05-24T11:19:26Z

Hi! I encountered similar error as you guys. Solved as what [apologize66] did, I got a different result but still very different from the original celltype with relatively low accuracy.

I tried to useTOSICA to train my own model with human lung scRNA-seq dataset using epoch=20. The validate accuracy is 0.993 when training the model. But when I used the model to predict internal test dataset, the accuracy is only about 0.29. I don't know why.

JiaweiChenGo · 2024-06-20T09:20:02Z

Here's the result of my final visualization

Thank you for your interest in TOSICA.
Unfortunately I cannot judge where the problem is from what has been shown here. If I encounter this problem, first, I will check whether the var_names of the ref_adata and query_adata are consistent and in the same order. Then I will check whether the pre-trained model is loaded correctly.
Besides, I noticed that different cell types were correctly separated in the attention space and there is no cell were predict to be alpha cell which is the most abundant cell type and should have the highest prediction accuracy. So I'm worried if there's something wrong with label_dictionary.csv.
If the prediction is still terrible and you are willing to share your demo dataset and code, I would be happy to help you analyze and examine what happened here!

JiaweiChenGo · 2024-06-20T09:30:25Z

Hello！ I encountered an error when running the 9th cell, which said "items in new_categories are not the same as in old categories." When I tried to change the order of the celltype defined by the original author to match the new_categories in order to solve this problem, I found that the result was the same as the one you obtained in the running result. Did you encounter the same error as well? And,if you are also a Chinese student, perhaps we can further communicate .

Maybe, you masked alpha cells in the traing process, which resulted in the categories of predicted cell types being different from those in the tutorial.ipynb.
I am glad to have more communications, here is my email: jiaweichen@pku.edu.cn and wechat: chenjiawei9667

JiaweiChenGo · 2024-06-20T09:39:03Z

Hi! I encountered similar error as you guys. Solved as what [apologize66] did, I got a different result but still very different from the original celltype with relatively low accuracy.

Thank you for your interest in TOSICA.
Similarly, I noticed that different cell types were correctly separated and there is no cell were predict to be alpha cell which is the most abundant cell type and should have the highest prediction accuracy. perhaps you masked alpha cells in the traing process, but the default cutoff of the predction is 0.1 which will resulte in a low accuracy.
As for the human lung scRNA-seq dataset, I am glad to help you analyze and examine what happened.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The test data runs differently than the example #21

The test data runs differently than the example #21

lifan2022 commented Mar 20, 2024 •

edited

Loading

lifan2022 commented Mar 20, 2024

apologize66 commented Mar 26, 2024

lifan2022 commented Mar 27, 2024

IvyYang00 commented May 24, 2024

IvyYang00 commented May 24, 2024

JiaweiChenGo commented Jun 20, 2024

JiaweiChenGo commented Jun 20, 2024

JiaweiChenGo commented Jun 20, 2024

The test data runs differently than the example #21

The test data runs differently than the example #21

Comments

lifan2022 commented Mar 20, 2024 • edited Loading

lifan2022 commented Mar 20, 2024

apologize66 commented Mar 26, 2024

lifan2022 commented Mar 27, 2024

IvyYang00 commented May 24, 2024

IvyYang00 commented May 24, 2024

JiaweiChenGo commented Jun 20, 2024

JiaweiChenGo commented Jun 20, 2024

JiaweiChenGo commented Jun 20, 2024

lifan2022 commented Mar 20, 2024 •

edited

Loading