wrong input_dims when use_img #2

frickyinn · 2022-05-08T09:54:13Z

Hello,
I noticed that several layers had wrong input dimensions when I turned use_img to True, and I have corrected them in my repo forked from yours: frickyinn/conST.
But with MAE image features, the ARI result were little lower than merely using gene expression. I think that maybe I was using the wrong hyper-parameters when I tried to change the dimensions. So could you help me solve this?

Thank you!

ys-zong · 2022-05-08T12:13:27Z

Hi,

Thanks for your interest. Can you tell me which dataset are you using, and what dimension have you changed to? It looks a bit strange that the ARI of the major training is even lower than pretraining.

frickyinn · 2022-05-09T03:50:58Z

I was using spatialLIBD/151673 just as conST_cluster.ipynb did. And I cropped the tissue image myself according to the spatial positions and extracted the features with run_mae_extract_feature.py.

Because I tried to train the model with MAE features, there were several input dimensions of the layers different from merely using gene expression, including:

self.latent_dim
self.cluster_layer
self.fc1
self.fc2
self.disc_c
self.disc

For details, please see: compare. Basically, it was because z = torch.cat((feat_x, gnn_z), 1) had changed to z = torch.cat((feat_x, gnn_z, feat_img), 1) when using image, and the following layers' dimensions changed accordingly.

Instead of using eval_resolution = res_search_fixed_clus(adata_conST, n_clusters) to detect the best resolution in every epoch, I set resolution constantly to be 1, so the results were little lower.
But the problem of lower ARI after major training does exist. When I ran the conST_cluster.ipynb with use_pretrained=False and commented out the major training line:

conST_net.pretraining()
# conST_net.major_training()

to compared their results, I found merely pretraining was 0.499, which was higher than 0.439 of both pretraining and major training.

I am wondering if I have set something wrong.

Thank you!

ys-zong · 2022-05-12T20:04:43Z

It's a bit difficult for me to tell from this code directly, but I suggest trying to debug it in two steps.

First, can you obtain similar results without using image features? For data spatialLIBD/151673, there is a slight improvement when using image features from MAE but not much, because as you can see from the histology images, the image patches look similar among spots. So, if you can't obtain similar results from this step, there is probably something wrong with your initial settings, environment, etc. From your result, I suspect there is something wrong with this step, because I didn't come across a situation where the performance is worse than the pretraining after the major training.

If the first step is okay, then you can try to reduce the input dimension of the image features from 748 to a lower dimension, e.g. 100 with PCA (or even smaller), and see how the performance goes. By reducing the proportion of image features, you can check if the image features are successfully extracted by MAE.

P.S. Note that you may not get exact same results every time you run the experiment, which is due to the Cuda non-deterministic characterics of pytorch-geometrics.

ys-zong closed this as completed Aug 20, 2022

asifajrof mentioned this issue Mar 7, 2023

"RuntimeError Unknown model" in MAE Feature Extraction #4

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

wrong input_dims when use_img #2

wrong input_dims when use_img #2

frickyinn commented May 8, 2022

ys-zong commented May 8, 2022

frickyinn commented May 9, 2022

ys-zong commented May 12, 2022

wrong input_dims when use_img #2

wrong input_dims when use_img #2

Comments

frickyinn commented May 8, 2022

ys-zong commented May 8, 2022

frickyinn commented May 9, 2022

ys-zong commented May 12, 2022