Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

pretty low GRN scores obtained #8

Open
JianmeiZhong opened this issue Apr 20, 2018 · 14 comments
Open

pretty low GRN scores obtained #8

JianmeiZhong opened this issue Apr 20, 2018 · 14 comments

Comments

@JianmeiZhong
Copy link

Hi,
I work through most part of the analysis pipeline but the results of GRN scores obtained via grnscores() in specified paths SP_10.SP_8 as tSNE and barplot shows.the grn scores have different levels compared to yours presented in several exemplary analysis,that's level of 0.0* versus 0.* .Sometimes,the GRNscores plot has one or two genes with low scores in both directions.I don't know if there's some wrong configuration in my pipeline.If my source and endpoints are correct and reliable,low scores level are possible for some datasets and it's available?
barplot
another questions asking for your guide :)
known knowledge this project based on are less subpopulations and increasing number of cells along differentiation process.So,k=120 was set according to my understanding in function buildGRN() and creatKNN() to obtain relatively less subpopulations as tSNE plot shows.k value is different a lot from yours.It might be a cause of low GRN scores?And if a path didn't go through every subpopulation in non-branched cells differentiation,gene expression dynamics with this path can represent entire process of changes?
tsne

Thank a lot !

Jm Zhong

@JianmeiZhong
Copy link
Author

add:every time running the grnscores(),warning messages were generated as follows.Hope this helps.
[1] 0
There were 20 warnings (use warnings() to see them)

warnings()
Warning messages:
1: In if (length(bla) == 1 & names(bla) == r) { ... :
the condition has length > 1 and only the first element will be used
2: In if (length(bla) == 1 & names(bla) == r) { ... :
the condition has length > 1 and only the first element will be used
3: In if (length(bla) == 1 & names(bla) == r) { ... :
the condition has length > 1 and only the first element will be used
4: In if (length(bla) == 1 & names(bla) == r) { ... :

@edroaldo
Copy link
Owner

edroaldo commented Apr 23, 2018 via email

@edroaldo
Copy link
Owner

edroaldo commented Apr 23, 2018 via email

@JianmeiZhong
Copy link
Author

Thanks! I got some very useful answers from your nice explications. :)
i'll try different neigh value.and yes,the dataset contains 5000~6000 cells.It's a large one.but larger k value,longer time it takes.if k value don't affect the grnscore that much,lower k value I'll set referring to your setting.And I called back some expression of genes using scImpute(WV Li et.al,2018) before and used normalized expression as input.
As for the monotonically increase or decrease genes,many genes are observed as up or down with obvious trends along the non-brached process of differentiation(five cell-stages,for example).I just cannot figure out how the numbers of genes represented in GRNscore plot so less in some samples.
Maybe the enpoint subpopulations SP_8 are too small to represent the whole last cell-stage(SP_1,SP_2,SP_5,SP_7,SP_8,S_12,but no marker genes are found for this cell-stage)?I don't know...

@JianmeiZhong
Copy link
Author

or if there's any way that subpopulation id of each cell can be customized coercively?That will be a absolutely sole path which all cells can be included. grin:

@JianmeiZhong
Copy link
Author

I tried neigh=2 in another sample,nothing seems changed.
sample2

@edroaldo
Copy link
Owner

edroaldo commented Apr 24, 2018 via email

@edroaldo
Copy link
Owner

edroaldo commented Apr 24, 2018 via email

@JianmeiZhong
Copy link
Author

mmmm....my apologies for missing your reply and my delayed respond.:)
I tried to put all genes with non-zero variance into calculation of GRN as you suggested and more genes were presented in GRNscores plot.increasing number of input genes generated more nodes and more GRN genes shown but with pretty low GRN scores mentioned before.more closer endpoint to source subpopulation was tested to have higher GRN scores,for example 0.1 for SNCA from SP_10 to SP_3.

Here are commands generating the low-scores plot and several previous steps.
### selecting genes to use as regulated along developmental trajectories
pca <- prcomp(t(ndata),scale=TRUE,center=TRUE)
loadings <- pca$rotation
num_pc <- 5
quantile <- 0.975
genes2use <- unique(as.vector(unlist(apply(loadings[,1:num_pc],2,function(x){names(x[which(abs(x) >= quantile(x,quantile))])}))))
ggrn <- buildGRN('Hs',ndata,genes2use,2,'results/GRN.R')
cellrouter <- findsubpopulations(cellrouter,90,'jaccard','results/kNN_network.gml')
cellrouter <- createKNN(cellrouter,90,'jaccard','results/paths/kNN_network_trajectory.gml')
tfs <- find_tfs(species = 'Hs')
t <- c('SP_10.SP_8')
x <- grnscores(cellrouter,tfs,t,direction='both',dir.targets='up',columns=1,width=8,height=5,flip=T,filename=paste('results/',t,sep=''))

@edroaldo
Copy link
Owner

edroaldo commented May 8, 2018 via email

@edroaldo
Copy link
Owner

edroaldo commented May 9, 2018 via email

@JianmeiZhong
Copy link
Author

Ok,I'll try it and let you know how it goes.

@JianmeiZhong
Copy link
Author

I tried increasing k in 3 samples but there are no better results than before.they are still pretty low when k=200 .

@edroaldo
Copy link
Owner

edroaldo commented May 24, 2018 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants