Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Expected runtimes for XDE/TDE test? #28

Open
DanielYuhangLi opened this issue Feb 22, 2024 · 2 comments
Open

Expected runtimes for XDE/TDE test? #28

DanielYuhangLi opened this issue Feb 22, 2024 · 2 comments

Comments

@DanielYuhangLi
Copy link

DanielYuhangLi commented Feb 22, 2024

Hi Dr. Hou, it's great to have an alternative to try in relation to tradeseq - I thought the paper was clear and an interesting read. I was wondering what expected runtimes may be for the lamian_test function for both XDE and TDE. I have been trying in various iterations with a dataset consisting of 42k cells and one single pseudotime lineage extracted from slingshot. So far even with just 5 permutations and 5 cores (although the 5 cores don't seem to be actively used but I don't see any clear issues with the mcapply function from the code) with the TDE test, I have not been able to finish a run after 12+ hours. I've tried to also create an h5 file using the built in h5 function but run into similar issues, after overnight, the h5 file is still not written which is very odd.
I was looking for some advice on what to look into regarding troubleshooting and let me know what information would help most in regards to troubleshooting.

For reference
#perform TDE test
Res <- lamian_test(
expr = seurat_object3@assays$RNA@data,
cellanno = seurat_meta2,
pseudotime = seurat_object3$pseudotime,
design = design,
test.type = 'time',
testvar = 2,
permuiter = 5,
test.method = 'chisq',

This is for permutation test only.

We suggest that users use default permuiter = 100.

Alternatively, we can use test.method = 'chisq' to swich to the chi-square test.

ncores = 5
)

seurat_object3@assays$RNA@data is a 23815x42416 matrix
seurat_object3$pseudotime is a vector of length 42416
seurat_meta2 is a metadata table that is 42416x2, columns are intercept (could you clarify what this refers to? I just set it as 1) and group (0,1)

Thanks!

Dan

@ahnchi
Copy link

ahnchi commented Mar 2, 2024

I am having a similar issue when I try to run seurat object. I subsetted my object, so has ~6000 cells with ~50000 genes. I used ncores=10, but it's not being completed after many hours.

@Winnie09
Copy link
Owner

Winnie09 commented Apr 5, 2024

Hi,

Thanks for your interest in this package! The time complexity depends heavily on the number of samples, which is fixed for a dataset. We can filter out some lowly expressed and lowly variable genes to reduce the running time.

Also, if you are using slingshot for the pseudotime inference, please refer to the manual section "Apply other pseudotime inference methods in Lamian: take slingshot as an example" on page https://winnie09.github.io/Wenpin_Hou/pages/Lamian.html#apply-other-pseudotime-inference-methods-in-lamian-take-slingshot-as-an-example. The following lines are important:

pt <- pt[!is.na(pt)] ## important
selectedCell <- names(pt)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants