Parallelism fix to reduce errors on large datasets like scRNAseq #92

TheAustinator · 2020-03-05T00:49:56Z

This fix circumvents pickling errors generated by gseapy/algorithm.py line 374, which large datasets (e.g. scRNAseq) are particularly vulnerable to. The gsea tensor computation can generate blocks which are larger than 4 GB, which do not fit in the i struct formatter used by python's build in multiprocessing module.

…alization error

zqfang · 2020-03-06T05:45:30Z

Thank you very much for your patch. I' will check these more detailly later.

zqfang · 2020-04-19T18:47:29Z

refer to a bug #94

austinmckay and others added 2 commits March 3, 2020 18:50

- Parallelism fix to prevent int overflow error and pickle 4 GiB seri…

83b5b8f

…alization error

Update requirements.txt

eb42c6e

zqfang merged commit adb3742 into zqfang:master Mar 6, 2020

zqfang mentioned this pull request Mar 8, 2020

Running GSEA with large gmt file #91

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parallelism fix to reduce errors on large datasets like scRNAseq #92

Parallelism fix to reduce errors on large datasets like scRNAseq #92

TheAustinator commented Mar 5, 2020

zqfang commented Mar 6, 2020

zqfang commented Apr 19, 2020

Parallelism fix to reduce errors on large datasets like scRNAseq #92

Parallelism fix to reduce errors on large datasets like scRNAseq #92

Conversation

TheAustinator commented Mar 5, 2020

zqfang commented Mar 6, 2020

zqfang commented Apr 19, 2020