New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
rebin_peaks
: `Error in data.table::rbindlist(rebinned_peaks, use.names = TRUE, idcol = "assay") :
#103
Comments
This seems to be a limit size of what rbindlist can handle but I'm not 100%. I'd first try running it on just one thread but I think this might require a per chromosome approach to the binning (like we discussed previously) |
Ohhh, that makes a lot of sense. I'll play around with this function a bit and see if I can improve the efficiency #101 |
I'm not sure that's a good idea as then you are losing places where all peak files have noted 'no peak'. This will also mean your correlation values for a pair could change by adding other peak datasets which isn't desirable. So I'd avoid doing this |
Good point, this is also something I was concerned about. But from the perspective of a single-cell dataset, when you construct a Seurat object one of the initial steps is to filter out any features (bins) without any counts. Same goes for generating CTDs in EWCE, the idea being that they don't contribute to differentiating your samples. But I agree, it does affect the correlation structure and is a bit arbitrary since it's dependent on the samples you're including. |
Resolved by implementing improvements described here: #101 |
1. Bug description
Error message prevents rebinning of peaks, but only with certain datasets and certain bin sizes.
bin_size=400
seemed to run fine, butbin_size=200
causes the error.Console output
Expected behaviour
Peaks rebin regardless of peak file inputs or bin size.
2. Reproducible example
Code
Data
Unfortunately, this does require access to our HPC lab folder, as I haven't seen this error come up with any other datasets. As such, happy to take the lead on fixing this, perhaps with input from @Al-Murphy if necessary.
I'll also bump the version as mentioned in #102, but @serachoi1230 you'll need to push upstream to Bioc devel once all the changes are ready.
3. Session info
The text was updated successfully, but these errors were encountered: