-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Issue with Gencode_TSS_pc_lincRNA_antisense.bed file #2
Comments
Hello friend, have you finished running this code?
|
Hello @plbaldoni & @caokai001, To get raw matrices from raw count, you'll have to download the pipeline we developed : https://github.com/vallotlab/scChIPseq_DataEngineering/tree/devel (make sure to use the devel version). You will have to manually change the paths towards all the tools in the first lines of the 'CONFIG_TEMPLATE' file, at the root of the directory. (If you don't have bowtie1 or BWA, you can just omit it as they are not used by default) You will also have to modify in the 'species_design_configs.csv', at the root of the directory :
/path/to/source being the root of the GitHub directory. For all the samples described in the paper, the barcode design is 'Hifibio'
You might want to omit the coverage step which produces bigwigs but needs a bioconda environment which path is hardcoded in the pipeline. You would run, after downloading the fastq files from SRA:
You need to have at least 40-60Gb RAM and 8 cores availables. This will produce BAM files as well as count matrices in the output directory, that you can input in this R downstream analysis script. If you have issues with the pipelines, don't hesitate to post on the respective page. Thank you for noticing that the bed file is corrupted, I will correct this. |
Hi,
the file Gencode_TSS_pc_lincRNA_antisense.bed has an extra trailing tab at the end of every line which makes the R pipeline to fail when computing the annotation.
Also, it would be great if the authors could make the code to process the raw data available for the community (from SRA files to count matrices). It is unclear how to obtain the count matrices out of the SRA files posted at https://www.ncbi.nlm.nih.gov/Traces/study/?acc=PRJNA481734.
Best,
Pedro Baldoni
The text was updated successfully, but these errors were encountered: