-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Problems when building the Reference(T2T-genome) #1
Comments
Hi @MG-DYM Sorry for coming late. From the error messages, there are multiple issues.
Please let me know if any of the above solution fix your issue. |
@MG-DYM Now this is a new yet much clearer error message to help us understand what is going on. In this step, IRFinder extracts 100% mapped reads from a bam file that is successfully generated in the previous step and saves them into compressed BED files. IRFinder does it chromosome by chromosome, so that each of the output Then, in the step you encountered error, IRFinder carries out a trivial merge of those We need to figure out why file incompletion happens:
|
@MG-DYM Thank you for the updates!
|
@MG-DYM I didn't see the problem of chromosomes' names. The 1st column of BAM/SAM refers to read names, in this case, Here are my questions:
Does this generate any error? If not, this will write many files into the |
@MG-DYM Looks good to me. The last line indicates your paired-end reads have a reverse-forward directionality for R1 and R2, indicating your RNASeq is likely from Illumina. |
Thanks for your advice.It's very nice of you! |
Reposted from the original question of @MG-DYM
Hi Sir,I want to use T2T-genome and its transcripts.gtf to builde the Reference.It's going wrong at <Phase 2: Mapability Calculation>:
<Phase 1: STAR Reference Preparation>
Jul 04 21:37:17 ..... started STAR run
Jul 04 21:37:17 ... starting to generate Genome files
Jul 04 21:38:30 ... starting to sort Suffix Array. This may take a long time...
Jul 04 21:38:46 ... sorting Suffix Array chunks and saving them to disk...
Jul 04 21:51:05 ... loading chunks from disk, packing SA...
Jul 04 21:52:30 ... finished generating suffix array
Jul 04 21:52:30 ... generating Suffix Array index
Jul 04 21:56:23 ... completed Suffix Array index
Jul 04 21:56:23 ..... processing annotations GTF
Jul 04 21:56:58 ..... inserting junctions into the genome indices
Jul 04 21:59:54 ... writing Genome to disk ...
Jul 04 22:00:24 ... writing Suffix Array to disk ...
Jul 04 22:04:27 ... writing SAindex to disk
Jul 04 22:04:41 ..... finished successfully
<Phase 2: Mapability Calculation>
Jul 04 22:04:41 ... mapping genome fragments back to genome...
Jul 04 22:21:57 ... sorting aligned genome fragments...
[bam_sort_core] merging from 36 files and 36 in-memory blocks...
Jul 04 22:29:45 ... indexing aligned genome fragments...
Jul 04 22:30:24 ... filtering aligned genome fragments by chromosome/scaffold...
Jul 04 22:31:05 ... merging filtered genome fragments...
lzop: : not a lzop file
Mapability build: Failed!
I checked the software version and FAST's and GTF's chromosome name,both are ok.Is there something wrong with T2T-geonme's transcripts.gtf?I first used the origin T2T-geonme fasta and its transcripts.gtf,whose chromosome name is like 'NC_060925.1',it also has a problem:
Could you help me find out the problem?
Thanks for your help,Best wishes!
The text was updated successfully, but these errors were encountered: