-
Notifications
You must be signed in to change notification settings - Fork 0
summary OfAllSmash
yvancouver edited this page Apr 7, 2014
·
3 revisions
summary of all the wiki/issues for Smash:
-
Link to the original fast files for:
-
Link to Smash Download and QC
-
Link to Smash Downloadable Material
- Just links to the amazonaws
-
A summary of which files belong to what sample
-
Methodology we will use for the project.
Downloading BAM files (whole genome sequencing data) from SMaSH data- Extracting reads from coding exon regions (+/- 20 bp)
- Extracting truth variants from the same regions as step2
- Converting BAM back to fastq files
- Running Variant Calling pipeline
- Evaluating results
##Issues
-
NA19240.bam
1. `java -Xmx6g -jar ValidateSamFile.jar INPUT=NA19240.bam MODE=SUMMARY \` `VALIDATION_STRINGENCY=STRICT` 2. ` java -Xmx6g -jar CleanSam.jar INPUT=NA19240.bam OUTPUT=NA19240_cleaned.bam ` 3. `java -Xmx6g -jar ValidateSamFile.jar INPUT=NA19240_cleaned.bam MODE=SUMMARY \` `VALIDATION_STRINGENCY=STRICT` 4. ` java -Xmx6g -jar FixMateInformation.jar INPUT=NA19240_cleaned.bam \` `OUTPUT=NA19240_cleaned_fixed.bam VALIDATION_STRINGENCY=LENIENT \` `TMP_DIR=/Volumes/ ` 5. `java -Xmx6g -jar ValidateSamFile.jar INPUT=NA19240_cleaned_fixed.bam\ ` ` MODE=SUMMARY VALIDATION_STRINGENCY=STRICT` 6. `samtools view -b -L uscs_all_exons_sorted+-20.bed NA19240_cleaned_fixed.bam > NA19240_cleaned_fixed+-20.bam` 7. `java -Xmx6g -jar ValidateSamFile.jar INPUT=NA19240_cleaned_fixed+-20.bam\ ` ` MODE=SUMMARY VALIDATION_STRINGENCY=STRICT`
| File | ERROR INVALID INDEXING_BIN | ERROR INVALID MAPPING_QUALITY | ERROR MISMATCH FLAG_MATE_NEG_STRAND | ERROR MISMATCH FLAG_MATE_UNMAPPED | | :----: | :----: | :----: | :----: | :----: | :----: | | NA19240.bam | 58471 | 2009 | 13980965 | 2045 | |NA19240_cleaned.bam | 58471 | 0 | 13980965 | 2045 | |NA19240_cleaned_fixed.bam | 0 | 0 | 0 | 0 | |NA19240_cleaned_fixed+-20.bam| ? | ? | ? | ? |
- Contaminated_NA12878