Skip to content

Latest commit

 

History

History
38 lines (24 loc) · 1.25 KB

README.md

File metadata and controls

38 lines (24 loc) · 1.25 KB

README

ExtractFlankingSequence : Grab reference genome sequence 5' flanking or at 5' end of short reads mapped to a reference genome

Extract1stNBasesFromMappedReads.py: Grab 1st N bp from reads mapped to reference genome

CompareExtractedFiles.py: Compare results from the two scripts listed above

DEPENDENCIES

Python 2.7 pyfasta numpy

USAGE

python ExtractFlankingSequence.py sam_file reference_sequence.fasta #_bp_to_extract output_file_name python Extract1stNBasesFromMappedReads.py <# bp to extract> python CompareExtracts.py <extract1.fa> <extract2.fa>

e.g.

python ExtractFlankingSequence.py hits.sam dmel-4-chromosome-r5.33.fasta -20 flanks.fasta python ExtractFlankingSequence.py hits.sam dmel.genome.fasta 20 out.fasta GATGGCAT python ExtractFlankingSequence.py flanks.fasta out.fasta compare.fa

David L. Stern Janelia Farm Research Campus 5 February 2013