genomic_data_science

Python code written for EdX course CSE181 (Genomic Data Science): analyzing biological sequences, identifying motifs and consensus sequences, and applying statistical and random search methods to genomic data

week1_frequent_words: constructing frequency arrays and identifying frequent sub-sequences in genomes (analyze e_coli.txt and vibrio_cholerae_genome.txt)
week2_frequent_words_w_mismatches: identify lagging and leading strands by G-C content, identify mismatches and reverse complements of sub-sequences to more realistically determine most frequently repeated sub-sequences in genome in order to locate ori
week3_motif_matrices: construct numpy arrays of regulatory motifs, probability distributions of motif matrices, construct consensus motif, and identify possible regulatory motif sets from brute force and "greedy" searches
week4_random_motif_search: use more accurate random search with iterative and repeated searches to identify likely regulatory motifs

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md
e_coli.txt		e_coli.txt
vibrio_cholerae_genome.txt		vibrio_cholerae_genome.txt
week1_frequent_words.py		week1_frequent_words.py
week2_frequent_words_w_mismatches.py		week2_frequent_words_w_mismatches.py
week3_motif_matrices.py		week3_motif_matrices.py
week4_random_motif_search.py		week4_random_motif_search.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

e_coli.txt

e_coli.txt

vibrio_cholerae_genome.txt

vibrio_cholerae_genome.txt

week1_frequent_words.py

week1_frequent_words.py

week2_frequent_words_w_mismatches.py

week2_frequent_words_w_mismatches.py

week3_motif_matrices.py

week3_motif_matrices.py

week4_random_motif_search.py

week4_random_motif_search.py

Repository files navigation

genomic_data_science

About

Releases

Packages

Languages

0916kj/genomic_data_science

Folders and files

Latest commit

History

Repository files navigation

genomic_data_science

About

Resources

Stars

Watchers

Forks

Languages