Daniel Norman edited this page Jun 5, 2015 · 8 revisions
Clone this wiki locally

#Haplotype Assembly Haplotype information is needed for many biological purposes such as predicting diseases, determining ancestry, predicting susceptibility to medicine, and inferring family relationships. This information can be read from our DNA by machines, but these machines only produce short reads of the DNA rather than an entire haplotype. Furthermore, we are unsure which haplotype (of the two in humans) a particular read originated from. The purpose of this project is to reassemble haplotypes from ambiguous read data in an accurate and efficient manner.

Graphical representation of the problem:

Creating reads from haplotypes Result of the algorithm

##Status Reports See weekly status updates for week-by-week status reports, goals, and personal gradings.

See the commits list for a list of all commits to the code.

##Algorithm For a detailed description of the algorithm along with graphs of results, see the presentation slides.