Third-generation sequencing offers some advantages over its next-generation sequencing predecessor, with the caveat of harboring a much higher error rate. Accordingly, clustering related sequences has become an essential task in modern biology. In order to accurately cluster sequences rich in errors, error type and frequency need to be accounted for. Levenshtein distance is a well-established mathematical algorithm for measuring edit distance between words and can specifically weight insertions, deletions and substitutions. However, there are drawbacks to using Levenshtein distance in a biological context, and hence, it has rarely been used for this purpose. 3GOLD is a novel modification to the Levenshtein distance algorithm for clustering error-rich biological sequencing data.
-
Notifications
You must be signed in to change notification settings - Fork 0
roblogan6/3GOLD
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
No description, website, or topics provided.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published