Analysing different pattern-searching algorithms when finding for substring occurrences in a given DNA main string
Give a long DNA string, find all occurrences of a chosen substring
- Implemented three different algorithms: Finite Automata (FA) algorithm, Boyer Moore Horspool algorithm and the Naive/Brute Force algorithm
- Calculated and compared time complexities for the three different algorithms on different sizes of main DNA string and substrings
- FA algorithm is more suitable to be used for general pattern searching and has a stable time complexity regardless of the size of the main DNA string and the substrings
- Boyer Moore Horspool algorithm is more suitable for substrings with a few types of characters. Otherwise, this has a similar time complexity to the Naive/Brute Force algorithm
- Full detailed report of the project: CZ2001 Project 1 Report.pdf
- Summary and presentation of the project: Project 1 Presentation.pptx
- Ang Shu Hui
- Bachhas Nikita
- Kundu Koushani
- Leonardo Irvin Pratama