Skip to content

shayanaamir/DNA-Analysis-with-FM-Index

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CS 201 Data Structures II Project, Spring 2021

Team: Amortized Properties

Aim

To implement all our learnings of the studied data structures and even research and implement another one: FM-Index This is being done by making an application for a Bio-Informatics Scientist who can search, analyse, and alter genomic data.

Overview

Main programming language used is python. This project imports several preinstalled packages but even numpy, matplotlib and PyQt libraries which you should install before attempting to run this application. If you do not have these packages, you can install them by going to the Resouces folder and then opening command prompt(type cmd on the navigation bar or however you want to open command prompt with the directory of the resources folder) simply running the following in command prompt:
pip install -r requirements.txt
For the GUI the PyQt library is mainly being used.
The application implements FM-Index for all the different features and uses. For comparison of efficieny of FM-Index with other data structures that were studied, Suffix Tree, Rabin Karp (using hashing), and Linear search have also been implemented.

Running the Application

Just run the MainWindow.py to start the application

Back End Testing

Along with testing the application you can also test the algorithms separately, in the back end, in Testing.py

Data Sets

All DNA Data Sets for testing purposes have been taken from http://www.faculty.ucr.edu/~mmaduro/random.htm

Resources and References

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages