Sequence Alignment Using Dynamic Programming and Divide and Conquer

Overview

This project implements two solutions for the Sequence Alignment problem:

Basic version using Dynamic Programming (DP)
Memory-efficient version that combines DP with Divide-and-Conquer

The project aims to align two sequences of symbols (A, C, G, T) by minimizing the cost of alignment, which includes gap penalties and mismatch costs.

Project Description

Problem Review

Given two strings X and Y, where X consists of symbols x1, x2, ..., xm and Y consists of symbols y1, y2, ..., yn, the goal is to find the optimal alignment between these strings. The alignment cost includes:

Gap Penalty (δ): A fixed cost for each unmatched position.
Mismatch Costs (αpq): Costs for matching different symbols p and q.

The task is to implement the basic DP solution and a memory-efficient version, run them on provided test sets, and compare their performance.

Input String Generator

The input strings are generated from a base string and a series of steps that iteratively insert copies of the string within itself at specified indices.

Delta and Alpha Values

Gap Penalty (δ): 30
Mismatch Costs:
- A: [0, 110, 48, 94]
- C: [110, 0, 118, 48]
- G: [48, 118, 0, 110]
- T: [94, 48, 110, 0]

Implementation Details

Basic Algorithm

Uses a dynamic programming approach to compute the optimal alignment cost and sequences.

Memory-efficient Algorithm

Combines dynamic programming with divide-and-conquer to reduce memory usage while maintaining the correctness of the alignment.

How to Run

Prerequisites

Python 3.x
Required Python packages: psutil

Running the Basic Algorithm

./basic.sh input.txt output.txt

Running the Memory-efficient Algorithm

./efficient.sh input.txt output.txt

Results

Summary.pdf file with:

Data points output table generated from provided input files.
Line graphs comparing CPU time and memory usage vs. problem size for both solutions.
Insights and observations from the results.

Name	Name	Last commit message	Last commit date
Latest commit darshanrao Create README.md May 16, 2024 d48b5b7 · May 16, 2024 History 18 Commits
SampleTestCases	SampleTestCases	Default files	Apr 15, 2024
datapoints	datapoints	Default files	Apr 15, 2024
outputs_basic	outputs_basic	Changed path	May 7, 2024
outputs_efficient	outputs_efficient	Changed path	May 7, 2024
.DS_Store	.DS_Store	Default files	Apr 15, 2024
.gitattributes	.gitattributes	Initial commit	Apr 15, 2024
CSCI570_Spring24_Project.pdf	CSCI570_Spring24_Project.pdf	Default files	Apr 15, 2024
README.md	README.md	Create README.md	May 16, 2024
Summary.docx	Summary.docx	Default files	Apr 15, 2024
Summary.pdf	Summary.pdf	Add files via upload	May 9, 2024
basic.sh	basic.sh	updated shell file	May 9, 2024
basic_3.py	basic_3.py	Shell file	May 9, 2024
efficient.sh	efficient.sh	updated shell file	May 9, 2024
efficient_3.py	efficient_3.py	Shell file	May 9, 2024
output.txt	output.txt	updated shell file	May 9, 2024
outy.txt	outy.txt	updated shell file	May 9, 2024
problem_size.txt	problem_size.txt	Stats	May 6, 2024
stats.py	stats.py	Changed path	May 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sequence Alignment Using Dynamic Programming and Divide and Conquer

Overview

Project Description

Problem Review

Input String Generator

Delta and Alpha Values

Implementation Details

Basic Algorithm

Memory-efficient Algorithm

How to Run

Prerequisites

Running the Basic Algorithm

Contributors

About

Releases

Packages

Contributors 3

Languages

darshanrao/CSCI-Algo-Sequence-Alignment

Folders and files

Latest commit

History

Repository files navigation

Sequence Alignment Using Dynamic Programming and Divide and Conquer

Overview

Project Description

Problem Review

Input String Generator

Delta and Alpha Values

Implementation Details

Basic Algorithm

Memory-efficient Algorithm

How to Run

Prerequisites

Running the Basic Algorithm

Contributors

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages