AptaZ and AptaFastZ

Repository of code associated with article "A high-dimensional microfluidic approach for selection of aptamers with programmable binding affinities" published in Nature Chemistry

AptaZ is an algorithm that generates aptamers with desired binding affinity on the basis of high-content sequencing dataset. AptaFastZ is an rapid version of AptaZ with the same precision. We recommend using AptaFastZ.

(Sample dataset is available at https://zenodo.org/record/8106594)

Installation

• Make sure your computer has at least 8GB RAM • Install MATLAB 2021a or later version

Running

• Convert the raw sequencing data (fastq) into a txt file with the following format using USEARCH or other search algorithms. Make sure to use semicolon ; as the spacer

#sequence_name;count;sequence

For example, #seq1;size=7;TGTAGCAGCACAGAGGTCAGATGTGTAGCAGCACAGAGGTCAGATG

• Deposit all sequence txt files from the SORTED samples into a folder, index the files following the following format. Make sure the folder only contains the txt of SORTED samples.

Flow_rate –target_concentration–zone_number.txt

For example, 16-10p-z1.txt. This name indicates the sequences were sorted under the condition of 16 mL/hr, 10p target concentration, and recovered from the first zone (zone 1).

• Deposit the reference (unsort) txt file in any folders other than the SORTED samples.

• Run ‘Z_score_calculation_fast.m’, follow the instructions and select the reference file in txt format and the folder containing the txt files of SORTED samples.

The code will create a new folder named ‘Z-results’ under the current location ‘Z_score_caultion.m’ and store the calculated Z scores per condition there in the format of mat

The calculation typically takes 6 - 24 hrs depending on the size of dataset and consumes ~ 4GB RAMs during calculation.

• Run ’Sum_Z_calculation.m’ and select the ‘Z-results’ or any renamed folder

The code will create a new folder named ‘Sum-Z-results’ under the current location ‘Sum_Z_calculation.m’ and store the SumZ scores per sequences there in the format of csv

• Interpret the data using excel, ultraedit or other softwares.

Contact

Please send your inquiry to zongjie.wang@northwestern.edu and cc shana.kelley@northwestern.edu

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
LICENSE		LICENSE
README.md		README.md
Sum_Z_calculation.m		Sum_Z_calculation.m
Z_score_calculation.m		Z_score_calculation.m
Z_score_calculation_fast.m		Z_score_calculation_fast.m

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LICENSE

LICENSE

README.md

README.md

Sum_Z_calculation.m

Sum_Z_calculation.m

Z_score_calculation.m

Z_score_calculation.m

Z_score_calculation_fast.m

Z_score_calculation_fast.m

Repository files navigation

AptaZ and AptaFastZ

Installation

Running

Contact

About

Releases

Packages

Languages

License

dwangnu/AptaZ

Folders and files

Latest commit

History

Repository files navigation

AptaZ and AptaFastZ

Installation

Running

Contact

About

Resources

License

Stars

Watchers

Forks

Languages