This is the official repository for the HAYSTAC paper.
haystack
is an easy to use pipeline for metagenomic identifications
You can easily:
- Construct a database
- Prepare your sample for analysis, including trimming sequencing adapters, collapsing paired end reads and also downloading data from the SRA
- Perform species identification
- Perform a chemical damage pattern analysis
All the code, setup script, run scripts and config files for the performance tests can be found under the performance_test directory.
You will need to clone the repository and then sequentially run the setup and run scripts from inside their directory.
For the simulation datasets you can fetch the database that was used in the apper and then follow the code examples. For the simulation we use gargammel (REF), so you will need to install that program first.
The relevant scripts can be found under the simulating_datasets directory.
For every analysis we have performed in this paper, we include the commands used in scripts under the analysis directory.
MIT
Happening soon