Skip to content

All files relevant to perform paired-end NGS data analysis. The reported codes are associated with Zuercher, Kleefeldt, Funke, Birnbaum, Fredens et al. (2023).

JWChin-Lab/NGS-analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

iSeq

This package can:

  1. automatically run the alignment using Bowtie2 (which can be visualised using IGV viewer);
  2. show you a report of the alignment in a txt file;
  3. call out all mutations in a csv file;
  4. generate a consensus sequence file in .txt format;
  5. generate a report called depth_check.csv that reports positions where read depth increases by more than 100% or drops by more than 50% in the space of one base. This is indicative of potential deletion or insertion.

Key advantages:

  1. Takes only a few seconds to drag-and-drop your alignment files and the reference file into the terminal. Minimal input required;
  2. Run multiple alignments in multiple terminal tabs at once, all using just drag-and-drop commands. Super fast to setup and particularly suitable when you have many small alignments to do regularly;
  3. The source code is open and users can customise the setting and parameters and even add thier own commands;
  4. All packages required are contained in a single folder (IGVtools, Samtools, Bowtie2) and save users a lot of time to download and install them individually;
  5. No unzipping required. The program can handle both gzipped or unzipped fastq files.

Motivation:

I couldn’t find any easy-to-use, either commercially available or open source, alignment tool that can align many files quickly and easily. Ugene is a free software that uses graphic user interface for alignment but users need to manually enter each alignment files and set parameters. It also offers limited customization and does not generate a mutation list, a consensus file and an alignment summary. The motivation of this package is to combine all of these into a single easy-to-use drag-and-drag alignment package.

Limitations:

  1. Currently, it takes only paired-end inputs;
  2. While I have satisfactory alignment results using the default setting and parameters, users who prefer to customise the setting would need to modify the run.sh file. Some understanding of Bowtie2 command and programming skills are required (Please refer to Bowtie2 manual for command);
  3. So far only tested on iOS

Requirement:

  1. Python 3 (Google it and install)
  2. Java 7 or above (Google it and install)
  3. Java Development Kit (JDK) (Google it and install)
  4. Python package: Pandas (To install, open the terminal and type 'pip3 install pandas' after you install Python 3)
  5. Python package: Biopython (To install, open the terminal and type 'pip3 install biopython' after you install Python 3)

To-Use:

  1. Download all contents of the repository, and place in folder called ‘iSeq-master’. Place iSeq-master on the Desktop. Unzip samtools, IGVtools and bowtie2 within the folder.

  2. Open the terminal (you can use the Search function in Mac to locate it, just search for ‘terminal’)

  3. Inside the terminal, type ‘nano ~/.bash_profile’ (this opens up a word editor called nano and edit a hidden file called .bash_profile)

  4. At the bottom of the script, type: export PATH=$PATH:./Desktop/iSeq-master (the purpose is to add ‘./Desktop/iSeq-master’ into your default PATH for easy access)

  5. Press ‘Ctrl + X’ to exit. When asked whether to save it or not, press ‘y’ and then ‘Enter’.

  6. Now you are ready to use the program. In the terminal, type ‘run.sh’ follow by a space, then drag and drop the following three items in order:

  7. first_paired_end_file.fastq (or in .gz format, output from BaseSpace without unzipping)

  8. second_paired_end_file.fastq (or in .gz format, output from BaseSpace without unzipping)

  9. reference_file_in_fasta_format.fasta

Example: run.sh paired1.fastq paired2.fastq ref.fasta

& press Enter.

The program will run automatically. When it is done, a message ‘Done’ will appear at the terminal. In the folder where you have the reference.fasta file, you will see a number of new files. The key files are:

  1. a filename ends with …fa_sorted.bam - this file can be opened and visualised in IGV Viewer (…fa_sorted.bam.bai must also be present in the same folder, do not delete it)
  2. a filename ends with …fa_call.csv - this file contains a list of all mutations
  3. a filename ends with …fa.stat.txt. - this file contains the stat for the alignment. Refer to Bowtie2 manual for details
  4. a filename ends with …_consensus_seq_txt - this file contains the consensus sequence of the alignment

To run multiple alignment in parallel, press ‘Ctrl + T’ in the terminal to open a new tab and repeat step 6. You can open as many terminals as you like and leaves them running overnight.

Disclaimer:

Some packages from other sources are included sorely for convenience to the users. The credits for these packages belong to the original content creators. The packages are:

  1. Bowtie2-2.3.2-legacy URL: http://bowtie-bio.sourceforge.net/bowtie2/index.shtml
  2. IGVtools URL: https://software.broadinstitute.org/software/igv/igvtools
  3. Samtools-1.3.1 URL: http://samtools.sourceforge.net

About

All files relevant to perform paired-end NGS data analysis. The reported codes are associated with Zuercher, Kleefeldt, Funke, Birnbaum, Fredens et al. (2023).

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published