Skip to content
/ ASSIS Public

# ASSIS - Assignment Syntenyc Score Integrated System: A Platform to Evaluate Eukaryotic Whole Genome Annotation

Notifications You must be signed in to change notification settings

Juassis/ASSIS

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation


#Hardware/Software Requirements - 64 bit Linux or Mac OS X - x86-64 compatible processors

#Tutorial https://github.com/Juassis/iedb PDF

RELEASEnotes contains detailed information about the latest major release

#Directory Contents main dir

	assis.pl, params_model

bin: scripts

 compare_orthogroups_by_synteny.pl, export_rnaseq.pl, fix_gff_from_bedtools.sh, gene_overlap_report.pl, gff2fasta.pl, overlap_gff.pl, protein_check.pl, protein_coding_report.pl, rnaseq_report.pl

tests: DataSet Example

D_simulans_Assessment

	FASTA: Drosophila_simulans.fasta, Drosophila_melanogaster.fasta
	GFF:   Drosophila_simulans.gff3, Drosophila_melanogaster.gff3

	OUTPUT:
		PROTEINS: Drosophila_simulans_proteins.fasta
		REPORT: report_overlap, report_protein_coding

doc: documentation

Tutorial:PDF
Paper: Link

#Installation

 Interpreters: Perl, Python

Install dependencies: OrthoFinder, BedTools

####OrthoFinder:

Installation - Latest Development Version

All SO

####BedTools

Installation - Latest Version

  • MAC: brew install bedtools
  • Linux: apt-get install bedtools

ASSIS Software Installation

##ASSIS

# Get latest ASSIS releases from github

wget https://github.com/Juassis.tar.gz
tar -xzf XXX.tar.gz
cd ASSIS

# Alternatively, get ASSIS using git

git clone https://github.com/Juassis/iedb.git

Running the program:

To Run ASSIS use the Example DataSet:

Project/
	bin
	params_model
	assis.pl
	D_simulans_Assessment
	README.md
	Tutorial.md

	1- cd D_simulans_Assessment		

		FASTA/ Drosophila_simulans.fasta, Drosophila_melanogaster.fasta
		GFF/  Drosophila_simulans.gff3, Drosophila_melanogaster.gff3
		OUTPUT/

	2- perl assis.pl -d D_simulans_Assessment -p params_model

Running with the own DataSet:

Command Line and Options:

1- mkdir Project

2- cd Project

	3- mkdir FASTA
		3.1 ln -s genome.fasta (Evaluated and Reference (Closely related species)) FASTA;

	4- mkdir GFF
		4.1 ln -s genome.gff (Evaluated and Reference) GFF;

		* FASTA and GFF should have the same suffix (See Example DataSet for fruit fly genome); *

	 	4.2 This Only for RNASeq Mapped Data (See the tutorial to generate a bam/bed file)


5- Edit the params file (see the tutorial)

6- mkdir OUTPUT

7- perl bin/Assis.pl -d Project -p params_model

###Limitations

Splicing leader/ Trans splicing

Fragmented genomes (Finished genomes annotation to help)

This release was tested with the default parameters for drosophila, mouse and parasites genomes. Mammal genomes require at least 4GB of RAM, ideally 16GB.

About

# ASSIS - Assignment Syntenyc Score Integrated System: A Platform to Evaluate Eukaryotic Whole Genome Annotation

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages