Skip to content

Commit

Permalink
Merge pull request #10 from luispedro/fix_md
Browse files Browse the repository at this point in the history
BUG+DOC Fix markdown notation
  • Loading branch information
psj1997 committed Jun 8, 2020
2 parents 1f4c3c0 + ac02cae commit fa1c8e6
Showing 1 changed file with 9 additions and 10 deletions.
19 changes: 9 additions & 10 deletions docs/output.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,8 +2,7 @@

Explanation of the files in the output


## prodigal\_out.faa , prodigal\_out.fna , gene.coords.gbk
## Prodigal output

These three files are the output prodigal.

Expand All @@ -12,17 +11,17 @@ These three files are the output prodigal.
- `gene.coords.gbk` gene information in Genebank format


## hit\_table.tsv :
## Hit Table (`hit_table.tsv`)

The results of the queries to the GMGC.

There are five columns in the file.

- `query_name`: the name/id of the input genome contig
- `gene_id`: the gener\_id with the best score in GMGC
- `gene_id`: the Unigene with the best score in GMGC
- `align_category: there are four different classes of alignment (see below)
- `gene\_dna`: the DNA sequence of the best hit in GMGC
- `gene\_protein`: the protein sequence of the best hit in GMGC
- `gene_dna`: the DNA sequence of the best hit in GMGC
- `gene_protein`: the protein sequence of the best hit in GMGC

### Alignment category

Expand All @@ -35,21 +34,21 @@ There are five columns in the file.
- `NO MATCH`: no match in GMGC.


## `genome\_bin.tsv`
## Genome bins (`genome_bin.tsv`)

Genome bins (MAGs) found in the results (and a count of how often many genes
are contained in them).

There are two columns in the file.

- `genome\_bin`: the name of genome bins in GMGC
- `times\_gene\_hit`: the times of input genes hitting it
- `genome_bin`: the name of genome bins in GMGC
- `times_gene_hit`: the times of input genes hitting it

Note that GMGC unigenes can while not all GMGC unigenes are contained in a
genome bin, some are contained in many. Thus, the total counts will not (except
by coincidence) correspond to the number of genes queried.

## summary.txt
## Summary (`summary.txt`)

Human-readable summary of the results.

0 comments on commit fa1c8e6

Please sign in to comment.