Skip to content

Commit

Permalink
fixed typo
Browse files Browse the repository at this point in the history
  • Loading branch information
qiyunzhu committed Aug 2, 2021
1 parent 7dfb4ec commit 94d9667
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions doc/wol.md
Original file line number Diff line number Diff line change
Expand Up @@ -187,7 +187,7 @@ Make an abbreviation:
mcdir=function/metacyc
```

Collapse ORFs into MetaCyc proteins.
Collapse ORFs into MetaCyc proteins:

```bash
woltka tools collapse -i gene.biom -m $mc/protein.map.xz -n $mc/protein_name.txt -o protein.biom
Expand Down Expand Up @@ -220,7 +220,7 @@ So on so forth. See [here](metacyc.md) for a graph of all available collapsing d

`classify` only supports a tree structure, in which one child unit has exactly one parent unit. This is typical in taxonomic classification. If multiple parents are present, all but the first parent will be discarded. In contrast, `collapse` supports **one-to-multiple** mappings, therefore it is more suitable when this is the norm instead of exception, especially in functional classification (where one gene can be involved in multiple metabolic pathways).

`classify` always ensures the [**compositionality**](https://en.wikipedia.org/wiki/Compositional_data) of the feature table, in which the frequencies match the numbers of aligned sequences. `collapse` however does not by default: In a one-to-multiple mapping, all parents will be counted once. But one can add `-d` to the `collapse` command to divide the counts by the number of parents so that the compositionality is retained. See [here](#collapse.md#considerations) for a detailed discussion.
`classify` always ensures the [**compositionality**](https://en.wikipedia.org/wiki/Compositional_data) of the feature table, in which the frequencies match the numbers of aligned sequences. `collapse` however does not by default: In a one-to-multiple mapping, all parents will be counted once. But one can add `-d` to the `collapse` command to divide the counts by the number of parents so that the compositionality is retained. See [here](collapse.md#considerations) for a detailed discussion.

However, if the mappings are **unique** (one-to-one), the two methods produce mutually identical results (minor differences may arise during number rounding), and the concern of compositionality is no longer relevant.

Expand Down

0 comments on commit 94d9667

Please sign in to comment.