@@ -9,10 +9,10 @@ Axe Usage
9
9
did not change.
10
10
11
11
Axe has several usage modes. The primary distinction is between the two
12
- alternate barcoding schemes, single and combinatorial barcoding . Single barcode
13
- matching is used when only the first read contains barcode sequences.
14
- Combinatorial barcoding is used when both reads in a read pair contain
15
- independent (typically different) barcode sequences.
12
+ alternate indexing schemes, single and combinatorial indexing . Single index
13
+ matching is used when only the first read contains index sequences.
14
+ Combinatorial indexing is used when both reads in a read pair contain
15
+ independent (typically different) index sequences.
16
16
17
17
For concise reference, the command-line usage of ``axe-demux `` is reproduced
18
18
below:
@@ -37,8 +37,8 @@ default) and 9, where 0 indicates plain text output (``gzopen`` mode "wT"), and
37
37
fastest and 9 is most compact.
38
38
39
39
The output flags should be prefixes that are used to generate the output file
40
- name based on the barcode 's (or barcode pair's) ID. The names are generated as:
41
- ``prefix `` + ``_ `` + ``barcode ID `` + ``_ `` + ``read number `` + ``.extension ``.
40
+ name based on the index 's (or index pair's) ID. The names are generated as:
41
+ ``prefix `` + ``_ `` + ``index ID `` + ``_ `` + ``read number `` + ``.extension ``.
42
42
The output file for reads that could not be demultiplexed is ``prefix `` + ``_ ``
43
43
+ ``unknown `` + ``_ `` + ``read number `` + ``.extension ``. The read number is
44
44
omitted unless the paired read file scheme is used, and is "il" for interleaved
@@ -51,65 +51,65 @@ The corresponding CLI flags are:
51
51
- ``-r `` and ``-R ``: Paired R2 file input and output.
52
52
- ``-i `` and ``-I ``: Interleaved paired input and output.
53
53
54
- The barcode file
54
+ The index file
55
55
----------------
56
56
57
- The barcode file is a tab-separated file with an optional header. It is
57
+ The index file is a tab-separated file with an optional header. It is
58
58
mandatory, and is always supplied using the ``-b `` command line flag. The exact
59
- format is dependent on barcoding mode, and is described further in the sections
59
+ format is dependent on indexing mode, and is described further in the sections
60
60
below. If a header is present, the header line must start with either
61
- `Barcode ` or ``barcode ``, or it will be interpreted as a barcode line, leading
61
+ `Barcode ` or ``index ``, or it will be interpreted as a index line, leading
62
62
to a parsing error. Any line starting with ';' or '#' is ignored, allowing
63
- comments to be added in line with barcodes . Please ensure that the software
64
- used to produce the barcode uses ASCII encoding, and does not insert a
63
+ comments to be added in line with indexes . Please ensure that the software
64
+ used to produce the index uses ASCII encoding, and does not insert a
65
65
Byte-order Mark (BoM) as many text editors can silently use Unicode-based
66
66
encoding schemes. I recommend the use of
67
67
`LibreOffice Calc <www.libreoffice.org >`_ (part of a free and open source
68
- office suite) to generate barcode tables; Microsoft Excel can also be used.
68
+ office suite) to generate index tables; Microsoft Excel can also be used.
69
69
70
70
Mismatch level selection
71
71
------------------------
72
72
73
- Independent of barcode mode, the ``-m `` flag is used to select the maximum
74
- allowable hamming distance between a read's prefix and a barcode to be
75
- considered as a match. As "mutated" barcodes must be unique, a hamming distance
76
- of one is the default as typically barcodes are designed to differ by a hamming
73
+ Independent of index mode, the ``-m `` flag is used to select the maximum
74
+ allowable hamming distance between a read's prefix and a index to be
75
+ considered as a match. As "mutated" indexes must be unique, a hamming distance
76
+ of one is the default as typically indexes are designed to differ by a hamming
77
77
distance of at least two. Optionally, (using the ``-p `` flag), axe will allow
78
- selective mismatch levels, where, if clashes are observed, the barcode will
79
- only be matched exactly. This allows one to process datasets with barcodes that
78
+ selective mismatch levels, where, if clashes are observed, the index will
79
+ only be matched exactly. This allows one to process datasets with indexes that
80
80
don't have a sufficiently high distance between them.
81
81
82
- Single barcode mode
82
+ Single index mode
83
83
-------------------
84
84
85
- Single barcode mode is the default mode of operation. Barcodes are matched
86
- against read one (hereafter the forward read), and the barcode is trimmed from
85
+ Single index mode is the default mode of operation. Barcodes are matched
86
+ against read one (hereafter the forward read), and the index is trimmed from
87
87
only the forward read, unless the ``-2 `` command line flag is given, in which
88
- case a prefix the same length as the matched barcode is also trimmed from the
88
+ case a prefix the same length as the matched index is also trimmed from the
89
89
second or reverse read. Note that sequence of this second read is not checked
90
90
before trimming.
91
91
92
- In single barcode mode, the barcode file has two columns: ``Barcode `` and
92
+ In single index mode, the index file has two columns: ``Barcode `` and
93
93
``ID ``.
94
94
95
- Combinatorial barcode mode
95
+ Combinatorial index mode
96
96
--------------------------
97
97
98
- Combinatorial barcode mode is activated by giving the ``-c `` flag on the
99
- command line. Forward read barcodes are matched against the forward read, and
100
- reverse read barcodes are matched against the reverse read. The optimal
101
- barcodes are selected independently, and the barcode pair is selected from
102
- these two barcodes . The respective barcodes are trimmed from both reads; the
103
- ``-2 `` command line flag has no effect in combinatorial barcode mode.
98
+ Combinatorial index mode is activated by giving the ``-c `` flag on the
99
+ command line. Forward read indexes are matched against the forward read, and
100
+ reverse read indexes are matched against the reverse read. The optimal
101
+ indexes are selected independently, and the index pair is selected from
102
+ these two indexes . The respective indexes are trimmed from both reads; the
103
+ ``-2 `` command line flag has no effect in combinatorial index mode.
104
104
105
- In combinatorial barcode mode, the barcode file has three columns:
106
- ``Barcode1 ``, ``Barcode2 `` and ``ID ``. Individual barcodes can occur many times
107
- within the forward and reverse barcodes , but barcode pairs must be unique
105
+ In combinatorial index mode, the index file has three columns:
106
+ ``Barcode1 ``, ``Barcode2 `` and ``ID ``. Individual indexes can occur many times
107
+ within the forward and reverse indexes , but index pairs must be unique
108
108
combinations.
109
109
110
- The Demultipexing Statistics File
111
- ---------------------------------
110
+ The Demultiplexing Statistics File
111
+ ----------------------------------
112
112
113
113
The ``-t `` option allows the output of per-sample read counts to a
114
114
tab-separated file. The file will have a header describing its format, and
115
- includes a line for unbarcoded reads.
115
+ includes a line for reads which could not be demultiplexed .
0 commit comments