Skip to content

Commit

Permalink
Merge pull request #21 from arnavm/master
Browse files Browse the repository at this point in the history
Updated calling card track documentation
  • Loading branch information
lidaof committed Jan 29, 2020
2 parents e6015d2 + 4e29b48 commit 59030fc
Show file tree
Hide file tree
Showing 2 changed files with 32 additions and 4 deletions.
27 changes: 27 additions & 0 deletions docs/datahub.rst
Original file line number Diff line number Diff line change
Expand Up @@ -268,6 +268,33 @@ Example genomealign track
}
}
Example callingcard track
-------------------------

.. code-block:: json
{
"type":"callingcard",
"url":"https://htcf.wustl.edu/files/RdNgrGeQ/HCT116-PBase.ccf.gz",
"name":"piggyBac insertions",
"showOnHubLoad":"true",
"options":{
"color":"#D12134",
"height":100,
"logScale":"log10",
"show":"sample",
"sampleSize":1000,
"markerSize":5,
"opacity":[50],
},
}
.. note:: Default callingcard track options are ``"logScale":"none"``, ``"show":"all"``, ``"markersize":3``, and ``"opacity":[100]``.
Log-scaling can be set by specifying ``"logScale":"log10"``. To change opacity, pass a single value in brackets, as in the above example.
Calling card tracks will, by default, plot all entries in view. For information-dense regions, this can lead to excessive memory consumption.
To plot a random subsample instead, specify ``"show":"sample"`` and pass the number of points to visualize to ``"sampleSize"``, e.g.
``"sampleSize":1000``

Example matplot track
---------------------

Expand Down
9 changes: 5 additions & 4 deletions docs/tracks.rst
Original file line number Diff line number Diff line change
Expand Up @@ -391,7 +391,7 @@ for a full list of available cool tracks please check http://higlass.io/api/v1/t
Calling card track
------------------

Calling card data must be stored in a tab-delimited, plain text format. This format requires a minimum of four columns and can support up to six. The four required columns are CHROM, START, STOP, and COUNT, where COUNT refers to the number of reads for that insertion. The START and STOP columns can be either 0- or 1-indexed. The fifth and sixth columns are optional and represent STRAND and BARCODE, respectively. Here is an example of a four-column calling card file::
Calling Card Format (CCF) is tab-delimited, plain text format for discrete genomic data, such as transposon insertions. This format requires a minimum of four columns and supports up to six. The four required columns are CHROM, START, END, and VALUE, where VALUE is a numeric value (i.e. an int or float). As with BED files, the START and END coordinates are 0-indexed. The fifth and sixth columns are optional and represent STRAND and ANNOTATION, respectively. The ANNOTATION column can be used to store sample- or entry- specific information, such as a replicate barcode. Here is an example of a four-column CCF file::

chr1 41954321 41954325 1
chr1 41954321 41954325 18
Expand All @@ -405,7 +405,7 @@ Calling card data must be stored in a tab-delimited, plain text format. This for
chr1 54747055 54747059 4
chr1 60748489 60748493 2

Here is an example of a six-column calling card file::
Here is an example of a six-column CCF file::

chr1 51441754 51441758 1 - CTAGAGACTGGC
chr1 51441754 51441758 21 - CTTTCCTCCCCA
Expand All @@ -418,11 +418,12 @@ Here is an example of a six-column calling card file::
chr1 61106283 61106287 7 - CGTTTTTCACCT
chr1 61542006 61542010 1 - CTGAGAGACTGG

our text file must be sorted by the first three columns. If your filename is example.ccf, you sort it with the following command: ``sort -k1V -k2n -k3n example.ccf > example_sorted.ccf``
Your text file must be sorted by the first three columns. If your filename is example.ccf, you can sort it with the following command: ``sort -k1V -k2n -k3n example.ccf > example_sorted.ccf``
Alternatively, with ``bedops``: ``sort-bed example.ccf > example_sorted.ccf``

Note that you can have strand information without a barcode, but you cannot have barcode information without a strand column.

Place your sorted text file in the public folder. Since genomic data is often large, we must compress and index it for fast retrieval. Use the following commands to do so::
Place your sorted CCF file in a web-accessible directory, then compress and index as follows::

bgzip example_sorted.ccf
tabix -p bed example_sorted.ccf.gz
Expand Down

0 comments on commit 59030fc

Please sign in to comment.