Skip to content

Commit

Permalink
Merge pull request #485 from brentp/docs
Browse files Browse the repository at this point in the history
docs on inheritance tools
  • Loading branch information
arq5x committed Jun 19, 2015
2 parents b701c92 + bf62c0f commit 25dc4bc
Show file tree
Hide file tree
Showing 2 changed files with 34 additions and 24 deletions.
8 changes: 5 additions & 3 deletions docs/content/history.rst
Original file line number Diff line number Diff line change
Expand Up @@ -4,16 +4,18 @@ Release History

(Future)
=======================================
#. Speed up and generalize database loading for multiple genome builds and species.
#. Support for both SO and HGVS terms.
#. Update parsing for snpEff >= 4.0
#. Speed up and generalize database loading for multiple genome builds and species.
#. Add an `is_splicing` column.

0.16.0
=======================================
1. The built-in inheritance model tools (``auto_rec``, etc.) have been modified to be more restrictive in order to remove false positive candidates. The strictness can be reduced by using the ``--lenient`` option.
1. The built-in inheritance model tools (``auto_rec``, etc.) have been modified to be more
restrictive in order to remove false positive candidates. The strictness can be reduced by using the ``--lenient`` option.
2. Leverage bcolz indexing for the built-in inheritance model tools to dramatically improve speed.
3. Support for multi-generational pedigrees for the built in inheritance model tools.
3. Support for multi-generational pedigrees for the built in inheritance model tools. (thanks to Jessica, Andrew,
and jmcelwee for extensive discussion https://github.com/arq5x/gemini/issues/388)
4. Leverage genotype likelihoods in tools other than ``mendel_errors`` as a means to filter variants.
5. Automatically phase genotypes by transmission on the fly for the `comp_hets` tool.
6. Further performance improvements for bcolz queries
Expand Down
50 changes: 29 additions & 21 deletions docs/content/tools.rst
Original file line number Diff line number Diff line change
Expand Up @@ -28,6 +28,11 @@ Other tools such as `mendel_errors` additional columns at the end.
unaffected samples share the same variants. The default will only
report variants that are unique to affected samples.

The design decisions for this change are described here:
https://github.com/arq5x/gemini/issues/388
a visual representation is here:
https://github.com/arq5x/gemini/blob/master/inheritance.ipynb


==================================
``common_args``: common arguments
Expand Down Expand Up @@ -124,6 +129,7 @@ heterozygotes for each sample/gene.
unphased genotypes. Any candidate that could be one element of a comp_het
will also be phaseable as long as the parents and their genotypes are known.


---------------------
Genotype Requirements
---------------------
Expand All @@ -133,6 +139,8 @@ Genotype Requirements
- No unaffected can be homozygous alterate at either site.
- Sites are automatically phased by transmission when parents are present in order to remove false positive candidates.

if `--lenient` is used then unknowns can be HET.


.. note::

Expand Down Expand Up @@ -226,6 +234,8 @@ Genotype Requirements
- (plausible de novo) kid is het. parents are same homozygotes
- (implausible de novo) kid is homozygoes. parents are same homozygotes and opposite to kid.

If allow `--only-affected` is used, then the tools will only consider samples that have parents
**and** are affected. The default is to consider any sample with parents.

This tool will report the probability of a mendelian error in the final column
that is derived from the genotype likelihoods if they are available.
Expand Down Expand Up @@ -270,7 +280,6 @@ Arguments are similar to the other tools:
variant in a gene.
--families FAMILIES Restrict analysis to a specific set of 1 or more
(comma) separated) families
--lenient Loosen the restrictions on family structure
-d MIN_SAMPLE_DEPTH The minimum aligned sequence depth (genotype DP) req'd
for each sample (def. = 0)
--gt-pl-max GT_PHRED_LL
Expand All @@ -294,16 +303,16 @@ Genotype Requirements
---------------------
- all affecteds must be het
- all unaffected must be homref or homalt
- [affected] all unaffected must be homref or homalt
- at least 1 affected kid must have unaffected parents
- [strict] if an affected has affected parents, it's not de_novo
- [strict] all affected kids must have unaffected (or no) parents
- [strict] warning if none of the affected samples have parents.

++++++
strict
++++++
The last 3 items, prefixed with [strict] can be turned off with `--lenient`

- if an affected has affected parents, it's not de_novo
- all affected kids must have unaffected (or no) parents
- warning if none of the affected samples have parents.
If `--allow-unaffected` is specified, then the item prefixed [affected] is not
required.


`Example PED file format for GEMINI`
Expand Down Expand Up @@ -428,14 +437,13 @@ Genotype Requirements
---------------------

- all affecteds must be hom_alt
- no unaffected can be hom_alt (can be unknown)
- [affected] no unaffected can be hom_alt (can be unknown)
- [strict] if parents exist they must be unaffected and het for all affected kids
- [strict] if there are no affecteds that have a parent, a warning is issued.

++++++
strict
++++++
if `--lenient` is specified, the 2 points prefixed with "[strict]" are not required.

- if parents exist they must be unaffected and het for all affected kids
- if there are no affecteds that have a parent, a warning is issued.
if `--allow-unaffected` is specified, the point prefix with "[affected]" is not required.


---------------------
Expand Down Expand Up @@ -552,16 +560,16 @@ Genotype Requirements
---------------------

- all affecteds must be het
- no unaffected can be het or homalt (can be unknown)
- [affected] no unaffected can be het or homalt (can be unknown)
- de_novo mutations are not auto_dom (at least not in the first generation)
- [strict] parents of affected cant have unknown phenotype.
- [strict] all affected kids must have at least 1 affected parent
- [strict] if no affected has a parent, a warning is issued.


++++++
strict
++++++
If `--lenient` is specified, the items prefixed with "[strict]" are not required.

- parents of affected cant have unknown phenotype.
- all affected kids must have at least 1 affected parent
- if no affected has a parent, a warning is issued.
If `--allow-unaffected` is specified, the item prefix with "[affected]" is not required.


---------------------
Expand Down

0 comments on commit 25dc4bc

Please sign in to comment.