Independent researcher and data analyst. Mostly working in statistical genetics.
- Ithaca, NY
Fast ordered sampling of rows from large text or binary files. Special cases for DNA variant files (.bed, VCF, HapMap, etc).
Efficient polymorphism summaries using hashes
Multitrait Gaussian mixture models for phenotype-based grouping of individuals
A collection of numerical methods and data structures