Skip to content

luizirber/phd

master
Switch branches/tags
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

The PhD repo

Binder DOI

Qualifying Exam

Exam happened in 2019-04-17.

Dissertation

Uses aggiedown and GitHub Actions for CI. Tagged versions are available in the Releases page.

Experiments

smol gather

experiments/smol_gather Binder

Comparison of containment approaches using MinHash:

  • CMash (containment minhash)
  • mash screen
  • smol (scaled minhash)

Regenerating results (after running the setup steps):

conda activate thesis
cd experiments/smol_gather && snakemake --use-conda

Scaled MinHash sizes

experiments/sizes Binder

Scaled MinHash sizes (number of hashes) analysis across domains in Genbank.

Inverted index and shared hashes

experiments/lca Binder

Analyzing unique and shared hashes in an inverted index.

Setup

All processing and analysis scripts were performed using the conda environment specified in environment.yml. To build and activate this environment run:

conda env create --force --file environment.yml

conda activate thesis