-
Notifications
You must be signed in to change notification settings - Fork 0
Advanced Customization
Jie Huang edited this page Nov 17, 2021
·
3 revisions
git clone https://github.com/jielab/pageant.git cd pageant git pull # for getting an updated version conda env create -f environment_linux.yml # "environment_macos.yml" for Mac-OS, "environment_win.yml" for Windows
Download 1000 genomes project (G1K) genotype, to be used as population reference. The basic version of PAGEANT comes with a subset of the G1K data that includes all SNPs used in the genetic report.
- For Windows user using Ubuntu Linux, an upgrade to Windows 11 is recommended, which supports WSL GUI. In Windows Powershell, run "wsl --update" to get the latest version.
conda activate pageant python ./GUI.py
conda activate pageant python ./pageant.py -n test -i ./personal_genome/HG001.vcf.gz -o output
- for umap, first download the sample information file if using 1000 genomes as reference.
- for add_rsid, first download the rsids-v*-hg*.tsv.gz file towards the end of pheweb resource page. Make sure that the chromosome and positions in the input GWAS file is sorted first, by using sort -k 1,1n -k 2,2n.
- for liftOver, first download the chain files, for hg19 and/or for hg38.
- for help: python pageant.py [API-NAME] --help
- conda activate pageant - python pageant.py umap -s HG001.vcf.gz -p g1k.vcf.gz -m g1k_samples.txt - python pageant.py add_rsid -i test.gwas.gz --chr chr --pos pos -d rsids-v154-hg19.tsv.gz -o new.gwas.gz - python pageant.py liftOver -i test.gwas.gz --chr chr --pos pos -c hg38ToHg19.over.chain.gz -o new.GWAS.gz - python pageant.py qr_code -s fingerprint_snps.txt -k key
The folder structures of PAGEANT is shown below. Advanced users could also follow this structure to customize the genetic report.
For example, under "algorithm database" folder, there are 3 files for each trait folder: TRAIT.desc.txt for description text, TRAIT.jpg for a representative picture, TRAIT.snps.ref for a list of SNPs used and the relevant calculation rules.
For qualitative traits, the TRAIT.snps.ref has four columns: SNP, genotype, matched, unmatched; For quantitative trait, the TRAIT.snps.ref file requires three columns: SNP, EA (effect allele), BETA or OR (effect size).