veQTL

Variable expression quantitative trait loci, an R script which identifies genes that are variably expressed between two or more genotypes. The veQTL_engine.R requires multiple edits to make it usable. Firstly, the expr (line 8) needs to be mapped to the expression data (IDs as rownames), secondly the thresholds can be estimated to limit the p-value computating.

Input data

Samples must be matched in order between two matrices (genotype and expression). Requires genotype to be a 012 matrix where genotypes are represented as the count of the minor allele (-1 = no call), genotypes must be rownames. Currently can only handle rownames with unique SNP IDs and no other annotations. For expression data this needs to be a matrix of expression values transcript ID (or other unique identifiers) as rownames.

veQTL_engine

Currently, this computes only the Brown-Forsythe test (a robust Levene's tests) on all genotypes compared to all transcripts. Any statistic meet the threshold (lines 85-90) will be retained and have p-values calculated.

Estimate W threshold

Use 1-pf(1:100, 3-1, N-1) to estimate p-value for W statistic from 1-100, set N at the smallest number of called genotypes. Repeat for two genotypes e.g( 1-pf(1:100,2-1,N-1) ), find the smallest W the meets your desired p-value.

Output

Data is output as a list of SNP with transcript statistics and pvalues, this is messy but can use veQTL_wrangler.R to tidy up and add annotation as desired.

Implementation of large genotypes

Genotypes can be split into chunks and run in parallel using split in unix and passing each chunk into a new instance of R with Rscript and a for loop (Example in loop_veQTL). The veQTL_wrangler script is able to stitch these together.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
pre_processing		pre_processing
LeadVariant_LD.R		LeadVariant_LD.R
README.md		README.md
loop_veQTL		loop_veQTL
veQTL_engine.R		veQTL_engine.R
veQTL_wrangler.R		veQTL_wrangler.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

veQTL

Input data

veQTL_engine

Estimate W threshold

Output

Implementation of large genotypes

About

Releases

Packages

Languages

wigge206/veQTL

Folders and files

Latest commit

History

Repository files navigation

veQTL

Input data

veQTL_engine

Estimate W threshold

Output

Implementation of large genotypes

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages