# Meta Analysis of GWAS data

Different GWAS results can be combined by means of the standard error based weights meta-analysis method implemented in METAL, corrected by study-specific inflation factors. A Cochran’s Q test for heterogeneity and I2 estimates is generated to evaluate the potential effect of study heterogeneity on the results.

Note this template is not run as only one GWAS dataset is available.

https://genome.sph.umich.edu/wiki/METAL_Documentation

In [1]:
%load_ext rpy2.ipython

**Input File Columns**

Each input file should include the following information:

- A column with marker name, which should be consistent across studies
- A column indicating the tested allele
- A column indicating the other allele


If you are carrying out a sample size weighted analysis (based on p-values), you will also need:

- A column indicating the direction of effect for the tested allele
- A column indicating the corresponding p-value
- An optional column indicating the sample size (if the sample size varies by marker)


If you are carrying out a meta-analysis based on standard errors, you will need:

-A column indicating the estimated effect size for each marker
- A column indicating the standard error of this effect size estimate

The header for each of these columns must be specified so that METAL knows how to interpret the data. Additional columns including allele frequency information, strand information, and others can also be present.

**Selecting an Analysis Scheme**

     1) SCHEME SAMPLESIZE        - default approach, uses p-value and direction of effect, weighted according to sample size
     
     The weight for each MARKER can be stored in a column in the table (specified with the WEIGHTLABEL or WEIGHT commands). Most commonly, the weight will be the number of individuals contributing to that particular p-value.
     
     WEIGHTLABEL     N
     
     Alternatively, the same weight can be used for all markers for that inputfile (in which case the fixed weight can be set  with the DEFAULTWEIGHT command). The WEIGHTLABEL command takes precedence over the DEFAULTWEIGHT command, so the WEIGHT column label in use must not match any columns in the inputfile.
     
     WEIGHTLABEL     DONTUSECOLUMN
     DEFAULTWEIGHT   1000
    
    
    2) SCHEME STDERR            - classical approach, uses effect size estimates and standard errors

     For this approach, you need to specify the label for the standard error column:

     STDERR SE                

## SCHEME SAMPLESIZE 

In [None]:
%%bash
metal

MARKER SNP
ALLELE A2 A1
FREQ FRQ
EFFECT OR
PVAL P
SEPARATOR TAB
MINMAXFREQ ON
GENOMICCONTROL ON
SCHEME STDERR
STDERR SE


PROCESS dataset1
PROCESS dataset2


ANALYZE HETEROGENEITY

## SCHEME STDERR

In [9]:
%%bash
metal

MARKER SNP
ALLELE A2 A1
FREQ FRQ
EFFECT OR
PVAL P
SEPARATOR TAB
MINMAXFREQ ON
GENOMICCONTROL ON
SCHEME SAMPLESIZE
DEFAULTWEIGHT   1000


PROCESS dataset1
PROCESS dataset2


ANALYZE HETEROGENEITY