This is a curiosity driven project to impute my 23andme data using 1000 Genomes project.
The scripts work sequentially as suggested by the numbers, the overall.sh has information of the 1st step.
Future to-do list:
Run EIGENSOFT to compute PCs using overlapping variants between my genotype and 1000G genotypes
Plot PCA plot to visualize genetic popluation assignment