Skip to content

jyu-theartofml/DNA_genotype_analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 

Repository files navigation

Analysis of genotype data from 23andMe DNA testing.

This R script is based on the blog post by Vince Buffalo . Some of his codes were outdated, so new ones are used here.The basic workflow involves Bioconductor libraries such as TxDb.Hsapiens.UCSC.hg18.knownGene for transcript annotation, and org.Hs.eg.db for converting annotated Entrez gene IDs to actual gene names, which is helpful for browsing the reference database.

After getting the database of gene names, I queried for the gene symbol "ADH1B" - the protein-coding gene for Alcohol Dehydrogenase II. This enzyme, or the lack of, is the reason behind my red face when I consume alcohol. From my 23andMe genotype data, I obtaind three hits on Chromosome 4 with the corresponding rsid and genotypes. Next, metadata from the library gwascat was used to merge with my genotype data. This creats an instance with the object "Strongest.SNP.Risk.Allele" showing which mutated base nucleotide is most risky(high correlation with disease in GWAS study). My own risk was calculated by looping over every line of the data table (mapply()) and see whether or not the strongest risk allele mutation is in my genotype. This creates a binary column (False, True) for my risk factors. Since I have hypothroidism, I found the rsid index for anything that matched hypothyroidism under disease.trait in the calculated risk table, and looked at the allele frequency for the transcripts related to hypothyroidism in general. Within the filtered data for hypothyroidsim (13 samples), there wasn't any studies specific to chinese or asian population.

Here're some interesting insights for the "Diease Trait" that showed up in my calculated risk table:

  • "Economic and political preferences (feminism/equality)"
  • "Economic and political preferences (environmentalism)"

Hmmm, why are they listed under "Disease.Trait". I'm trying to make sense of the fact that DNA testing can predict Economic and political preferences? 😂😂😂

Just for kick, I also plotted a karyogram of my "risk factor" loci to get a broad view of the distribution of strong risk mutations.

note: Just a quick background on 23andMe genetic testing, I received my testing before the FDA shutdown and the regulation changes that followed. Prior to the FDA shut down in November 2013, the company had offered consumers genetic testing to estimate their risk for 240 health conditions (that original report is no longer posted in my account). However, in October 2015 the company received FDA approval to test for carrier genes of 36 diseases that can be passed on to the offsprings.

About

Analysis of genotype data from 23andMe DNA testing.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages