These are the scripts used to create the MySQL database that is used by PopTargs.essex.ac.uk. The pipeline can be altered to create similar databases with different species, it may need to be adjusted to fit your file names.
- Get the 3’ UTRs with: "collect_and_clean_3utrs.R chr#"
- Download the mature human miRNA from miRBase.org
- Download the SNPs with: "download_SNPs_BiomaRt.R chr#"
- Run SeedVicious to get targets, near-targets, and reverse complement targets (available at http://seedvicious.essex.ac.uk/)
- Find the SNP/Target interactions: "Find_SNPs_in_Targets.R chr#"
- Put each chromosome into the MySQL database: "setup_mySQL_DB_new.R chr#"
- Combine the chromosomes into one big table for the database
- Download all the populations data from 1,000 Genomes: ftp://ftp-trace.ncbi.nih.gov/1000genomes/ftp/release/20130502/
- Extract the populations data: "ExtractPopData.R chr#"
- Take the relevant populations data and set it up for the database: "PopData2ndRound.R chr#"
- Combine the chromosomes into one big table for the database