This repository includes an approach to identify drug-response associated SNPs from clinical patients’ follow-up data by integrating cox proportional hazards model and Kaplan-Meier survival analysis. By using those patients didn’t receive any drug treatment as control, those prognosis correlated SNPs even without drug treatment were filtered out, and only focus on SNPs related to drug response.
The approach were conducted by using Python 2.7.16. Following Python packages should be installed:
pandas
lifelines
python DDRS.DRS.calculation.py -clincal_file ./example_clinical_infor.txt -snp_file ./example_snp.inf -min_g 20 -min_e 5
-clincal_file : Patients clinical file, contain patients ID, follow-up information (time and event), with drug treatment(1) or not(0)
-snp_file : Genotype data
-min_g(optional) : (Default is 20) Minimum patients size for each subgroup of pairwise KM survival analysis
-min_e(optional) : (Default is 5) Minimum patients with events for each subgroup
./output/snp_drug_interaction_cox.txt
/snp_drug_interaction_cox_filtered.txt
snp, snp.inf : SNP id, reference allele(0), alternative allele(1), patients number of Homozygous(reference allele),Heterozygosity and wild type(alternative allele)
drug.coef,exp_coef,se_coef,z,pval,lower95,upper95 : CoxPHFitter[1] result of Drug term.
snp.coef,exp_coef,se_coef,z,pval,lower95,upper95 : CoxPHFitter result of SNP term.
snp×drug.coef,exp_coef,se_coef,z,pval,lower95,upper95 : CoxPHFitter result of SNP×Drug term.
/KM.subgroup.txt
/KM.subgroup.fdr.txt
/KM.subgroup.fdr.filter.txt
-snp, snp.inf : SNP id, reference allele(0), alternative allele(1), patients number of Homozygous(reference allele),Heterozygosity and wild type(alternative allele)
1.WT.drug.used,2.HET.drug.used,3.HOM.drug.used,4.WT.drug.used.not,5.HET.drug.used.not,6.HOM.drug.used.not: patients numbers of Homozygous,Heterozygosity and wild type in 6 six subgroups
1&2.sta,1&2.pval,1&2.fdr.pval: logrank_test[2] result between Homozygous and Heterozygosity patients with drug treatment
1&3.sta,1&3.pval,1&3.fdr.pval: logrank_test result between Homozygous and wild type patients with drug treatment
2&3.sta,2&3.pval,2&3.fdr.pval: logrank_test result between Heterozygosity and wild type patients with drug treatment
4&5.sta,4&5.pval,4&5.fdr.pval: logrank_test result between Homozygous and Heterozygosity patients without drug treatment
4&6.sta,4&6.pval,4&6.fdr.pval: logrank_test result between Homozygous and wild type patients without drug treatment
5&6.sta,5&6.pval,5&6.fdr.pval: logrank_test result between Heterozygosity and wild type patients without drug treatment
snp.coef,exp_coef,se_coef,z,pval,lower95,upper95 : CoxPHFitter result of univariate Cox proportional hazards analysis about SNP in patients with drug treatment.
python DDRS.SNP.identify.py -snp_file ./example_snp.inf -filter_SNP ./KM.subgroup.fdr.filter.txt
-snp_file : Genotype data
-filter_SNP : identified drug associated SNPs and cox coefficients of SNPs
./output/DRS.score
[1]https://lifelines.readthedocs.io/en/latest/Survival%20Regression.html#cox-s-proportional-hazard-model [2]https://lifelines.readthedocs.io/en/latest/Examples.html?highlight=logrank#logrank-test
Yu Rong
