An app that can give you relative risk estimates for multiple common infections by looking at your personal genome from different Direct To Consumer genetic testing companies
Check out our website application! http://team2.bioapp208803.biosci.gatech.edu/login/
Conda enviroment:
VCftools, Bcftools, Plink, Eagle=v.. Python >3.5 pip libraries: pandas, Numpy, Scipy openssl=v1.0.2
Merging subject vcf file to the 1000 Genomes Reference Dataset (Before Imputation for PCA and Population Stratification, After Imputation for PRS Score Calculations)
merge.py -
Phasing and imputing the subject vcf file from 23andMe, Ancestry, or FTDNA
imputation_pipeline.py -
Using 1000 Genomes Reference DNA and input subject vcf file to create a PCA for determining user ancestry
PCA.sh - KNN.py -
Chicken pox, Shingles, Cold sores, Mononucleosis, Mumps, Hepatitis B, HIV, TB, Strep throat, Yeast Infections, Childhood ear infections
Extracting SNPs from the imputed subject vcf file pertaining to the Infections we are interested in merge.py -
Generating PRS Score has follwoing two steps:
- Getting Valid SNPs for every Infection by Clumping and Threshold using Plink
- Getting PRS score for every Infection using the above Valid SNPs
PRS.py -
percentile.py -