Skip to content

olorin15/genetic-genealogy

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

genetic-genealogy

Data Science techniques applied to human DNA data

Using G25 coordinates for both ancestral and modern DNA samples publicly available from scientific research

Global 25 coordinates from: https://vahaduo.github.io/g25download/

(Originally from Eurogenes — see https://eurogenes.blogspot.com/2019/07/getting-most-out-of-global25_12.html)

Datasets manually filtered by epoch / age (Bronze, Iron, etc)

Additional data comes from

R code using Jupyter notebooks includes

  • Principal Component Analysis
  • Hierarchical Clustering
  • K-means Clustering
  • Nearest Neighbors
  • Graph Analysis
  • XGBoost

Also see my blog at https://timpiatenko.substack.com/s/genetic-genealogy

Specifically this post: https://open.substack.com/pub/timpiatenko/p/genetic-analysis-tools-for-the-hobbyists

About

Data Science techniques applied to human DNA data

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published