Skip to content
Convert imputed DNA.Land VCF to 23andMe raw data format
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
LICENSE Improve license info & readme Jul 21, 2016
README.md Add notice May 31, 2018
blank_v3.txt Add conversion scripts and blank v3 file Dec 18, 2015
data_to_db.py Add func to allow compressed file input Jul 29, 2016
db_to_23.py Improve license info & readme Jul 21, 2016

README.md

VCF-to-23andMe

These scripts convert an imputed DNA.Land VCF into the 23andMe V3 raw data format.

data_to_db.py is used to convert the VCF file and any additional 23andMe raw data file (adds 23andMe indentifiers) into an indexed SQLite3 database for quick searching. db_to_23.py then inserts genotypes into the blank V3 file retrieved from the database by chromosome, position and identifier.

The data_to_db.py script accepts both compressed and uncompressed data files.

Notice

Please use responsibly! If you intend to upload imputed data to online DNA genealogy tools, please also delete it again so as not to pollute these websites with potentially false data. I have thought of deleting these files but they are already mirrored in other places now.

Requirements

  • Python3

Usage

python data_to_db.py <INPUT_VCF> vcf <OUTPUT_DB>
python data_to_db.py <INPUT_23_RAW> 23andme <OUTPUT_DB>
python db_to_23.py <INPUT_DB> <INPUT_BLANK_23_RAW> <OUTPUT_23_RAW>

Usage Example

python data_to_db.py imputed.vcf.gz vcf genome.db
python data_to_db.py genome_Fred_Bloggs_v4_Full_20160423012345.zip 23andme genome.db
python db_to_23.py genome.db blank_v3.txt genome_Fred_Bloggs_Full_20140628012345.txt
You can’t perform that action at this time.