From d2de90d086b78f5753c122385b549433d8001f38 Mon Sep 17 00:00:00 2001 From: Martin Maiers Date: Mon, 28 Nov 2022 09:34:58 -0600 Subject: [PATCH 1/2] minimal config --- README.md | 56 +++++++++++++++++++++++++++++++++++++++++++++++++++++++ 1 file changed, 56 insertions(+) diff --git a/README.md b/README.md index 2b576ae..ad47e06 100644 --- a/README.md +++ b/README.md @@ -82,3 +82,59 @@ How to develop on the project locally. 10. Use `python app.py` to run the Flask service app in debug mode. Service will be available at http://localhost:8080/ 11. Use `make docker-build` to build a docker image using the current `Dockerfile`. 12. `make docker` will build and run the docker image with the service. Service will be available at http://localhost:8080/ + + +# Runing a minimal configuration example + +From the main directory of the repo run: +``` +scripts//build-imputation-validation.sh +``` + +This will pepare and load frequency data into the graph and run imputation on a sample set of subjects. + +The execution is driven by the configuration file: +`conf/minimal-configuration.json` + +It takes input from this file: +``` +data/subjects/donor.csv +``` + + +And genrates an `output` directory with these contents: + +``` +output +├── don.miss +├── don.pmug +├── don.pmug.pops +├── don.problem +├── don.umug +└── don.umug.pops +``` + +The .miss and .problem files are cases contain cases that failed due to errors. + +The .pmug file contains the Phased Multi-locus Unambiguous Genotypes. + +The .umug file contains the Unphased Multi-locus Unambiguous Genotypes. + + +The format of both files is: + +* id +* genotype - in glstring format +* frequency +* rank + + +The .pmug.pops and .umug.pops contain the corresponding population assignments. + +The format of the .pops files is: + +* id +* pop1 +* pop2 +* frequency +* rank From 125ff5c24ac889fe0c84df0d9f76c64693c430d4 Mon Sep 17 00:00:00 2001 From: Martin Maiers Date: Mon, 28 Nov 2022 09:35:40 -0600 Subject: [PATCH 2/2] minimal config --- README.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index ad47e06..68c5daf 100644 --- a/README.md +++ b/README.md @@ -121,7 +121,7 @@ The .pmug file contains the Phased Multi-locus Unambiguous Genotypes. The .umug file contains the Unphased Multi-locus Unambiguous Genotypes. -The format of both files is: +The format of both files is (csv): * id * genotype - in glstring format @@ -131,7 +131,7 @@ The format of both files is: The .pmug.pops and .umug.pops contain the corresponding population assignments. -The format of the .pops files is: +The format of the .pops files is (csv): * id * pop1