Skip to content

QingliGuo/FFPEsig

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

FFPEsig

FFPEsig uses FFPE signature as a noise profile to correct the observed mutation counts from a given FFPE WGS sample.

  1. To run FFPEsig:
  • Download FFPEsig.py

  • Install python 3 (3.7.6) and import the packages required in FFPEsig.py, including pandas (1.0.1), numpy (1.18.1), matplotlib (3.1.3), seaborn (0.10.1).

  • Run the command line:

python FFPEsig.py [--input|-i] <Path-to-the-DataFrame> [--sample|-s] <Sample_id> [--label|-l] <Unrepaired|Repaired> [--output_dir|-o] <Path-of-output-folder>
  1. Example
python FFPEsig.py --input ./Data/simulated_PCAWG_FFPE_unrepaired.csv --sample ColoRect-AdenoCA::SP21528 --label Unrepaired --output_dir FFPEsig_OUTPUT

Or

python FFPEsig.py -i ./Data/simulated_PCAWG_FFPE_unrepaired.csv -s ColoRect-AdenoCA::SP21528 -l Unrepaired -o FFPEsig_OUTPUT

Note

  • Input file, [--input|-i], must be a standard CSV format dataframe which column names are the sample IDs;
  • Sample ID, [--sample|-s], must be one of the sample IDs in Input file [--input|-i];
  • Label option, [--label|-l], must be either of them <Unrepaired|Repaired>.
  • The total running time of FFPEsig for one sample is around 1-3 mins on a local desk computer (3,1 GHz Intel Core i5).

Analysis code

Here we include analysis codes and data used in our manuscript entitled "The mutational signatures of formalin fixation on the human genome".

Citation

Guo, Q., Lakatos, E., Bakir, I.A. et al. The mutational signatures of formalin fixation on the human genome. Nat Commun 13, 4487 (2022). https://doi.org/10.1038/s41467-022-32041-5

About

Codes and Data for FFPEsig manuscript

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published