scripts to replicate safe & active site search
COSMIC.xlsx: Cancer Gene Census, GRCh38, COSMIC v92 database https://cancer.sanger.ac.uk/census
UCR.txt: ultra-conserved regions (Lomonaco et al., 2014; Taccioli et al., 2009) (coordinates lifted over from hg19 to hg38 assembly)
ENCFF503GCK.tsv: https://www.encodeproject.org/files/ENCFF503GCK/
HiC active compartments.xlsx: Hi-C chromatin organization data (Schmitt et al., 2016). we shortlisted the genomic regions consistently located (at least 20/21 interrogated tissue types) in active (open chromatin) compartments. Compartments overlapped with identified low-variance housekeeping genes.