We compared the perceived similarity of patterns in sequential data at 3 different sizes. The sequential data is a DNase-seq experiment from the ENCODE project. We extracted 9 different patterns at increasing sizes of 3 kilobase pairs (kb), 12 kb, and 120 kb.
conda env create --file environment.yml
The staistical analysis and the code for generating the figures is available in the following three notebooks: