You need to install Biopython Main tutorial notebook is mainRCCtuto.ipynb Datasets CATHFINAL.txt contains 235,858 domains as reported in CATHv4.0 SCOPVectors.txt contains 203,025 SCOPv2.5 domains