Contains Python and R scripts with analyses of the copula in Biblical Hebrew
copula_data_perpare_bib.ipynb is a script for preprocessing the MT biblical data. It uses the Python library Text- Fabric. It extracts data from the ETCBC database in TF format, version BHSA c.
copula_data_prepare_xbib.ipynb preprocesses the extrabiblical data. It needs the extrabiblical TF data. This dataset presently contains some Dead Sea Scrolls (1QM and 1QS) and a number of Hebrew inscriptions.
The two Python files are nearly identical, but there are slight differences in the data format of the biblical and extrabiblical data, so I have decided to keep the scripts separate.
The files hyh_nom_biblical.csv and hyh_nom_extrabiblical.csv are the output produced by the Python scripts.
The file copula_analysis_R.txt imports the output of the two Python files into R. There some further preprocessing is done, after which the resulting dataset is analyzed.