*best viewed in [nbviewer](https://nbviewer.jupyter.org/github/CambridgeSemiticsLab/BH_time_collocations/blob/master/results/notebooks/tense_associations.ipynb)*

# Tense Associations with Time Adverbials
## Grammaticalization, Statistical Association, and the State of the Hebrew Verbal System
### Cody Kingham
<a href="../../docs/sponsors.md"><img height=200px width=200px align="left" src="../../docs/images/CambridgeU_BW.png"></a>

In [1]:
! echo "last updated:"; date

last updated:
Wed  8 Apr 2020 12:58:09 BST


## Introduction

In this notebook I seek to test various theories of Hebrew tense over 
against the collocational data of time adverbials. Across world 
languages, certain combinations of verbs and adverbials are predicted 
based on the semantic value of verb morphology. This value is hypothetized
to be derived from its location along a generic grammaticalization path,
following Bybee, Perkins, and Pagliuca (*The Evolution of Grammar*, 1994).

The question this notebook seeks to answer is relatively straightforward:
    
    How far along the path of development is a given verb type in Biblical
    Hebrew based on its collocational profile as compared with the profiles
    of verbs in other world languages?
<hr>

## Notes on Adverbial Collocation Tendencies in Other Languages

**Anteriors / Perfects**

* languages without grammatical perfects such as Russian may exhibit more extensive use of morphemes meaning "already" to make up for the lack of a perfect. (Bybee, Dahl 1989: 68)
* Anteriors do not readily combine with adverbials like "still" while resultatives do "very easily". (idem 1989: 69). 
* Anteriors are often accompanied with "already" or "just" (Bybee, Perkens, Pagliuca 1994: 54).
* In British and American English, the most common adverbial collocations with the present perfect ("has *verbed*") are "(ever) since (+temporal noun phrase or clause). (Schlüter 2002)

**Perfectives**

* Given Bybee and Dahl's claim that languages without anteriors may deploy more extensively words like "already" (1989: 68), it is sensible to expect the perfective would be the verb of choice: i.e. perfective + "already" *might* be more common than anterior + "already". But this is complicated by the fact that the anterior already prefers "already" (see above).

# Python

Now we import the modules and data needed for the analysis.

In [None]:
# standard & data science packages
import collections
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
from matplotlib import rcParams
rcParams['font.serif'] = ['SBL Biblit']
import seaborn as sns
from bidi.algorithm import get_display # bi-directional text support for plotting
from paths import main_table, figs

# custom packages (see /tools)
from tf_tools.load import load_tf
from stats.significance import contingency_table, apply_fishers

# launch Text-Fabric with custom data
TF, API, A = load_tf(silent='deep')
A.displaySetup(condenseType='phrase')
F, E, T, L = A.api.F, A.api.E, A.api.T, A.api.L # corpus analysis methods

# load and set up project dataset
times_full = pd.read_csv(main_table, sep='\t')
times_full.set_index(['node'], inplace=True)
times = times_full[~times_full.classi.str.contains('component')] # select singles