![pipeline](pictures/pictures.002.png)

# Text-Fabric from ETCBC

This notebook assembles the data from the ETCBC that is needed
to compile its datasets in text-fabric-format on Github.
Ulltimately the data for the website [SHEBANQ](https://shebanq.ancient-data.org) will be
derived from these TF-sources.

## Pipeline
This is **pipe 1** of the pipeline from ETCBC data to the website SHEBANQ.

A run of this pipe produces a data *version*.
It should be run whenever there are new or updated data sources present that affect the output data.
Since all input data is delivered in a Github repo, we have excellent machinery to 
work with versioning.

The pipe works by executing a series of programs, contained in Github repositories.
For each repository in the pipe, a series of notebooks will be executed.
See [script mode](https://github.com/ETCBC/pipeline/blob/master/README.md#operation) for 
details on how we call notebooks.

All this is specified in the configuration below.

### Core data

The core data is delivered by the ETCBC as `bhsa.mql.bz2` in 
the Github repo [bhsa](https://github.com/ETCBC/bhsa) in directory `source`.

This data will be converted by `tfFromMQL` in the `programs` directory.

The result of this action will be an updated TF resource in its 
`tf/core` directory.

### Additional data

Researchers have contributed to the dataset, 
but not all that data is in the core.
They are typically in the repository where the research has been 
executed, and where the data is documented.

Before the pipe starts, these repos must be pulled.

In [1]:
import os,sys,collections
from pipeline import runPipeline
from tf.fabric import Fabric

# Config

In [2]:
CORE_NAME = 'bhsa'

if 'SCRIPT' not in locals(): 
    SCRIPT = False
    DEFAULT_CORE_NAME = CORE_NAME
    DEFAULT_VERSION = 'c'

In [3]:
pipeline = dict(
    defaults = dict(
        CORE_NAME=CORE_NAME,
        VERSION=DEFAULT_VERSION,
        LANG_FEATURE='language',
        OCC_FEATURE='g_cons',
        LEX_FEATURE='lex',
        TEXT_FEATURE='g_word_utf8',
        TRAILER_FEATURE='trailer_utf8',
    ),
    versions={
        '3': dict(
                LANG_FEATURE='language',
                OCC_FEATURE='surface_consonants',
                LEX_FEATURE='lexeme',
                TEXT_FEATURE='text',
                TRAILER_FEATURE='suffix',
            ),
        '4': dict(),
        '4b': dict(),
        'c': dict(),
        '2016': dict(),
        '2017': dict(),
    },
    repoOrder = '''
        bhsa
        phono
        valence
        parallels
    ''',
    repoConfig = dict(
        bhsa=(
            dict(
                task='coreData',
            ),
            dict(
                task='bookNames',
                omit={},
            ),
            dict(
                task='lexicon',
                omit={'3'},
            ),
            dict(
                task='paragraphs',
                 omit={'3', '4', '4b'},
            ),
            dict(
                task='ketivQere',
                omit={'3', '4', '4b'},
            ),
            dict(
                task='stats',
                omit={'4', '4b'},
            ),
        ),
        phono=(
            dict(
                task='phono',
                omit={'3', '4', '4b'},
            ),
        ),
        valence=(
            dict(
                task='enrich',
                omit={'3'},
            ),
            dict(
                task='flowchart',
                omit={'3'},
            ),
        ),
        parallels=(
            dict(
                task='parallels',
                omit={},
                params=dict(
                    FORCE_MATRIX=False,
                ),
            ),
        ),
    ),
)

# Run the pipeline

In [4]:
good = runPipeline(pipeline, versions=['3', '4', '4b', '2016', 'c'], force=False)


##############################################################################################
#                                                                                            #
#       0.00s Make version [3]                                                               #
#                                                                                            #
##############################################################################################


**********************************************************************************************
*                                                                                            *
*       0.00s Make repo [bhsa]                                                               *
*                                                                                            *
**********************************************************************************************


---------------------------------------------

  0.26s 			feature phrase_function (str) =def= none : node
  0.26s 			feature determination (str) =def= NA : node
  0.26s 			feature is_apposition (str) =def= false : node
  0.27s 			feature phrase_type (str) =def= VP : node
  0.27s 			feature number_within_clause (int) =def= 0 : node
  0.27s 		otype subphrase
  0.27s 			feature parents (str) =def= id_d : node
  0.27s 			feature mother (str) =def= 0 : edge
  0.27s 			feature subphrase_kind (str) =def= mother : node
  0.27s 			feature subphrase_type (str) =def= ADJ : node
  0.28s 		otype chapter
  0.28s 			feature book (str) =def= Genesis : node
  0.28s 			feature chapter (int) =def= 0 : node
  0.28s 		otype book
  0.29s 			feature book (str) =def= Genesis : node
  0.29s 		otype sentence_atom
  0.29s 			feature parents (str) =def= id_d : node
  0.29s 			feature sentence_atom_number (int) =def= 0 : node
  0.29s 		otype clause
  0.29s 			feature parents (str) =def= id_d : node
  0.30s 			feature clause_type (str) =def= none : node
  0.30s

   |     0.16s T indentation          to /Users/dirk/github/etcbc/bhsa/_temp/3/tf
   |     0.89s T is_apposition        to /Users/dirk/github/etcbc/bhsa/_temp/3/tf
   |     0.77s T language             to /Users/dirk/github/etcbc/bhsa/_temp/3/tf
   |     0.18s T levels_of_embedding  to /Users/dirk/github/etcbc/bhsa/_temp/3/tf
   |     0.77s T lexeme               to /Users/dirk/github/etcbc/bhsa/_temp/3/tf
   |     0.91s T lexeme_utf8          to /Users/dirk/github/etcbc/bhsa/_temp/3/tf
   |     0.80s T lexical_set          to /Users/dirk/github/etcbc/bhsa/_temp/3/tf
   |     0.85s T locative             to /Users/dirk/github/etcbc/bhsa/_temp/3/tf
   |     0.76s T noun_type            to /Users/dirk/github/etcbc/bhsa/_temp/3/tf
   |     0.78s T number               to /Users/dirk/github/etcbc/bhsa/_temp/3/tf
   |     0.12s T number_within_chapter to /Users/dirk/github/etcbc/bhsa/_temp/3/tf
   |     0.44s T number_within_clause to /Users/dirk/github/etcbc/bhsa/_temp/3/tf
   |     0.15s 

88 features found and 0 ignored
  0.00s loading features ...
   |     1.51s T otype                from /Users/dirk/github/etcbc/bhsa/tf/3
   |     9.79s T oslots               from /Users/dirk/github/etcbc/bhsa/tf/3
   |     0.08s T book                 from /Users/dirk/github/etcbc/bhsa/tf/3
   |     0.05s T chapter              from /Users/dirk/github/etcbc/bhsa/tf/3
   |     0.05s T verse                from /Users/dirk/github/etcbc/bhsa/tf/3
   |     1.60s T graphical_lexeme     from /Users/dirk/github/etcbc/bhsa/tf/3
   |     1.72s T graphical_lexeme_utf8 from /Users/dirk/github/etcbc/bhsa/tf/3
   |     1.60s T graphical_word       from /Users/dirk/github/etcbc/bhsa/tf/3
   |     1.42s T lexeme               from /Users/dirk/github/etcbc/bhsa/tf/3
   |     1.58s T lexeme_utf8          from /Users/dirk/github/etcbc/bhsa/tf/3
   |     1.19s T suffix               from /Users/dirk/github/etcbc/bhsa/tf/3
   |     1.43s T surface_consonants   from /Users/dirk/github/etcbc/bhsa/tf/3
  

   |     0.00s Feature overview: 85 for nodes; 2 for edges; 1 configs; 7 computed
 1m 10s All features loaded/computed - for details use loadLog()
..............................................................................................
.      7m 09s Basic test                                                                     .
..............................................................................................
..............................................................................................
.      7m 09s First verse in all formats                                                     .
..............................................................................................
text-trans-plain
	B R>CJT BR> >LHJM >T H CMJM W >T H >RY 
text-orig-full
	בְּרֵאשִׁ֖ית בָּרָ֣א אֱלֹהִ֑ים אֵ֥ת הַשָּׁמַ֖יִם וְאֵ֥ת הָאָֽרֶץ׃
lex-trans-plain
	B R>CJT/ BR>[ >LHJM/ >T H CMJM/ W >T H >RY/ 
text-trans-full
	B.:- R;>CI73JT B.@R@74> >:ELOHI92Jm >;71T HA- C.@MA73JIm W:- >;7

114 features found and 0 ignored
  0.00s loading features ...
   |     0.00s T book@am              from /Users/dirk/github/etcbc/bhsa/tf/3
   |     0.00s T book@ar              from /Users/dirk/github/etcbc/bhsa/tf/3
   |     0.00s T book@bn              from /Users/dirk/github/etcbc/bhsa/tf/3
   |     0.00s T book@da              from /Users/dirk/github/etcbc/bhsa/tf/3
   |     0.00s T book@de              from /Users/dirk/github/etcbc/bhsa/tf/3
   |     0.00s T book@el              from /Users/dirk/github/etcbc/bhsa/tf/3
   |     0.00s T book@en              from /Users/dirk/github/etcbc/bhsa/tf/3
   |     0.00s T book@es              from /Users/dirk/github/etcbc/bhsa/tf/3
   |     0.00s T book@fa              from /Users/dirk/github/etcbc/bhsa/tf/3
   |     0.00s T book@fr              from /Users/dirk/github/etcbc/bhsa/tf/3
   |     0.00s T book@he              from /Users/dirk/github/etcbc/bhsa/tf/3
   |     0.00s T book@hi              from /Users/dirk/github/etcbc/bhsa/tf/3
  

   |     0.71s T freq_lex             to /Users/dirk/github/etcbc/bhsa/_temp/3/tf
   |     0.71s T freq_occ             to /Users/dirk/github/etcbc/bhsa/_temp/3/tf
   |     0.72s T rank_lex             to /Users/dirk/github/etcbc/bhsa/_temp/3/tf
   |     0.74s T rank_occ             to /Users/dirk/github/etcbc/bhsa/_temp/3/tf
..............................................................................................
.      7m 31s Check differences with previous version                                        .
..............................................................................................
|      7m 31s 	4 features to add
|      7m 31s 		freq_lex
|      7m 31s 		freq_occ
|      7m 31s 		rank_lex
|      7m 31s 		rank_occ
|      7m 31s 	no features to delete
|      7m 31s 	0 features in common
|      7m 31s Done
..............................................................................................
.      7m 31s Deliver features to /Users/dirk/github/etcbc/bhsa/tf/

118 features found and 0 ignored
  0.00s loading features ...
   |     0.03s B otype                from /Users/dirk/github/etcbc/bhsa/tf/3
   |     0.01s B book                 from /Users/dirk/github/etcbc/bhsa/tf/3
   |     0.01s B chapter              from /Users/dirk/github/etcbc/bhsa/tf/3
   |     0.01s B verse                from /Users/dirk/github/etcbc/bhsa/tf/3
   |     0.13s B lexeme               from /Users/dirk/github/etcbc/bhsa/tf/3
   |     0.13s B suffix               from /Users/dirk/github/etcbc/bhsa/tf/3
   |     0.23s B text                 from /Users/dirk/github/etcbc/bhsa/tf/3
   |     0.22s B number               from /Users/dirk/github/etcbc/bhsa/tf/3
   |     0.00s Feature overview: 115 for nodes; 2 for edges; 1 configs; 7 computed
  5.14s All features loaded/computed - for details use loadLog()
..............................................................................................
.      7m 59s CROSSREFS: Fetching crossrefs                            

|      8m 06s 		         ----------> 1_Chronicles 1:9     confidende 100%
		Genesis 10:8
|      8m 06s 		         ----------> 1_Chronicles 1:10    confidende 100%
		Genesis 10:13
|      8m 06s 		         ----------> 1_Chronicles 1:11    confidende 100%
		Genesis 10:14
|      8m 06s 		         ----------> 1_Chronicles 1:12    confidende 100%
		Genesis 10:15
|      8m 06s 		         ----------> 1_Chronicles 1:13    confidende 100%
		Genesis 10:16
|      8m 06s 		         ----------> 1_Chronicles 1:14    confidende 100%
		Genesis 10:17
|      8m 06s 		         ----------> Genesis 15:20        confidende  76%
|      8m 06s 		         ----------> 1_Chronicles 1:15    confidende 100%
		Genesis 10:20
|      8m 06s 		         ----------> Genesis 10:31        confidende  94%
		Genesis 10:22
|      8m 06s 		         ----------> 1_Chronicles 1:17    confidende  77%
		Genesis 10:24
|      8m 06s 		         ----------> 1_Chronicles 1:18    confidende 100%
		Genesis 10:25
|      8m 06s 		         --

  0.11s 			feature sp (str) =def= art : node
  0.11s 			feature pdp (str) =def= art : node
  0.11s 			feature freq_lex (int) =def= 0 : node
  0.11s 			feature freq_occ (int) =def= 0 : node
  0.11s 			feature g_entry (str) =def=  : node
  0.12s 			feature g_entry_heb (str) =def=  : node
  0.12s 			feature g_qere_utf8 (str) =def=  : node
  0.12s 			feature gloss (str) =def=  : node
  0.12s 			feature nametype (str) =def=  : node
  0.12s 			feature phono (str) =def=  : node
  0.12s 			feature phono_sep (str) =def=  : node
  0.12s 			feature qtrailer_utf8 (str) =def=  : node
  0.12s 			feature rank_lex (int) =def= 0 : node
  0.13s 			feature rank_occ (int) =def= 0 : node
  0.13s 		otype clause_atom
  0.13s 			feature number (int) =def= 0 : node
  0.13s 			feature tab (int) =def= 0 : node
  0.13s 			feature code (int) =def= 0 : node
  0.13s 			feature distributional_parent (str) =def= 0 : edge
  0.14s 			feature mother (str) =def= 0 : edge
  0.14s 			feature functional_parent (str) =def= 0 

   |     0.86s T g_entry              to /Users/dirk/github/etcbc/bhsa/_temp/4/tf
   |     0.80s T g_entry_heb          to /Users/dirk/github/etcbc/bhsa/_temp/4/tf
   |     0.77s T g_lex                to /Users/dirk/github/etcbc/bhsa/_temp/4/tf
   |     0.79s T g_lex_utf8           to /Users/dirk/github/etcbc/bhsa/_temp/4/tf
   |     0.71s T g_nme                to /Users/dirk/github/etcbc/bhsa/_temp/4/tf
   |     0.77s T g_nme_utf8           to /Users/dirk/github/etcbc/bhsa/_temp/4/tf
   |     0.72s T g_pfm                to /Users/dirk/github/etcbc/bhsa/_temp/4/tf
   |     0.71s T g_pfm_utf8           to /Users/dirk/github/etcbc/bhsa/_temp/4/tf
   |     0.72s T g_prs                to /Users/dirk/github/etcbc/bhsa/_temp/4/tf
   |     0.75s T g_prs_utf8           to /Users/dirk/github/etcbc/bhsa/_temp/4/tf
   |     0.71s T g_qere_utf8          to /Users/dirk/github/etcbc/bhsa/_temp/4/tf
   |     0.75s T g_uvf                to /Users/dirk/github/etcbc/bhsa/_temp/4/tf
   |     0.75s T

   |     9.77s T oslots               from /Users/dirk/github/etcbc/bhsa/tf/4
   |     0.09s T book                 from /Users/dirk/github/etcbc/bhsa/tf/4
   |     0.05s T chapter              from /Users/dirk/github/etcbc/bhsa/tf/4
   |     0.05s T verse                from /Users/dirk/github/etcbc/bhsa/tf/4
   |     1.49s T g_cons               from /Users/dirk/github/etcbc/bhsa/tf/4
   |     1.69s T g_cons_utf8          from /Users/dirk/github/etcbc/bhsa/tf/4
   |     1.56s T g_lex                from /Users/dirk/github/etcbc/bhsa/tf/4
   |     1.69s T g_lex_utf8           from /Users/dirk/github/etcbc/bhsa/tf/4
   |     0.68s T g_qere_utf8          from /Users/dirk/github/etcbc/bhsa/tf/4
   |     1.63s T g_word               from /Users/dirk/github/etcbc/bhsa/tf/4
   |     1.72s T g_word_utf8          from /Users/dirk/github/etcbc/bhsa/tf/4
   |     1.48s T lex                  from /Users/dirk/github/etcbc/bhsa/tf/4
   |     1.64s T lex_utf8             from /Users/dirk/github/et

75 features found and 0 ignored
  0.00s loading features ...
   |     0.01s B book                 from /Users/dirk/github/etcbc/bhsa/tf/4
   |     0.00s Feature overview: 70 for nodes; 4 for edges; 1 configs; 7 computed
  4.22s All features loaded/computed - for details use loadLog()
|     15m 23s 26 book name features created
..............................................................................................
.     15m 23s Write book name features as TF                                                 .
..............................................................................................
   |     0.00s T book@am              to /Users/dirk/github/etcbc/bhsa/_temp/4/tf
   |     0.00s T book@ar              to /Users/dirk/github/etcbc/bhsa/_temp/4/tf
   |     0.00s T book@bn              to /Users/dirk/github/etcbc/bhsa/_temp/4/tf
   |     0.00s T book@da              to /Users/dirk/github/etcbc/bhsa/_temp/4/tf
   |     0.00s T book@de              to /Users/dirk/gith

|     15m 28s la = latin                Genesis is Genesis              in Latina              
|     15m 28s nl = dutch                Genesis is Genesis              in Nederlands          
|     15m 28s pa = punjabi              Genesis is ਉਤਪਤ                 in ਪੰਜਾਬੀ              
|     15m 28s pt = portuguese           Genesis is Gênesis              in Português           
|     15m 28s ru = russian              Genesis is Бытия                in Русский             
|     15m 29s sw = swahili              Genesis is Mwanzo               in Kiswahili           
|     15m 29s syc = syriac               Genesis is ܒܪܝܬܐ                in ܠܫܢܐ ܣܘܪܝܝܐ         
|     15m 29s tr = turkish              Genesis is Yaratılış            in Türkçe              
|     15m 29s ur = urdu                 Genesis is پیدائش               in اُردُو              
|     15m 29s yo = yoruba               Genesis is Genesisi             in èdè Yorùbá          
|     15m 29s zh = chinese             

   |     0.78s T sp                   to /Users/dirk/github/etcbc/bhsa/_temp/4/tf
   |     4.31s T oslots               to /Users/dirk/github/etcbc/bhsa/_temp/4/tf
   |     0.00s M otext                to /Users/dirk/github/etcbc/bhsa/_temp/4/tf
..............................................................................................
.     16m 12s Check differences with previous version                                        .
..............................................................................................
|     16m 12s 	2 features to add
|     16m 12s 		lex0
|     16m 12s 		root
|     16m 12s 	no features to delete
|     16m 12s 	10 features in common
|     16m 12s gloss                     ... differences after the metadata
|     16m 12s 	line 426557 OLD --><empty><--
|     16m 12s 	line 426557 NEW -->1441145	in<--
|     16m 12s 	line 426558 OLD --><empty><--
|     16m 12s 	line 426558 NEW -->beginning<--
|     16m 12s 	line 426559 OLD --><empty><--
|     16m 12s 	

|     17m 53s 	Destination /Users/dirk/github/etcbc/valence/tf/4/.tf/valence.tfx does not exist
True True
..............................................................................................
.     17m 53s Load the existing TF dataset                                                   .
..............................................................................................
This is Text-Fabric 3.0.6
Api reference : https://github.com/Dans-labs/text-fabric/wiki/Api
Tutorial      : https://github.com/Dans-labs/text-fabric/blob/master/docs/tutorial.ipynb
Example data  : https://github.com/Dans-labs/text-fabric-data

103 features found and 0 ignored
  0.00s loading features ...
   |     0.13s B lex                  from /Users/dirk/github/etcbc/bhsa/tf/4
   |     0.19s B lex_utf8             from /Users/dirk/github/etcbc/bhsa/tf/4
   |     0.14s B gloss                from /Users/dirk/github/etcbc/bhsa/tf/4
   |     0.12s B sp                   from /Users/dirk/github/etcbc/b

|     18m 11s 	Done
|     18m 11s 	Phrases of kind C :  16081
|     18m 11s 	Phrases of kind L :  11467
|     18m 11s 	Phrases of kind I :   7308
|     18m 11s 	Total complements :  34856
|     18m 11s 	Total phrases     : 215708
..............................................................................................
.     18m 11s Checking enrichment logic                                                      .
..............................................................................................
|     18m 11s 	All 6 rules OK
..............................................................................................
.     18m 11s Generating enrichments                                                         .
..............................................................................................
|     18m 17s 	Generated enrichment values for 1381 verbs:
|     18m 17s 	Enriched values for 222039 nodes
|     18m 17s 	Overview of rule applications:
|     18m 17s gen

   |     0.78s T predication          from /Users/dirk/github/etcbc/valence/tf/4
   |     0.77s T grammatical          from /Users/dirk/github/etcbc/valence/tf/4
   |     0.38s T original             from /Users/dirk/github/etcbc/valence/tf/4
   |     0.52s T lexical              from /Users/dirk/github/etcbc/valence/tf/4
   |     0.52s T semantic             from /Users/dirk/github/etcbc/valence/tf/4
   |     0.36s T f_correction         from /Users/dirk/github/etcbc/valence/tf/4
   |     0.37s T s_manual             from /Users/dirk/github/etcbc/valence/tf/4
   |     0.47s T cfunction            from /Users/dirk/github/etcbc/valence/tf/4
   |     0.00s Feature overview: 107 for nodes; 4 for edges; 1 configs; 7 computed
    11s All features loaded/computed - for details use loadLog()
Time - Time - True
Pred - Pred - True
Subj - Subj - True
Objc - Objc - True
Conj -  - True
Subj -  - True
Pred -  - True
PreC -  - True
Conj - None - False
Subj - None - False
|     18m 38s SUCCESS enrich

|     18m 59s 	10000 clauses
|     19m 02s 	20000 clauses
|     19m 05s 	30000 clauses
|     19m 08s 	40000 clauses
|     19m 10s 	47316 clauses
..............................................................................................
.     19m 10s Writing sense feature to TF                                                    .
..............................................................................................
   |     0.11s T sense                to /Users/dirk/github/etcbc/valence/_temp/4/tf
..............................................................................................
.     19m 11s Check differences with previous version                                        .
..............................................................................................
|     19m 11s 	1 features to add
|     19m 11s 		sense
|     19m 11s 	no features to delete
|     19m 11s 	0 features in common
|     19m 11s Done
.....................................................

106 features found and 0 ignored
  0.00s loading features ...
   |     0.08s T crossref             from /Users/dirk/github/etcbc/parallels/tf/4
   |     0.05s T crossrefSET          from /Users/dirk/github/etcbc/parallels/tf/4
   |     0.09s T crossrefLCS          from /Users/dirk/github/etcbc/parallels/tf/4
   |     0.00s Feature overview: 98 for nodes; 7 for edges; 1 configs; 7 computed
  5.51s All features loaded/computed - for details use loadLog()
..............................................................................................
.     19m 33s Test: crossrefs of Genesis 10                                                  .
..............................................................................................
|     19m 33s 	Method 
|     19m 33s 		20 start verses
		Genesis 10:2
|     19m 33s 		         ----------> 1_Chronicles 1:5     confidende 100%
		Genesis 10:3
|     19m 33s 		         ----------> 1_Chronicles 1:6     confidende  95%
		Genesis 10:4
|     19m

  0.00s 		enum boolean_t
  0.00s 		enum phrase_determination_t
  0.00s 		enum language_t
  0.00s 		enum book_name_t
  0.01s 		enum lexical_set_t
  0.01s 		enum verbal_stem_t
  0.01s 		enum verbal_tense_t
  0.01s 		enum person_t
  0.01s 		enum number_t
  0.01s 		enum gender_t
  0.01s 		enum state_t
  0.01s 		enum part_of_speech_t
  0.02s 		enum phrase_type_t
  0.02s 		enum phrase_atom_relation_t
  0.02s 		enum phrase_relation_t
  0.02s 		enum phrase_atom_unit_distance_to_mother_t
  0.02s 		enum subphrase_relation_t
  0.02s 		enum subphrase_mother_object_type_t
  0.02s 		enum phrase_function_t
  0.02s 		enum clause_atom_type_t
  0.03s 		enum clause_type_t
  0.03s 		enum clause_kind_t
  0.03s 		enum clause_constituent_relation_t
  0.03s 		enum clause_constituent_mother_object_type_t
  0.03s 		enum clause_constituent_unit_distance_to_mother_t
  0.03s 		otype word
  0.03s 			feature trailer_utf8 (str) =def=  : node
  0.03s 			feature number (int) =def= 0 : node
  0.03s 			feature g_vbe (str

 2m 41s 90554 objects of type clause_atom
 2m 41s 63586 objects of type sentence
 2m 41s 113764 objects of type subphrase
 2m 41s 253161 objects of type phrase
 2m 41s 929 objects of type chapter
 2m 41s 39 objects of type book
 2m 41s 88011 objects of type clause
 2m 41s 45180 objects of type half_verse
 2m 41s 23213 objects of type verse
 2m 41s 64354 objects of type sentence_atom
 2m 41s 267499 objects of type phrase_atom
 2m 41s Making TF data ...
 2m 41s Monad - idd mapping ...
 2m 41s Removing holes in the monad sequence
 2m 41s maxSlot=426568
 2m 41s Node mapping and otype ...
 2m 42s oslots ...
 2m 44s metadata ...
 2m 44s features ...
 2m 44s 	features from words
 2m 49s 	   100000 words
 2m 53s 	   200000 words
 2m 57s 	   300000 words
 3m 01s 	   400000 words
 3m 02s 	   426568 words
 3m 02s 	features from books
 3m 02s 	       39 books
 3m 02s 	features from chapters
 3m 02s 	      929 chapters
 3m 02s 	features from clauses
 3m 03s 	    88011 clauses
 3m 03s 	features from

..............................................................................................
.     24m 08s Load and compile standard TF features                                          .
..............................................................................................
This is Text-Fabric 3.0.6
Api reference : https://github.com/Dans-labs/text-fabric/wiki/Api
Tutorial      : https://github.com/Dans-labs/text-fabric/blob/master/docs/tutorial.ipynb
Example data  : https://github.com/Dans-labs/text-fabric-data

74 features found and 0 ignored
  0.00s loading features ...
   |     1.31s T otype                from /Users/dirk/github/etcbc/bhsa/tf/4b
   |       13s T oslots               from /Users/dirk/github/etcbc/bhsa/tf/4b
   |     0.09s T book                 from /Users/dirk/github/etcbc/bhsa/tf/4b
   |     0.05s T chapter              from /Users/dirk/github/etcbc/bhsa/tf/4b
   |     0.05s T verse                from /Users/dirk/github/etcbc/bhsa/tf/4b
   |     1.56s 

74 features found and 0 ignored
  0.00s loading features ...
   |     0.01s B book                 from /Users/dirk/github/etcbc/bhsa/tf/4b
   |     0.00s Feature overview: 69 for nodes; 4 for edges; 1 configs; 7 computed
  4.09s All features loaded/computed - for details use loadLog()
|     26m 53s 26 book name features created
..............................................................................................
.     26m 53s Write book name features as TF                                                 .
..............................................................................................
   |     0.00s T book@am              to /Users/dirk/github/etcbc/bhsa/_temp/4b/tf
   |     0.00s T book@ar              to /Users/dirk/github/etcbc/bhsa/_temp/4b/tf
   |     0.00s T book@bn              to /Users/dirk/github/etcbc/bhsa/_temp/4b/tf
   |     0.00s T book@da              to /Users/dirk/github/etcbc/bhsa/_temp/4b/tf
   |     0.00s T book@de              to /Users/dirk

|     26m 58s id = indonesian           Genesis is Kejadian             in Bahasa Indonesia    
|     26m 58s ja = japanese             Genesis is 創世記                  in 日本語                 
|     26m 58s ko = korean               Genesis is 창세기                  in 한국어                 
|     26m 58s la = latin                Genesis is Genesis              in Latina              
|     26m 58s nl = dutch                Genesis is Genesis              in Nederlands          
|     26m 58s pa = punjabi              Genesis is ਉਤਪਤ                 in ਪੰਜਾਬੀ              
|     26m 58s pt = portuguese           Genesis is Gênesis              in Português           
|     26m 58s ru = russian              Genesis is Бытия                in Русский             
|     26m 58s sw = swahili              Genesis is Mwanzo               in Kiswahili           
|     26m 58s syc = syriac               Genesis is ܒܪܝܬܐ                in ܠܫܢܐ ܣܘܪܝܝܐ         
|     26m 58s tr = turkish             

   |     0.67s T otype                to /Users/dirk/github/etcbc/bhsa/_temp/4b/tf
   |     0.00s T root                 to /Users/dirk/github/etcbc/bhsa/_temp/4b/tf
   |     0.76s T sp                   to /Users/dirk/github/etcbc/bhsa/_temp/4b/tf
   |     4.24s T oslots               to /Users/dirk/github/etcbc/bhsa/_temp/4b/tf
   |     0.00s M otext                to /Users/dirk/github/etcbc/bhsa/_temp/4b/tf
..............................................................................................
.     27m 40s Check differences with previous version                                        .
..............................................................................................
|     27m 40s 	2 features to add
|     27m 40s 		lex0
|     27m 40s 		root
|     27m 40s 	no features to delete
|     27m 40s 	10 features in common
|     27m 40s gloss                     ... differences after the metadata
|     27m 41s 	line 426570 OLD --><empty><--
|     27m 41s 	line 426570 NEW 

|     29m 15s 		ls              = None
|     29m 15s 		nametype        = None
|     29m 15s 		root            = None
|     29m 15s 		sp              = prep
|     29m 15s 	hbo - H - 30380x
|     29m 15s 		gloss           = the
|     29m 15s 		ls              = None
|     29m 15s 		nametype        = None
|     29m 15s 		root            = None
|     29m 15s 		sp              = art
|     29m 15s 	hbo - >RY/ - 2504x
|     29m 15s 		gloss           = earth
|     29m 15s 		ls              = None
|     29m 15s 		nametype        = None
|     29m 15s 		root            = None
|     29m 15s 		sp              = subs
|     29m 15s SUCCESS lexicon

----------------------------------------------------------------------------------------------
-     29m 15s SUCCES [bhsa/lexicon]                                                          -
----------------------------------------------------------------------------------------------


-----------------------------------------------------------------------

|     29m 31s 		for verb JYa
|     29m 32s 		for verb CWB
|     29m 32s 		for verb SWR
|     29m 32s 		for verb BRa
|     29m 32s 		for verb oFH
|     29m 33s 		for verb NTN
|     29m 33s 		for verb NFa
|     29m 33s 		for verb FJM
|     29m 33s 		for verb oBR
|     29m 33s 		for verb NWS
|     29m 33s 		for verb oLH
|     29m 33s 		for verb CJT
|     29m 33s 		for verb JRD
|     29m 33s 		for verb NPL
|     29m 33s 		for verb PQD
|     29m 34s 		for verb QRa
|     29m 34s 		for verb BWa
|     29m 34s 		for verb HLK
|     29m 34s 	52110  phrases seen 1  time(s)
|     29m 34s 	181    phrases seen 2  time(s)
|     29m 34s 	9      phrases seen 3  time(s)
|     29m 34s 	Total phrases seen: 52300
..............................................................................................
.     29m 34s Processing filled correction sheets ...                                        .
..............................................................................................
|     29m 34s 

|     29m 52s 	blank enrichment sheet for BWa
|     29m 52s 	blank enrichment sheet for CJT
|     29m 52s 	blank enrichment sheet for CWB
|     29m 52s 	blank enrichment sheet for FJM
|     29m 52s 	blank enrichment sheet for HLK
|     29m 52s 	blank enrichment sheet for JRD
|     29m 52s 	blank enrichment sheet for JYa
|     29m 52s 	blank enrichment sheet for NFa
|     29m 52s 	blank enrichment sheet for NPL
|     29m 52s 	blank enrichment sheet for NTN
|     29m 52s 	blank enrichment sheet for NWS
|     29m 52s 	blank enrichment sheet for PQD
|     29m 52s 	blank enrichment sheet for QRa
|     29m 52s 	blank enrichment sheet for SWR
|     29m 52s 	OK: The used blank enrichment sheets have legal values
|     29m 52s 	OK: The used blank enrichment sheets are consistent
|     29m 52s 	OK: The used filled enrichment sheets have legal values
|     29m 52s 	OK: The used filled enrichment sheets are consistent
|     29m 52s 	OK: all enriched nodes where phrase nodes
|     29m 52s 	OK: all 

   |     0.22s B rela                 from /Users/dirk/github/etcbc/bhsa/tf/4b
   |     0.21s B typ                  from /Users/dirk/github/etcbc/bhsa/tf/4b
   |     0.12s B prs                  from /Users/dirk/github/etcbc/bhsa/tf/4b
   |     0.12s B uvf                  from /Users/dirk/github/etcbc/bhsa/tf/4b
   |     0.13s B sp                   from /Users/dirk/github/etcbc/bhsa/tf/4b
   |     0.12s B pdp                  from /Users/dirk/github/etcbc/bhsa/tf/4b
   |     0.12s B ls                   from /Users/dirk/github/etcbc/bhsa/tf/4b
   |     0.12s B vs                   from /Users/dirk/github/etcbc/bhsa/tf/4b
   |     0.12s B vt                   from /Users/dirk/github/etcbc/bhsa/tf/4b
   |     0.08s B nametype             from /Users/dirk/github/etcbc/bhsa/tf/4b
   |     0.14s B gloss                from /Users/dirk/github/etcbc/bhsa/tf/4b
   |     0.02s B label                from /Users/dirk/github/etcbc/bhsa/tf/4b
   |     0.32s B number               from /Users/di

   |     0.12s B sp                   from /Users/dirk/github/etcbc/bhsa/tf/4b
   |     0.12s B vs                   from /Users/dirk/github/etcbc/bhsa/tf/4b
   |     0.06s B predication          from /Users/dirk/github/etcbc/valence/tf/4b
   |     0.14s B gloss                from /Users/dirk/github/etcbc/bhsa/tf/4b
   |     0.23s T sense                from /Users/dirk/github/etcbc/valence/tf/4b
   |     0.00s Feature overview: 107 for nodes; 4 for edges; 1 configs; 7 computed
  5.14s All features loaded/computed - for details use loadLog()
..............................................................................................
.     30m 50s Show sense counts                                                              .
..............................................................................................
|     30m 50s 	Sense labels = -- -b -c -i -p c. d- db dc di dp i. k. l. n.
|     30m 50s 	Counted 47349 senses
|     30m 50s 	All relevant verbs have been assigned a 

		Genesis 10:29
|     31m 08s 		         ----------> 1_Chronicles 1:23    confidende 100%
		Genesis 10:31
|     31m 08s 		         ----------> Genesis 10:20        confidende  87%
|     31m 08s 	Method SET
|     31m 08s 		20 start verses
		Genesis 10:2
|     31m 08s 		         ----------> 1_Chronicles 1:5     confidende 100%
		Genesis 10:3
|     31m 08s 		         ----------> 1_Chronicles 1:6     confidende  95%
		Genesis 10:4
|     31m 08s 		         ----------> 1_Chronicles 1:7     confidende  95%
		Genesis 10:6
|     31m 08s 		         ----------> 1_Chronicles 1:8     confidende 100%
		Genesis 10:7
|     31m 08s 		         ----------> 1_Chronicles 1:9     confidende 100%
		Genesis 10:8
|     31m 08s 		         ----------> 1_Chronicles 1:10    confidende 100%
		Genesis 10:13
|     31m 08s 		         ----------> 1_Chronicles 1:11    confidende 100%
		Genesis 10:14
|     31m 08s 		         ----------> 1_Chronicles 1:12    confidende 100%
		Genesis 10:15
|     31m 08s 		         -------

  0.02s 		enum gender_t
  0.02s 		enum state_t
  0.02s 		enum part_of_speech_t
  0.03s 		enum phrase_type_t
  0.03s 		enum phrase_atom_relation_t
  0.03s 		enum phrase_relation_t
  0.03s 		enum phrase_atom_unit_distance_to_mother_t
  0.04s 		enum subphrase_relation_t
  0.04s 		enum subphrase_mother_object_type_t
  0.04s 		enum phrase_function_t
  0.04s 		enum clause_atom_type_t
  0.04s 		enum clause_type_t
  0.05s 		enum clause_kind_t
  0.05s 		enum clause_constituent_relation_t
  0.05s 		enum clause_constituent_mother_object_type_t
  0.05s 		enum clause_constituent_unit_distance_to_mother_t
  0.06s 		otype word
  0.06s 			feature number (int) =def= 0 : node
  0.06s 			feature g_voc_lex (str) =def=  : node
  0.06s 			feature g_vbe_utf8 (str) =def=  : node
  0.06s 			feature g_voc_lex_utf8 (str) =def=  : node
  0.07s 			feature g_nme (str) =def=  : node
  0.07s 			feature nme (str) =def=  : node
  0.07s 			feature g_vbe (str) =def=  : node
  0.07s 			feature g_word (str) =def=  : node
 

 3m 06s maxSlot=426581
 3m 06s Node mapping and otype ...
 3m 07s oslots ...
 3m 10s metadata ...
 3m 10s features ...
 3m 10s 	features from words
 3m 14s 	   100000 words
 3m 19s 	   200000 words
 3m 22s 	   300000 words
 3m 26s 	   400000 words
 3m 27s 	   426581 words
 3m 27s 	features from books
 3m 27s 	       39 books
 3m 27s 	features from chapters
 3m 27s 	      929 chapters
 3m 27s 	features from clauses
 3m 28s 	    88000 clauses
 3m 28s 	features from clause_atoms
 3m 29s 	    90562 clause_atoms
 3m 29s 	features from half_verses
 3m 29s 	    45180 half_verses
 3m 29s 	features from phrases
 3m 30s 	   100000 phrases
 3m 31s 	   200000 phrases
 3m 33s 	   253174 phrases
 3m 33s 	features from phrase_atoms
 3m 34s 	   100000 phrase_atoms
 3m 35s 	   200000 phrase_atoms
 3m 35s 	   267515 phrase_atoms
 3m 35s 	features from sentences
 3m 36s 	    63570 sentences
 3m 36s 	features from sentence_atoms
 3m 36s 	    64339 sentence_atoms
 3m 36s 	features from subphrases
 3m 36s 	

..............................................................................................
.     35m 48s Load and compile standard TF features                                          .
..............................................................................................
This is Text-Fabric 3.0.6
Api reference : https://github.com/Dans-labs/text-fabric/wiki/Api
Tutorial      : https://github.com/Dans-labs/text-fabric/blob/master/docs/tutorial.ipynb
Example data  : https://github.com/Dans-labs/text-fabric-data

69 features found and 0 ignored
  0.00s loading features ...
   |     1.24s T otype                from /Users/dirk/github/etcbc/bhsa/tf/2016
   |     9.70s T oslots               from /Users/dirk/github/etcbc/bhsa/tf/2016
   |     0.09s T book                 from /Users/dirk/github/etcbc/bhsa/tf/2016
   |     0.05s T chapter              from /Users/dirk/github/etcbc/bhsa/tf/2016
   |     0.05s T verse                from /Users/dirk/github/etcbc/bhsa/tf/2016
   | 

69 features found and 0 ignored
  0.00s loading features ...
   |     0.01s B book                 from /Users/dirk/github/etcbc/bhsa/tf/2016
   |     0.00s Feature overview: 64 for nodes; 4 for edges; 1 configs; 7 computed
  4.03s All features loaded/computed - for details use loadLog()
|     38m 21s 26 book name features created
..............................................................................................
.     38m 21s Write book name features as TF                                                 .
..............................................................................................
   |     0.00s T book@am              to /Users/dirk/github/etcbc/bhsa/_temp/2016/tf
   |     0.00s T book@ar              to /Users/dirk/github/etcbc/bhsa/_temp/2016/tf
   |     0.00s T book@bn              to /Users/dirk/github/etcbc/bhsa/_temp/2016/tf
   |     0.00s T book@da              to /Users/dirk/github/etcbc/bhsa/_temp/2016/tf
   |     0.00s T book@de              to /

|     38m 25s la = latin                Genesis is Genesis              in Latina              
|     38m 25s nl = dutch                Genesis is Genesis              in Nederlands          
|     38m 25s pa = punjabi              Genesis is ਉਤਪਤ                 in ਪੰਜਾਬੀ              
|     38m 25s pt = portuguese           Genesis is Gênesis              in Português           
|     38m 25s ru = russian              Genesis is Бытия                in Русский             
|     38m 25s sw = swahili              Genesis is Mwanzo               in Kiswahili           
|     38m 25s syc = syriac               Genesis is ܒܪܝܬܐ                in ܠܫܢܐ ܣܘܪܝܝܐ         
|     38m 25s tr = turkish              Genesis is Yaratılış            in Türkçe              
|     38m 25s ur = urdu                 Genesis is پیدائش               in اُردُو              
|     38m 25s yo = yoruba               Genesis is Genesisi             in èdè Yorùbá          
|     38m 25s zh = chinese             

..............................................................................................
.     38m 49s Update the otype, oslots and otext features                                    .
..............................................................................................
|     38m 51s Features that have new or modified data
|     38m 51s 	gloss
|     38m 51s 	language
|     38m 51s 	lex
|     38m 51s 	lex0
|     38m 51s 	lex_utf8
|     38m 51s 	ls
|     38m 51s 	nametype
|     38m 51s 	otype
|     38m 51s 	root
|     38m 51s 	sp
|     38m 51s 	voc_lex
|     38m 51s 	voc_lex_utf8
|     38m 51s 	oslots
|     38m 51s Check voc_lex_utf8: בְּ רֵאשִׁית ברא אֱלֹהִים אֵת הַ שָׁמַיִם וְ אֶרֶץ
|     38m 51s 	Features to remove
|     38m 51s 	g_voc_lex
|     38m 51s 	g_voc_lex_utf8
..............................................................................................
.     38m 51s write new/changed features to TF ...                                           .
...............

   |      |     1.24s C __levels__           from otype, oslots
   |      |       20s C __order__            from otype, oslots, __levels__
   |      |     0.89s C __rank__             from otype, __order__
   |      |       22s C __levUp__            from otype, oslots, __rank__
   |      |       12s C __levDown__          from otype, __levUp__, __rank__
   |      |     4.45s C __boundary__         from otype, oslots, __rank__
   |     0.00s M otext                from /Users/dirk/github/etcbc/bhsa/tf/2016
   |      |     0.12s C __sections__         from otype, oslots, otext, __levUp__, __levels__, book, chapter, verse
   |     0.04s T gloss                from /Users/dirk/github/etcbc/bhsa/tf/2016
   |     1.53s T lex                  from /Users/dirk/github/etcbc/bhsa/tf/2016
   |     1.54s T sp                   from /Users/dirk/github/etcbc/bhsa/tf/2016
   |     0.01s T nametype             from /Users/dirk/github/etcbc/bhsa/tf/2016
   |     1.45s T language             from /Use

   |     0.15s T pargr                to /Users/dirk/github/etcbc/bhsa/_temp/2016/tf
..............................................................................................
.     40m 42s Check differences with previous version                                        .
..............................................................................................
|     40m 42s 	2 features to add
|     40m 42s 		instruction
|     40m 42s 		pargr
|     40m 42s 	no features to delete
|     40m 42s 	0 features in common
|     40m 42s Done
..............................................................................................
.     40m 42s Deliver features to /Users/dirk/github/etcbc/bhsa/tf/2016                      .
..............................................................................................
|     40m 42s 	pargr
|     40m 42s 	instruction
..............................................................................................
.     40m 42s Load and comp

|     40m 53s qere                      ... differences after the metadata
|     40m 53s 	line      2 OLD --><--
|     40m 53s 	line      2 NEW -->3897	HAJ:Y;74><--
|     40m 53s 	line      3 OLD --><--
|     40m 53s 	line      3 NEW -->4420	>@H:@LO75W<--
|     40m 53s 	line      4 OLD --><--
|     40m 53s 	line      4 NEW -->5645	>@H:@LO92W<--
|     40m 53s 	line      5 OLD --><--
|     40m 53s 	line      5 NEW -->5912	>@95H:@LOW03<--

|     40m 53s qere_utf8                 ... differences after the metadata
|     40m 53s 	line      2 OLD --><--
|     40m 53s 	line      2 NEW -->3897	הַיְצֵ֣א<--
|     40m 53s 	line      3 OLD --><--
|     40m 53s 	line      3 NEW -->4420	אָהֳלֹֽו<--
|     40m 53s 	line      4 OLD --><--
|     40m 53s 	line      4 NEW -->5645	אָהֳלֹ֑ו<--
|     40m 53s 	line      5 OLD --><--
|     40m 53s 	line      5 NEW -->5912	אָֽהֳלֹו֙<--

|     40m 53s Done
..............................................................................................
.     40m 53

107 features found and 0 ignored
  0.00s loading features ...
   |     0.20s B lex                  from /Users/dirk/github/etcbc/bhsa/tf/2016
   |     1.12s T freq_occ             from /Users/dirk/github/etcbc/bhsa/tf/2016
   |     1.00s T freq_lex             from /Users/dirk/github/etcbc/bhsa/tf/2016
   |     0.95s T rank_occ             from /Users/dirk/github/etcbc/bhsa/tf/2016
   |     1.02s T rank_lex             from /Users/dirk/github/etcbc/bhsa/tf/2016
   |     0.00s Feature overview: 102 for nodes; 4 for edges; 1 configs; 7 computed
    10s All features loaded/computed - for details use loadLog()
..............................................................................................
.     41m 23s Basic test                                                                     .
..............................................................................................
|     41m 23s Top 10 freqent lexemes (computed on otype=word)
|     41m 23s W           50272x
|    

|     42m 09s 	16000 verses 227941 1335 39 777
|     42m 10s 	17000 verses 235631 1378 48 827
|     42m 11s 	18000 verses 243254 1395 51 866
|     42m 12s 	19000 verses 250705 1428 59 906
|     42m 13s 	20000 verses 260114 1469 60 960
|     42m 15s 	21000 verses 275079 1532 63 979
|     42m 17s 	22000 verses 286437 1589 65 1007
|     42m 19s 	23000 verses 301295 1644 66 1075
|     42m 19s 	23213 verses done 304793 1649 66 1081
|     42m 19s 	  270184 accents
|     42m 19s 	    9015 cleanup
|     42m 19s 	   45235 dagesh_forte
|     42m 19s 	   21511 dagesh_forte_lene
|     42m 19s 	   59612 dagesh_lene
|     42m 19s 	   16321 default_accent
|     42m 19s 	     968 fixit
|     42m 19s 	    2658 furtive_patah
|     42m 19s 	   28195 last_ml
|     42m 19s 	    2201 mappiq_heh
|     42m 19s 	   93898 mobile_schwa1
|     42m 19s 	    2255 mobile_schwa2
|     42m 19s 	     179 mobile_schwa3
|     42m 19s 	    7702 mobile_schwa4
|     42m 19s 	   25498 punct
|     42m 19s 	   25498 punctuatio

|     42m 34s 	Destination /Users/dirk/github/etcbc/valence/tf/2016/.tf/valence.tfx does not exist
True True
..............................................................................................
.     42m 34s Load the existing TF dataset                                                   .
..............................................................................................
This is Text-Fabric 3.0.6
Api reference : https://github.com/Dans-labs/text-fabric/wiki/Api
Tutorial      : https://github.com/Dans-labs/text-fabric/blob/master/docs/tutorial.ipynb
Example data  : https://github.com/Dans-labs/text-fabric-data

107 features found and 0 ignored
  0.00s loading features ...
   |     0.23s B lex_utf8             from /Users/dirk/github/etcbc/bhsa/tf/2016
   |     0.15s B lex                  from /Users/dirk/github/etcbc/bhsa/tf/2016
   |     0.01s B gloss                from /Users/dirk/github/etcbc/bhsa/tf/2016
   |     0.17s B sp                   from /Users/dirk/gi

|     43m 21s 	   18 clauses with  2 infinitive objects
|     43m 21s 	 1192 clauses with  1 infinitive object
|     43m 21s 	68948 clauses with  0 infinitive objects
|     43m 21s 	 1211 clauses with  a infinitive object
..............................................................................................
.     43m 21s Determinig kind of complements                                                 .
..............................................................................................
|     43m 23s 	Done
|     43m 23s 	Phrases of kind C :  19300
|     43m 23s 	Phrases of kind L :  11681
|     43m 23s 	Phrases of kind I :   6016
|     43m 23s 	Total complements :  36997
|     43m 23s 	Total phrases     : 214665
..............................................................................................
.     43m 23s Checking enrichment logic                                                      .
.........................................................................

   |     0.00s B gloss                from /Users/dirk/github/etcbc/bhsa/tf/2016
   |     0.13s B sp                   from /Users/dirk/github/etcbc/bhsa/tf/2016
   |     0.12s B vs                   from /Users/dirk/github/etcbc/bhsa/tf/2016
   |     0.23s B rela                 from /Users/dirk/github/etcbc/bhsa/tf/2016
   |     0.22s B typ                  from /Users/dirk/github/etcbc/bhsa/tf/2016
   |     0.07s B function             from /Users/dirk/github/etcbc/bhsa/tf/2016
   |     0.83s T valence              from /Users/dirk/github/etcbc/valence/tf/2016
   |     0.82s T predication          from /Users/dirk/github/etcbc/valence/tf/2016
   |     0.79s T grammatical          from /Users/dirk/github/etcbc/valence/tf/2016
   |     0.38s T original             from /Users/dirk/github/etcbc/valence/tf/2016
   |     0.53s T lexical              from /Users/dirk/github/etcbc/valence/tf/2016
   |     0.53s T semantic             from /Users/dirk/github/etcbc/valence/tf/2016
   |     0

|     44m 12s 	10000 clauses
|     44m 15s 	20000 clauses
|     44m 18s 	30000 clauses
|     44m 21s 	40000 clauses
|     44m 23s 	47383 clauses
..............................................................................................
.     44m 23s Writing sense feature to TF                                                    .
..............................................................................................
   |     0.10s T sense                to /Users/dirk/github/etcbc/valence/_temp/2016/tf
..............................................................................................
.     44m 23s Check differences with previous version                                        .
..............................................................................................
|     44m 23s 	1 features to add
|     44m 23s 		sense
|     44m 23s 	no features to delete
|     44m 23s 	0 features in common
|     44m 23s Done
..................................................

110 features found and 0 ignored
  0.00s loading features ...
   |     0.09s T crossref             from /Users/dirk/github/etcbc/parallels/tf/2016
   |     0.04s T crossrefSET          from /Users/dirk/github/etcbc/parallels/tf/2016
   |     0.07s T crossrefLCS          from /Users/dirk/github/etcbc/parallels/tf/2016
   |     0.00s Feature overview: 102 for nodes; 7 for edges; 1 configs; 7 computed
  4.58s All features loaded/computed - for details use loadLog()
..............................................................................................
.     44m 43s Test: crossrefs of Genesis 10                                                  .
..............................................................................................
|     44m 43s 	Method 
|     44m 43s 		20 start verses
		Genesis 10:2
|     44m 43s 		         ----------> 1_Chronicles 1:5     confidende 100%
		Genesis 10:3
|     44m 43s 		         ----------> 1_Chronicles 1:6     confidende  95%
		Genesis 10:4

  0.00s 		enum boolean_t
  0.00s 		enum phrase_determination_t
  0.00s 		enum language_t
  0.00s 		enum book_name_t
  0.01s 		enum lexical_set_t
  0.01s 		enum verbal_stem_t
  0.01s 		enum verbal_tense_t
  0.01s 		enum person_t
  0.01s 		enum number_t
  0.01s 		enum gender_t
  0.01s 		enum state_t
  0.02s 		enum part_of_speech_t
  0.02s 		enum phrase_type_t
  0.02s 		enum phrase_atom_relation_t
  0.02s 		enum phrase_relation_t
  0.02s 		enum phrase_atom_unit_distance_to_mother_t
  0.02s 		enum subphrase_relation_t
  0.02s 		enum subphrase_mother_object_type_t
  0.02s 		enum phrase_function_t
  0.03s 		enum clause_atom_type_t
  0.03s 		enum clause_type_t
  0.03s 		enum clause_kind_t
  0.03s 		enum clause_constituent_relation_t
  0.03s 		enum clause_constituent_mother_object_type_t
  0.03s 		enum clause_constituent_unit_distance_to_mother_t
  0.03s 		otype word
  0.04s 			feature number (int) =def= 0 : node
  0.04s 			feature g_voc_lex (str) =def=  : node
  0.04s 			feature g_vbe_utf8 (s

 2m 53s 88000 objects of type clause
 2m 53s 45180 objects of type half_verse
 2m 53s 23213 objects of type verse
 2m 53s 267515 objects of type phrase_atom
 2m 53s Making TF data ...
 2m 53s Monad - idd mapping ...
 2m 53s Removing holes in the monad sequence
 2m 54s maxSlot=426581
 2m 54s Node mapping and otype ...
 2m 54s oslots ...
 2m 58s metadata ...
 2m 58s features ...
 2m 58s 	features from words
 3m 02s 	   100000 words
 3m 06s 	   200000 words
 3m 11s 	   300000 words
 3m 17s 	   400000 words
 3m 18s 	   426581 words
 3m 18s 	features from books
 3m 18s 	       39 books
 3m 18s 	features from chapters
 3m 18s 	      929 chapters
 3m 18s 	features from clauses
 3m 22s 	    88000 clauses
 3m 22s 	features from clause_atoms
 3m 23s 	    90562 clause_atoms
 3m 23s 	features from half_verses
 3m 23s 	    45180 half_verses
 3m 23s 	features from phrases
 3m 24s 	   100000 phrases
 3m 25s 	   200000 phrases
 3m 26s 	   253174 phrases
 3m 26s 	features from phrase_atoms
 3m 27s 	   

..............................................................................................
.     49m 30s Load and compile standard TF features                                          .
..............................................................................................
This is Text-Fabric 3.0.6
Api reference : https://github.com/Dans-labs/text-fabric/wiki/Api
Tutorial      : https://github.com/Dans-labs/text-fabric/blob/master/docs/tutorial.ipynb
Example data  : https://github.com/Dans-labs/text-fabric-data

69 features found and 0 ignored
  0.00s loading features ...
   |     1.06s T otype                from /Users/dirk/github/etcbc/bhsa/tf/c
   |       10s T oslots               from /Users/dirk/github/etcbc/bhsa/tf/c
   |     0.09s T book                 from /Users/dirk/github/etcbc/bhsa/tf/c
   |     0.05s T chapter              from /Users/dirk/github/etcbc/bhsa/tf/c
   |     0.05s T verse                from /Users/dirk/github/etcbc/bhsa/tf/c
   |     1.59s T g_c

69 features found and 0 ignored
  0.00s loading features ...
   |     0.01s B book                 from /Users/dirk/github/etcbc/bhsa/tf/c
   |     0.00s Feature overview: 64 for nodes; 4 for edges; 1 configs; 7 computed
  4.71s All features loaded/computed - for details use loadLog()
|     52m 05s 26 book name features created
..............................................................................................
.     52m 05s Write book name features as TF                                                 .
..............................................................................................
   |     0.00s T book@am              to /Users/dirk/github/etcbc/bhsa/_temp/c/tf
   |     0.00s T book@ar              to /Users/dirk/github/etcbc/bhsa/_temp/c/tf
   |     0.00s T book@bn              to /Users/dirk/github/etcbc/bhsa/_temp/c/tf
   |     0.00s T book@da              to /Users/dirk/github/etcbc/bhsa/_temp/c/tf
   |     0.00s T book@de              to /Users/dirk/gith

|     52m 10s hi = hindi                Genesis is उत्पाति              in हिन्दी              
|     52m 10s id = indonesian           Genesis is Kejadian             in Bahasa Indonesia    
|     52m 10s ja = japanese             Genesis is 創世記                  in 日本語                 
|     52m 10s ko = korean               Genesis is 창세기                  in 한국어                 
|     52m 10s la = latin                Genesis is Genesis              in Latina              
|     52m 10s nl = dutch                Genesis is Genesis              in Nederlands          
|     52m 10s pa = punjabi              Genesis is ਉਤਪਤ                 in ਪੰਜਾਬੀ              
|     52m 10s pt = portuguese           Genesis is Gênesis              in Português           
|     52m 10s ru = russian              Genesis is Бытия                in Русский             
|     52m 10s sw = swahili              Genesis is Mwanzo               in Kiswahili           
|     52m 10s syc = syriac              

..............................................................................................
.     52m 33s Various tweaks in features                                                     .
..............................................................................................
..............................................................................................
.     52m 34s Update the otype, oslots and otext features                                    .
..............................................................................................
|     52m 37s Features that have new or modified data
|     52m 37s 	gloss
|     52m 37s 	language
|     52m 37s 	lex
|     52m 37s 	lex0
|     52m 37s 	lex_utf8
|     52m 37s 	ls
|     52m 37s 	nametype
|     52m 37s 	otype
|     52m 37s 	root
|     52m 37s 	sp
|     52m 37s 	voc_lex
|     52m 37s 	voc_lex_utf8
|     52m 37s 	oslots
|     52m 37s Check voc_lex_utf8: בְּ רֵאשִׁית ברא אֱלֹהִים אֵת הַ שָׁמַיִם וְ אֶרֶץ
|     52m

   |       17s T oslots               from /Users/dirk/github/etcbc/bhsa/tf/c
   |     1.43s T lex0                 from /Users/dirk/github/etcbc/bhsa/tf/c
   |     1.60s T lex_utf8             from /Users/dirk/github/etcbc/bhsa/tf/c
   |      |     1.23s C __levels__           from otype, oslots
   |      |       18s C __order__            from otype, oslots, __levels__
   |      |     0.91s C __rank__             from otype, __order__
   |      |       22s C __levUp__            from otype, oslots, __rank__
   |      |       13s C __levDown__          from otype, __levUp__, __rank__
   |      |     4.12s C __boundary__         from otype, oslots, __rank__
   |     0.00s M otext                from /Users/dirk/github/etcbc/bhsa/tf/c
   |      |     0.12s C __sections__         from otype, oslots, otext, __levUp__, __levels__, book, chapter, verse
   |     0.03s T gloss                from /Users/dirk/github/etcbc/bhsa/tf/c
   |     1.96s T lex                  from /Users/dirk/github/

   |     0.16s T instruction          to /Users/dirk/github/etcbc/bhsa/_temp/c/tf
   |     0.17s T pargr                to /Users/dirk/github/etcbc/bhsa/_temp/c/tf
..............................................................................................
.     54m 26s Check differences with previous version                                        .
..............................................................................................
|     54m 26s 	2 features to add
|     54m 26s 		instruction
|     54m 26s 		pargr
|     54m 26s 	no features to delete
|     54m 26s 	0 features in common
|     54m 26s Done
..............................................................................................
.     54m 26s Deliver features to /Users/dirk/github/etcbc/bhsa/tf/c                         .
..............................................................................................
|     54m 26s 	pargr
|     54m 26s 	instruction
...........................................

|     54m 38s 	line      5 NEW -->5912	>@95H:@LOW03<--

|     54m 38s qere_utf8                 ... differences after the metadata
|     54m 38s 	line      2 OLD --><--
|     54m 38s 	line      2 NEW -->3897	הַיְצֵ֣א<--
|     54m 38s 	line      3 OLD --><--
|     54m 38s 	line      3 NEW -->4420	אָהֳלֹֽו<--
|     54m 38s 	line      4 OLD --><--
|     54m 38s 	line      4 NEW -->5645	אָהֳלֹ֑ו<--
|     54m 38s 	line      5 OLD --><--
|     54m 38s 	line      5 NEW -->5912	אָֽהֳלֹו֙<--

|     54m 38s Done
..............................................................................................
.     54m 38s Deliver features to /Users/dirk/github/etcbc/bhsa/tf/c                         .
..............................................................................................
|     54m 38s 	qere_utf8
|     54m 38s 	qere_trailer_utf8
|     54m 38s 	qere
|     54m 38s 	otext
|     54m 38s 	qere_trailer
................................................................................

   |     1.07s T freq_occ             from /Users/dirk/github/etcbc/bhsa/tf/c
   |     1.02s T freq_lex             from /Users/dirk/github/etcbc/bhsa/tf/c
   |     1.02s T rank_occ             from /Users/dirk/github/etcbc/bhsa/tf/c
   |     0.92s T rank_lex             from /Users/dirk/github/etcbc/bhsa/tf/c
   |     0.00s Feature overview: 102 for nodes; 4 for edges; 1 configs; 7 computed
    11s All features loaded/computed - for details use loadLog()
..............................................................................................
.     55m 10s Basic test                                                                     .
..............................................................................................
|     55m 10s Top 10 freqent lexemes (computed on otype=word)
|     55m 10s W           50272x
|     55m 10s H           30384x
|     55m 10s L           20069x
|     55m 10s B           15542x
|     55m 10s >T          11002x
|     55m 10s MN           7

|     56m 05s 	21000 verses 275079 1532 63 979
|     56m 07s 	22000 verses 286437 1589 65 1007
|     56m 09s 	23000 verses 301295 1644 66 1075
|     56m 09s 	23213 verses done 304793 1649 66 1081
|     56m 09s 	  270184 accents
|     56m 09s 	    9015 cleanup
|     56m 09s 	   45235 dagesh_forte
|     56m 09s 	   21511 dagesh_forte_lene
|     56m 09s 	   59612 dagesh_lene
|     56m 09s 	   16321 default_accent
|     56m 09s 	     968 fixit
|     56m 09s 	    2658 furtive_patah
|     56m 09s 	   28195 last_ml
|     56m 09s 	    2201 mappiq_heh
|     56m 09s 	   93898 mobile_schwa1
|     56m 09s 	    2255 mobile_schwa2
|     56m 09s 	     179 mobile_schwa3
|     56m 09s 	    7702 mobile_schwa4
|     56m 09s 	   25498 punct
|     56m 09s 	   25498 punctuation
|     56m 09s 	      66 qamets_prs_suppress_qatan
|     56m 09s 	    5257 qamets_qatan1
|     56m 09s 	     243 qamets_qatan2
|     56m 09s 	    1791 qamets_qatan3
|     56m 09s 	      28 qamets_qatan4a
|     56m 09s 	     256 qamets

|     56m 25s 	Destination /Users/dirk/github/etcbc/valence/tf/c/.tf/valence.tfx does not exist
True True
..............................................................................................
.     56m 25s Load the existing TF dataset                                                   .
..............................................................................................
This is Text-Fabric 3.0.6
Api reference : https://github.com/Dans-labs/text-fabric/wiki/Api
Tutorial      : https://github.com/Dans-labs/text-fabric/blob/master/docs/tutorial.ipynb
Example data  : https://github.com/Dans-labs/text-fabric-data

107 features found and 0 ignored
  0.00s loading features ...
   |     0.23s B lex_utf8             from /Users/dirk/github/etcbc/bhsa/tf/c
   |     0.17s B lex                  from /Users/dirk/github/etcbc/bhsa/tf/c
   |     0.01s B gloss                from /Users/dirk/github/etcbc/bhsa/tf/c
   |     0.18s B sp                   from /Users/dirk/github/etcbc/b

|     57m 18s 	Done
|     57m 18s 	Phrases of kind C :  19300
|     57m 18s 	Phrases of kind L :  11681
|     57m 18s 	Phrases of kind I :   6016
|     57m 18s 	Total complements :  36997
|     57m 18s 	Total phrases     : 214665
..............................................................................................
.     57m 18s Checking enrichment logic                                                      .
..............................................................................................
|     57m 18s 	All 6 rules OK
..............................................................................................
.     57m 18s Generating enrichments                                                         .
..............................................................................................
|     57m 24s 	Generated enrichment values for 1380 verbs:
|     57m 24s 	Enriched values for 221480 nodes
|     57m 24s 	Overview of rule applications:
|     57m 24s gen

   |     0.81s T predication          from /Users/dirk/github/etcbc/valence/tf/c
   |     0.80s T grammatical          from /Users/dirk/github/etcbc/valence/tf/c
   |     0.41s T original             from /Users/dirk/github/etcbc/valence/tf/c
   |     0.61s T lexical              from /Users/dirk/github/etcbc/valence/tf/c
   |     0.61s T semantic             from /Users/dirk/github/etcbc/valence/tf/c
   |     0.43s T f_correction         from /Users/dirk/github/etcbc/valence/tf/c
   |     0.42s T s_manual             from /Users/dirk/github/etcbc/valence/tf/c
   |     0.55s T cfunction            from /Users/dirk/github/etcbc/valence/tf/c
   |     0.00s Feature overview: 111 for nodes; 4 for edges; 1 configs; 7 computed
    11s All features loaded/computed - for details use loadLog()
Time - Time - True
Pred - Pred - True
Subj - Subj - True
Objc - Objc - True
Conj -  - True
Subj -  - True
Pred -  - True
PreC -  - True
Conj - None - False
Subj - None - False
|     57m 46s SUCCESS enrich

|     58m 04s 	47395 clauses with  0 bens       constituents
|     58m 04s 	  173 clauses with  a bens       constituent
|     58m 04s 	69448 clauses
..............................................................................................
.     58m 04s Checking the flowcharts                                                        .
..............................................................................................
|     58m 04s 	No flowchart for 1543 verbs, e.g. <BC, <BD, <BH, <BR, <BR=, <BT, <BV, <BV=, <CC, <CN
|     58m 04s 	All flowcharts belong to a verb in the corpus
..............................................................................................
.     58m 04s Applying the flowcharts                                                        .
..............................................................................................
|     58m 07s 	10000 clauses
|     58m 10s 	20000 clauses
|     58m 13s 	30000 clauses
|     58m 16s 	40000 clauses
|  

   |     0.08s T crossrefLCS          to /Users/dirk/github/etcbc/parallels/_temp/c/tf
   |     0.05s T crossrefSET          to /Users/dirk/github/etcbc/parallels/_temp/c/tf
..............................................................................................
.     58m 36s Check differences with previous version                                        .
..............................................................................................
|     58m 36s 	3 features to add
|     58m 36s 		crossref
|     58m 36s 		crossrefLCS
|     58m 36s 		crossrefSET
|     58m 36s 	no features to delete
|     58m 36s 	0 features in common
|     58m 36s Done
..............................................................................................
.     58m 36s Deliver data set to /Users/dirk/github/etcbc/parallels/tf/c                    .
..............................................................................................
..................................................