-
Notifications
You must be signed in to change notification settings - Fork 8
/
eval.log
108 lines (108 loc) · 10.5 KB
/
eval.log
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
Running the following version of UD tools:
commit 13e6b709a8bc643c3f902800321a7beda46feb8d
Author: Dan Zeman <zeman@ufal.mff.cuni.cz>
Date: Sun Nov 13 22:03:41 2022 +0100
Evaluating the following revision of UD_English-ESL:
commit 6f3b547866202071e3c37432d637713e3fb24e9e
Author: Dan Zeman <zeman@ufal.mff.cuni.cz>
Date: Sat May 14 14:00:02 2022 +0200
Size: counted 97681 of 97681 words (nodes).
Size: min(0, log((N/1000)**2)) = 9.16341413452057.
Size: maximum value 13.815511 is for 1000000 words or more.
Split: Found more than 10000 training words.
Split: Did not find at least 10000 development words.
Split: Did not find at least 10000 test words.
Lemmas: '_' is the most frequent lemma.
Universal POS tags: 17 out of 17 found in the corpus.
Universal POS tags: source of annotation (from README) factor is 1.
Features: 1177 out of 97681 total words have one or more features.
Features: source of annotation (from README) factor is 0.4.
Universal relations: 35 out of 37 found in the corpus.
Universal relations: source of annotation (from README) factor is 1.
Udapi:
TOTAL 34045
Udapi: found 34045 bugs.
Udapi: worst expected case (threshold) is one bug per 10 words. There are 97681 words.
Genres: found 1 out of 17 known.
Availability: README does not say Includes text: yes
Availability: '_' is the most frequent form.
validate.py --lang en --max-err=10 UD_English-ESL/en_esl-ud-dev.conllu
[Line 861 Sent 0100_2000_6-doc403.xml-27 Node 6]: [L3 Syntax too-many-subjects] Node has multiple subjects not subtyped as ':outer': [7, 14]. Outer subjects are allowed if a clause acts as the predicate of another clause.
[Line 985 Sent 0102_2000_6-doc614.xml-3 Node 15]: [L3 Syntax rel-upos-case] 'case' should not be 'PROPN'
[Line 1033 Sent 0100_2000_12-doc579.xml-6 Node 18]: [L3 Morpho goeswith-missing-typo] Since the treebank has morphological features, 'Typo=Yes' must be used with 'goeswith' heads.
[Line 1437 Sent 0102_2000_6-doc2677.xml-30 Node 16]: [L3 Syntax rel-upos-det] 'det' should be 'DET' or 'PRON' but it is 'ADJ'
[Line 1623 Sent 0102_2000_6-doc1541.xml-21a Node 1]: [L3 Morpho goeswith-missing-typo] Since the treebank has morphological features, 'Typo=Yes' must be used with 'goeswith' heads.
[Line 1624 Sent 0102_2000_6-doc1541.xml-21a Node 2]: [L3 Morpho goeswith-upos] The UPOS tag of a 'goeswith'-connected word must be annotated only at the first part; the other parts must be tagged 'X'.
[Line 1716 Sent 0100_2000_6-doc1902.xml-31 Node 3]: [L3 Syntax leaf-aux-cop] 'cop' not expected to have children (3:_:cop --> 4:_:advmod)
[Line 2767 Sent 0100_2000_6-doc2039.xml-20 Node 13]: [L3 Syntax punct-causes-nonproj] Punctuation must not cause non-projectivity of nodes [12]
[Line 2797 Sent 0102_2000_6-doc2514.xml-6 Node 17]: [L3 Syntax rel-upos-advmod] 'advmod' should be 'ADV' but it is 'VERB'
[Line 6381 Sent 0100_2000_6-doc280.xml-16 Node 6]: [L3 Morpho goeswith-missing-typo] Since the treebank has morphological features, 'Typo=Yes' must be used with 'goeswith' heads.
[Line 6382 Sent 0100_2000_6-doc280.xml-16 Node 7]: [L3 Morpho goeswith-upos] The UPOS tag of a 'goeswith'-connected word must be annotated only at the first part; the other parts must be tagged 'X'.
[Line 6565 Sent 0100_2000_12-doc1306.xml-17 Node 16]: [L3 Syntax punct-is-nonproj] Punctuation must not be attached non-projectively over nodes [17]
[Line 7376 Sent 0100_2000_6-doc1759.xml-6 Node 5]: [L3 Syntax leaf-aux-cop] 'aux' not expected to have children (5:_:aux --> 4:_:nsubj)
[Line 8969 Sent 0102_2000_6-doc278.xml-11 Node 4]: [L3 Syntax leaf-aux-cop] 'aux' not expected to have children (4:_:aux --> 3:_:nsubj)
...suppressing further errors regarding Syntax
Morpho errors: 5
Syntax errors: 11
*** FAILED *** with 16 errors
Exit code: 1
validate.py --lang en --max-err=10 UD_English-ESL/en_esl-ud-test.conllu
[Line 32 Sent 0102_2000_6-doc874.xml-21 Node 13]: [L3 Syntax punct-is-nonproj] Punctuation must not be attached non-projectively over nodes [12]
[Line 62 Sent 0100_2000_6-doc2542.xml-25 Node 20]: [L3 Syntax punct-is-nonproj] Punctuation must not be attached non-projectively over nodes [22, 23, 24, 25, 26, 27]
[Line 757 Sent 0100_2000_6-doc818.xml-25 Node 1]: [L3 Morpho goeswith-missing-typo] Since the treebank has morphological features, 'Typo=Yes' must be used with 'goeswith' heads.
[Line 758 Sent 0100_2000_6-doc818.xml-25 Node 2]: [L3 Morpho goeswith-upos] The UPOS tag of a 'goeswith'-connected word must be annotated only at the first part; the other parts must be tagged 'X'.
[Line 822 Sent 0100_2000_6-doc2494.xml-3 Node 27]: [L3 Syntax too-many-subjects] Node has multiple subjects not subtyped as ':outer': [25, 32]. Outer subjects are allowed if a clause acts as the predicate of another clause.
[Line 942 Sent 0100_2000_6-doc2359.xml-30 Node 3]: [L3 Syntax leaf-mark-case] 'case' not expected to have children (3:_:case --> 2:_:mark)
[Line 2304 Sent 0100_2000_6-doc166.xml-6 Node 20]: [L3 Morpho goeswith-missing-typo] Since the treebank has morphological features, 'Typo=Yes' must be used with 'goeswith' heads.
[Line 2305 Sent 0100_2000_6-doc166.xml-6 Node 21]: [L3 Morpho goeswith-upos] The UPOS tag of a 'goeswith'-connected word must be annotated only at the first part; the other parts must be tagged 'X'.
[Line 2380 Sent 0102_2000_6-doc1288.xml-10 Node 20]: [L3 Syntax rel-upos-det] 'det' should be 'DET' or 'PRON' but it is 'ADP'
[Line 2732 Sent 0100_2000_6-doc584.xml-27 Node 5]: [L3 Syntax leaf-aux-cop] 'aux' not expected to have children (5:_:aux --> 4:_:nsubj)
[Line 3047 Sent 0102_2000_6-doc513.xml-10 Node 8]: [L3 Syntax leaf-aux-cop] 'cop' not expected to have children (8:_:cop --> 5:_:mark)
[Line 5792 Sent 0102_2000_12-doc558.xml-29 Node 2]: [L3 Syntax leaf-aux-cop] 'aux' not expected to have children (2:_:aux --> 1:_:nsubj)
[Line 6376 Sent 0100_2000_12-doc1029.xml-4 Node 2]: [L3 Syntax leaf-aux-cop] 'cop' not expected to have children (2:_:cop --> 1:_:nsubj)
[Line 6489 Sent 0100_2000_12-doc1840.xml-4 Node 46]: [L3 Morpho goeswith-missing-typo] Since the treebank has morphological features, 'Typo=Yes' must be used with 'goeswith' heads.
...suppressing further errors regarding Syntax
[Line 6818 Sent 0100_2000_12-doc548.xml-9 Node 13]: [L3 Morpho goeswith-missing-typo] Since the treebank has morphological features, 'Typo=Yes' must be used with 'goeswith' heads.
[Line 10702 Sent 0100_2000_12-doc1293.xml-4 Node 14]: [L3 Morpho goeswith-missing-typo] Since the treebank has morphological features, 'Typo=Yes' must be used with 'goeswith' heads.
[Line 11000 Sent 0100_2000_12-doc1357.xml-7 Node 13]: [L3 Morpho goeswith-missing-typo] Since the treebank has morphological features, 'Typo=Yes' must be used with 'goeswith' heads.
Morpho errors: 8
Syntax errors: 19
*** FAILED *** with 27 errors
Exit code: 1
validate.py --lang en --max-err=10 UD_English-ESL/en_esl-ud-train.conllu
[Line 1399 Sent 0100_2000_6-doc2740.xml-4 Node 13]: [L3 Syntax leaf-aux-cop] 'aux' not expected to have children (13:_:aux --> 12:_:nsubj)
[Line 3643 Sent 0100_2000_6-doc2484.xml-29 Node 9]: [L3 Syntax too-many-subjects] Node has multiple subjects not subtyped as ':outer': [7, 11]. Outer subjects are allowed if a clause acts as the predicate of another clause.
[Line 4074 Sent 0100_2001_6-doc3158.xml-20 Node 19]: [L3 Syntax leaf-aux-cop] 'aux' not expected to have children (19:_:aux --> 18:_:nsubj)
[Line 4732 Sent 0102_2000_6-doc515.xml-27 Node 2]: [L3 Syntax leaf-aux-cop] 'cop' not expected to have children (2:_:cop --> 1:_:det)
[Line 5333 Sent 0102_2000_6-doc2075.xml-7 Node 2]: [L3 Syntax rel-upos-case] 'case' should not be 'DET'
[Line 5727 Sent 0102_2000_6-doc1198.xml-22 Node 4]: [L3 Morpho goeswith-missing-typo] Since the treebank has morphological features, 'Typo=Yes' must be used with 'goeswith' heads.
[Line 5728 Sent 0102_2000_6-doc1198.xml-22 Node 5]: [L3 Morpho goeswith-upos] The UPOS tag of a 'goeswith'-connected word must be annotated only at the first part; the other parts must be tagged 'X'.
[Line 6326 Sent 0102_2000_6-doc2341.xml-6 Node 16]: [L3 Syntax rel-upos-mark] 'mark' should not be 'DET'
[Line 6431 Sent 0100_2000_6-doc2049.xml-19 Node 2]: [L3 Syntax leaf-aux-cop] 'aux' not expected to have children (2:_:aux --> 1:_:mark)
[Line 6600 Sent 0100_2000_6-doc66.xml-10 Node 9]: [L3 Syntax rel-upos-advmod] 'advmod' should be 'ADV' but it is 'VERB'
[Line 7470 Sent 0100_2000_12-doc241.xml-24 Node 32]: [L3 Syntax rel-upos-advmod] 'advmod' should be 'ADV' but it is 'VERB'
...suppressing further errors regarding Syntax
[Line 13692 Sent 0100_2000_12-doc241.xml-7 Node 17]: [L3 Morpho goeswith-missing-typo] Since the treebank has morphological features, 'Typo=Yes' must be used with 'goeswith' heads.
[Line 14154 Sent 0100_2000_12-doc904.xml-5 Node 16]: [L3 Morpho goeswith-missing-typo] Since the treebank has morphological features, 'Typo=Yes' must be used with 'goeswith' heads.
[Line 14982 Sent 0100_2000_6-doc2070.xml-44 Node 1]: [L3 Morpho goeswith-missing-typo] Since the treebank has morphological features, 'Typo=Yes' must be used with 'goeswith' heads.
[Line 14983 Sent 0100_2000_6-doc2070.xml-44 Node 2]: [L3 Morpho goeswith-upos] The UPOS tag of a 'goeswith'-connected word must be annotated only at the first part; the other parts must be tagged 'X'.
[Line 14995 Sent 0100_2000_6-doc2070.xml-44 Node 14]: [L3 Morpho goeswith-missing-typo] Since the treebank has morphological features, 'Typo=Yes' must be used with 'goeswith' heads.
[Line 14996 Sent 0100_2000_6-doc2070.xml-44 Node 15]: [L3 Morpho goeswith-upos] The UPOS tag of a 'goeswith'-connected word must be annotated only at the first part; the other parts must be tagged 'X'.
[Line 16544 Sent 0100_2000_12-doc198.xml-7 Node 24]: [L3 Morpho goeswith-missing-typo] Since the treebank has morphological features, 'Typo=Yes' must be used with 'goeswith' heads.
...suppressing further errors regarding Morpho
Morpho errors: 70
Syntax errors: 148
*** FAILED *** with 218 errors
Exit code: 1
Validity: 0.01
(weight=0.0769230769230769) * (score{features}=0.12) = 0.00923076923076923
(weight=0.0769230769230769) * (score{genres}=0.0588235294117647) = 0.00452488687782805
(weight=0.0769230769230769) * (score{lemmas}=0.01) = 0.000769230769230769
(weight=0.256410256410256) * (score{size}=0.663270032336091) = 0.170069239060536
(weight=0.0512820512820513) * (score{split}=0.34) = 0.0174358974358974
(weight=0.0769230769230769) * (score{tags}=1) = 0.0769230769230769
(weight=0.307692307692308) * (score{udapi}=0.01) = 0.00307692307692308
(weight=0.0769230769230769) * (score{udeprels}=0.945945945945946) = 0.0727650727650728
(TOTAL score=0.354795096139334) * (availability=0.1) * (validity=0.01) = 0.000354795096139334
STARS = 0
UD_English-ESL 0.000354795096139334 0