Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
1 changed file
with
43 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,43 @@ | ||
Sun Feb 11 20:38:10 CET 2018 | ||
perl -I perllib/lib/perl5 -I tools tools/evaluate_treebank.pl --verbose UD_English-ESL | ||
Running the following version of tools/evaluate_treebank.pl: | ||
commit c1cb17aec6a5505b56b885f578dc6b1093bb4b30 | ||
Author: Dan Zeman <zeman@ufal.mff.cuni.cz> | ||
Date: Sun Feb 11 20:27:50 2018 +0100 | ||
Switched to branch 'master' | ||
Your branch is up-to-date with 'origin/master'. | ||
Evaluating the following revision of UD_English-ESL: | ||
commit 58f953248052639289fc7f423eef84dcffc02787 | ||
Author: Dan Zeman <zeman@ufal.mff.cuni.cz> | ||
Date: Sat Nov 11 13:54:23 2017 +0100 | ||
Size: counted 88090 of 88090 words (nodes). | ||
Size: min(0, log((N/1000)**2)) = 8.95671803824341. | ||
Size: maximum value 13.815511 is for 1000000 words or more. | ||
Split: Found more than 10000 training words. | ||
Split: Did not find at least 10000 development words. | ||
Split: Did not find at least 10000 test words. | ||
Lemmas: '_' is the most frequent lemma. | ||
Universal POS tags: 17 out of 17 found in the corpus. | ||
Universal POS tags: source of annotation (from README) factor is 1. | ||
Features: 0 out of 88090 total words have one or more features. | ||
Features: source of annotation (from README) factor is 0.4. | ||
Universal relations: 39 out of 37 found in the corpus. | ||
Universal relations: source of annotation (from README) factor is 1. | ||
Udapi: found 38763 bugs. | ||
Udapi: worst expected case (threshold) is one bug per 10 words. There are 88090 words. | ||
Genres: found 1 out of 14 known. | ||
Availability: README does not say Includes text: yes | ||
Availability: '_' is the most frequent form. | ||
(weight=0.0769230769230769) * (score{features}=0.01) = 0.000769230769230769 | ||
(weight=0.0769230769230769) * (score{genres}=0.0714285714285714) = 0.00549450549450549 | ||
(weight=0.0769230769230769) * (score{lemmas}=0.01) = 0.000769230769230769 | ||
(weight=0.256410256410256) * (score{size}=0.648308869995405) = 0.166233043588565 | ||
(weight=0.0512820512820513) * (score{split}=0.34) = 0.0174358974358974 | ||
(weight=0.0769230769230769) * (score{tags}=1) = 0.0769230769230769 | ||
(weight=0.307692307692308) * (score{udapi}=0.01) = 0.00307692307692308 | ||
(weight=0.0769230769230769) * (score{udeprels}=1.05405405405405) = 0.0810810810810811 | ||
(TOTAL score=0.351782989138511) * (availability=0.1) * (validity=0.01) = 0.000351782989138511 | ||
STARS = 0 | ||
UD_English-ESL 0.000351782989138511 0 | ||
Switched to branch 'dev' | ||
Your branch is up-to-date with 'origin/dev'. |