This repository has been archived by the owner on Nov 9, 2021. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 2
/
README.dic
105 lines (85 loc) · 3.46 KB
/
README.dic
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
--------------------------------------------
Maintenance History
--------------------------------------------
2011/11/21 keyword files (Aizawa)
2011/11/23 add some statistics for Japanese terms (Aizawa)
--------------------------------------------
Keyword files
-------------------------------------------------------------------
-rw-r--r--+ 1 akiko staff 237234750 11·î 21 19:56 kw_cinii_sv.tsv
-rw-r--r--+ 1 akiko staff 5462352978 11·î 21 19:58 kw_cinii_jst.tsv
-rw-r--r--+ 1 akiko staff 28421718 11·î 21 20:09 kw_kaken_je.tsv
-rw-r--r--+ 1 akiko staff 23689526 11·î 21 20:57 kw_ieee.tsv
-rw-r--r--+ 1 akiko staff 31352755 11·î 21 20:57 kw_kluwer.tsv
-rw-r--r--+ 1 akiko staff 128318180 11·î 21 21:00 kw_springer.tsv
-rw-r--r--+ 1 akiko staff 14115268 11·î 21 21:01 kw_oup.tsv
-rw-r--r--+ 1 akiko staff 16545001 11·î 21 21:14 kw_sciterm.tsv
-rw-r--r--+ 1 akiko staff 401538 11·î 21 21:16 kw_ipsjterm.tsv
-------------------------------------------------------------------
=== Dictionaries Edited by Human Experts ===
kw_ipsjterm.tsv
(Terms extracted from Handbook of Information Processing Society in Japan)
(Scanned and Manually corrected. We tend to think it's safe to
use indexes at the end of published text/handbooks.)
kw_sciterm.tsv (http://sciterm.nii.ac.jp/)
(The copyright is very complex. Only for the internal use.)
kw_cinii_jst.tsv
(!-!- Special attention: No disclosure of this resource !-!-)
(These keywords are assigned by JST to each individual paper
manyally. They are originated from JST bilingual terms dictionary
which is not publicly available.)
=== Keywords by the Authors of the Papers ===
kw_{ieee,kluwer,oup,springer}.tsv
comes from NII-REO (http://reo.nii.ac.jp/), electronic journal
archive service at NII.
kw_cinii_sv.tsv
comes from NII-CiNii (http://ci.nii.ac.jp), Scholarly and Academic I
nformation Navigator
kw_kaken_sv.tsv
comes from NII-Kaken (http://kaken.nii.ac.jp), a database of
Grants-in-Aid for Scientific Research
****Format***
dbname<tab>url<tab>jterm_num<tab>eterm_num<tab>jterm1<tab>..<tab>jtermn<tab>eterm1<tab>...<tab>etermm<ret>
=== Extracted files ===
kw_jst.je
extracted from kw_cinii_jst.tsv
dictionary with frequency information
kw_nii_sv.jpn
extracted from kw_cinii_sv.tsv
kw_nii_sv.eng
extracted from kw_cinii_sv.tsv
kw_nii_sv.je
extracted from kw_cinii_sv.tsv
kw_reo.eng
extracted from kw_{ieee,kluwer,oup,springer}.tsv
kw_trans
J-E translation pair
extracted from kw_nii_sv.je, kw_jst.je, kw_sciterm.tsv, kw_ipsjterm.tsv
--------------------------------------
Statistics for Japanese terms
kw_trans.simpair.j
Japanese keywords with common English translation and EditDistance=1
kw_trans.simrule.jword
word substitution rules obtained from the above
kw_trans.simrule.jchar
character substitution rules obtained from the above
kw_nii_sv.jpn.w0.stat
kw_nii_sv.jpn.m0.stat
{word,POS} sequence<tab>f,f0,fb,fi,fe<ret>
f: total frequency
f0: the number of times <term> appears
as independt keywords
fb: the number of times <term> appears
at the beginning of longer keywords
fm: the number of times <term> appears
in the middle of longer keywords
fe: the number of times <term> appears
at the end of longer keywords
f = f0 + fb + fm + fe
--------------------------------------
Related URLs
--------------------------------------
http://157.1.128.237/~akiko/dict
http://157.1.128.237/~akiko/termext
http://157.1.128.237/~akiko/adict
(guest/i2nic)