A Dictionary of Kagome Japanese Morphological Analyzer v2

A dictionary package of kagome v2. This software includes a binary and/or source version of data from

mecab-ko-dic-2.1.1-20180720

which can be obtained from

https://bitbucket.org/eunjeon/mecab-ko-dic/downloads/mecab-ko-dic-2.1.1-20180720.tar.gz

Feature Fields

Information about the dictionary format and part-of-speech tags used by mecab-ko-dic id documented in this Google Spreadsheet, linked to from mecab-ko-dic's repository readme.

Note how ko-dic has one less feature column than NAIST JDIC, and has an altogether different set of information (e.g. doesn't provide the "original form" of the word).

The tags are a slight modification of those specified by 세종 (Sejong), whatever that is. The mappings from Sejong to mecab-ko-dic's tag names are given in tab 태그 v2.0 on the above-linked spreadsheet.

The dictionary format is specified fully (in Korean) in tab 사전 형식 v2.0 of the spreadsheet. Any blank values default to *.

Index	Name (Korean)	Name (English)	Notes
0	품사 태그	part-of-speech tag	See `태그 v2.0` tab on spreadsheet
1	의미 부류	meaning	(too few examples for me to be sure)
2	종성 유무	presence or absence	`T` for true; `F` for false; else `*`
3	읽기	reading	usually matches surface, but may differ for foreign words e.g. Chinese character words
4	타입	type	One of: `Inflect` (활용); `Compound` (복합명사); or `Preanalysis` (기분석)
5	첫번째 품사	first part-of-speech	e.g. given a part-of-speech tag of "VV+EM+VX+EP", would return `VV`
6	마지막 품사	last part-of-speech	e.g. given a part-of-speech tag of "VV+EM+VX+EP", would return `EP`
7	표현	expression	`활용, 복합명사, 기분석이 어떻게 구성되는지 알려주는 필드` – Fields that tell how usage, compound nouns, and key analysis are organized

Licence

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.github		.github
internal/data		internal/data
testdata		testdata
tool		tool
LICENSE		LICENSE
NOTICE.txt		NOTICE.txt
README.md		README.md
dict.go		dict.go
dict_test.go		dict_test.go
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.github

.github

internal/data

internal/data

testdata

testdata

tool

tool

LICENSE

LICENSE

NOTICE.txt

NOTICE.txt

README.md

README.md

dict.go

dict.go

dict_test.go

dict_test.go

go.mod

go.mod

go.sum

go.sum

Repository files navigation

A Dictionary of Kagome Japanese Morphological Analyzer v2

Feature Fields

Licence

About

Releases 4

Packages

Contributors 2

Languages

License

ikawaha/kagome-dict-ko

Folders and files

Latest commit

History

Repository files navigation

A Dictionary of Kagome Japanese Morphological Analyzer v2

Feature Fields

Licence

About

Resources

License

Stars

Watchers

Forks

Languages