Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Multiple roots occured in ヤニス・スマラグディス監督 #42

Closed
KoichiYasuoka opened this issue Aug 31, 2019 · 19 comments
Closed

Comments

@KoichiYasuoka
Copy link
Contributor

I've just tried a sentence "ヤニス・スマラグディス監督の『エル・グレコ』であった。" and got curious result.

% echo 'ヤニス・スマラグディス監督の『エル・グレコ』であった。' | ginza
mode is C
disabling sentence separator
# text = ヤニス・スマラグディス監督の『エル・グレコ』であった。
1	ヤニス	ヤニス	PROPN	名詞-固有名詞-人名-一般	_	3	compound	_	BunsetuBILabel=B|BunsetuPositionType=SEM_HEAD|SpaceAfter=No|NP_B
2	・	・	PUNCT	補助記号-一般	_	3	punct	_	BunsetuBILabel=I|BunsetuPositionType=CONT|SpaceAfter=No
3	スマラグ	スマラグ	NOUN	名詞-普通名詞-一般	_	0	root	_	BunsetuBILabel=I|BunsetuPositionType=ROOT|SpaceAfter=No|NP_B
4	ディス	ディス	NOUN	名詞-普通名詞-一般	_	5	compound	_	BunsetuBILabel=B|BunsetuPositionType=CONT|SpaceAfter=No|NP_I
5	監督	監督	NOUN	名詞-普通名詞-サ変可能	_	8	nmod	_	BunsetuBILabel=I|BunsetuPositionType=SEM_HEAD|SpaceAfter=No|NP_I
6	の	の	ADP	助詞-格助詞	_	5	case	_	BunsetuBILabel=I|BunsetuPositionType=SYN_HEAD|SpaceAfter=No
7	『	『	PUNCT	補助記号-括弧開	_	8	punct	_	BunsetuBILabel=B|BunsetuPositionType=CONT|SpaceAfter=No
8	エル・グレコ	エルグレコ	PROPN	名詞-固有名詞-人名-一般	_	0	root	_	BunsetuBILabel=I|BunsetuPositionType=ROOT|SpaceAfter=No|NP_B
9	』	』	PUNCT	補助記号-括弧閉	_	8	punct	_	BunsetuBILabel=I|BunsetuPositionType=CONT|SpaceAfter=No
10	で	だ	AUX	助動詞	_	8	aux	_	BunsetuBILabel=I|BunsetuPositionType=FUNC|SpaceAfter=No
11	あっ	有る	AUX	動詞-非自立可能	_	8	aux	_	BunsetuBILabel=I|BunsetuPositionType=FUNC|SpaceAfter=No
12	た	た	AUX	助動詞	_	8	aux	_	BunsetuBILabel=I|BunsetuPositionType=SYN_HEAD|SpaceAfter=No
13	。	。	PUNCT	補助記号-句点	_	8	punct	_	BunsetuBILabel=I|BunsetuPositionType=CONT|SpaceAfter=No

We can see multiple roots exist at tokens 3 and 8. It seems some kind of bug in compound but I'm vague in its reason...

@KoichiYasuoka
Copy link
Contributor Author

Similar result in "ドミニコ会修道士のハインリッヒ・クレイマーはザルツブルク大司教の助手を務めた。"

# text = ドミニコ会修道士のハインリッヒ・クレイマーはザルツブルク大司教の助手を務めた。 
1	ドミニコ	ドミニコ	PROPN	名詞-固有名詞-人名-一般	_	2	compound	_	BunsetuBILabel=B|BunsetuPositionType=CONT|SpaceAfter=No|NP_B|NE=ORG_B
2	会	会	NOUN	名詞-普通名詞-一般	_	0	root	_	BunsetuBILabel=I|BunsetuPositionType=ROOT|SpaceAfter=No|NP_I|NE=ORG_I
3	修道士	修道士	NOUN	名詞-普通名詞-一般	_	7	nmod	_	BunsetuBILabel=B|BunsetuPositionType=SEM_HEAD|SpaceAfter=No|NP_I
4	の	の	ADP	助詞-格助詞	_	3	case	_	BunsetuBILabel=I|BunsetuPositionType=SYN_HEAD|SpaceAfter=No
5	ハインリッヒ	ハインリヒ	PROPN	名詞-固有名詞-人名-一般	_	7	compound	_	BunsetuBILabel=B|BunsetuPositionType=SEM_HEAD|SpaceAfter=No|NP_B|NE=PERSON_B
6	・	・	PUNCT	補助記号-一般	_	7	punct	_	BunsetuBILabel=I|BunsetuPositionType=CONT|SpaceAfter=No|NE=PERSON_I
7	クレイマー	クレーマー	PROPN	名詞-固有名詞-人名-一般	_	14	nsubj	_	BunsetuBILabel=B|BunsetuPositionType=SEM_HEAD|SpaceAfter=No|NP_B|NE=PERSON_I
8	は	は	ADP	助詞-係助詞	_	7	case	_	BunsetuBILabel=I|BunsetuPositionType=SYN_HEAD|SpaceAfter=No
9	ザルツブルク	ザルツブルク	PROPN	名詞-固有名詞-地名-一般	_	10	compound	_	BunsetuBILabel=B|BunsetuPositionType=CONT|SpaceAfter=No|NP_B|NE=PERSON_B
10	大司教	大司教	NOUN	名詞-普通名詞-一般	_	12	nmod	_	BunsetuBILabel=I|BunsetuPositionType=SEM_HEAD|SpaceAfter=No|NP_I
11	の	の	ADP	助詞-格助詞	_	10	case	_	BunsetuBILabel=I|BunsetuPositionType=SYN_HEAD|SpaceAfter=No
12	助手	助手	NOUN	名詞-普通名詞-一般	_	14	obj	_	BunsetuBILabel=B|BunsetuPositionType=SEM_HEAD|SpaceAfter=No|NP_B
13	を	を	ADP	助詞-格助詞	_	12	case	_	BunsetuBILabel=I|BunsetuPositionType=SYN_HEAD|SpaceAfter=No
14	務め	勤める	VERB	動詞-一般	_	0	root	_	BunsetuBILabel=B|BunsetuPositionType=ROOT|SpaceAfter=No
15	た	た	AUX	助動詞	_	14	aux	_	BunsetuBILabel=I|BunsetuPositionType=SYN_HEAD|SpaceAfter=No
16	。	。	PUNCT	補助記号-句点	_	14	punct	_	BunsetuBILabel=I|BunsetuPositionType=CONT

@KoichiYasuoka
Copy link
Contributor Author

Another result in "ライトの兄弟オスカーはコミックブック作家だ。"

# text = ライトの兄弟オスカーはコミックブック作家だ。
1	ライト	ライト	NOUN	名詞-普通名詞-一般	_	3	nmod	_	BunsetuBILabel=B|BunsetuPositionType=SEM_HEAD|SpaceAfter=No|NP_B|NE=PERSON_B
2	の	の	ADP	助詞-格助詞	_	1	case	_	BunsetuBILabel=I|BunsetuPositionType=SYN_HEAD|SpaceAfter=No
3	兄弟	兄弟	NOUN	名詞-普通名詞-一般	_	0	root	_	BunsetuBILabel=B|BunsetuPositionType=ROOT|SpaceAfter=No|NP_B
4	オスカー	オスカー	PROPN	名詞-固有名詞-人名-一般	_	7	nsubj	_	BunsetuBILabel=B|BunsetuPositionType=SEM_HEAD|SpaceAfter=No|NP_I|NE=PERSON_B
5	は	は	ADP	助詞-係助詞	_	4	case	_	BunsetuBILabel=I|BunsetuPositionType=SYN_HEAD|SpaceAfter=No
6	コミックブック	コミックブック	NOUN	名詞-普通名詞-一般	_	7	compound	_	BunsetuBILabel=B|BunsetuPositionType=CONT|SpaceAfter=No|NP_B|NE=PERSON_B
7	作家	作家	NOUN	名詞-普通名詞-一般	_	0	root	_	BunsetuBILabel=I|BunsetuPositionType=ROOT|SpaceAfter=No|NP_I
8	だ	だ	AUX	助動詞	_	7	cop	_	BunsetuBILabel=I|BunsetuPositionType=SYN_HEAD|SpaceAfter=No
9	。	。	PUNCT	補助記号-句点	_	7	punct	_	BunsetuBILabel=I|BunsetuPositionType=CONT|SpaceAfter=No

@TomokiMatsuno
Copy link

TomokiMatsuno commented Sep 11, 2019

It seems that GiNZA considered the first input "ヤニス・スマラグディス監督の『エル・グレコ』であった。" to be two separate sentences: "ヤニス・スマラグ" and "ディス監督の『エル・グレコ』であった。". And that is the reason why there are two roots in the result.

I think this is essentially the same behavior as the example below which also has two roots.

カレーは辛い。アイスは甘い。
# text = カレーは辛い。アイスは甘い。
1	カレー	カレー	NOUN	名詞-普通名詞-一般	_	3	iobj	BunsetuBILabel=B|BunsetuPositionType=SEM_HEAD|SpaceAfter=No|NP_B
2	は	は	ADP	助詞-係助詞	_	1	case	_	BunsetuBILabel=I|BunsetuPositionType=SYN_HEAD|SpaceAfter=No
3	辛い	辛い	ADJ	形容詞-一般	_	0	root	_	BunsetuBILabel=B|BunsetuPositionType=ROOT|SpaceAfter=No
4	。	。	PUNCT	補助記号-句点	_	3	punct	_	BunsetuBILabel=I|BunsetuPositionType=CONT|SpaceAfter=No
5	アイス	アイス	NOUN	名詞-普通名詞-一般	_	7	iobj	BunsetuBILabel=B|BunsetuPositionType=SEM_HEAD|SpaceAfter=No|NP_B
6	は	は	ADP	助詞-係助詞	_	5	case	_	BunsetuBILabel=I|BunsetuPositionType=SYN_HEAD|SpaceAfter=No
7	甘い	甘い	ADJ	形容詞-一般	_	0	root	_	BunsetuBILabel=B|BunsetuPositionType=ROOT|SpaceAfter=No
8	。	。	PUNCT	補助記号-句点	_	7	punct	_	BunsetuBILabel=I|BunsetuPositionType=CONT|SpaceAfter=No

@hiroshi-matsuda-rit
Copy link
Contributor

hiroshi-matsuda-rit commented Sep 29, 2019

@KoichiYasuoka Thank you for reporting considerable examples. I analyzed these sentences and found that the spaCy's parsing logic returns multiple roots for these situations. I suppose this is kinds of unexpected behavior of spaCy v2.1.x.

I've pushed a patch for the situation of revising "root_as_xxx" type POS disambiguation but it has no relation to above multiple root issues.

I'd like to keep this issue for future works.

@TomokiMatsumoto Thanks for your analysis. Your observation is correct, I think.

hiroshi-matsuda-rit added a commit that referenced this issue Sep 29, 2019
#42 Use 'dep' label when root_as_xxx arises for non-root tokens
@KoichiYasuoka
Copy link
Contributor Author

GiNZA v2.2.0 has improved this issue in the two of three examples I showed above, but the result of "ライトの兄弟オスカーはコミックブック作家だ。" has not been improved yet:

# text = ライトの兄弟オスカーはコミックブック作家だ。
1	ライト	ライト	NOUN	名詞-普通名詞-一般	_	3	nmod	_	BunsetuBILabel=B|BunsetuPositionType=SEM_HEAD|SpaceAfter=No|NP_B|NE=ORG_B
2	の	の	ADP	助詞-格助詞	_	1	case	_	BunsetuBILabel=I|BunsetuPositionType=SYN_HEAD|SpaceAfter=No
3	兄弟	兄弟	NOUN	名詞-普通名詞-一般	_	0	root	_	BunsetuBILabel=B|BunsetuPositionType=ROOT|SpaceAfter=No|NP_B
4	オスカー	オスカー	PROPN	名詞-固有名詞-人名-一般	_	7	nsubj	_	BunsetuBILabel=B|BunsetuPositionType=SEM_HEAD|SpaceAfter=No|NP_I|NE=PERSON_B
5	は	は	ADP	助詞-係助詞	_	4	case	_	BunsetuBILabel=I|BunsetuPositionType=SYN_HEAD|SpaceAfter=No
6	コミックブック	コミックブック	NOUN	名詞-普通名詞-一般	_	7	compound	_	BunsetuBILabel=B|BunsetuPositionType=CONT|SpaceAfter=No|NP_B
7	作家	作家	NOUN	名詞-普通名詞-一般	_	0	root	_	BunsetuBILabel=I|BunsetuPositionType=ROOT|SpaceAfter=No|NP_I
8	だ	だ	AUX	助動詞	_	7	cop	_	BunsetuBILabel=I|BunsetuPositionType=SYN_HEAD|SpaceAfter=No
9	。	。	PUNCT	補助記号-句点	_	7	punct	_	BunsetuBILabel=I|BunsetuPositionType=CONT|SpaceAfter=No

@hiroshi-matsuda-rit
Copy link
Contributor

Could you please test the new ja_ginza-3.0.0 model with some sentences and report the analyzing error if you'd find? @KoichiYasuoka @TomokiMatsuno

@KoichiYasuoka
Copy link
Contributor Author

I've just tried pip3 install -U ginza. It seems OK for updating ginza and ja-ginza into 3.0.0, spacy into 2.2.3, but it doesn't work in my Debian nor Mac OS X (High Sierra) nor Cygwin64. Debian's error output is:

  File "/home/yasuoka/.local/lib/python3.7/site-packages/sudachipy/dictionarylib/binarydictionary.py", line 34, in _read_dictionary
    bytes_ = mmap.mmap(system_dic.fileno(), 0, access=access)
ValueError: cannot mmap an empty file

Umm...

@hiroshi-matsuda-rit
Copy link
Contributor

Are you using pipenv?
#81 (comment)

@KoichiYasuoka
Copy link
Contributor Author

KoichiYasuoka commented Jan 15, 2020

No, I don't use pipenv in any of these environments: Debian, Mac OS X (High Sierra), and Cygwin64. Error output on Mac OS X (High Sierra) as follows:

$ echo ライトの兄弟オスカーはコミックブック作家だ。 | ginza
Traceback (most recent call last):
  File "/usr/local/bin/ginza", line 11, in <module>
    sys.exit(main_ginza())
  File "/usr/local/lib/python3.7/site-packages/ginza/command_line.py", line 380, in main_ginza
    plac.call(run_ginza)
  File "/usr/local/lib/python3.7/site-packages/plac_core.py", line 367, in call
    cmd, result = parser.consume(arglist)
  File "/usr/local/lib/python3.7/site-packages/plac_core.py", line 232, in consume
    return cmd, self.func(*(args + varargs + extraopts), **kwargs)
  File "/usr/local/lib/python3.7/site-packages/ginza/command_line.py", line 375, in run_ginza
    files=files,
  File "/usr/local/lib/python3.7/site-packages/ginza/command_line.py", line 69, in run
    analyzer.set_nlp()
  File "/usr/local/lib/python3.7/site-packages/ginza/command_line.py", line 159, in set_nlp
    nlp = spacy.load('ja_ginza')
  File "/usr/local/lib/python3.7/site-packages/spacy/__init__.py", line 30, in load
    return util.load_model(name, **overrides)
  File "/usr/local/lib/python3.7/site-packages/spacy/util.py", line 164, in load_model
    return load_model_from_package(name, **overrides)
  File "/usr/local/lib/python3.7/site-packages/spacy/util.py", line 185, in load_model_from_package
    return cls.load(**overrides)
  File "/usr/local/lib/python3.7/site-packages/ja_ginza/__init__.py", line 12, in load
    return load_model_from_init_py(__file__, **overrides)
  File "/usr/local/lib/python3.7/site-packages/spacy/util.py", line 228, in load_model_from_init_py
    return load_model_from_path(data_path, meta, **overrides)
  File "/usr/local/lib/python3.7/site-packages/spacy/util.py", line 197, in load_model_from_path
    nlp = cls(meta=meta, **overrides)
  File "/usr/local/lib/python3.7/site-packages/spacy/language.py", line 173, in __init__
    make_doc = factory(self, **meta.get("tokenizer", {}))
  File "/usr/local/lib/python3.7/site-packages/ginza/__init__.py", line 50, in create_tokenizer
    return SudachipyTokenizer(nlp)
  File "/usr/local/lib/python3.7/site-packages/ginza/sudachipy_tokenizer.py", line 81, in __init__
    dict_ = dictionary.Dictionary(config_path=config_path)
  File "/usr/local/lib/python3.7/site-packages/sudachipy/dictionary.py", line 37, in __init__
    self._read_system_dictionary(config.settings.system_dict_path())
  File "/usr/local/lib/python3.7/site-packages/sudachipy/dictionary.py", line 66, in _read_system_dictionary
    dict_ = BinaryDictionary.from_system_dictionary(filename)
  File "/usr/local/lib/python3.7/site-packages/sudachipy/dictionarylib/binarydictionary.py", line 50, in from_system_dictionary
    args = cls._read_dictionary(filename)
  File "/usr/local/lib/python3.7/site-packages/sudachipy/dictionarylib/binarydictionary.py", line 34, in _read_dictionary
    bytes_ = mmap.mmap(system_dic.fileno(), 0, access=access)
ValueError: cannot mmap an empty file

@fortharrow
Copy link

Incidentally, I expanded ja-ginza-3.0.0.tar.gz files and which system.dic was empty file.

@hiroshi-matsuda-rit
Copy link
Contributor

@fortharrow Sure. The system.dic file is empty in ja_ginza package and it should be overwritten during executing ja_ginza/setup.py from pip install process. I'm going to research this phenomenon.

@hiroshi-matsuda-rit
Copy link
Contributor

@KoichiYasuoka Thanks for testing. Could you please paste the version number of pip and the log of pip install -U ja_ginza?

@KoichiYasuoka
Copy link
Contributor Author

For Cygwin64:

$ pip3.7 --version
pip 19.3.1 from /usr/lib/python3.7/site-packages/pip (python 3.7)
$ pip3.7 --no-cache-dir install -U ja_ginza
Collecting ja_ginza
  Downloading https://files.pythonhosted.org/packages/71/f1/7e5da20f57a60b4e0a7d9eee3b120ecb0600dc0eaa08350d290580bb4bc4/ja_ginza-3.0.0.tar.gz (54.9MB)
     |████████████████████████████████| 54.9MB 259kB/s
Requirement already satisfied, skipping upgrade: spacy>=2.2.3 in /usr/lib/python3.7/site-packages (from ja_ginza) (2.2.3)
Requirement already satisfied, skipping upgrade: catalogue<1.1.0,>=0.0.7 in /usr/lib/python3.7/site-packages (from spacy>=2.2.3->ja_ginza) (1.0.0)
Requirement already satisfied, skipping upgrade: plac<1.2.0,>=0.9.6 in /usr/lib/python3.7/site-packages (from spacy>=2.2.3->ja_ginza) (1.1.3)
Requirement already satisfied, skipping upgrade: cymem<2.1.0,>=2.0.2 in /usr/lib/python3.7/site-packages (from spacy>=2.2.3->ja_ginza) (2.0.2)
Requirement already satisfied, skipping upgrade: thinc<7.4.0,>=7.3.0 in /usr/lib/python3.7/site-packages (from spacy>=2.2.3->ja_ginza) (7.3.1)
Requirement already satisfied, skipping upgrade: murmurhash<1.1.0,>=0.28.0 in /usr/lib/python3.7/site-packages (from spacy>=2.2.3->ja_ginza) (1.0.2)
Requirement already satisfied, skipping upgrade: numpy>=1.15.0 in /usr/lib/python3.7/site-packages (from spacy>=2.2.3->ja_ginza) (1.16.2)
Requirement already satisfied, skipping upgrade: requests<3.0.0,>=2.13.0 in /usr/lib/python3.7/site-packages (from spacy>=2.2.3->ja_ginza) (2.22.0)
Requirement already satisfied, skipping upgrade: preshed<3.1.0,>=3.0.2 in /usr/lib/python3.7/site-packages (from spacy>=2.2.3->ja_ginza) (3.0.2)
Requirement already satisfied, skipping upgrade: blis<0.5.0,>=0.4.0 in /usr/lib/python3.7/site-packages (from spacy>=2.2.3->ja_ginza) (0.4.1)
Requirement already satisfied, skipping upgrade: setuptools in /usr/lib/python3.7/site-packages (from spacy>=2.2.3->ja_ginza) (41.2.0)
Requirement already satisfied, skipping upgrade: wasabi<1.1.0,>=0.4.0 in /usr/lib/python3.7/site-packages (from spacy>=2.2.3->ja_ginza) (0.4.0)
Requirement already satisfied, skipping upgrade: srsly<1.1.0,>=0.1.0 in /usr/lib/python3.7/site-packages (from spacy>=2.2.3->ja_ginza) (0.2.0)
Requirement already satisfied, skipping upgrade: importlib-metadata>=0.20; python_version < "3.8" in /usr/lib/python3.7/site-packages (from catalogue<1.1.0,>=0.0.7->spacy>=2.2.3->ja_ginza) (0.23)
Requirement already satisfied, skipping upgrade: tqdm<5.0.0,>=4.10.0 in /usr/lib/python3.7/site-packages (from thinc<7.4.0,>=7.3.0->spacy>=2.2.3->ja_ginza) (4.37.0)
Requirement already satisfied, skipping upgrade: idna<2.9,>=2.5 in /usr/lib/python3.7/site-packages (from requests<3.0.0,>=2.13.0->spacy>=2.2.3->ja_ginza) (2.8)
Requirement already satisfied, skipping upgrade: urllib3!=1.25.0,!=1.25.1,<1.26,>=1.21.1 in /usr/lib/python3.7/site-packages (from requests<3.0.0,>=2.13.0->spacy>=2.2.3->ja_ginza) (1.25.6)
Requirement already satisfied, skipping upgrade: chardet<3.1.0,>=3.0.2 in /usr/lib/python3.7/site-packages (from requests<3.0.0,>=2.13.0->spacy>=2.2.3->ja_ginza) (3.0.4)
Requirement already satisfied, skipping upgrade: certifi>=2017.4.17 in /usr/lib/python3.7/site-packages (from requests<3.0.0,>=2.13.0->spacy>=2.2.3->ja_ginza) (2019.9.11)
Requirement already satisfied, skipping upgrade: zipp>=0.5 in /usr/lib/python3.7/site-packages (from importlib-metadata>=0.20; python_version < "3.8"->catalogue<1.1.0,>=0.0.7->spacy>=2.2.3->ja_ginza) (0.6.0)
Requirement already satisfied, skipping upgrade: more-itertools in /usr/lib/python3.7/site-packages (from zipp>=0.5->importlib-metadata>=0.20; python_version < "3.8"->catalogue<1.1.0,>=0.0.7->spacy>=2.2.3->ja_ginza) (7.2.0)
Building wheels for collected packages: ja-ginza
  Building wheel for ja-ginza (setup.py) ... done
  Created wheel for ja-ginza: filename=ja_ginza-3.0.0-cp37-none-any.whl size=54969542 sha256=3948aa47e6e30310f8163eca1b4db32a5f0150bc00de328fab817c2e33fca849
  Stored in directory: /tmp/pip-ephem-wheel-cache-mnbr3hqx/wheels/02/2f/85/df776c49d7532d013a52d0e481e2cdbee080069e1cdd9554a8
Successfully built ja-ginza
Installing collected packages: ja-ginza
Successfully installed ja-ginza-3.0.0

@KoichiYasuoka
Copy link
Contributor Author

Umm... It seems not to have accessed to https://github.com/megagonlabs/ginza/releases/download/v3.0.0/SudachiDict_core-20191224.zip

@hiroshi-matsuda-rit
Copy link
Contributor

@KoichiYasuoka Sure. The download process of SudachiDict_core*.zip does not appear in pip log.
I'm still testing some environmental variations.
Thank you so much for your kind contributions.

@KoichiYasuoka
Copy link
Contributor Author

Building wheel for ja-ginza (setup.py) ... done only takes 10 seconds. It seems too fast for download, comparing with my experience of setup.py of UD-Kundoku.

@hiroshi-matsuda-rit
Copy link
Contributor

@KoichiYasuoka I just released GiNZA v3.1.0 with some implementation improvements around sudachidict distribution. Please try it and feedback the errors if you found.

$ pip install -U ginza

@KoichiYasuoka
Copy link
Contributor Author

In my environments of Debian, Mac OS X (High Sierra) and Cygwin64 (with python37-pip python37-devel python37-numpy python37-cython python37-wheel gcc-g++ git and swig), GiNZA v3.1.0 totally works well. OK! Here I show you the result in Cygwin64:

$ pip3.7 install 'spacy>=2.2.3' --no-build-isolation
$ pip3.7 install 'ginza>=3.1.0'
$ echo ライトの兄弟オスカーはコミックブック作家だ。 | ginza
# text = ライトの兄弟オスカーはコミックブック作家だ。
1	ライト	ライト	NOUN	名詞-普通名詞-一般	_	3	nmod	_	BunsetuBILabel=B|BunsetuPositionType=SEM_HEAD|SpaceAfter=No|NP_B
2	の	の	ADP	助詞-格助詞	_	1	case	_	BunsetuBILabel=I|BunsetuPositionType=SYN_HEAD|SpaceAfter=No
3	兄弟	兄弟	NOUN	名詞-普通名詞-一般	_	4	compound	_	BunsetuBILabel=B|BunsetuPositionType=CONT|SpaceAfter=No|NP_B
4	オスカー	オスカー	PROPN	名詞-固有名詞-人名-一般	_	7	nsubj	_	BunsetuBILabel=I|BunsetuPositionType=SEM_HEAD|SpaceAfter=No|NP_I
5	は	は	ADP	助詞-係助詞	_	4	case	_	BunsetuBILabel=I|BunsetuPositionType=SYN_HEAD|SpaceAfter=No
6	コミックブック	コミックブック	NOUN	名詞-普通名詞-一般	_	7	compound	_	BunsetuBILabel=B|BunsetuPositionType=CONT|SpaceAfter=No|NP_B
7	作家	作家	NOUN	名詞-普通名詞-一般	_	0	root	_	BunsetuBILabel=I|BunsetuPositionType=ROOT|SpaceAfter=No|NP_I
8	だ	だ	AUX	助動詞	_	7	cop	_	BunsetuBILabel=I|BunsetuPositionType=SYN_HEAD|SpaceAfter=No
9	。	。	PUNCT	補助記号-句点	_	7	punct	_	BunsetuBILabel=I|BunsetuPositionType=CONT|SpaceAfter=No

@KoichiYasuoka
Copy link
Contributor Author

I've just written GiNZA v3.1.0で読む「ライトの兄弟オスカーはコミックブック作家だ。」 in my blog. Thank you @hiroshi-matsuda-rit and now I close this issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants