documentation of the parsed results? #91

YameiW · 2023-02-28T18:12:38Z

Hello there,

I am using Mecab to parse Japanese sentences. But I am confused by the results. Do you have some documents that I can read to understand the parsing results?

For instance, What does each column mean, and what is the meaning of some numbers in the last column? Does Mecab give us the dependency information that we can use to extract nominal phrases?

Any help would be appreciated!

polm · 2023-03-06T05:47:29Z

The output format depends on your config file and your dictionary.

You seem to be using the full sized UniDic with accent information (the last column in your output), so you'll need to check your config file against the dictionary format. Or, instead, you could just use fugashi, which will parse all UniDic fields into a namedtuple for easy use. See here for an overview of fields.

MeCab cannot annotate the field names because they are not stored in the config or dictionary itself anywhere.

Also MeCab does not generate any kind of dependency information.

In general, the official MeCab docs may be helpful.

polm · 2023-03-06T06:12:38Z

Closing this because I believe that answers your question, but if anything is unclear please feel free to follow up.

polm closed this as completed Mar 6, 2023

polm mentioned this issue Mar 10, 2023

Output Format taku910/mecab#69

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

documentation of the parsed results? #91

documentation of the parsed results? #91

YameiW commented Feb 28, 2023

polm commented Mar 6, 2023

polm commented Mar 6, 2023

documentation of the parsed results? #91

documentation of the parsed results? #91

Comments

YameiW commented Feb 28, 2023

polm commented Mar 6, 2023

polm commented Mar 6, 2023