Skip to content

Integrate pt Wikidata into Unicode Inflection #52

@grhoten

Description

@grhoten

The revised dictionary-parser can parse Wikidata, but some issues need to be resolved.

The initial issues include:

  • The dictionary-parser output needs to be addressed
  • The unit tests need to be fixed.

Tool output that needs to be addressed:

Line 59422: Q29864688 is not a known grammeme for L488745(ele)
Line 168408: Q82799 is not a known part of speech grammeme for L1378462(ouriço)
Line 177613: Q79377486 is not a known grammeme for L43388(ela)
Line 226265: Q23663136 is not a known grammeme for L446983(gritar)
Line 349090: Q23663136 is not a known grammeme for L39470(ser)
Line 350667: Q23663136 is not a known grammeme for L52324(vender)
Line 723470: Q79377486 is not a known grammeme for L307268(elu)
Line 740810: Q23663136 is not a known grammeme for L446980(aparecer)
Line 740811: Q23663136 is not a known grammeme for L446982(gostar)
Line 800782: Q3062294 is not a known part of speech grammeme for L942478(et al.)
Line 911819: Q23663136 is not a known grammeme for L447086(cantar)
Line 1083558: Q23663136 is not a known grammeme for L446981(almofadar)
Line 1083741: Q23663136 is not a known grammeme for L448629(estudar)

Here is the current generated lexical dictionary files to debug the test failures.

pt.zip

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions