Skip to content

Commit

Permalink
Add some notes on JA roberta
Browse files Browse the repository at this point in the history
  • Loading branch information
AngledLuffa committed Sep 5, 2023
1 parent 25e0f58 commit 45b3875
Showing 1 changed file with 10 additions and 0 deletions.
10 changes: 10 additions & 0 deletions stanza/resources/default_packages.py
Expand Up @@ -548,6 +548,16 @@
# Musixmatch/umberto-commoncrawl-cased-v1: 98.56 98.45 98.13 97.62
"it": "dbmdz/electra-base-italian-xxl-cased-discriminator",

# for Japanese
# there are others that would also work,
# but they require different tokenizers instead of being
# plug & play
#
# Constitutency scores on ALT (in-order)
# no bert: 90.68 dev, 91.40 test
# rinna: 91.54 dev, 91.89 test
"ja": "rinna/japanese-roberta-base",

# could also try:
# l3cube-pune/marathi-bert-v2
# or
Expand Down

0 comments on commit 45b3875

Please sign in to comment.