Skip to content
This repository has been archived by the owner on Feb 25, 2023. It is now read-only.

Importing rikaichamp's dictionary? #1599

Open
rivke41levp656 opened this issue Apr 8, 2021 · 7 comments
Open

Importing rikaichamp's dictionary? #1599

rivke41levp656 opened this issue Apr 8, 2021 · 7 comments

Comments

@rivke41levp656
Copy link

I want to import the dictionary that rikaichamp uses but am having trouble. It seems to use data from https://hikibiki.app, but there is no obvious way to download dictionary files from there. That site seems to store the data in sqlite files located in your firefox profile, but attempting to import these with yomichan import results in
Conversion process failed: no such table: dict

@toasted-nutbread
Copy link
Collaborator

I haven't looked deeply into this, but check out https://github.com/birchill/hikibiki-data and see if there's anything meaningful in there.

@Thermospore
Copy link
Contributor

At a glance, both rikaichamp and https://hikibiki.app just use JMdict. You can either

  1. download a prebuilt copy of jmdict from the yomichan info page
  2. use yomichan import to build your own copy of jmdict at your leisure (which is what I like to do; it's a very active project)

There are a bunch of derivatives of that rikai yaddayadda extension, so I haven't kept track, but it might also include the KireiCake/RikaiCake dictionary. To be brutally honest, it's kinda trash. Even if it has something not in your other dictionaries, you are probably better off googling it or dropping it

@rivke41levp656
Copy link
Author

I don't think that is the case. It is mostly JMDict, but there are differences. Rikaichamp generally has more detailed entries, and is much more comprehensive particularly w.r.t slang terms. For example,
だんとつ : yomichan with my compiled JMDict does not read it, instead picks up only だんと
rikaichamp's entry: 断トツ [だんトツ, ダントツ] (n,adj-no) (uk,abbr,sl) far and away the best; the very best; cream of the crop; crème de la crème; decisive lead; commanding lead (from 断然 + トップ)
I don't know where the difference is coming from, but I don't think it is KireiCake, as yomichan with KireiCake produces the same result as JMDict in above example. Both my KireiCake and JMDict versions are probably old, but searching for だんとつ on https://www.edrdg.org/jmdictdb/cgi-bin/srchform.py?svc=jmdict&sid=  (which I assume is the most recent version) produces no results, as does searching it on https://kireicake.com/rikaicake-online/.

@Thermospore
Copy link
Contributor

For example,
だんとつ : yomichan with my compiled JMDict does not read it, instead picks up only だんと
rikaichamp's entry: 断トツ [だんトツ, ダントツ] (n,adj-no) (uk,abbr,sl) far and away the best; the very best; cream of the crop; crème de la crème; decisive lead; commanding lead (from 断然 + トップ)

image

That's JMdict. JMdict doesn't consider だんとつ as a reading or form of that word, technically speaking. If you search 断トツ, だんトツ, or ダントツ in your yomichan it should come up. You can enable some options to have yomichan check other formats automatically (highly recommended). Here are my settings. That option I marked with a red dot converts だんとつ to ダントツ, which successfully pulls up the entry. This is probably what Rikaichamp is doing

image

yomichan with KireiCake produces the same result as JMDict in above example

the "kirecake" on the yomichan info page is actually a frankenstein of the upper RikaiCake table and a very old copy of jmdict. I think it was ripped from rikaichan? imo it should either be removed entirely or replaced by a standalone version of RikaiCake not containing old jmdict

@Thermospore
Copy link
Contributor

(from 断然 + トップ)

you are right, those types of notes are missing from yomichan import's jmdict builds. I made an issue to suggest including them a while back #1165

Glad to see other people have interest in this as well!

@rivke41levp656
Copy link
Author

OK. You're right about the hiragana-katakana conversion, I didn't consider that. So I guess the differences between dictionaries come from either discrepancies in JMDict version (Rikaichamp autoupdates as it is web-based), or the fact that yomichan excludes those parenthetical notes. I agree that they ought to be included.

@FooSoft
Copy link
Owner

FooSoft commented Apr 12, 2021

This must be a different sqlite schema. yomichan-import currently only supports sqlite imports for Rikai-style databases, which probably differ in some insignificant way from what you are trying to load.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants