New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some parts have no content #591

Closed
andrejrenard opened this Issue Nov 15, 2015 · 8 comments

Comments

Projects
None yet
3 participants
@andrejrenard

andrejrenard commented Nov 15, 2015

I found 2 (right part of 残 and upper part of 甬) but there can be more.
They are dead ends we cannot add to a notepad and to which we cannot add a tag

@mvysny

This comment has been minimized.

Show comment
Hide comment
@mvysny

mvysny Nov 24, 2015

Owner

Hmm, this is interesting. Kanji 戋 0x620b is not present in the Kanjidic, neither is マ (it is katakana letter MA). Therefore, I cannot obtain proper entries for those two characters and therefore they cannot be further worked with, unfortunately.
There is however something you can do: can you please ask Jim Breen to add this specific kanji 戋 0x620b to the kanjidic?

Owner

mvysny commented Nov 24, 2015

Hmm, this is interesting. Kanji 戋 0x620b is not present in the Kanjidic, neither is マ (it is katakana letter MA). Therefore, I cannot obtain proper entries for those two characters and therefore they cannot be further worked with, unfortunately.
There is however something you can do: can you please ask Jim Breen to add this specific kanji 戋 0x620b to the kanjidic?

@andrejrenard

This comment has been minimized.

Show comment
Hide comment
@andrejrenard

andrejrenard Nov 24, 2015

But there are lots of them. The best would be to develop a routine to find them and ask Jim to adapt his database. I am not sure about Jim's db. Is there only one version?

andrejrenard commented Nov 24, 2015

But there are lots of them. The best would be to develop a routine to find them and ask Jim to adapt his database. I am not sure about Jim's db. Is there only one version?

@mvysny

This comment has been minimized.

Show comment
Hide comment
@mvysny

mvysny Nov 24, 2015

Owner

I'm not sure if I understand correctly. AFAIK there is exactly one free Kanji database, the Jim Breen's Kanjidic2: http://www.edrdg.org/kanjidic/kanjd2index.html
Jim have put a lot of effort and energy into creating that database, I believe he deserves to be contacted in person. I have dropped Jim a mail, I hope he will find some time to discuss this issue.

Owner

mvysny commented Nov 24, 2015

I'm not sure if I understand correctly. AFAIK there is exactly one free Kanji database, the Jim Breen's Kanjidic2: http://www.edrdg.org/kanjidic/kanjd2index.html
Jim have put a lot of effort and energy into creating that database, I believe he deserves to be contacted in person. I have dropped Jim a mail, I hope he will find some time to discuss this issue.

@mvysny

This comment has been minimized.

Show comment
Hide comment
@mvysny

mvysny Nov 24, 2015

Owner

Andrej, could you please paste some more missing characters here? Thanks!

Owner

mvysny commented Nov 24, 2015

Andrej, could you please paste some more missing characters here? Thanks!

@jimmubreen

This comment has been minimized.

Show comment
Hide comment
@jimmubreen

jimmubreen Nov 25, 2015

Kanji 戋 0x620b is not in the Kanjidic2 database because it's not in any of the 3 JIS standards that identify the kanji used in Japan (JIS X 0208/0212/0213). See: http://www.unicode.org/cgi-bin/GetUnihanData.pl?codepoint=%E6%88%8B
It may seem an arbitrary limit to draw, especially as with Unicode people in Japan can, and do, use characters outside that set, but it does make the task of database management much more manageable.

jimmubreen commented Nov 25, 2015

Kanji 戋 0x620b is not in the Kanjidic2 database because it's not in any of the 3 JIS standards that identify the kanji used in Japan (JIS X 0208/0212/0213). See: http://www.unicode.org/cgi-bin/GetUnihanData.pl?codepoint=%E6%88%8B
It may seem an arbitrary limit to draw, especially as with Unicode people in Japan can, and do, use characters outside that set, but it does make the task of database management much more manageable.

@mvysny

This comment has been minimized.

Show comment
Hide comment
@mvysny

mvysny Nov 25, 2015

Owner

Thank you very much Jim. It now makes sense for 戋 to never appear in Kanjidic2.
So, Andrej, what now? If you wish to attach tags to that particular character, I would need to create a dummy Kanjidic2 entry, which will only contain given kanji and no other information - no onyomi, kunyomi, readings,SKIP nor any other information. I can imagine this for users to be very confusing...

Owner

mvysny commented Nov 25, 2015

Thank you very much Jim. It now makes sense for 戋 to never appear in Kanjidic2.
So, Andrej, what now? If you wish to attach tags to that particular character, I would need to create a dummy Kanjidic2 entry, which will only contain given kanji and no other information - no onyomi, kunyomi, readings,SKIP nor any other information. I can imagine this for users to be very confusing...

@andrejrenard

This comment has been minimized.

Show comment
Hide comment
@andrejrenard

andrejrenard Nov 25, 2015

Hi Martin
I think it's still worth to create the "dummy" kanjis to be able to add them to notepads, to attach them a tag and to search for their junctures.
Andre

andrejrenard commented Nov 25, 2015

Hi Martin
I think it's still worth to create the "dummy" kanjis to be able to add them to notepads, to attach them a tag and to search for their junctures.
Andre

@mvysny

This comment has been minimized.

Show comment
Hide comment
@mvysny

mvysny Nov 26, 2015

Owner

I tried to do just that, and it seems to be working. I cannot do this for the マ character though, because it is actually a katakana letter, not a kanji. However, this should work for all other parts.
Fixed in 3.39.1

Owner

mvysny commented Nov 26, 2015

I tried to do just that, and it seems to be working. I cannot do this for the マ character though, because it is actually a katakana letter, not a kanji. However, this should work for all other parts.
Fixed in 3.39.1

@mvysny mvysny closed this Nov 26, 2015

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment