-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Future feature requests #10
Comments
(This is my third post in case you are reading them in different order.) #1 is simple enough. The program is not very command line friendly right now, but a simple help flag is understandable. Just in case, currently only -i or -e (-ie for combined) are accepted flags for importing the dictionary or examples, as I wrote in the install instructions. #2: The annotation descriptions from JMdict wouldn't be very useful, because I renamed them to something I found much more sensible. Many were joined too. I don't know if they were added like this for linguists or for algorithms to handle them better, but changing them was the only option to make them a bit user friendly. The program help is completely missing for now, but it's sensible to have at least the annotations listed somewhere. #3 Adjustable font sizes by pixel might be the best, but it's faster and cheaper to add the "very large" option. If someone needs an even bigger size after that, they probably need interface scaling as well. #4 I always wanted searchable examples, but in the end I'm not sure if that isn't an overkill. The ability to list the examples for a given word in a separate window, and allowing to see them all in a list might be more useful. Adding a name dictionary is among my plans too. #5 When I first designed zkanji I started out with a regexp search, but abandoned the idea after not finding much use for it. It's not difficult to make, since I keep a map of every single word for each separate kana character. I use that for the ?+text-in-middle+? searches. I need real life examples where this might be useful though. |
For #1, aware of those being the only ones listed at the instructions - but still, appriciated you concider adding it so it's allways at hand. Thank you. For #2, thank you for concidering adding it - it's a helpful feature which saves users quite some headache. For #3, seems we're on same page - thank you a lot. Go with whatever's easy to implement. For #4 & #5, this gets a little bit long and more personal than objective - please bear with me. A good way to learn when to use what wording is per example. Being able to browse and search examples is very helpful - and I can speak for myself too as a fairly advanced foreign speaker who didn't learn Japanese through glossary - those examples are very useful. But the issue is finding the right examples as there are too many. WWWJDIC has a feature for this, but very basic without regex and it's in the smartphone app as well. Can get hundreds of examples where none is what one was looking for due to the non-regex style. (Side note: Personally, for the time being, I open a terminal and run grep -iE PATTERN on the raw examples-file and set up an alias just the other day for acquaintances who use Linux.) This is where the #4 and #5 connects. Having this feature on a cross-platform desktop dictionary which can be used offline would be a true killer feature for those learning Japanese. Would greatly improve their chances of finding how to express themselves properly - as well as it's great for Japanese natives who study other English. Mentioned in post above, 御馳走様「ごちそうさま」, came up while dining togather the other day - as I learned my Japanese in 関西 (Kansai), there's a bit of an issue with me adapting their infamous accent... so, when they heard me they intrepeted it as 「ごくそうさま」 where the 「う」 in 「く」 had been dropped. So they opened zKanji on the computer to look it up and got really frustrated as they couldn't find it. Words ending with 「そ」 or 「そう」 are plenty... and they also used the English part to try find but you get a lot when you type "meal", and as they (unlike me) aren't native English speakers (although very good at English) they typed things like "Thank you for the food" etc. but not "What a wonderful meal"/"That was a delicious meal"/"said after meals" which ment they got irritated and eventaully gave up. To actually look things up oneself is a great way of learning, as that effort triggers the brain to actually remember - whereas asking and just getting an answer may be forgotten in a flash. This also connects to the final item at #4 - as the source support other languages as well it'd be very neat to be able to import these as well upon running the -ie flag. Another occation when I personally would like a regex feature is when translating texts, to find appropriate synonyms and expressions without doing 10+ individual searches for a single purpose. It's also great way to narrow down when unsure about spelling or wording. |
Would also like to add an additional request, which could ease up learning for people as it's overlooked - yet super useful. -- Future feature request #6 --Radical dictionary/browser, new and old system. Some form of easy browsable mode dedicated to radicals would be a great help for many - and especially for those learning Kanji as one no longer need to remember stroke-by-stroke-by-stroke, but simply 人(偏)土寸 to write 持 - which also helps with remembering meaning and (some) readings. Another benefit of knowing the radicals and using them as base for learning Kanji is when on-site talking with the natives and asking like "Oh, that word - how do you write it?" and if there's an unknown Kanji involved it'll be easy for the native to explain how it's written. |
The example sentences search means adding an entirely new feature too which would be lengthy to make. No data is generated for this currently. I won't say no for certain, but right now I have no plans to do this. If you want to contribute I can help to integrate it with zkanji. The regexp search is possible to make reasonably fast, but only for kana readings and not in English at all. This is limited to the dictionary however. Is the radical search already in the program not adequate for what you suggested in #6? It misses the meanings for each radicals, but that could be included as well. The third method includes the radical names and those can be used for searching. I plan a new type of kanji search, that would allow people to select the components and even their placement in the kanji. The stroke order data is made up of parts already, which you can check in the currently crashing kanji information window. (You can see this in the old version.) I don't know if this would be adequate but I might have misunderstood the feature request. |
Indeed, examples and radicals would both require effort and therefore be left for later. Regex for kanji, English and others I can take a look at how to implement. I may make an outline which you can optimize and implement for these after I've had the time to go through the complete source, but for now I will focus on helping with debugging and solve install and the auto-update. Simple regex for kana readings would be plenty helpful, and a good start. As for #6, how much effort would it be to just add a pop-up description at "Parts" & "Radicals" as well containing reading and meaning for now? And, if there's data, also include first occurance for JLPT and 常用 respectively? If there's no data available don't bother for now, I'll look at it when I take a look at examples, radicals and general regex if so. As for the radical search mode, there seem to be some irregularities for me at "Parts" take a look at the screenshot: ...and you should notice there are more strokes than there should be here and there - 10, 15 & 22 among others . -- Addition --I know this is being a bit picky with details, but one small detail caught my attention: |
I have added a Very large dictionary font option. This might still not be enough for big monitors, but it already looks huge compared to the rest of the interface. #6 There's a "names" toggle-button for writing them under the "Radicals and parts" tab, but tooltips were planned too, I just didn't do it yet. Adding meanings apart from the names requires additional data I don't have. If you can provide data, it could be displayed in tooltips or even used for searching between the radicals. The irregularities you mention in the radicals window is due to default fonts not having the required glyphs on most systems. Even if there exists a UTF-8 code point for them doesn't mean the font can display them. The characters displayed are the closest match that likely to appear in fonts. Instead of figuring out which fonts to use here, it would be best to either use the stroke order diagrams I already have to draw these radicals, or make SVG pictures for each of them. My vote goes for the SVG but it might take a little work to do it. I will answer the other issues and solutions you brought up too maybe tomorrow or after that. |
Answer things as you got the time to. Got some questions/requests/issues from my acquaintances I'll simply be forwarding, only forwarding those which I've confirmed for now - haven't done debugging yet. Questions came after the "zkanji.zks" and "similar.txt" fix for kanji-dialog had been applied. NOTE: The "kanjipen.svg" is icon for the button at the "Japanese-to-English" which cause the initial crash, and will cause crash every time it's clicked. I was able to reproduce this with the GIT-build from 2017-12-28, however when trying a fresh GIT install a few hours after your response on the 2017-12-31 the window will simply crash on click - but the Radical-search will be unaffected afterwards. Got some more feedback from them, will post here when I checked things out. Ain't got the time now. |
I'll move the crash/hang issue to the bug-thread I made earlier. |
Regarding data for radicals, I will double-check the data when digitilizing before providing as it's a bit old and contains referenses to 常用 and JLPT levels. Also, I think I got the radicals as either SVG or PNG somewhere... not sure which format, but I'll check on it. |
|
For #3, The addition of "Very Large" as font option is great. Looks good on screens ranging from 15" laptop to 27" stationary at 1920x1080. Small 4k screens got interface-scaling option so should be fine. I think this can be crossed of the list now. Regarding retaining saved size, you probably tried but just in case you haven't: (Wishing I've had the time to go through all the code... but probably gonna be a few more weeks before I've done that.) -- Future feature requst #7 --I made a mod-proposal yesterday in accordance to the requests I forwarded to you. Saw now you had tried, I'll see what I can do to try solve this layout thing. As I'm unfamiliar with GUI-programming in general, is it plausible to fully adjust layout using IF-clauses in QT or just to a limited extent? -- Future feature request #8 --"Filter"-buttons @ the Kanji Widget, as well as "S.O.D", "Words", "Stroke shadow" etc buttons @ Kanji Information Dialog (floating) - would it be reasonable to add an "active" frame or "hover" effect on these when they are active? To contrast them from inactive. Nice touch that when "Words" is active, some buttons goes inactive. |
Everything can be changed programmatically. Some changes are simple, others are not. In Qt we have "layouts" that try to automatically place things depending on their default size. It sometimes wants to be too smart for its own good, or the default size is wrong, this is causing trouble restoring sizes at start-up too. There is no way around this layout system, that's why it can be so difficult sometimes to force on it what we want. What makes things more difficult is that Qt calculates positions and sizes lazily, only making final calculations when a window is shown. There are workarounds for this but again it's not always easy. What you see in the designer is often not how the final result will look like, but at least it helps us to know what result we want to achieve. Can you please show what you mean by horizontal and vertical use of the kanji search? For #8, buttons do have a different appearance depending on whether they are active/pressed or not. I wonder if there's an issue with the theme? Can you experiment in the designer what makes buttons look different? Mainly by changing the "auto-raise" and "checked" properties. |
Lets just ignore the docking retain-size thing then, seems too much effort for small detail. Maybe one day as "finishing touch", if at all. I will take a look at #8, using QT Designer or QT Creator later, gonna fully digitalize the radical-info and see if I can find those radical images. I probably got the time for the radicals during the weekend, as I do it I'll make them as easily parsable tables in text files with the new radical index as unique key (252-system), the old radical index (214-system) will be present as well ofcource. Readings separated into preferred and optional. First line will be a header defining columns. Btw, do you prefer TAB, COLON or SEMICOLON as separator? -- Kanji Widget: Horizontal-/Vertical-mode --People have different preferances for layout, and some uses vertical screens (i.e. 1200x1920) while others horizontal (i.e. 1920x1200), or even flipable - and some "split" a single screen into several pieces keeping multiple applications visible at once - leaving people who wants several features docked at once wanting. This is why the topic was brought to my attention in the first place. Proposal:
Attaching various screenshots, and drafts in an archive: As for Vertical-mode, if the "From: " is above or below the shown Kanji is no big deal according to the feedback, but parhaps logical to keep all filters near eachother, thus placing it above? I don't know. |
I made the kanji search widget slimmer, I hope this will be enough. I removed the "From" label completely, added a toggle button to show and hide all filters, and moved the "Reset" button next to the filter controls, to make more space at the top. The combo-box for the "From" field is now moved next to the "Order" box in case there's enough space. I don't want to go slimmer than this but I hope it'll work for everyone. Moving the filter controls next to the toggle buttons is a bit more work, so I'll pass with that for now. |
Got some feedback on the "study"-mode... but will get back to that later as I haven't gotten a clear idea yet of what they actually ment, was a bit confusing... Once I've cleared things out and tested I'll let you know, though - prioritizing the radical data atm. -- Update on #4: Radical dictionary/browser --The data file is more time-consuming than I had thought, digitalized and verified roughly 50% (maybe a bit more, but just to be on the safe side...). Gonna add a documentation header for the file too before providing. -- Update on Kanji Widget --I've checked out the new layout and it looks good. Gotten some feed back, and it's all possitive - better than before. If, on day, the filters can hop up next to the "toggle"-buttons it'd be perfect they say. Good job! -- Pre-release request: #1 "Help"-flag --Would you mind including it in this release? It's useful not only to see optional flags, but also for generating generic start options etc - i.e. info, dictionary re-build/update etc. I'm aware there aren't much options at the moment, but still... |
I'll add the help flags, and make them shown for either of these flags: --help, -h, -?, /help, /h, /? |
I have added #1 help flags. Please test it. It doesn't work on Windows because there it's more complicated to have a GUI application that prints to a console, but the average user there rather reads text files anyway. |
Great, I will make sure to check it out during the weekend. Rather than a Windows-thing, the dash is a legacy from DOS. Remember using that back in the 90's. On a side note: Well, it's expected - Microsoft does all they can to restrict their users and mold them into the "Microsoft mold". NT just enchance the limitations compared to the old DOS days. While, on the other hand, Linux is freedom - to a point which easily overwhelms people who are used to restricted environments. Many Linux distribution do apply sort-of limitations too, primarily to not scare off people who aren't allready super users - making it very beginner-friendly. |
Thank you for implementing the "help"-feature. Paste below into a text-editor supporting syntax highlightning, such as kate or gedit, to see the difference more clearly. (Set highlight to Script -> Bash, or SHell-script.) Also, help section is per default limited to max width: 80 chars
NOTE(s): (Please excuse the somewhat "random" order.)
|
I received your mail of the radical example and I have questions if you don't mind. Which part of this data should be displayed in your opinion? Is it important what level of JLPT or Jouyou kanji they first show up in? I would like to display as little data of radicals in zkanji as possible that's still useful, since zkanji is just a study aid. If more details are necessary (I can imagine people in official Japanese education might be forced to use them,) it will only come at a much later version of zkanji. Do you have reliable JLPT data using the new 5 ranked system? I had data for the system where there were only 4 ranks, since these were officially published, but I haven't heard of an updated one. The data in zkanji was compiled by me personally, from several textbooks made a year after they introduces the system. They just gave examples of what goes to the N3 rank, and topics around them. My list is as good a guess as any, but it's probably very different from your data. Since there already is a list of radicals that allow filtering kanji to be displayed by them, isn't it the logical step to use that to show which radical is used in which Jouyou level? Last one for now! Is the listed names in the third method for radical search incorrect or missing any data? They didn't have translations for sure, which is a welcome addition though. |
At the radical search, displayed info should be:
Optionally include: Also concider a: As for the search-by-radicals, for simplicity, set your algorithm to ignore LIST-item beyond first. Else you'll end up with some 400 parts - where most are just a slimmer version - instead of the basic 214/252. Those additional ones and the other fields in the data are primarily ment for, what I will refer to as the "Radical Dictionary"-widget, addition which I mentioned earlier I'd like to make and include for those who wish to be able to browse and dig into radicals. As my my file doesn't cover JLPT - I cannot guarrantie reliability of the N-standard data. For the time being, I line it up with your data - digging into this later as it's gonna take plenty time. Time which I don't have atm - and this data is sort-of irrelevant until the "Radical Dictionary"-widget is comming around. Mentioned this earlier among requests. In regards to the file which contains the list of radicals for filtering, it's what I'd like your point of view on if to use generic data - as mentioned in the mail. The problem with N-standard is that it's subject for changes on a year-to-year basis if I'm not misstaken. There are loads of smartphone apps which has data on this, and I speculate they got the same issue - although not confirmed. The WWWJDIC-app is likely to use the same data as the site, and it has data on the five N-level system. However, the data on N-levels is for GUIDENCE in order to give the users a so-called priority-order of sort for which radicals are important to focus on - and roughly when. It's kind of pointless learning the flute-radical, 214/252, allready when being introduced to elementary 1 or N5 - it's this sort things the data on J- & N-appearance is to be helpful with. I will check on the names in the 3rd method, but you may have to be patient until next weekend. Gonna have a tough week, so I may not have time to answer or help out the upcomming few days. I noted scaling on the radicals dissapeared when that bug was fixed... More difficult to see them now, suppose this played a role in resolving the bug. Would it be possible re-implemented if there's a full set of SVG's for radicals later? Depending on licence terms, there's a project which could be helpful for radical images & strokes - the Kanji-Alive-project has made a dedicated radical font (Apache v2 licence), not sure about the terms but worth taking a look at parhaps? They also got 247 radical SVG's and radical animations (730 files) which I think is under the Creative Commons v4 licence. Take a look and see if conditions allows for a time-saver? Regarding the RadicalData-file structure, licensing etc. Anything you'd like to change, add or remove? Any data-field missing? Also, may I request your view on the LOCALE-thing? Random side note: I miss the counters at "Kanji Search"-widget which were present in the old v0.731. |
Showing both the translations and names for radicals at the same time doesn't seem too bad to me. It could be similar to the kanji popups, just with less data. I agree that this should be a setting though, but I'd rather put that in the settings window instead of a button on the radical selector for consistency with common interface designs.
I don't want to limit what zkanji can do, I just want to bring up my point of view in the matter. To me the program is mainly a study aid for those not in formal education of Japanese, with access to digital resources, who "just want to learn the language." I never had any use for radicals, which doesn't mean others won't find this data useful.
The USAGE_IMPORTANCE data can be generated from the dictionary, since I have data for what radical appears in which kanji. Please consider that this data only refers to the 6000+ kanji found in KANJIDIC. I think it's enough, but my take on what the program is for is what I wrote above.
The stroke order data is completely my work, though the classification of elements (something only visible in the stroke order editor in previous zkanji versions) was taken from another project at the early days. I would like the data to stay non-commercial, but that's all my restriction on it. I didn't really publish a license for it so legally it's still private I think, but I guess this doesn't matter regarding zkanji.
I just done a fast search again on this matter. There is no official N list for ANY of the kanji. They said at the 2010 change of the system that the old JLPT1-4 match N1, N2, N4 and N5. Everyone uses these lists, and the N3 is an estimate. If they use the same data that only means they all took the same list compiled by a single site. When I passed JLPT2 in 2009 (still the old system, but with official lists) they said at the time that on any level, there are 20% kanji/vocabulary they just randomly take from the more difficult levels. I would guess they still do that, so the changing requirements might be nothing more but the same thing.
I use 0.75*[square height] point size for radical fonts, scaling them down until they fit. My "fix" just lowers the point size until the fonts fit. If they don't fit ever, the last known valid size (or the initial size) is used. I still don't know what caused the error, apart from that the measurement probably gave incorrect results. This is in Qt so I can't do anything with that. If we switch to a different font that's proven to work or to SVG drawings, this problem will be fixed.
I would like to use as few outsider resources as possible. The Apache v2 licence (according to their official website) is in theory compatible with GPLv3, but there are restrictions and the wording doesn't make it clear whether this allows or doesn't allow in my specific case the use of their data, and I'm not a lawyer.
The only data needed to be able to show radical names and translations is radical number and the text data for them. The first line which lists "ID_UNIQUE;ID_NEW..." etc. is unnecessary, unless you plan to reorder the data or insert new fields in the middle. But in that case you might want to fix the missing semicolon between "RADICAL_ORIENTATION REF_KANA" for example. The rest depends on what features you need for the radical dictionary. This line contains a character that I don't have the font to display:
With the current data format, as you write in the description, if they provide a separate file, it can be used later fine. Although it might be worth changing the format to be extendable in case we want to distribute a single file with all translations. For example extra translations could come after the last standard data with the format [LANGUAGE CODE]:translation (i.e. "EN:straight line") There should be a restriction on the translations, as they can't contain some characters, like the semicolon.
This is planned, but wasn't priority with all the bugs and changes. Adding a "status bar" with Qt is a bit complicated, which made me postpone this feature. |
@ Radical-popups: @ Limitations: @ Generic data for radicals: @ Fields in radical data: @ Locale: @ zkanji.zks & radical-SVG's and font: Side notes @ Use of radicals: Off-topic: |
The radical data is ready for final check tomorrow. |
Please excuse me for not answering sooner but I couldn't find the time. I'll check the radicals you sent me.
When they introduced the 5 level system, they said the new middle level will just be stuff from the previous JLPT2. To translate this to the levels:
I see that you use the ID_UNIQUE in RADICAL_REFERENCE, which answers my previous doubts. It doesn't need to be used as the SVG file names. Please see my idea regarding this below. A question about duplicates, for example:
or
What should the program display as radical name and meaning in such cases? Also should we display the optional meaning in the tooltips, or you only mean it to be used in the radical dictionary? The RADICAL_OPTIONAL almost always contains a character that my system cannot display, just the placeholder box character is shown. As you mention in the TO-DO, if we use SVG images to display them, will there be a separate file that tells us which unicode code-point is which SVG? Maybe after the SVG files are done, we could just name each SVG file after a unicode character as hexadecimal, like
For each stroke order in the data, it's possible to specify a unicode character, and a name. The data potentially already contains all the radical stroke orders, and I have stroke order data there for the kana, You can see that by checking the kana tests in the study menu. The stroke order diagrams are made up of parts, which are other stroke orders, all that's left to do is to specify which unicode character they are, and then it'll be possible to show an animation for them. What is more, my original idea was to replace the current display of radicals with the S.O.D. but it doesn't look nearly as good as a font or SVG. Or bluntly put, it can be really ugly.
If you mean the stroke number displayed on the animation, this data is generated on the fly when displaying the stroke order diagrams. It tries to pick a fixed position near the starting point, and if all pre-defined positions are already taken, the number might be moved by a small distance. If this is not possible, that number won't be displayed. |
Do you mean you need to study readings and meanings of radicals to be able to talk with Japanese people about your language study? Or maybe you're in formal education where it's one of the requirements? When I took the JLPT it was, and to my knowledge still is only a multi-option test. Fill in the circle and you're done. Wherever you take it, the test paper is sent to Japan to a central location, and the answers are checked automatically. They don't do it by hand. |
It's not mandatory for what I know, but as previously mentioned - when talking with locals and unsure how to spell something, it's really helps if one know the radicals as they can explain in a flash in an a way which is easy to remember - no need for pen & paper, digital devices etc. See "For example..." below. As I believe I mentioned earlier, it also helps figuring out potential meanings/readings of kanji one has yet to learn. Radicals also reduce the amount of things one need to remember when studying, and one will also know the fail-safe way to allways draw correctly - even if it's the very first time encountering the specific kanji in question. For example, most kanji consist of 2-5 radicals - but often 7-20 strokes. so learning a high-radical low-stroke like "前" (which is early stage kanji) if going per radicals (253) it's: It's very likely this kind of logic which, supposably, created the characters back in the old China. Off-topic: |
Thank you for further clarification on the JLPT. Lets drop the 4-level system info and base things on your data for the 5-level system in combination with 常用 - and with a notice for the users regarding N3 & N2. @ Radical Data
They look near identical at first glance if using small font, however they are drawn differently and have different meanings. It's similar with "川" and "巛", they have different usages - I got comments on this in my data which I decided to not put in the file for the time being as that data refers to general guidance and train of thought as to when which is used. I will include this later in a separate file as it may be useful for those who wish to study, where I use the ID_UNIQUE as KEY for linking. Another reason to not include this in the same file is that some have extensive info while others got none at all as it's not needed.
RADICAL_OPTIONAL field got the correct glypf in 99~100% of the cases, and majority of fonts does NOT contain these even though they are in the UTF-8 definition. Most of them are squeezed versions of the primary appearance, some are completely different - I will make SVG's for all. @ SVG & Stroke order PS: If you're busy then you are, no worries. |
Thank you for your explanation regarding radicals, though I'm aware of it. My personal experience with learning kanji without consciously studying radicals, is that it's enough to know what radicals generally are, and to recognize which parts might be radicals. I can guess the reading of many unknown kanji, because the brain is built to recognize patterns, and radicals are exactly that. If a student is told that radicals can have the property of holding the reading to a kanji, they can figure out the rest when they expand their vocabulary. The same is true for stroke order. There are some special cases when the order is not easy to find, like the well known 左 and 右, because they evolved from different sources, but in general stroke order is the easiest part of kanji learning. It is true though, that during my years of reading/writing Japanese, many radicals with their meanings and stroke order stuck in my head. I myself enjoy studying kanji, and reading is fun too, so I'm always disappointed when an author uses hiragana too much, but an alphabet has its own advantages. It might just be random symbols stuck together without meaning, but if you learn a language with an alphabet from native speakers via listening, you don't have to learn to specifically read that language to be able to read. (Apart from strange languages like English, because the spelling doesn't make any sense...) In many languages (which are not English,) letters can be read without knowledge of the word, to find out the pronunciation. When it comes to Japanese and kanji, it is undeniably daunting for a beginner. It's only the first step to get over the thought that it's "impossible" (many people I knew who were learning Japanese felt that way.) It takes some time to read about the writing system, and to understand what that information actually means. I heard conflicting opinions from people who have on-site experience in Japan. Someone told me that even the Japanese he met have no idea how to write, many can only write a few hundred kanji at best and read maybe a thousand (those must be the uneducated people I guess.) It is true though, that even intelligent and educated people need to look up how to write an obscure kanji sometimes, like the word 薔薇. I'm not sure our essays are great to write in "GIT issues." Is there a forum feature here somewhere?
By duplicate I meant either the "old radical number" is the same in two separate lines or the "new radical number" is. When I hover on an "old radical" in the radicals window, what should be displayed? My guess is "both," but I asked just to make sure.
I see. If you can make the SVG images, we won't need the current "wrong" characters, and only show the optional (or rather real?) ones. I think optional should be renamed as "real" or "correct" radical, and instead of including the character itself, there could be just a code-point in hexadecimal if available. If the editing is done by hand, I can fix it with a script.
It's very likely that this specific radical is not in the unicode standard, because the non-classical radicals used in zkanji are not an official radicals listing. They were created by someone for his Japanese text editor. The page for radkfile gives some details about this. It is useful because it is more detailed than the classical radicals, but it is most probably never used in Japanese-published dictionaries.
The only support in Qt for SVG is displaying them by the standards. It doesn't even have common implemented features not in the strict standard. I don't know if it even supports layers, but hopefully it does. I will test this. I use Inkscape for the icons too, and none of the fancy effects show up in Qt, so I was forced to not use anything else but normal fill and outline, or the gradients (with transparency.) Outline or stroke width works at least. Keeping text in an SVG is in general a bad idea since the target OS might not have the font installed. They must be converted to path first. This is unrelated to radicals, but can be good to know for future reference. I forgot to answer about the document size for the SVG. The size doesn't matter, but please make sure the radical touches or at least is close to the edges of the document, because if I specify which rectangle the SVG should be painted in, Qt will use this rectangle as the document edges. It also distorts the sides if they are not the same ratio in the editor as the destination rectangle. |
With experience, developing this type of recognitive ability is inevidible I believe - and my guess is it's part of the reason learning radicals at early stage isn't mandatory for natives. Well, apart from enough to use dictionaries.
I can confirm this, beginner-level people (understanding less than 100 kanji) in my surroundings told me - after I informed them about radicals and how to use them per example - they saw the light at the end of the tunnel, no longer overwhelmed. And they asked why I didn't tell them this earlier. I told them it's best to know, or at least be able to recognize/read, some 50+ kanji before starting with radicals. Why? Learning the radicals can be boring, or feel unrewarding unless knowing some kanji in-before, thus - to see the use and when learning them - focus on the important ones for current lingual level.
It's the sad truth for most languages using latin letters. At least the whole Germanic family (Danish, Norse, Swedish etc), German-related ones (German, Dutch etc.), French- & Finnish-families and so on... included. Only plausible exception that comes to mind is the Spanish-family (Spanish, Pourtoguise etc), second only to the Chinese-family (Mandarin, Cantonese etc) in terms of practitioners.
Good point. If carrying on these things, lets move to mail conversation. @ Radical display info
I think you've missunderstood this one... NOTE: Exception for @ SVG <--> QT-relation The reason for the layers - and their naming - is for later reference as well as for anyone wishing to re-use the source, split for stroke order diagram which (maybe) is missing or for whomever who wish to verify something to have an additional pointer for strokes, although direction is not shown here. May possibly enable using parser to extract stroke-by-stroke data for creating stroke order diagram which fit same pattern as current data, e.g. extend to cover missing - unless licence prevents this. |
I will definitely add a setting for the popup contents.
There are multiple radicals in the radical search represented by the "wrong" kanji (or kanji containing more than just the radical.) My understanding was that the RADICAL_PRIMARY meant those when there was no closer match in fonts. In my understanding those are not correct, just placeholders. Although your list might hold closer representations.
Unicode is a character set, while UTF-8 is an encoding of Unicode. This means that when there's a code-point, which is a number if we simplify things, it's the same number in any encoding of Unicode. Be it UTF-8, UTF-16 etc. If a document is labelled as simply "Unicode," it's an error. (Or comes from a tradition of the operating system, but actually means a specific encoding with a different name.)
Qt only distorts the SVG if the document was a square for example, while the rectangle drawn into is not. In this case the image will be stretched. The empty areas are left alone, they will be empty in the painted image. |
@ Radical Data & Unicode/Encoding That touchscreen proved too defect, so gonna use mouse to draw the SVG's. Had that screen for design once up on a time... but it had an accident and now its responsive area is a rectangle of 50% of the height and 67% of the width starting from top-left corner. It only responds to movement and doesn't recognize "click", "hold", "erase" etc. of any sort, not even using the stylus buttons. Opened it, but couldn't fix it - nor improve sadly enough. Not a single dead pixel though... so is still nice for colour-representation in-before print of digital art. The @ SVG & QT
|
I've started adding |
It should be "Unicode code point" to be precise, but if you just call it UNICODE, it'll be clear. It doesn't matter how it's prefixed. 0x is a notation specific to C and derived languages for hexadecimal, but programmers generally understand it. You could also just replace any Japanese character with the Unicode code point. Providing both the code point separately and the characters in the file don't make sense, because just by reading the file (which is already in UTF-8) you get the code points anyway.
There are many file name restrictions in Windows, and those were never clearly stated, so if you look for a list of forbidden characters you are out of luck. It truly is junk but I'm not a fan of Linux either, so I don't want to stand on either side. Let's leave this outside of GIT please. Some people who gets triggered by it might come to join in.
The input method editor for Japanese in Windows allows you to draw the characters like the recognition in zkanji. If there's some obscure kanji or radical present in Unicode, you can draw it too, but drawing 038 doesn't give anything matching. It just doesn't exist in that form. There's no deformation of the SVG. It's fine if you place the radical at the position and size where it actually should be. |
I renamed the column to
I'll change the prefix to
I agree with this. However, I still decided add a column instead in order to keep the file "human-friendly". Added notation in the column description, and for a parser it's easy to simply skip a column or two. Sure not the most space-efficient solution, but - I wonder who cares about a few KB nowadays? Also, for the same reason, I will keep
Only reason I brought it up was to ensure cross-platform compability, don't want to use "illigal" characters. Also, I was thinking of using symlink for some references as it'd save space and make things more convinient... but pointless without cross-platform support.
Great, saves me effort. |
@ Radical Data / Browser: |
A brief overview - INDEX:
--> Future feature request #1 - Flags.
--> Future feature request #2 - Annotation descriptions.
--> Future feature request #3 - Font size.
--> Future feature request #4 - Examples, names & multi-lingual.
--> Future feature request #5 - Regex.
-- Future feature request #1 --
In terminal, and plausibly at Help-menu:
--> Flag/switch information, e.g.: --help or -h brings up what options there are and short info.
-- Future feature request #2 --
Annotation descriptions @ Help-menu or Dictionary-menu.
These can be extracted from JMdict:
--> Format:
<!ENTITY term "Explanation">
--> Inbetween:
-- Future feature request #3 --
Font size adjustable "by px" or a "Very Large", as so-called "Large" is fairly small - even using a 27" monitor @ 1920x1080. Indeed, there's interface scaling - great, but for those who only wish to increase size of dictionary entries a "Very Large" or freely adjustable "by px" would be perfect.
-- Future feature request #4 --
Browsable & searchable examples dictionary.
Browsable & searchable names dictionary. (Possible to import along with the others? Using like: zkanji -ien PATH)
Build multi-dictionaries (one for each language in JMdict) at initial import (zkanji -ie PATH)
-- Future feature request #5 --
Simple regex (or import existing regex library/module parhaps?) @ search, e.g.:
--> "*" (none-or-more-characters),
--> "." (single-character),
--> "|" (this-or-that, plausibly brackets enclosure, "[" and "]", for multichar-this-or-that)
--> "&" (this-and-that, plausibly brackets enclosure, "[" and "]", for multichar-this-and-that)
Example:
--> ご*そうさま,
--> ご.そうさま,
--> ごち|くそうさま,
--> ご[ちそう|くそう]さま
...would all find 御馳走様「ごちそうさま」, and anything else matching the given pattern.
Notes:
--> "|" and "&" for latin letters could plausibly be used inbetween SPACE(s) instead of brackets.
--> "&" is primarily useful in latin-style lookup and examples.
The text was updated successfully, but these errors were encountered: