New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Compoundding problem #419

Closed
GoogleCodeExporter opened this Issue Mar 14, 2015 · 2 comments

Comments

Projects
None yet
1 participant
@GoogleCodeExporter
Collaborator

GoogleCodeExporter commented Mar 14, 2015

Searching for the word 番組紹介 gives me a translation of each kanji, 4 in 
total, instead of dividing it into just 2 compounds, as shown in separate 
searches (images 2+3).

Is it me who is somehow at fault here, if so then what do I do wrong?
This only happens occasionally.

Original issue reported on code.google.com by dezorian...@gmail.com on 27 Jan 2015 at 6:14

Attachments:

@GoogleCodeExporter

This comment has been minimized.

Show comment
Hide comment
@GoogleCodeExporter

GoogleCodeExporter Mar 14, 2015

Collaborator
The algorithm works so that it searches for a word which must contain all of 
the abovementioned four kanjis, in any order. If this is not possible, it is 
not smart enough to try all sub-permutations of these kanjis. Perhaps I can 
check all permutations, including:

番組紹介
番組紹 and 介
番組介 and 紹
番紹介 and 組
組紹介 and 番
番組 and 紹介
番紹 and 組介
番介 and 紹組
組紹 and 番介

Perhaps I can walk the list in the abovementioned order, and I will stop as 
soon as a match is found.
However, imagine that a match is found at 番組 and 紹介, both kanjis 
produce a list of words containing them. How should I merge the word lists, to 
show them in the single list? Perhaps by length... but this would seem 
randomingly to the user. Perhaps I should show tabs, one tab containing 番組 
and the second tab containing 紹介...

Original comment by martin.v...@gmail.com on 29 Jan 2015 at 11:14

Collaborator

GoogleCodeExporter commented Mar 14, 2015

The algorithm works so that it searches for a word which must contain all of 
the abovementioned four kanjis, in any order. If this is not possible, it is 
not smart enough to try all sub-permutations of these kanjis. Perhaps I can 
check all permutations, including:

番組紹介
番組紹 and 介
番組介 and 紹
番紹介 and 組
組紹介 and 番
番組 and 紹介
番紹 and 組介
番介 and 紹組
組紹 and 番介

Perhaps I can walk the list in the abovementioned order, and I will stop as 
soon as a match is found.
However, imagine that a match is found at 番組 and 紹介, both kanjis 
produce a list of words containing them. How should I merge the word lists, to 
show them in the single list? Perhaps by length... but this would seem 
randomingly to the user. Perhaps I should show tabs, one tab containing 番組 
and the second tab containing 紹介...

Original comment by martin.v...@gmail.com on 29 Jan 2015 at 11:14

@GoogleCodeExporter

This comment has been minimized.

Show comment
Hide comment
@GoogleCodeExporter

GoogleCodeExporter Mar 14, 2015

Collaborator
Implemented basic word break-up - fixed.

Original comment by martin.v...@gmail.com on 9 Feb 2015 at 4:09

  • Changed state: Fixed
Collaborator

GoogleCodeExporter commented Mar 14, 2015

Implemented basic word break-up - fixed.

Original comment by martin.v...@gmail.com on 9 Feb 2015 at 4:09

  • Changed state: Fixed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment