Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Questions #76

Open
hrstoyanov opened this issue Apr 17, 2024 · 1 comment
Open

Questions #76

hrstoyanov opened this issue Apr 17, 2024 · 1 comment

Comments

@hrstoyanov
Copy link

Thanks for creating this library! I have not looked into the code in detail, but would like to ask a few questions?

  • it looks like the "soundex" functionality from Apache is heavily biased to the English language only?

  • Do you have any estimates of the performance/complexity of the algorithms involved? I am considering this library for searching/iterating through in-memory data objects that have no other indexing.

@manishobhatia
Copy link
Contributor

Hi

  • You are right the use of soundex is limited to english. But it is relatively easy to plugin new functions that support other languages.
  • The complexity measured in our tests is almost linear (see here for details). This would work well for in-memory data

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants