Skip to content
This repository has been archived by the owner on Nov 9, 2022. It is now read-only.

Fuzzy matching (e.g. double-metaphone) should dynamically generate dictionaries #321

Closed
anthony-gray opened this issue May 22, 2019 · 4 comments
Milestone

Comments

@anthony-gray
Copy link

Fuzzy matching (e.g. double-metaphone) should dynamically generate dictionaries because the domain for a given match element is the values of the element in the system at the start of matching. This currently does not seem to be working.

@ryanjdew
Copy link
Contributor

@anthony-gray Are you using this library independently or as an incorporated part of Data Hub 5? There was a method that allowed would create a dictionary if it didn't exist and there was a range index on the field, as part of storing options.

This logic isn't called as part of Data Hub 5, as the options are stored differently. A dictionary can be generated using cts.values on a range index and passing those values into spell.makeDictionary. (https://docs.marklogic.com/spell.makeDictionary)

@anthony-gray
Copy link
Author

I’m using it independently. I do not intend to use DHF. Matching should do this automatically by default because this should be the most common use case in the context of matching documents in a particular database using fields of documents in that database. Passing/supplying a dictionary should be optional. I should not have to call a method to generate a dictionary. All I should have to do is specify in the match configuration the arguments for fuzzy matching and it should just work. If there is a configuration option that toggles dictionary generation please let me know.

@ryanjdew
Copy link
Contributor

In version 1.3.1. dictionaries that don't already exist will be created from indexes that are associated with a property if a cts:reference is provided.

@anthony-gray
Copy link
Author

Okay. I will have to test this.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants