Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

similarity as alternative way of searching #267

Open
dfa1 opened this issue Jun 3, 2020 · 1 comment
Open

similarity as alternative way of searching #267

dfa1 opened this issue Jun 3, 2020 · 1 comment
Labels
command related to a command

Comments

@dfa1
Copy link
Collaborator

dfa1 commented Jun 3, 2020

Problem
Provide a command to search by similarity, according to different well known algorithms such as
Levenshtein, Cosine, Jaccard.

Solution
In practice, create a tiny wrapper around this library:
https://github.com/tdebatty/java-string-similarity

Normalized values would require also a new value type (decimal or float).

Benefit
Being able to search not only by exact match or regexp but also by similarity.

@dfa1 dfa1 added the command related to a command label Jun 3, 2020
@kosiakk
Copy link

kosiakk commented Jun 3, 2020

Smart synonym search would also be magnificent. Some embedding like Word2Vec or more modern from GPT-2. I'd love play around with them.
Look, lots of guys play with them for fun, pre-training those enormous models on some specific area. Ask them to train a model on bash scripts and comments. It will then write code for you, from the comments :-)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
command related to a command
Projects
None yet
Development

No branches or pull requests

2 participants