Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

String kernels as a feature for names #174

Closed
marfox opened this issue Feb 6, 2019 · 4 comments
Assignees
Labels
Projects
Milestone

Comments

@marfox
Copy link
Member

@marfox marfox commented Feb 6, 2019

String kernels enable the comparison of strings with character windows and character jumps: this might be a great feature for names.

@marfox marfox added the discussion label Feb 6, 2019
@marfox marfox added this to the ML linker milestone Feb 6, 2019
@marfox marfox added this to To do in soweego via automation Feb 6, 2019
@marfox marfox self-assigned this Feb 7, 2019
@marfox marfox moved this from To do to Doing in soweego Feb 7, 2019
@marfox

This comment has been minimized.

Copy link
Member Author

@marfox marfox commented Feb 12, 2019

scikit's built-in text analyzers may be relevant because they build character n-grams, see https://scikit-learn.org/stable/modules/feature_extraction.html#limitations-of-the-bag-of-words-representation

@marfox

This comment has been minimized.

Copy link
Member Author

@marfox marfox commented Feb 12, 2019

scikit's built-in text analyzers may be relevant because they build character n-grams, see https://scikit-learn.org/stable/modules/feature_extraction.html#limitations-of-the-bag-of-words-representation

These are already considered in the cosine similarity as per linker/feature_extraction.StringList#cosine_similarity

@marfox marfox moved this from Doing to Upcoming in soweego Feb 14, 2019
@marfox

This comment has been minimized.

Copy link
Member Author

@marfox marfox commented Mar 8, 2019

Subtask of #214

@marfox marfox referenced this issue Mar 8, 2019
5 of 10 tasks complete
@marfox marfox moved this from Upcoming to To do in soweego Mar 8, 2019
@marfox marfox moved this from To do to Upcoming in soweego Mar 29, 2019
@marfox marfox moved this from Upcoming to To do in soweego Apr 3, 2019
@marfox marfox moved this from To do to Upcoming in soweego May 6, 2019
@marfox marfox added task and removed discussion labels May 6, 2019
marfox added a commit that referenced this issue May 8, 2019
…scriptions anymore, it's now done at feature extraction
@marfox marfox assigned marfox and unassigned marfox May 8, 2019
@marfox marfox moved this from Upcoming to Doing in soweego May 8, 2019
@marfox

This comment has been minimized.

Copy link
Member Author

@marfox marfox commented May 9, 2019

@marfox marfox closed this May 9, 2019
soweego automation moved this from Doing to Done May 9, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
soweego
  
Done
2 participants
You can’t perform that action at this time.