Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

BUG Remove special symbols from Ken embeddings #601

Merged
merged 27 commits into from
Jun 15, 2023

Conversation

jovan-stojanovic
Copy link
Member

Fix #599

Copy link
Member

@GaelVaroquaux GaelVaroquaux left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I really think that slicing the strings is faster than the replace. Please correct me if I am wrong.

skrub/datasets/_ken_embeddings.py Outdated Show resolved Hide resolved
skrub/datasets/_ken_embeddings.py Outdated Show resolved Hide resolved
@jovan-stojanovic
Copy link
Member Author

Thanks, I think this is ready to be merged then. Not to confuse with #602 which maybe needs some additional work on :)

@GaelVaroquaux
Copy link
Member

Thanks. Maybe a small changelog entry, so as not to surprise users

@jovan-stojanovic
Copy link
Member Author

Actually now the main changes were made with #602, merging this just to update the changelog..

@jovan-stojanovic jovan-stojanovic merged commit ddfa1a1 into skrub-data:main Jun 15, 2023
18 of 19 checks passed
@jovan-stojanovic jovan-stojanovic deleted the fix_ken_output branch July 21, 2023 14:06
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Ken entity names should not have the preceding and trailing "<" and ">"
2 participants