Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Version 0.1.1 #19

Merged
merged 4 commits into from
Nov 4, 2021
Merged

Version 0.1.1 #19

merged 4 commits into from
Nov 4, 2021

Conversation

koaning
Copy link
Owner

@koaning koaning commented Nov 4, 2021

Upgrade the version of the project and the testing suite.

@koaning koaning merged commit b9afe1d into main Nov 4, 2021
@koaning koaning deleted the version-0.1.1 branch November 4, 2021 12:31
@koaning
Copy link
Owner Author

koaning commented Nov 5, 2021

@gclen I've added your GitHub name to the v0.1.1 release. Just to check, do you also have a twitter account? I usually like to give shoutouts to folks who submit PRs.

Also, mainly out of curiosity, are you also using simsity for something?

@gclen
Copy link
Contributor

gclen commented Nov 5, 2021

Thanks! My twitter handle is @hotapplepi.

My original goal was to use to combine simsity and a weak supervision framework such as snorkel for text embeddings. That way you could see the nearest points to the weakly labelled points to hopefully fine tune your labelling functions. Unfortunately snorkel depends on scikit-learn < 1.0 and simsity requires > 1.0. I'm still going to attempt making a notebook using simsity with text embeddings and see where the pain points are. I'll make a new issue with what I find!

@koaning
Copy link
Owner Author

koaning commented Nov 6, 2021

I think if you were to manually install snorkel after simsity everything would work.

That said ... I am working on a tool, meant to be used in combination with this library, that should help make embeddings better by putting a labelling human in the loop. The idea is similar to this idea on the docs but the goal will be to end up with better embeddings instead of a better label. The library is to be called (pun intended) embetter and there's a repo in the works. It's very much alpha software though, it'll likely take a week or two to have a proper demo ready.

@koaning
Copy link
Owner Author

koaning commented Nov 6, 2021

Also, just to check, have you seen the bulk labelling demo?

@gclen
Copy link
Contributor

gclen commented Nov 10, 2021

Cool! I'll take a look. I think the bulk labelling does something similar ish to what I want, might be able to modify it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants