Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Upload personally created embeddings #24

Open
jhoetter opened this issue Jul 22, 2022 · 0 comments
Open

Upload personally created embeddings #24

jhoetter opened this issue Jul 22, 2022 · 0 comments
Assignees
Labels
enhancement New feature or request

Comments

@jhoetter
Copy link
Member

Is your feature request related to a problem? Please describe.
Embeddings are only creatable through the app & with huggingface models. There is no way to upload/integrate already existing/precomputed embeddings

Describe the solution you’d like
I want to upload personally created embeddings (e.g. an upload option similar to the record upload). They should be usable in the app as well as a visualization for them (e.g. with koaning/bulk - link see Additional Context).

Describe alternatives you’ve considered
-

Additional context
Requested by GeorgePearse on Discord

One last thought before I call it a day. I know the variation in dimensionality is what you stated was the problem with an upload embeddings functionality, but I actually only want to upload 2d 'embeddings' e.g. the output of UMAP such that it can actually be usefully visualized, in the same way, that koaning/bulk and https://github.com/phurwicz/hover allow you to. This covers quite a lot of use cases (admittedly 2D would not be so good for 'get similar' with QDrant, but great for a quick summary, they may just be two completely different features)

In this space (super quick visualization and labelling) there are a few tools, but none are set up neatly enough to actually manage a project. And as for the production-grade tools (yourselves, rubrix, and a few others), none of you seem to have this feature, so it might be a nice way to distinguish yourselves a little.

demoability of a 2d scatter plot (with meaningful embeddings) to senior management is 10/10 when you're trying to argue that your team should adopt a tool Or in my case arguing that you should use NLP at all https://projector.tensorflow.org/

Actually had some interesting ideas, if you go to custom on the bottom left you can create axes of similarity to different examples. They just got some of the levels of abstraction wrong which makes it a real pain to work with. Also doesn't work for any meaningfully sized text

@jhoetter jhoetter added the enhancement New feature or request label Jul 22, 2022
@jhoetter jhoetter self-assigned this Jul 22, 2022
@jhoetter jhoetter moved this from Backlog to 2022 in Roadmap Kern AI refinery Aug 3, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Development

No branches or pull requests

1 participant