Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Trying the search for user-defined sets #2

Closed
Ljohn89 opened this issue Dec 29, 2023 · 1 comment
Closed

Trying the search for user-defined sets #2

Ljohn89 opened this issue Dec 29, 2023 · 1 comment

Comments

@Ljohn89
Copy link

Ljohn89 commented Dec 29, 2023

Hello @ashvardanian,
I'm working on implementing the Usearch algorithm for around 300 million molecules, following the steps outlined in your post. However, the search program is currently taking too long to identify and display results. I'm seeking guidance on approaches to optimize the program for faster execution, aiming for results in milliseconds. Any pointers or suggestions would be appreciated.

The snippet of the search code is mentioned below,
#################################################################
from usearch_molecules.dataset import FingerprintedDataset, shape_mixed

data = FingerprintedDataset.open("data/example", shape=shape_mixed)

No inspiration? Pick a random molecule with data.random_smiles()

results = data.search('CC(O)C(CN)=NNCC(C)(C)C', 100)

results_keys = [r[0] for r in results]
results_smiles = [r[1] for r in results]
results_scores = [r[2] for r in results]
##################################################################

Regards,
Lijo

@ashvardanian
Copy link
Owner

Hi @Ljohn89! Sorry, I didn't see the issue earlier. How slow is the search? What hardware are you running on right now?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants