Fingerprint similarity in MongoDB
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Failed to load latest commit information.

This repository accompanies a post where I describe the fingerprint based similarity searches in MongoDB. The post and code was inspired by a Datablend post who described this approach and my code is pretty much what they described (just packaged in a Python wrapper).

To load the fingerprints and then run the benchmark unzip and then run the scripts:


On completion, the time for each query along with the bit length of the query structure will be in times.txt. This assumes you have a MongoDB instance running on the local machine at the default port

The fingerprints were generated using the CDK and are Signature fingerprints. If you prefer another type, the original SMILES from ChEMBL are available in and can be used to generate a different set of fingerprints.