FPSim2: Simple package for fast molecular similarity searches

FPSim2 is a small NumPy centric Python/C++ RDKit based package to run fast compound similarity searches. FPSim2 performs better with high search thresholds (>=0.7). Currently used in the ChEMBL interface.

Highlights:

Using CPU POPCNT instruction
Bounds for sublinear speedups from 10.1021/ci600358f
A compressed file format with optimised read speed based in PyTables and BLOSC
Fast multicore CPU and GPU similarity searches
In memory and on disk search modes
Distance matrix calculation

Installation

pip install fpsim2

or

conda install -c conda-forge fpsim2

Documentation

Documentation is available at https://chembl.github.io/FPSim2/

Trying it online

To try out FPSim2 interactively in your web browser, just click on the binder icon

Name		Name	Last commit message	Last commit date
Latest commit History 180 Commits
.github/workflows		.github/workflows
FPSim2		FPSim2
docs		docs
docsrc		docsrc
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FPSim2: Simple package for fast molecular similarity searches

Installation

Documentation

Trying it online

About

Releases 32

Packages

Contributors 4

Languages

License

chembl/FPSim2

Folders and files

Latest commit

History

Repository files navigation

FPSim2: Simple package for fast molecular similarity searches

Installation

Documentation

Trying it online

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 32

Packages 0

Contributors 4

Languages

Packages