Releases: cthoyt/chembl-downloader
Releases · cthoyt/chembl-downloader
v0.4.5
v0.4.4
Full Changelog: v0.4.3...v0.4.4
v0.4.3
What's Changed
- Refresh datasets from Cortés-Ciriano by @cthoyt in #9
- Update README.md by @YojanaGadiya in #5
- Update cortes-ciriano-refresh.ipynb by @rflameiro in #12
New Contributors
- @YojanaGadiya made their first contribution in #5
- @rflameiro made their first contribution in #12
Full Changelog: v0.4.2...v0.4.3
v0.4.2
Full Changelog: v0.4.1...v0.4.2
New notebooks and uniprot mapping function
🧪⚗️🧪 Improve project skeleton and other tasty things
This PR does several things:
- Removes dependency on
bioversions
and just implements the code locally - Adds a CLI for generating a statistics table for all versions of ChEMBL
- Add proper project skeleton (documentation, unit tests, code quality assurance, CI)
- Improve SQLite loading in case you delete the compressed data
Notebooks
- Adds notebook about drug indications
- Adds notebook about mapping to ChEBI
Add monomer library functionality 🧪🧪🧪
This release adds two new functions:
chembl_downloader.download_monomer_library
which gets this file https://ftp.ebi.ac.uk/pub/databases/chembl/ChEMBLdb/latest/chembl_30_monomer_library.xml for whatever version you specifychembl_downloader.get_monomer_library_root
which does the same as the downloader but also parses the XML for you
Thanks to @iwatobipen and his recent blog post for inspiring this.
Get fingerprints and chemical representations 🧪⚗️🧪
New Functions
chembl_downloader.download_fps
downloads the pre-computed Morgan fingerprint filechembl_downloader.download_chemreps
downloads the chembl-smiles-inchi-inchikey mapchembl_downloader.get_chemreps_df
builds onchembl_downloader.download_chemreps
and loads them in a pandas dataframe
Misc
- Add
isort
to code quality checking - Enable many functions with
return_version
to make a tuple with the version, which is useful if you're having it infer the latest version.
v0.1.3
This release adds the get_substructure_library()
for automating the generation of an RDKit substructure library as described in Greg Landrum's RDKit blog post, Some new features in the SubstructLibrary. The following example shows how it can be used to accomplish some of the first tasks presented in the post:
from rdkit import Chem
import chembl_downloader
library = chembl_downloader.get_substructure_library()
query = Chem.MolFromSmarts('[O,N]=C-c:1:c:c:n:c:c:1')
matches = library.GetMatches(query)
Full Changelog: v0.1.2...v0.1.3
v0.1.2
- Add
get_assay_sql()
function
Full Changelog: v0.1.1...v0.1.2