In [2]:
from pyCheckmol import CheckMol
import numpy as np

- This function generates the bitvector of functional groups (FG) that can be used in QSPR-QSAR modeling.
- Each position means the presence or absence of a certain FG. The positions of this vector follows the ordering of checkmol table, i.e, the first position is the presence/absence of cation, in the same way the last position (204) is the alpha-hydroxyacid.


In [9]:
smi = 'CC1=CC(=CC=C1)C'
cm = CheckMol()
cm.functionalGroupASbitvector(smi)

bitvector = cm.functionalGroupASbitvector(smi)
print(len(bitvector))
print(f"Index of 1 element: {np.where(bitvector == 1)[0][0] if 1 in bitvector else 'No 1 element found'}")


205
Index of 1 element: 201


- The `cm.functionalGroupSmiles` function returns the functional groups information organized in a pandas.Dataframe, dictionary or just the list with the functional groups code by seting the parameters `justFGcode` and `returnDataframe`.

In [24]:
smi = 'CC1=CC(=CC=C1)C'
cm = CheckMol()
res = cm.functionalGroupSmiles(smiles=smi, isString=True, generate3D=False, justFGcode=False, returnDataframe=False,deleteTMP=False)
res

{'Functional Group': ['alkyne '],
 'Frequency': ['1'],
 'Atom Position': ['2-3-4-5-6-7-'],
 'Functional Group Number': ['201'],
 'Code': ['000C3C00 ']}

- The full output about the molecule and functional groups provided by checkmol can be viewed using the following command

In [10]:
print(cm.information_)

Molecule name: 
atoms: 8  bonds: 8  rings: 1
   1 C  C3    -0.8660   -1.5000    0.0000  (1 heavy-atom neighbors, Hexp: 0 Htot: 3)
   2 C  CAR   -0.8660   -0.5000    0.0000  (3 heavy-atom neighbors, Hexp: 0 Htot: 0) aromatic
   3 C  CAR   -1.7321   -0.0000    0.0000  (2 heavy-atom neighbors, Hexp: 0 Htot: 1) aromatic
   4 C  CAR   -1.7321    1.0000    0.0000  (3 heavy-atom neighbors, Hexp: 0 Htot: 0) aromatic
   5 C  CAR   -0.8660    1.5000    0.0000  (2 heavy-atom neighbors, Hexp: 0 Htot: 1) aromatic
   6 C  CAR   -0.0000    1.0000    0.0000  (2 heavy-atom neighbors, Hexp: 0 Htot: 1) aromatic
   7 C  CAR    0.0000    0.0000    0.0000  (2 heavy-atom neighbors, Hexp: 0 Htot: 1) aromatic
   8 C  C3    -2.5981    1.5000    0.0000  (1 heavy-atom neighbors, Hexp: 0 Htot: 3)
   1 1 2 S
   2 2 7 S, contained in 1 ring(s) (aromatic) 
   3 2 3 D, contained in 1 ring(s) (aromatic) 
   4 3 4 S, contained in 1 ring(s) (aromatic) 
   5 4 5 D, contained in 1 ring(s) (aromatic) 
   6 4 8 S
   7 5 6 S,