Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Updating RDKitDescriptors featurizer to support custom descriptors #3135

Merged
merged 3 commits into from
Dec 27, 2022

Conversation

arunppsg
Copy link
Contributor

This PR updates RDKitDescriptors to support user-defined descriptors. For this, an additional arguments, descriptors which takes in a List[str] is added. It takes a list of custom user defined descriptors and produce output properties.

An another change is adding labels_only argument which when set to True, it returns only the presence or absence of a group.

New tests were also added to cover the new features.

@arunppsg
Copy link
Contributor Author

In 6364020, I removed the test assertions using array index for molecular weight. The array index for molecular weight can be any value based on the sorted descriptors list. A descriptor list with only two descriptors, say [ExactMolWt, BCUT2D_MWHI] will have ExactMolWt at index 1 (sorted) and for the descriptor list [ExactMolWt], ExactMolWt will be at index 0. Since the array index is not deterministic, I removed it from test assertion.

@arunppsg
Copy link
Contributor Author

Reviewed by @tonydavis629 offline.

@arunppsg arunppsg merged commit f53f9c9 into deepchem:master Dec 27, 2022
@arunppsg arunppsg deleted the grover branch June 30, 2023 03:15
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant