Extracting SMILE Structure and IUPAC Name of Drugs from PubChem using Drug Names

Overview This Python script is designed to extract the SMILE structure of drugs from PubChem using drug names provided by the user. The user must submit a text file containing drug names separated by a new line. The code will automatically generate the SMILE structures of the respective drugs and save the output in a CSV file.

Inputs A text file containing drug names separated by new line.

Outputs A CSV file containing drug names and their respective SMILE structures and IUPAC names.

Procedure

The script starts by importing the necessary libraries, such as pandas, requests, and os.
The user is prompted to enter the name of the input text file containing drug names.
The script reads the input file and stores the drug names in a list.
For each drug in the list, the script searches for the compound in PubChem and retrieves its SMILE structure and IUPAC name.
The drug name and its corresponding SMILE structure and IUPAC name are exported as a CSV file.
The output CSV file is saved in the resultant directory as the input text file.

Assumptions

The user has installed the necessary libraries (such as pandas, requests, and os).
The user has a stable internet connection to access PubChem.
The input text file only contains one drug name per line.
The drug names in the input file are spelled correctly and match the names in PubChem.

Potential Improvements

Add error handling to handle cases where a drug name is misspelled or not found in PubChem.
Add the option to output the SMILE structures in other file formats, such as SDF or Mol.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
DrugName_to_SMILES.ipynb		DrugName_to_SMILES.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Extracting SMILE Structure and IUPAC Name of Drugs from PubChem using Drug Names

About

Releases

Packages

Languages

swarsatkn/Extracting-SMILE-Structure-and-IUPAC-Name-of-Drugs-from-PubChem-using-Drug-Names

Folders and files

Latest commit

History

Repository files navigation

Extracting SMILE Structure and IUPAC Name of Drugs from PubChem using Drug Names

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages