# **Computational Drug**

In this Jupyter notebook, we will  build a real-life **Biology Computational project**  In the end we will be building a machine learning model using the ChEMBL bioactivity data.Our target is dataset containing the bioactivity data for Hepatitis C virus NS3 protease/helicase

Francisco Garcia
---

## **1- Download Bioactivity Data**

#### **ChEMBL Database**

The [*ChEMBL Database*](https://www.ebi.ac.uk/chembl/) is a database that contains curated bioactivity data of more than 2 million compounds. It is compiled from more than 76,000 documents, 1.2 million assays and the data spans 13,000 targets and 1,800 cells and 33,000 indications.
[Data as of October 25, 2020; ChEMBL version 26].

## **Installing libraries**

Installing the ChEMBL web service package so that we can retrieve bioactivity data from the ChEMBL Database.

In [47]:
! pip install chembl_webresource_client



## **Importing libraries**

In [48]:
# Import necessary libraries
import pandas as pd
from chembl_webresource_client.new_client import new_client

## **Search for Target protein**

### **Target search for coronavirus**

In [49]:
# Target search for hepatitis
target = new_client.target
target_query = target.search('hepatitis')
targets = pd.DataFrame.from_dict(target_query)
targets

Unnamed: 0,cross_references,organism,pref_name,score,species_group_flag,target_chembl_id,target_components,target_type,tax_id
0,[],Hepatitis B virus,Hepatitis B virus,11.0,False,CHEMBL613497,[],ORGANISM,10407
1,[],Hepatitis C virus,Hepatitis C virus,11.0,False,CHEMBL379,[],ORGANISM,11103
2,[],Murine hepatitis virus,Murine hepatitis virus,11.0,False,CHEMBL613733,[],ORGANISM,11138
3,[],Hepatitis A virus,Hepatitis A virus,11.0,False,CHEMBL613753,[],ORGANISM,12092
4,[],Woodchuck hepatitis virus,Woodchuck hepatitis virus,11.0,False,CHEMBL613179,[],ORGANISM,35269
5,"[{'xref_id': 'P26664', 'xref_name': None, 'xre...",Hepatitis C virus genotype 1a (isolate 1) (HCV),Hepatitis C virus polyprotein,10.0,False,CHEMBL4620,"[{'accession': 'P26664', 'component_descriptio...",SINGLE PROTEIN,11104
6,[],Duck hepatitis B virus,Duck hepatitis B virus,10.0,False,CHEMBL613761,[],ORGANISM,12639
7,"[{'xref_id': 'Q15004', 'xref_name': None, 'xre...",Homo sapiens,PCNA-associated factor,10.0,False,CHEMBL5574,"[{'accession': 'Q15004', 'component_descriptio...",SINGLE PROTEIN,9606
8,"[{'xref_id': 'D2K2A8', 'xref_name': None, 'xre...",Hepatitis C virus,Hepatitis C virus NS4A protein,9.0,False,CHEMBL2364,"[{'accession': 'D2K2A8', 'component_descriptio...",SINGLE PROTEIN,11103
9,[],Murine hepatitis virus strain A59,Murine hepatitis virus (strain A59),9.0,False,CHEMBL613734,[],ORGANISM,11142


### **Select and retrieve bioactivity data for *Hepatitis C virus NS3 protease/helicase* (10º entry)**

We will assign the 10º entry (which corresponds to the target protein, Hepatitis C virus NS3 protease/helicase) to the ***selected_target*** variable 

In [50]:
selected_target = targets.target_chembl_id[11]
selected_target

'CHEMBL4893'

Here, we will retrieve only bioactivity data for *Hepatitis C virus NS3 protease/helicase* (CHEMBL4893) that are reported as IC$_{50}$ values in nM (nanomolar) unit.

In [53]:
activity = new_client.activity
res = activity.filter(target_chembl_id=selected_target).filter(standard_type="IC50")

In [54]:
df = pd.DataFrame.from_dict(res)
df

Unnamed: 0,activity_comment,activity_id,activity_properties,assay_chembl_id,assay_description,assay_type,bao_endpoint,bao_format,bao_label,canonical_smiles,data_validity_comment,data_validity_description,document_chembl_id,document_journal,document_year,ligand_efficiency,molecule_chembl_id,molecule_pref_name,parent_molecule_chembl_id,pchembl_value,potential_duplicate,qudt_units,record_id,relation,src_id,standard_flag,standard_relation,standard_text_value,standard_type,standard_units,standard_upper_value,standard_value,target_chembl_id,target_organism,target_pref_name,target_tax_id,text_value,toid,type,units,uo_units,upper_value,value
0,,115491,[],CHEMBL763432,Inhibition against hepatitis C virus protease ...,B,BAO_0000190,BAO_0000357,single protein format,C=CCNC(=O)C(=O)[C@H](CC)NC(=O)[C@@H]1CCCN1C(=O...,,,CHEMBL1133326,Bioorg. Med. Chem. Lett.,2000,"{'bei': '8.98', 'le': '0.17', 'lle': '8.20', '...",CHEMBL13443,,CHEMBL13443,6.38,False,http://www.openphacts.org/units/Nanomolar,317605,=,1,True,=,,IC50,nM,,420.0,CHEMBL4893,Hepatitis C virus,Hepatitis C virus NS3 protease/helicase,11103,,,IC50,uM,UO_0000065,,0.42
1,,129448,[],CHEMBL763432,Inhibition against hepatitis C virus protease ...,B,BAO_0000190,BAO_0000357,single protein format,C=CCC(NC(=O)[C@H]1CCCN1C(=O)[C@@H](NC(=O)[C@@H...,,,CHEMBL1133326,Bioorg. Med. Chem. Lett.,2000,"{'bei': '9.10', 'le': '0.18', 'lle': '8.28', '...",CHEMBL346460,,CHEMBL346460,6.46,False,http://www.openphacts.org/units/Nanomolar,317604,=,1,True,=,,IC50,nM,,350.0,CHEMBL4893,Hepatitis C virus,Hepatitis C virus NS3 protease/helicase,11103,,,IC50,uM,UO_0000065,,0.35
2,,130845,[],CHEMBL763432,Inhibition against hepatitis C virus protease ...,B,BAO_0000190,BAO_0000357,single protein format,C=CCC(NC(=O)[C@H]1CCCN1C(=O)[C@@H](NC(=O)[C@@H...,,,CHEMBL1133326,Bioorg. Med. Chem. Lett.,2000,"{'bei': '7.70', 'le': '0.15', 'lle': '7.15', '...",CHEMBL348896,,CHEMBL348896,5.36,False,http://www.openphacts.org/units/Nanomolar,317607,=,1,True,=,,IC50,nM,,4340.0,CHEMBL4893,Hepatitis C virus,Hepatitis C virus NS3 protease/helicase,11103,,,IC50,uM,UO_0000065,,4.34
3,,133287,[],CHEMBL763432,Inhibition against hepatitis C virus protease ...,B,BAO_0000190,BAO_0000357,single protein format,CC(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(...,,,CHEMBL1133326,Bioorg. Med. Chem. Lett.,2000,"{'bei': '7.97', 'le': '0.16', 'lle': '7.51', '...",CHEMBL351702,,CHEMBL351702,5.60,False,http://www.openphacts.org/units/Nanomolar,317598,=,1,True,=,,IC50,nM,,2500.0,CHEMBL4893,Hepatitis C virus,Hepatitis C virus NS3 protease/helicase,11103,,,IC50,uM,UO_0000065,,2.5
4,,134500,[],CHEMBL763432,Inhibition against hepatitis C virus protease ...,B,BAO_0000190,BAO_0000357,single protein format,C=CCNC(=O)C(=O)C(CC=C)NC(=O)[C@H]1CCCN1C(=O)[C...,,,CHEMBL1133326,Bioorg. Med. Chem. Lett.,2000,"{'bei': '6.68', 'le': '0.13', 'lle': '3.82', '...",CHEMBL351389,,CHEMBL351389,6.24,False,http://www.openphacts.org/units/Nanomolar,317599,=,1,True,=,,IC50,nM,,570.0,CHEMBL4893,Hepatitis C virus,Hepatitis C virus NS3 protease/helicase,11103,,,IC50,uM,UO_0000065,,0.57
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
306,,1840273,[],CHEMBL917717,Inhibition of HCV NS3 protease,B,BAO_0000190,BAO_0000357,single protein format,CC(C)C[C@H](NC(=O)C1(Cc2ccsc2C(=O)O)Cc2ccccc2N...,,,CHEMBL1149267,J. Med. Chem.,2004,"{'bei': '7.31', 'le': '0.14', 'lle': '1.42', '...",CHEMBL390344,,CHEMBL390344,4.14,False,http://www.openphacts.org/units/Nanomolar,630083,=,1,True,=,,IC50,nM,,73000.0,CHEMBL4893,Hepatitis C virus,Hepatitis C virus NS3 protease/helicase,11103,,,IC50,uM,UO_0000065,,73.0
307,,1840274,[],CHEMBL917717,Inhibition of HCV NS3 protease,B,BAO_0000190,BAO_0000357,single protein format,CC(C)C[C@H](NC(=O)C1(Cc2ccsc2C(=O)O)Cc2ccccc2N...,,,CHEMBL1149267,J. Med. Chem.,2004,"{'bei': '7.82', 'le': '0.15', 'lle': '1.70', '...",CHEMBL390344,,CHEMBL390344,4.42,False,http://www.openphacts.org/units/Nanomolar,630082,=,1,True,=,,IC50,nM,,38000.0,CHEMBL4893,Hepatitis C virus,Hepatitis C virus NS3 protease/helicase,11103,,,IC50,uM,UO_0000065,,38.0
308,,1840275,[],CHEMBL917717,Inhibition of HCV NS3 protease,B,BAO_0000190,BAO_0000357,single protein format,CC(C)C[C@H](NC(=O)C1(Cc2ccsc2C(=O)O)Cc2ccccc2N...,,,CHEMBL1149267,J. Med. Chem.,2004,"{'bei': '8.08', 'le': '0.16', 'lle': '1.85', '...",CHEMBL390344,,CHEMBL390344,4.57,False,http://www.openphacts.org/units/Nanomolar,630081,=,1,True,=,,IC50,nM,,27000.0,CHEMBL4893,Hepatitis C virus,Hepatitis C virus NS3 protease/helicase,11103,,,IC50,uM,UO_0000065,,27.0
309,,1840276,[],CHEMBL917717,Inhibition of HCV NS3 protease,B,BAO_0000190,BAO_0000357,single protein format,CC(C)C[C@H](NC(=O)C1Cc2ccc(OCC(=O)O)cc2N1)C(=O...,,,CHEMBL1149267,J. Med. Chem.,2004,"{'bei': '10.01', 'le': '0.20', 'lle': '4.19', ...",CHEMBL439547,,CHEMBL439547,5.00,False,http://www.openphacts.org/units/Nanomolar,630084,=,1,True,=,,IC50,nM,,10000.0,CHEMBL4893,Hepatitis C virus,Hepatitis C virus NS3 protease/helicase,11103,,,IC50,uM,UO_0000065,,10.0


In [56]:
df.standard_type.unique()

array(['IC50'], dtype=object)

Finally we will save the resulting bioactivity data to a CSV file **bioactivity__hepatitis0.csv**.

In [57]:
df.to_csv('bioactivity_data_hepatitis0.csv', index=False)

## **Copying files to Google Drive**

Firstly, we need to mount the Google Drive into Colab so that we can have access to our Google adrive from within Colab.

In [58]:
from google.colab import drive
drive.mount('/content/gdrive/', force_remount=True)


Mounted at /content/gdrive/


Next, we create a **data** folder in our **Colab Notebooks** folder on Google Drive.

In [59]:
! mkdir "/content/gdrive/My Drive/Colab Notebooks/Data_hepatitis"

In [60]:
! cp bioactivity_data_hepatitis0.csv "/content/gdrive/My Drive/Colab Notebooks/Data_hepatitis"

In [61]:
! ls -l "/content/gdrive/My Drive/Colab Notebooks/Data_hepatitis"

total 176
-rw------- 1 root root 179392 Dec 14 14:40 bioactivity_data_hepatitis0.csv


Let's see the CSV files that we have so far.

In [63]:
! ls

bioactivity_data.csv  bioactivity_data_hepatitis0.csv  gdrive  sample_data


Taking a glimpse of the **bioactivity_data.csv** file that we've just created.

In [64]:
! head bioactivity_data.csv

activity_comment,activity_id,activity_properties,assay_chembl_id,assay_description,assay_type,bao_endpoint,bao_format,bao_label,canonical_smiles,data_validity_comment,data_validity_description,document_chembl_id,document_journal,document_year,ligand_efficiency,molecule_chembl_id,molecule_pref_name,parent_molecule_chembl_id,pchembl_value,potential_duplicate,qudt_units,record_id,relation,src_id,standard_flag,standard_relation,standard_text_value,standard_type,standard_units,standard_upper_value,standard_value,target_chembl_id,target_organism,target_pref_name,target_tax_id,text_value,toid,type,units,uo_units,upper_value,value
,1411512,[],CHEMBL831669,Inhibitory concentration against hepatitis C NS4A protease,B,BAO_0000190,BAO_0000357,single protein format,CC[C@H](C)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CS)C(C)C)C(C)C)[C@@H](C)CC)C(C)C)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)C(C)C,,,CH

## **Handling missing data**
If any compounds has missing value for the **standard_value** column then drop it

In [65]:
df2 = df[df.standard_value.notna()]
df2

Unnamed: 0,activity_comment,activity_id,activity_properties,assay_chembl_id,assay_description,assay_type,bao_endpoint,bao_format,bao_label,canonical_smiles,data_validity_comment,data_validity_description,document_chembl_id,document_journal,document_year,ligand_efficiency,molecule_chembl_id,molecule_pref_name,parent_molecule_chembl_id,pchembl_value,potential_duplicate,qudt_units,record_id,relation,src_id,standard_flag,standard_relation,standard_text_value,standard_type,standard_units,standard_upper_value,standard_value,target_chembl_id,target_organism,target_pref_name,target_tax_id,text_value,toid,type,units,uo_units,upper_value,value
0,,115491,[],CHEMBL763432,Inhibition against hepatitis C virus protease ...,B,BAO_0000190,BAO_0000357,single protein format,C=CCNC(=O)C(=O)[C@H](CC)NC(=O)[C@@H]1CCCN1C(=O...,,,CHEMBL1133326,Bioorg. Med. Chem. Lett.,2000,"{'bei': '8.98', 'le': '0.17', 'lle': '8.20', '...",CHEMBL13443,,CHEMBL13443,6.38,False,http://www.openphacts.org/units/Nanomolar,317605,=,1,True,=,,IC50,nM,,420.0,CHEMBL4893,Hepatitis C virus,Hepatitis C virus NS3 protease/helicase,11103,,,IC50,uM,UO_0000065,,0.42
1,,129448,[],CHEMBL763432,Inhibition against hepatitis C virus protease ...,B,BAO_0000190,BAO_0000357,single protein format,C=CCC(NC(=O)[C@H]1CCCN1C(=O)[C@@H](NC(=O)[C@@H...,,,CHEMBL1133326,Bioorg. Med. Chem. Lett.,2000,"{'bei': '9.10', 'le': '0.18', 'lle': '8.28', '...",CHEMBL346460,,CHEMBL346460,6.46,False,http://www.openphacts.org/units/Nanomolar,317604,=,1,True,=,,IC50,nM,,350.0,CHEMBL4893,Hepatitis C virus,Hepatitis C virus NS3 protease/helicase,11103,,,IC50,uM,UO_0000065,,0.35
2,,130845,[],CHEMBL763432,Inhibition against hepatitis C virus protease ...,B,BAO_0000190,BAO_0000357,single protein format,C=CCC(NC(=O)[C@H]1CCCN1C(=O)[C@@H](NC(=O)[C@@H...,,,CHEMBL1133326,Bioorg. Med. Chem. Lett.,2000,"{'bei': '7.70', 'le': '0.15', 'lle': '7.15', '...",CHEMBL348896,,CHEMBL348896,5.36,False,http://www.openphacts.org/units/Nanomolar,317607,=,1,True,=,,IC50,nM,,4340.0,CHEMBL4893,Hepatitis C virus,Hepatitis C virus NS3 protease/helicase,11103,,,IC50,uM,UO_0000065,,4.34
3,,133287,[],CHEMBL763432,Inhibition against hepatitis C virus protease ...,B,BAO_0000190,BAO_0000357,single protein format,CC(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(...,,,CHEMBL1133326,Bioorg. Med. Chem. Lett.,2000,"{'bei': '7.97', 'le': '0.16', 'lle': '7.51', '...",CHEMBL351702,,CHEMBL351702,5.60,False,http://www.openphacts.org/units/Nanomolar,317598,=,1,True,=,,IC50,nM,,2500.0,CHEMBL4893,Hepatitis C virus,Hepatitis C virus NS3 protease/helicase,11103,,,IC50,uM,UO_0000065,,2.5
4,,134500,[],CHEMBL763432,Inhibition against hepatitis C virus protease ...,B,BAO_0000190,BAO_0000357,single protein format,C=CCNC(=O)C(=O)C(CC=C)NC(=O)[C@H]1CCCN1C(=O)[C...,,,CHEMBL1133326,Bioorg. Med. Chem. Lett.,2000,"{'bei': '6.68', 'le': '0.13', 'lle': '3.82', '...",CHEMBL351389,,CHEMBL351389,6.24,False,http://www.openphacts.org/units/Nanomolar,317599,=,1,True,=,,IC50,nM,,570.0,CHEMBL4893,Hepatitis C virus,Hepatitis C virus NS3 protease/helicase,11103,,,IC50,uM,UO_0000065,,0.57
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
306,,1840273,[],CHEMBL917717,Inhibition of HCV NS3 protease,B,BAO_0000190,BAO_0000357,single protein format,CC(C)C[C@H](NC(=O)C1(Cc2ccsc2C(=O)O)Cc2ccccc2N...,,,CHEMBL1149267,J. Med. Chem.,2004,"{'bei': '7.31', 'le': '0.14', 'lle': '1.42', '...",CHEMBL390344,,CHEMBL390344,4.14,False,http://www.openphacts.org/units/Nanomolar,630083,=,1,True,=,,IC50,nM,,73000.0,CHEMBL4893,Hepatitis C virus,Hepatitis C virus NS3 protease/helicase,11103,,,IC50,uM,UO_0000065,,73.0
307,,1840274,[],CHEMBL917717,Inhibition of HCV NS3 protease,B,BAO_0000190,BAO_0000357,single protein format,CC(C)C[C@H](NC(=O)C1(Cc2ccsc2C(=O)O)Cc2ccccc2N...,,,CHEMBL1149267,J. Med. Chem.,2004,"{'bei': '7.82', 'le': '0.15', 'lle': '1.70', '...",CHEMBL390344,,CHEMBL390344,4.42,False,http://www.openphacts.org/units/Nanomolar,630082,=,1,True,=,,IC50,nM,,38000.0,CHEMBL4893,Hepatitis C virus,Hepatitis C virus NS3 protease/helicase,11103,,,IC50,uM,UO_0000065,,38.0
308,,1840275,[],CHEMBL917717,Inhibition of HCV NS3 protease,B,BAO_0000190,BAO_0000357,single protein format,CC(C)C[C@H](NC(=O)C1(Cc2ccsc2C(=O)O)Cc2ccccc2N...,,,CHEMBL1149267,J. Med. Chem.,2004,"{'bei': '8.08', 'le': '0.16', 'lle': '1.85', '...",CHEMBL390344,,CHEMBL390344,4.57,False,http://www.openphacts.org/units/Nanomolar,630081,=,1,True,=,,IC50,nM,,27000.0,CHEMBL4893,Hepatitis C virus,Hepatitis C virus NS3 protease/helicase,11103,,,IC50,uM,UO_0000065,,27.0
309,,1840276,[],CHEMBL917717,Inhibition of HCV NS3 protease,B,BAO_0000190,BAO_0000357,single protein format,CC(C)C[C@H](NC(=O)C1Cc2ccc(OCC(=O)O)cc2N1)C(=O...,,,CHEMBL1149267,J. Med. Chem.,2004,"{'bei': '10.01', 'le': '0.20', 'lle': '4.19', ...",CHEMBL439547,,CHEMBL439547,5.00,False,http://www.openphacts.org/units/Nanomolar,630084,=,1,True,=,,IC50,nM,,10000.0,CHEMBL4893,Hepatitis C virus,Hepatitis C virus NS3 protease/helicase,11103,,,IC50,uM,UO_0000065,,10.0


We  had cleaned the missing data.

## **Data pre-processing of the bioactivity data**

### **Labeling compounds as either being active, inactive or intermediate**
The bioactivity data is in the IC50 unit. Compounds having values of less than 1000 nM will be considered to be **active** while those greater than 10,000 nM will be considered to be **inactive**. As for those values in between 1,000 and 10,000 nM will be referred to as **intermediate**. 

In [66]:
bioactivity_class = []
for i in df2.standard_value:
  if float(i) >= 10000:
    bioactivity_class.append("inactive")
  elif float(i) <= 1000:
    bioactivity_class.append("active")
  else:
    bioactivity_class.append("intermediate")

### **Iterate the *molecule_chembl_id* to a list**

In [67]:
mol_cid = []
for i in df2.molecule_chembl_id:
  mol_cid.append(i)

### **Iterate *canonical_smiles* to a list**

In [68]:
canonical_smiles = []
for i in df2.canonical_smiles:
  canonical_smiles.append(i)

### **Iterate *standard_value* to a list**

In [69]:
standard_value = []
for i in df2.standard_value:
  standard_value.append(i)

### **Combine the 4 lists into a dataframe**

In [74]:
data_tuples = list(zip(mol_cid, canonical_smiles, bioactivity_class, standard_value))
df3 = pd.DataFrame( data_tuples,  columns=['molecule_chembl_id', 'canonical_smiles', 'bioactivity_class', 'standard_value'])

In [75]:
df3

Unnamed: 0,molecule_chembl_id,canonical_smiles,bioactivity_class,standard_value
0,CHEMBL13443,C=CCNC(=O)C(=O)[C@H](CC)NC(=O)[C@@H]1CCCN1C(=O...,active,420.0
1,CHEMBL346460,C=CCC(NC(=O)[C@H]1CCCN1C(=O)[C@@H](NC(=O)[C@@H...,active,350.0
2,CHEMBL348896,C=CCC(NC(=O)[C@H]1CCCN1C(=O)[C@@H](NC(=O)[C@@H...,intermediate,4340.0
3,CHEMBL351702,CC(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(...,intermediate,2500.0
4,CHEMBL351389,C=CCNC(=O)C(=O)C(CC=C)NC(=O)[C@H]1CCCN1C(=O)[C...,active,570.0
...,...,...,...,...
302,CHEMBL390344,CC(C)C[C@H](NC(=O)C1(Cc2ccsc2C(=O)O)Cc2ccccc2N...,inactive,73000.0
303,CHEMBL390344,CC(C)C[C@H](NC(=O)C1(Cc2ccsc2C(=O)O)Cc2ccccc2N...,inactive,38000.0
304,CHEMBL390344,CC(C)C[C@H](NC(=O)C1(Cc2ccsc2C(=O)O)Cc2ccccc2N...,inactive,27000.0
305,CHEMBL439547,CC(C)C[C@H](NC(=O)C1Cc2ccc(OCC(=O)O)cc2N1)C(=O...,inactive,10000.0


Saves dataframe to CSV file

In [76]:
df3.to_csv('bioactivity_hepatitis_preprocessed_data.csv', index=False)

In [77]:
! ls -l

total 236
-rw-r--r-- 1 root root   2593 Dec 14 14:14 bioactivity_data.csv
-rw-r--r-- 1 root root 179392 Dec 14 14:37 bioactivity_data_hepatitis0.csv
-rw-r--r-- 1 root root  45146 Dec 14 14:48 bioactivity_hepatitis_preprocessed_data.csv
drwx------ 5 root root   4096 Dec 14 14:39 gdrive
drwxr-xr-x 1 root root   4096 Dec  2 22:04 sample_data


Coping to the Google Drive

In [79]:
! cp bioactivity_hepatitis_preprocessed_data.csv "/content/gdrive/My Drive/Colab Notebooks/Data_hepatitis"

In [80]:
! ls "/content/gdrive/My Drive/Colab Notebooks/Data_hepatitis"

bioactivity_data_hepatitis0.csv  bioactivity_hepatitis_preprocessed_data.csv


---