<a href="https://colab.research.google.com/github/sarahrdk/EscapeEarth/blob/main/Interns/Sarah/How_to_access_our_shared_data_folder.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/></a>

# __How to mount our google drive__ 
#### so we can open the data in our shared EscapeEarthData folder.

In [1]:
# this cell mounts your drive but you also must follow these
# STEPS:
# 1) run this cell and a url will display that you should click on
# 2) select your google account if applicable
# 3) click 'Allow' when asked by 'Google Drive File Stream wants to access your Google Account'
# 4) click on the copy icon next to the code displayed
# 5) paste that code into the below cell prompt 'Enter your authorization code:'
# 6) hit the Enter key and wait
# 7) successful mounting is indicated by the statement 'Mounted at /content/gdrive'

from google.colab import drive

drive.mount('/content/gdrive')

Mounted at /content/gdrive


In [2]:
# this cell lists what's in your google drive 
# you should see the shared 'EscapeEarthData' folder
!ls /content/gdrive/My\ Drive

amnh  EscapeEarthData  Files  imovis  other


#### _IF_ you get an error __"ls: cannot access '/content/gdrive/My Drive': Transport endpoint is not connected"__ you need to remount your drive. Copy/Paste the below code into a new cell to do so:



```
from google.colab import drive
drive.mount('/content/gdrive', force_remount=True)
```

#### _Otherwise_, you may proceed but be aware of this issue as it can arise at any time.

In [3]:
# we can also list the contents of our shared folder 
# (once i add more data this wont be a good thing to do tho)
!ls /content/gdrive/My\ Drive/EscapeEarthData

1161345_lc.csv	2161623_lc.csv	bls_powers.npy	df8.csv
1573836_lc.csv	Activity-2	bls_rps.npy


# __How to Open Data Files__ 
#### Before we can open these files we will need to import, _and/or_ install then import, packages & their dependencies 

In [4]:
# import dependencies
import numpy as np
import pandas as pd
# example of an installation then import (we'll need this later)
!pip install lightkurve
import lightkurve as lk

Collecting lightkurve
[?25l  Downloading https://files.pythonhosted.org/packages/6b/cb/a2917205537f6bad53b109365e09abe946afbf5d8a4e1f46c3f75abcb398/lightkurve-1.11.3-py3-none-any.whl (515kB)
[K     |████████████████████████████████| 522kB 2.8MB/s 
[?25hCollecting astroquery>=0.3.9
[?25l  Downloading https://files.pythonhosted.org/packages/1b/f8/4690523783691ed816b3469c3ec611af3798594d37ade510dd918d59f57e/astroquery-0.4.1.tar.gz (6.5MB)
[K     |████████████████████████████████| 6.5MB 8.5MB/s 
Collecting uncertainties
[?25l  Downloading https://files.pythonhosted.org/packages/b0/e0/fc200da8190729dcb685ae4877ed6936d31d64aeccb8cc355d9ec982681d/uncertainties-3.1.4-py2.py3-none-any.whl (246kB)
[K     |████████████████████████████████| 256kB 29.5MB/s 
Collecting fbpca>=1.0
  Downloading https://files.pythonhosted.org/packages/a7/a5/2085d0645a4bb4f0b606251b0b7466c61326e4a471d445c1c3761a2d07bc/fbpca-1.0.tar.gz
Collecting scipy!=1.4.0,!=1.4.1,>=0.19.0
[?25l  Downloading https://files.pyt

In [5]:
# now we can open each data file, here are some examples:
## NOTICE: for the path I do NOT use the "\" in My\ Drive" like we did for the terminal commands above

# for .csv files
data_1 = pd.read_csv('/content/gdrive/My Drive/EscapeEarthData/2161623_lc.csv') 
# for .npy files
data_2 = np.load('/content/gdrive/My Drive/EscapeEarthData/bls_powers.npy')
# let's see the data
print('Data-1 Example:',data_1)
print('Data-2 Example:',data_2)

Data-1 Example:      # bjd\tphase\traw_flux\traw_err\tcorr_flux\tcorr_err\tdtr_flux\tdtr_err
0     56107.16069599\t0.03416000\t0.593396\t0.000242...                     
1     56107.18113035\t0.04310882\t0.745326\t0.000245...                     
2     56107.20156461\t0.05205760\t0.871438\t0.000247...                     
3     56107.22199908\t0.06100646\t0.960180\t0.000249...                     
4     56107.24243344\t0.06995528\t0.994236\t0.000252...                     
...                                                 ...                     
8631  56304.05439858\t0.25982505\t1.050795\t0.000264...                     
8632  56304.07483160\t0.26877329\t1.050991\t0.000264...                     
8633  56304.09526452\t0.27772148\t1.051052\t0.000264...                     
8634  56304.11569744\t0.28666967\t1.050454\t0.000264...                     
8635  56304.13613036\t0.29561785\t1.050061\t0.000264...                     

[8636 rows x 1 columns]
Data-2 Example: [34615   294    12 

In [6]:
# Data-1 Example's format isn't a typical dataframe so let's try this
data_1

Unnamed: 0,# bjd\tphase\traw_flux\traw_err\tcorr_flux\tcorr_err\tdtr_flux\tdtr_err
0,56107.16069599\t0.03416000\t0.593396\t0.000242...
1,56107.18113035\t0.04310882\t0.745326\t0.000245...
2,56107.20156461\t0.05205760\t0.871438\t0.000247...
3,56107.22199908\t0.06100646\t0.960180\t0.000249...
4,56107.24243344\t0.06995528\t0.994236\t0.000252...
...,...
8631,56304.05439858\t0.25982505\t1.050795\t0.000264...
8632,56304.07483160\t0.26877329\t1.050991\t0.000264...
8633,56304.09526452\t0.27772148\t1.051052\t0.000264...
8634,56304.11569744\t0.28666967\t1.050454\t0.000264...


In [7]:
# that looks better but we still only have one column 
# - let's use input arguments to fix the formatting

#notice I'm rewriting the variable too
data_1 = pd.read_csv('/content/gdrive/My Drive/EscapeEarthData/2161623_lc.csv',header=0,delimiter ='	')
data_1

Unnamed: 0,# bjd,phase,raw_flux,raw_err,corr_flux,corr_err,dtr_flux,dtr_err
0,56107.160696,0.034160,0.593396,0.000242,0.741012,0.000343,0.581487,0.000237
1,56107.181130,0.043109,0.745326,0.000245,0.956920,0.000347,0.730368,0.000240
2,56107.201565,0.052058,0.871438,0.000247,1.135839,0.000350,0.853949,0.000242
3,56107.221999,0.061006,0.960180,0.000249,1.261791,0.000353,0.940910,0.000244
4,56107.242433,0.069955,0.994236,0.000252,1.309991,0.000357,0.974283,0.000247
...,...,...,...,...,...,...,...,...
8631,56304.054399,0.259825,1.050795,0.000264,1.196939,0.000332,1.033424,0.000259
8632,56304.074832,0.268773,1.050991,0.000264,1.196989,0.000332,1.033611,0.000259
8633,56304.095265,0.277721,1.051052,0.000264,1.197222,0.000333,1.033666,0.000259
8634,56304.115697,0.286670,1.050454,0.000264,1.196450,0.000333,1.033072,0.000259


In [8]:
# Here's another trick 
# our path to the shared folder is long and we may not want to type it everytime
# we can save the path and insert it as shown below

mypath = '/content/gdrive/My Drive/EscapeEarthData'

#let's open yet another data file
data_3 = np.load('{}/bls_powers.npy'.format(mypath))
print('Data-3 Example:',data_3)

Data-3 Example: [34615   294    12 ... 43420 32493  4189]


# __How to Save NEW data to our shared folder__
#### _Important Note_: when saving new data always name the file with a clear name followed by '-YourName'. And similar to how you opened data, you MUST enter the full path as shown in the cell directly above as the variable mypath and include an necessary subdirectories. I will then resave the new file within my google drive that has unlimited storage, whereas yours will not.
#### FOR EXAMPLE: Using data_3's filename, you would save it as "/content/gdrive/My Drive/EscapeEarthData/bls_powers-Danielle.npy".  