# Download Data Files for use of notebooks
required packages:
 - vos

## The following files are available through VOSpace to the public. 
### To access these files: 
 1. create a CADC account
 2. install vos in command line with:
 ```
 pip install vos
 ```
 3. in your command line type: 
 ```
 getCert 
 ```
 and input your account information

 Depending on the intended usage of these notebooks, not all of these files are necessary, below is a description of each file and which notebooks it is necessary for, followed by the necessary code to download the file

In [1]:
import vos


## 1. apStar_visits_main.h5
### description: 
This file contains apStar individual visit spectra along with APOGEE data associated with each star
### size: 
38.5GB
### used in: 
This file is necessary for $2\_Preprocessing\_of\_Training\_Data.ipynb$ where it is used to create a training set
### Notes:
This file was created by pulling apStar visits from the APOGEE v603.fits file. Instructions on dealing with apogee data can be found here: https://github.com/jobovy/apogee but this is not necessary for for the use of this notebook, as the files have already been created

In [None]:
filename='apStar_visits_main.h5'
vclient = vos.Client()
vclient.copy('vos:starnet/public/'+filename, filename)
print(filename+' downloaded')

## 2. apStar_combined_main.h5
### description: 
This file contains apStar combined spectra along with APOGEE data associated with each star
### size: 
10.3GB
### used in: 
This file is necessary for $3\_Preprocessing\_of\_Test\_Data.ipynb$ where it is used to create a two test sets. It is also used in $6\_Error\_Propagation.ipynb$ to calculate the scatter between the model predictions and ASPCAP labels in different regions of the label-space.
### Notes:
This file was created by pulling apStar combined spectra from the APOGEE v603.fits file. Instructions on dealing with apogee data can be found here: https://github.com/jobovy/apogee but this is not necessary for for the use of this notebook, as the files have already been created

In [None]:
filename='apStar_combined_main.h5'
vclient = vos.Client()
vclient.copy('vos:starnet/public/'+filename, filename)
print(filename+' downloaded')

## 3. high_snr_test_apids.npy
### description: 
This file contains APOGEE IDs for a High S/N test set.
### size: 
89 kB
### used in: 
This file is necessary for $3\_Preprocessing\_of\_Test\_Data.ipynb$ where it is used to separate a test set. 
### Notes: 
This file can also be created in $2\_Preprocessing\_of\_Training\_Data.ipynb$ but is made available so that users do not have to download apStar_visits_main.h5 (a much larger file) if they are interested in skipping $2\_Preprocessing\_of\_Training\_Data.ipynb$.

In [2]:
filename='high_snr_test_apids.npy'
vclient = vos.Client()
vclient.copy('vos:starnet/public/'+filename, filename)
print(filename+' downloaded')

high_snr_test_apids.npy downloaded


## 4. training_set.h5
### description: 
This file contains apStar individual visit spectra along with APOGEE data associated with each star that is used for the training of your model
### size: 
1.4GB
### used in: 
This file is necessary for $4\_Train\_Model\_Keras\_1.ipynb$ and/or $4\_Train\_Model\_Keras\_2.ipynb$ where it is used to train the model. 
### Notes: 
This file can also be created in $2\_Preprocessing\_of\_Training\_Data.ipynb$ but is made available so that users do not have to download apStar_visits_main.h5 (a much larger file) if they are interested in skipping $2\_Preprocessing\_of\_Training\_Data.ipynb$. 
### IMPORTANT: 
If you choose to skip $2\_Preprocessing\_of\_Training\_Data.ipynb$, I advise at least taking a look at $2\_Preprocessing\_of\_Training\_Data.ipynb$ in order to understand the steps taken to create the training set. The data mining of the training set is perhaps the most crucial part of creating a quality neural network model.

In [3]:
filename='training_data.h5'
vclient = vos.Client()
vclient.copy('vos:starnet/public/'+filename, filename)
print(filename+' downloaded')

training_data.h5 downloaded


## 5. mean_and_std.npy
### description: 
This file contains the mean and standard deviations for the stellar labels, used to normalize the labels before training and denormalize the labels after testing.
### size: 
104 bytes
### used in: 
This file is necessary for $4\_Train\_Model\_Keras\_1.ipynb$, $4\_Train\_Model\_Keras\_2.ipynb$, and/or  $5\_Test\_Model.ipynb$ where it is used to train the model. 
### Notes: 
This file can also be created in $3\_Preprocessing\_of\_Test\_Data.ipynb$ but is made available so that users do not have to download apStar_combined_main.h5 (a much larger file) if they are interested in skipping $3\_Preprocessing\_of\_Test\_Data.ipynb$.

In [4]:
filename='mean_and_std.npy'
vclient = vos.Client()
vclient.copy('vos:starnet/public/'+filename, filename)
print(filename+' downloaded')

mean_and_std.npy downloaded


## 6. high_snr_test_data.h5
### description: 
This file contains High S/N apStar combined spectra along with APOGEE data associated with each star that is used for the testing of your model
### size: 
0.2GB
### used in: 
This file is necessary for $5\_Test\_Model.ipynb$ where it is used to test the model. 
### Notes: 
This file can also be created in $3\_Preprocessing\_of\_Test\_Data.ipynb$ but is made available so that users do not have to download apStar_combined_main.h5 (a much larger file) if they are interested in skipping $3\_Preprocessing\_of\_Test\_Data.ipynb$.

In [5]:
filename='high_snr_test_data.h5'
vclient = vos.Client()
vclient.copy('vos:starnet/public/'+filename, filename)
print(filename+' downloaded')

high_snr_test_data.h5 downloaded


## 7. low_snr_test_data.h5
### description: 
This file contains Low S/N apStar combined spectra along with APOGEE data associated with each star that is used for the testing of your model
### size: 
1.0GB
### used in: 
This file is necessary for $5\_Test\_Model.ipynb$ where it is used to test the model. 
### Notes: 
This file can also be created in $3\_Preprocessing\_of\_Test\_Data.ipynb$ but is made available so that users do not have to download apStar_combined_main.h5 (a much larger file) if they are interested in skipping $3\_Preprocessing\_of\_Test\_Data.ipynb$.

In [6]:
filename='low_snr_test_data.h5'
vclient = vos.Client()
vclient.copy('vos:starnet/public/'+filename, filename)
print(filename+' downloaded')

low_snr_test_data.h5 downloaded


## 8. Model_Tensorflow_Keras_1.h5
### description: 
This file is a pretrained model. The model was trained with Keras 1, using Tensorflow as the backend
### size: 
90MB
### used in: 
This file can be used for $5\_Test\_Model.ipynb$ where it is used to predict stellar labels for the test sets. 
### Notes: 
A similar file can also be created in $4\_Train\_Model\_Keras\_1.ipynb$.

In [7]:
filename='Model_Tensorflow_Keras_1.h5'
vclient = vos.Client()
vclient.copy('vos:starnet/public/'+filename, filename)
print(filename+' downloaded')

Model_Tensorflow_Keras_1.h5 downloaded


## 9. Model_Theano_Keras_1.h5
### description: 
This file is a pretrained model. The model was trained with Keras 1, using Theano as the backend
### size: 
90MB
### used in: 
This file can be used for $5\_Test\_Model.ipynb$ where it is used to predict stellar labels for the test sets and in $6\_Error\_Propogation$ to produce statistical errors for the model predictions. 
### Notes: 
A similar file can also be created in $4\_Train\_Model\_Keras\_1.ipynb$.

In [8]:
filename='Model_Theano_Keras_1.h5'
vclient = vos.Client()
vclient.copy('vos:starnet/public/'+filename, filename)
print(filename+' downloaded')

Model_Theano_Keras_1.h5 downloaded


## 10. Model_Tensorflow_Keras_2.h5
### description: 
This file is a pretrained model. The model was trained with Keras 1, using Tensorflow as the backend
### size: 
90MB
### used in: 
This file can be used for $5\_Test\_Model.ipynb$ where it is used to predict stellar labels for the test sets. 
### Notes: 
A similar file can also be created in $4\_Train\_Model\_Keras\_2.ipynb$.

In [9]:
filename='Model_Tensorflow_Keras_2.h5'
vclient = vos.Client()
vclient.copy('vos:starnet/public/'+filename, filename)
print(filename+' downloaded')

Model_Tensorflow_Keras_2.h5 downloaded


## 11. Model_Theano_Keras_2.h5
### description: 
This file is a pretrained model. The model was trained with Keras 1, using Theano as the backend
### size: 
90MB
### used in: 
This file can be used for $5\_Test\_Model.ipynb$ where it is used to predict stellar labels for the test sets and in $6\_Error\_Propogation$ to produce statistical errors for the model predictions. 
### Notes: 
A similar file can also be created in $4\_Train\_Model\_Keras\_2.ipynb$.

In [10]:
filename='Model_Theano_Keras_2.h5'
vclient = vos.Client()
vclient.copy('vos:starnet/public/'+filename, filename)
print(filename+' downloaded')

Model_Theano_Keras_2.h5 downloaded
