# Setup and Installation

The purpose of this tutorial is to demonstrate the proper setup of your Python environment so that `hs_process` properly runs. In this tutorial, we will:

1. [Install Python](installation.html#Install-Python)
2. [Create a new environment and install the required dependencies](installation.html#Python-environment-setup-and-dependencies) (several options for doing so).
3. [Confirm you are using the correct Python installation](installation.html#Checking-your-environment) (i.e., the one in your new environment).
4. Verify that the installation was successful by [executing a test script](installation.html#Testing-hs_process-installation).
5. Use [Anaconda Navigator](https://docs.anaconda.com/anaconda/navigator/) to [install Spyder and/or Jupyter Notebook](installation.html#Install-Spyder-and/or-Jupyter-Notebook) in the new environment.

***

## Install Python
There are many ways to download and install Python. The [Anaconda distribution](https://www.anaconda.com/distribution/#download-section) is recommended becuase it can simplify the installation of package dependencies and streamlines the process of setting up the virtual environment. Anaconda also comes with [Spyder](https://www.spyder-ide.org/) and [Jupyter notebooks](https://jupyter.org/), each of which make working with Python quite easy.

Download and install **Python 3.7** via [Anaconda](https://www.anaconda.com/distribution/#download-section). When installing, choose _**install for only me**_ instead of *install for all users*, as this simplifies insallation of package dependencies.

***

## Python environment setup and dependencies
A *Python environment* refers to the ecosystem consisting of a particular installed version of [Python](https://www.python.org/), as well as any third-party packages it has access to. In Python, every time a package is installed (e.g., `pip install some_third_party_package`, `conda install -c conda-forge some_third_party_package`, etc.), your Python environment is gaining functionality that builds upon the [Python Standard Library](https://docs.python.org/3/library/).

This is great because the possibilities of what can be accomplished using Python is virtually limitless. Ideally, the list of packages in a given environment should only include those required in the project we are working on, not all packages that have ever been installed and are needed for any project you've ever worked on (this can be dozens or even hundreds of packages). A downside with too many packages is that it is inefficient to share your environment with someone else that would like to use the program. As time goes on, there may also be compatibility issues among package dependencies and your code that can be avoided by creating a tidy little Python environment for your project.

There are a few ways to go about setting up a new environment and getting the dependencies installed. In this tutorial, there are several installation options provided with instructions. Please choose one of the three options.

***

### [Option 1] Installing dependencies manually
The `hs_process` dependencies can be installed manually on your own. This is perhaps the most straightforward option because it does not require you to download either the *environment.yml* [Option 2](installation.html#[Option-2]-Setting-up-the-Python-environment-using-the-cloned-.yml-file) or *spec-file.txt* [Option 3](installation.html#[Option-3]-Setting-up-the-Python-environment-using-spec-file) from Github. It does, however, require more steps/commands.

**A) Create the environment (optional)**

With [Anaconda](https://www.anaconda.com/distribution/#download-section) installed, it is first recommended (but not required) to create a new **Python 3.7** environment before installing package dependencies. Open an Anaconda console from the start menu as an administrator by clicking `Start->Anaconda`, right-click `Anaconda Console`, and choose to `Run as Administrator`. In the `Anaconda Console`, execute the following commands:

1. `conda create -n spec python=3.7` to create an Anaconda **Python 3.7** environment named **spec** (you can name this whatever you'd like).
2. `conda activate spec` to activate the newly configured environment. **IMPORTANT**: Anytime you want to use `hs_process`, be sure to run `conda activate spec` prior to opening your Python IDE via the command line (e.g. *Spyder*, *Jupyter Notebook*, etc.). Otherwise, open your IDE via Anaconda Navigator, being sure the correct environment is active.

**B) Install package dependencies**

Although `pip install hs_process` will try to install package dependencies available on [PyPI](https://pypi.org/), it is not generally recommended to install a package via `pip` if that package exists in the [Anaconda Package Repository](https://anaconda.org/anaconda/repo). Therefore, all the required dependencies available in the [Anaconda Package Repository](https://anaconda.org/anaconda/repo) should be installed via the `conda install` command from the *conda-forge* channel. In the `Anaconda Console`, execute each of the following commands:

3. `conda install -c conda-forge geopandas -y` [(*link*)](https://anaconda.org/conda-forge/geopandas)
4. `conda install -c conda-forge seaborn -y` [(*link*)](https://anaconda.org/conda-forge/seaborn)
5. `conda install -c conda-forge spectral -y` [(*link*)](https://anaconda.org/conda-forge/spectral)

The following packages are also dependencies of `hs_process`, but they are also dependencies of `GeoPandas`, `Seaborn`, and `Spectral`, and should therefore already have been installed via the above `conda install` commands. If you'd like to check, you can try to install manually anyway (but you should get a message saying *# All requested packages already installed.*).   

- `conda install -c conda-forge gdal -y` [(*link*)](https://anaconda.org/conda-forge/gdal) *`GeoPandas` has a `GDAL` dependency
- `conda install -c conda-forge numpy -y` [(*link*)](https://anaconda.org/conda-forge/numpy) *`GeoPandas` has a `numpy` dependency
- `conda install -c conda-forge pandas -y` [(*link*)](https://anaconda.org/conda-forge/pandas) *`GeoPandas` has a `Pandas` dependency
- `conda install -c conda-forge matplotlib -y` [(*link*)](https://anaconda.org/conda-forge/matplotlib) *`Seaborn` has a `Matplotlib` dependency
- `conda install -c conda-forge shapely -y` [(*link*)](https://anaconda.org/conda-forge/shapely) *`GeoPandas` has a `Shapely` dependency

**C) Install** `hs_process` **via** `pip`

Now that all of the dependencies are installed via `conda`, `hs_process` can finally be installed via `pip`:

6. `pip install hs_process` [(*link*)](https://pypi.org/project/hs_process/) will install the remaining dependencies via `pip`. Note that `pip` should only be used *after* as many dependencies as possible are installed via `conda install` - [see here for more information](https://docs.conda.io/projects/conda/en/latest/user-guide/tasks/manage-environments.html#using-pip-in-an-environment).


* [Confirming your environment](installation.html#Confirming-your-environment)
* [Testing your installation](installation.html#Testing-hs_process-installation)
* [Install Spyder and/or Jupyter Notebook](installation.html#Install-Spyder-and/or-Jupyter-Notebook)

***

### [Option 2] Installing via the cloned *.yml* file
With [Anaconda](https://www.anaconda.com/distribution/#download-section) installed, the most streamlined approach is to create a new virtual environment via a clone of the complete working environment to run `hs_process`. This requires that you download the .yml file from the `hs_process` [Github repository](https://github.com/tnigon/hs_process/blob/master/env/environment.yml).

**Download** *environment.yml* **from Github** *(choose one of a few options)*

1. You can simply copy the contents of "environment.yml" from the `hs_process` [Github repository](https://github.com/tnigon/hs_process/blob/master/env/environment.yml), paste them to a new file, and save this file in your local directory (be sure to name the file "environment.yml"). 

2. If [Git](https://git-scm.com/downloads) is already installed on your system, you can clone the `hs_process` repository into your local directory using the command line:
`git clone https://github.com/tnigon/hs_process.git`

3. Otherwise, you can always download the `hs_process` repository as a .zip file and unpack it into your local directory.

**Create the environment *and* install dependencies simultaneously via the** *environment.yml* **file**

Open an Anaconda console from the start menu as an administrator by clicking `Start->Anaconda`, right-click `Anaconda Console`, and choose to `Run as Administrator`. In the `Anaconda Console`, execute the following commands:

1. `cd` to the local directory that contains the *environment.yml* file (e.g., the location you saved the "environment.yml" file, or the location of your `hs_process` Github clone - `Github\hs_process\env`).
2. `conda env create --name spec --file environment.yml`. This will configure an Anaconda environment named _**spec**_ with all of the required dependencies. This is the recommended approach, because it installs as many dependencies via `conda` before resorting to `pip` for the remaining packages. Note that this command also installs `hs_process`. This process will take some time, depending on your computer and internet speeds (1-5 minutes is common).
3. `conda activate spec` to activate the newly configured environment. **Important**: Anytime you want to use `hs_process`, be sure to run `conda activate spec` prior to opening your Python IDE via the command line (e.g. *Spyder*, *Jupyter Notebook*, etc.). Otherwise, open your IDE via Anaconda Navigator, being sure the correct environment is active.


* [Confirming your environment](installation.html#Confirming-your-environment)
* [Testing your installation](installation.html#Testing-hs_process-installation)
* [Install Spyder and/or Jupyter Notebook](installation.html#Install-Spyder-and/or-Jupyter-Notebook)

***

### [Option 3] Installing via the *spec-file*
The required packages can also be installed via the [*spec-file.txt*](https://github.com/tnigon/hs_process/blob/master/env/spec-file.txt) file. Similar to the *.yml approach* ([Option 2](installation.html#[Option-2]-Setting-up-the-Python-environment-using-the-cloned-.yml-file) above), this approach requires that you download the [*spec-file.txt*](https://github.com/tnigon/hs_process/blob/master/env/spec-file.txt) file from the `hs_process` [Github repository](https://github.com/tnigon/hs_process).

**Download** *spec-file.txt* **from Github** *(choose one of a few options)*

1. You can simply copy the contents of "spec-file.txt" from the `hs_process` [Github repository](https://github.com/tnigon/hs_process/blob/master/env/spec-file.txt), paste them to a new file, and save this file in your local directory (be sure to name the file "spec-file.txt"). 

2. If [Git](https://git-scm.com/downloads) is already installed on your system, you can clone the `hs_process` repository into your local directory using the command line:
`git clone https://github.com/tnigon/hs_process.git`

3. Otherwise, you can always download the `hs_process` repository as a .zip file and unpack it into your local directory.

**Create the environment while installing all dependencies**

With [Anaconda](https://www.anaconda.com/distribution/#download-section) installed, a new environment will be created based on the contents of *spec-file.txt*. Open an Anaconda console from the start menu as an administrator by clicking `Start->Anaconda`, right-click `Anaconda Console`, and choose to `Run as Administrator`. In the `Anaconda Console`, execute the following commands:

1. `cd` to the local directory that contains the *spec-file.txt* file (e.g., the location you saved the "spec-file.txt" file, or the location of your `hs_process` Github clone - `Github\hs_process\env`).
2. `conda create --name spec --file spec-file.txt` to create an Anaconda **Python 3.7** environment named _**spec**_ with all of the required dependencies. This is the recommended approach, because it installs as many dependencies via `conda` before resorting to `pip` for the remaining packages. Note that this command also installs `hs_process`. This process will take some time, depending on your computer and internet speeds (1-5 minutes is common).
3. `conda activate spec` to activate the newly configured environment. **Important**: Anytime you want to use `hs_process`, be sure to run `conda activate spec` prior to opening your Python IDE via the command line (e.g. *Spyder*, *Jupyter Notebook*, etc.). Otherwise, open your IDE via Anaconda Navigator, being sure the correct environment is active.
4. `pip install hs_process` [(*link*)](https://pypi.org/project/hs_process/) will install the remaining dependencies via `pip`. Note that `pip` should only be used *after* as many dependencies as possible are installed via `conda install` - [see here for more information](https://docs.conda.io/projects/conda/en/latest/user-guide/tasks/manage-environments.html#using-pip-in-an-environment).


* [Confirming your environment](installation.html#Confirming-your-environment)
* [Testing your installation](installation.html#Testing-hs_process-installation)
* [Install Spyder and/or Jupyter Notebook](installation.html#Install-Spyder-and/or-Jupyter-Notebook)

***

## Confirming your environment
It can be easy to accidentally open a Python IDE (e.g., [Spyder](https://www.spyder-ide.org/) or [Jupyter Notebook](https://jupyter.org/)) for a Python environment different that where `hs_process` and its dependencies were installed. The following Python command can be used to confirm you are using the Python executable in your new environment:

In [1]:
import sys

sys.executable

'C:\\Users\\nigo0024\\AppData\\Local\\Continuum\\anaconda3\\envs\\spec\\python.exe'

## Testing `hs_process` installation
To verify `hs_process` and its dependencies are properly installed, the `test.py` can be run. First get local access to `test.py`:

1. Fork, clone, or download the [hs_process repository](https://github.com/tnigon/hs_process).

2. Download [sample files](https://drive.google.com/drive/folders/1KpOBB4-qghedVFd8ukQngXNwUit8PFy_?usp=sharing) and place into the newly forked/cloned/downloaded `hs_process\test\testdata` directory (do not change the file names).

__*Note*__: There is a hyperspectral datacube that is ~750 MB, and this is why this data is not placed in the `testdata` folder by default (Github doesn't allow large files). The small files should be included from Github, but the *.spec* and *.bip* files must be copied over. After copying the sample files to your `testdata` directory, run `test.py` from your command prompt:

3. `python <path to hs_process\test\test.py>` to run through some basic tests to be sure all dependencies are installed properly.

***

In [2]:
%run F:\nigo0024\Documents\GitHub\hs_process\test\test.py  # optionally, a script can be run from Jupyter with this syntax

test_bip_extensions (test_hsio.Test_hsio_get_fname_hdr) ... ok
test_spec_extensions (test_hsio.Test_hsio_get_fname_hdr) ... ok
test_failure (test_hsio.Test_hsio_read_cube) ... expected failure
test_names (test_hsio.Test_hsio_read_cube) ... ok
test_readability (test_hsio.Test_hsio_read_cube) ... ok
test_split (test_hsio.Test_hsio_read_cube) ... skipped 'demonstrating skipping'
test_names (test_hsio.Test_hsio_read_spec) ... ok
test_readability (test_hsio.Test_hsio_read_spec) ... ok
test_byteorder (test_hsio.Test_hsio_set_io_defaults) ... ok
test_dtype (test_hsio.Test_hsio_set_io_defaults) ... ok
test_ext (test_hsio.Test_hsio_set_io_defaults) ... ok
test_force (test_hsio.Test_hsio_set_io_defaults) ... ok
test_instance_independence (test_hsio.Test_hsio_set_io_defaults) ... ok
test_interleave (test_hsio.Test_hsio_set_io_defaults) ... ok
test_return_values (test_hsio.Test_hsio_set_io_defaults) ... ok
test_write_cube (test_hsio.Test_hsio_write_cube) ... ok
test_write_spec (test_hsio.Test_hsio

C:\Users\nigo0024\AppData\Local\Temp\tmpl3m_8qps\Wells_rep2_20180628_16h56m_pika_gige_7.bip.hdr
Saving C:\Users\nigo0024\AppData\Local\Temp\tmpl3m_8qps\Wells_rep2_20180628_16h56m_pika_gige_7.bip


ok
test_write_tif_multi (test_hsio.Test_hsio_write_tif) ... ok
test_write_tif_single (test_hsio.Test_hsio_write_tif) ... 

Saving C:\Users\nigo0024\AppData\Local\Temp\tmpvpeb_d9y\Wells_rep2_20180628_16h56m_pika_gige_7-mean.spec
Either `projection_out` is `None` or `geotransform_out` is `None` (or both are). Retrieving projection and geotransform information by loading `hsio.fname_in` via GDAL. Be sure this is appropriate for the data you are trying to write.

Either `projection_out` is `None` or `geotransform_out` is `None` (or both are). Retrieving projection and geotransform information by loading `hsio.fname_in` via GDAL. Be sure this is appropriate for the data you are trying to write.



ok

----------------------------------------------------------------------
Ran 19 tests in 1.248s

OK (skipped=1, expected failures=1)


## Install Spyder and/or Jupyter Notebook

Open *Anaconda Navigator* from the start menu by clicking `Start -> Anaconda` -> `Anaconda Navigaor`. If [Spyder](https://www.spyder-ide.org/) or [Jupyter Notebook](https://jupyter.org/) are not installed on your new "spec" environment (if you created a new environment), select the "Install" button, then "Launch" to open the application to run Python in your selected environment.

***

## Other information

### System Requirements

`hs_process` was developed using Python 3 (on Windows 10 OS), so it is recommended to install and use Python 3. This tutorial has only been tested on Windows. It will likely work on other major platforms (i.e., Windows, Linux, Mac), but you will have to be sure to get the required dependencies installed and working on your own.

***

### Software and Libraries

The following software and libraries are required to leverage the full API of `hs_process`:

**Software**

* [Python 3](https://www.python.org/downloads/)

**Libraries**

1. [GeoPandas](https://anaconda.org/conda-forge/geopandas)
2. [Seaborn](https://anaconda.org/conda-forge/seaborn) (if only for visualizing data in the API examples)
3. [Spectral Python](https://anaconda.org/conda-forge/spectral)

The following libraries are also dependencies of `hs_process`, but they are also dependencies of `GeoPandas`, `Seaborn`, and `Spectral`. Therefore they will be installed during the install procedures of the above libraries.

- [GDAL](https://anaconda.org/conda-forge/gdal) *`GeoPandas` has a `GDAL` dependency
- [Numpy](https://anaconda.org/conda-forge/numpy) *`GDAL` has a `numpy` dependency
- [Pandas](https://anaconda.org/conda-forge/pandas) *`GeoPandas` has a `Pandas` dependency
- [Matplotlib](https://anaconda.org/conda-forge/matplotlib) *`Seaborn` has a `Matplotlib` dependency
- [Shapely](https://anaconda.org/conda-forge/shapely) *`GeoPandas` has a `Shapely` dependency

***

### Other methods for installing `hs_process`
**IMPORTANT**: Before installing `hs_process` via any of the following methods, be sure the [GeoPandas](http://geopandas.org/install.html) and [Fiona](https://fiona.readthedocs.io/en/latest/index.html) dependencies are installed via `conda` (see [GeoPandas and Fiona dependencies](installation.html#GeoPandas-and-Fiona-dependencies) below).

* `pip install git+https://github.com/tnigon/hs_process` [*(link)*](https://github.com/tnigon/hs_process) will install the latest development version directly from Github.

OR *another* option is to clone the Github repository and install from your local copy. After navigating to the directory of your cloned local copy:

* `pip install .`

The recommended folder directory for the `hs_process` package is in the *site-packages* folder in your Python Path (alongside all other Python packages).

***

### [GeoPandas](http://geopandas.org/install.html) and [Fiona](https://fiona.readthedocs.io/en/latest/index.html) dependencies
**IMPORTANT**: The [GeoPandas](http://geopandas.org/install.html) and [Fiona](https://fiona.readthedocs.io/en/latest/index.html) dependencies **MUST** installed via `conda`. [GeoPandas](http://geopandas.org/install.html) has a dependency on [Fiona](https://fiona.readthedocs.io/en/latest/index.html), and because of some nuances with the [pip version of Fiona*](https://pypi.org/project/Fiona/), [GeoPandas](https://anaconda.org/conda-forge/geopandas) **MUST** be installed via Anaconda (at least on Windows OS).

* `conda install -c conda-forge geopandas` [*(link)*](https://anaconda.org/conda-forge/geopandas) will install both *GeoPandas* and *fiona* because *GeoPandas* has a [fiona dependency](http://geopandas.org/install.html#dependencies).

*During installation, the pip version of [Fiona](https://pypi.org/project/Fiona/) requires that it is able to discover the locations of header files and libraries needed to compile its C extnesions, and these must be provided by the user when installing on Windows. For more information on this behavior, please refer to the [Fiona documentation](https://fiona.readthedocs.io/en/latest/README.html#windows).