In [None]:
# !pip install papermill nbconvert jupytext pandoc

In [None]:
import pandas as pd

# Managing Notebooks and Scripts using Command-Line Tools in Jupyter

In this notebook we learn how to use command-line tools inside Jupyter notebooks to make our work easier and more efficient. 
We start by learning how to run command-line commands, like managing files or installing software, directly from the notebook. 
Then, we explore how to run entire notebooks from the command line, which helps when we need to automate tasks. 
We also see how to turn regular Python scripts into notebooks, making them easier version control with Git. 
Lastly, we see how to convert notebooks into other formats, like HTML or PDF from command line tools, so they can be shared with others.

## Running Command-Line Commands in Jupyter

A command line is a text-based interface that allows users to interact with their computerâ€™s operating system by typing commands, rather than using graphical interfaces.
In this interface, users can navigate directories, manage files, run programs, and perform a wide range of tasks by typing specific commands.
Popular command-line environments include Bash (common in Linux and macOS) and the Windows Command Prompt or PowerShell.

As researchers we may need to use command-line for file management (move, rename, delete, or organize datasets), automate repeating tasks that may involve external tools, install software etc. 

Incorporating command-line commands into our analysis notebooks allows us to integrate external tools, automate repeating tasks, and manage data all within the same environment. 

**Run below code to downlaod data for this section. You do not have to know the contents of the file as we are only learning how to manage files**

In [None]:
import sys
sys.path.append('src')
import sciebo

sciebo.download_file('https://uni-bonn.sciebo.de/s/yDiGZT44SXLvK5r', 'magic_commands/text_config.txt')
sciebo.download_file('https://uni-bonn.sciebo.de/s/apw9RMXjgfhQaK5', 'magic_commands/python_config.py')
sciebo.download_file('https://uni-bonn.sciebo.de/s/lwVMGbzKQXFuIax', 'magic_commands/notebook_config.ipynb')

**Example** Install `pandas`

In [None]:
# !pip install pandas

Install `numpy`

In [None]:
# !pip install numpy

Install seaborn

In [None]:
# !pip install seaborn

You can use any option that comes along with the command-line command

**Example** Upgrade matplotlib

In [None]:
# !pip install --upgrade matplotlib

Upgrade seaborn

In [None]:
# !pip install seaborn

Upgrade nbformat

In [None]:
# !pip install --upgrade nbformat

**Example** Create a new directory called `data_1`

In [None]:
!mkdir data_1

Create a new directory `data_2`

In [None]:
!mkdir data_2

Create a new directory `data_1/data_1_sub`

(`data_1\data_1_sub` for windows machines)

In [None]:
!mkdir data_1\data_1_sub

We can run Linux command-line commands within a cell using %%bash

**Example** Copy `magic_commands/hello.py` to `data_1` directory

In [None]:
%%bash
cp magic_commands/python_config.py data_1/python_config.py

Copy `magic_commands/text_config.txt` to `data_1`

In [None]:
%%bash
cp magic_commands/text_config.txt data_1/text_config.txt

Copy `magic_commands/notebook_config.ipynb` to `data_1/data_1_sub` with a name `nb_config.ipynb`

In [None]:
%%bash
cp magic_commands/notebook_config.ipynb data_1/data_1_sub/nb_config.ipynb

Let's practice deleting files and folders. **Always be cautious when deleting any file**

**Example** Delete `data_1/text_config.txt` file. (Only file)

In [None]:
%%bash
rm data_1/text_config.txt

Delete `data_1/python_config.txt` (Only file)

In [None]:
%%bash
rm data_1/python_config.py

Delete `data_2` directory

In [None]:
%%bash
rmdir data_2

Delete `data_1` including sub-directories

---

## Executing Notebooks from Command Line

Running a notebook from command-line can be useful to automate execution of Jupyter notebook as part of a workflow or pipeline.
It can help us integrate it with task scheduling tools to perform routine tasks without manually opening and running the notebook.
Another use would be when dealing with multiple notebooks, running from command-line allows for batch processing enabling us to execute several notebooks sequentially without manually interacting with each one.

Here we will look into a tool called `papermill` that can execute notebooks from command-line. We will also see how to execute notebooks sequentially and in parallel. For this, we use three notebooks

1. `analysis_workflow/1_data_access.ipynb`: Prepares the dataset `steinmetz_active.csv`
2. `analysis_workflow/2_contrast_level.ipynb`: Uses `steinmetz_active.csv` for contrast level analysis
3. `analysis_workflow/3_mouse_analysis.ipynb`: Uses `steinmetz_active.csv` for mouse analysis


Notebooks 2 and 3 are not dependent on each other.
Both use the output from `notebook 1` for their analysis. 

**Run below code to downlaod data for this section. You do not have to know the contents of the file as we are only learning how to execute notebooks from command-line**

In [None]:
import sys
sys.path.append('src')
import sciebo

sciebo.download_file('https://uni-bonn.sciebo.de/s/sqOWyThRHtrwDNH', 'execute_notebooks/1_data_access.ipynb')
sciebo.download_file('https://uni-bonn.sciebo.de/s/iHXsgis2cTJ52rj', 'execute_notebooks/2_contrast_level.ipynb')
sciebo.download_file('https://uni-bonn.sciebo.de/s/IC2NHoU7EOcsoMK', 'execute_notebooks/3_mouse_analysis.ipynb')

**Example** Execute `execute_notebooks/1_data_access.ipynb` as `output.ipynb` and examine it. Was any other file generated from this?

In [None]:
!papermill execute_notebooks/1_data_access.ipynb output.ipynb

It has created the `steinmetz_active.csv` file. `output.ipynb` file is the same

Execute `execute_notebooks/2_contrast_level.ipynb` as `output.ipynb` and examine the output.

In [None]:
!papermill execute_notebooks/2_contrast_level.ipynb output.ipynb

Execute `execute_notebooks/3_mouse_analysis.ipynb` as `output.ipynb` and the output.

In [None]:
!papermill execute_notebooks/3_mouse_analysis.ipynb output.ipynb

Delete `data_analysis/steinmetz_active.csv` file.

Execute `execute_notebooks/3_mouse_analysis.ipynb` as `output.ipynb` and examine `output.ipynb` directory. What do you see?

In [None]:
!papermill execute_notebooks/3_mouse_analysis.ipynb output.ipynb

It gives an error in the output of the cell. 
In `data_analysis/output.ipynb`, you will see a huge error in red on top of the notebook and another red text before the cell where it encountered an error.

If you are not interested in creating an output file

**Example** Execute `execute_notebooks/1_data_access.ipynb` inplace

In [None]:
!papermill execute_notebooks/1_data_access.ipynb execute_notebooks/1_data_access.ipynb

Execute `execute_notebooks/2_contrast_level.ipynb` in place

In [None]:
!papermill execute_notebooks/2_contrast_level.ipynb execute_notebooks/2_contrast_level.ipynb

Execute `execute_notebooks/3_mouse_analysis.ipynb` as `execute_notebooks/3_mouse_analysis.ipynb`

In [None]:
!papermill execute_notebooks/3_mouse_analysis.ipynb execute_notebooks/3_mouse_analysis.ipynb

**Example** Execute `1_data_access.ipynb` and `2_contrast_level.ipynb` sequentially

In [None]:
!papermill execute_notebooks/1_data_access.ipynb output_1.ipynb
!papermill execute_notebooks/2_contrast_level.ipynb output_2.ipynb

Execute `1_data_access.ipynb` and `3_mouse_analysis.ipynb` sequentially

In [None]:
!papermill execute_notebooks/1_data_access.ipynb output_1.ipynb
!papermill execute_notebooks/3_mouse_analysis.ipynb output_3.ipynb

Execute all the three notebooks one after the other

In [None]:
!papermill execute_notebooks/1_data_access.ipynb output_1.ipynb
!papermill execute_notebooks/2_contrast_level.ipynb output_2.ipynb
!papermill execute_notebooks/3_mouse_analysis.ipynb output_3.ipynb

---

## Turning Scripts into Notebooks

Converting a script into a Jupyter notebook can be valuable for enhancing code readability, facilitating interactive analysis, and improving collaboration. 
Notebooks provide an environment where code, explanations, and results are combined in a clear, organized format. 
This allows users to document their thought process alongside the code, include visualizations directly within the workflow, and run individual code cells for step-by-step debugging or exploration.

**Example** Create `script.py` with the below code and convert it to notebook. How does the resulting notebook look?

```python
num_mouse = 10
num_contrast_left = 4
num_contrast_right = 4
```

In [None]:
# !jupytext --to notebook script.py

Create `script.py` with the below code and convert it to notebook. How does the resulting notebook look?

```python
num_mouse = 10
num_contrast_left = 4
num_contrast_right = 4

print(num_mouse)
```

In [None]:
# !jupytext --to notebook script.py

Create `script.py` with the below code and convert it to notebook. How does the resulting notebook look?

```python
num_mouse = 10
num_contrast_left = 4
num_contrast_right = 4

num_mouse
```

In [None]:
# !jupytext --to notebook script.py

Let's practice with markdown

**Example** Create a python `script.py` with markdown text "This is markdown text"

```python
# %% [markdown]
# This is a markdown cell
```

In [None]:
# !jupytext --to notebook script.py

 Create a python `script.py` with multiple lines of markdown text. 
 Convert it to notebook and examine the resulting notebook.

In [None]:
# !jupytext --to notebook script.py

**Example** Create `script.py` with the a title "Data Analysis" and `a=10`. 
Convert it to notebook. 
How does the resulting notebook look?

```python
# %% [markdown]
# Title

# %%
a = 10
```

In [None]:
# !jupytext --to notebook script.py

Create `script.py` with the a title "Data Analysis" and `a=10`, `b=100`. 
Convert it to notebook. 
How does the resulting notebook look?

In [None]:
# !jupytext --to notebook script.py

---

## Turning Notebooks into Other Formats

Sometimes, we would want to convert jupyter notebooks to other formats.
Mainly, we would convert to python scripts or HTML. 
Converting to Python scripts can often help in version controlling and usage in large libraries.

Converting to HTML enables embedding the notebook within websites or presentations, enhancing communication of data and findings.

**Example** Convert `execute_notebooks/1_data_access.ipynb` to python script

In [None]:
# !jupyter nbconvert --to script execute_notebooks/1_data_access.ipynb

Convert `execute_notebooks/2_contrast_level.ipynb` to python script

In [None]:
# !jupyter nbconvert --to script execute_notebooks/2_contrast_level.ipynb

Convert `execute_notebooks/3_mouse_analysis.ipynb` to python script

In [None]:
# !jupyter nbconvert --to script execute_notebooks/3_mouse_analysis.ipynb

**Example** Convert `execute_notebooks/1_data_access.ipynb` to HTML and open in new browser to examine.

In [None]:
# !jupyter nbconvert --to html execute_notebooks/1_data_access.ipynb

Convert `execute_notebooks/2_contrast_level.ipynb` to HTML and open in new browser to examine.

In [None]:
# !jupyter nbconvert --to html execute_notebooks/2_contrast_level.ipynb

Convert `execute_notebooks/3_mouse_analysis.ipynb` to HTML and open in new browser to examine.

In [None]:
# !jupyter nbconvert --to html execute_notebooks/3_mouse_analysis.ipynb

**Example** Execute `1_data_access.ipynb` and convert output to HTML

In [None]:
# !papermill execute_notebooks/1_data_access.ipynb output_1.ipynb
# !jupyter nbconvert --to html output_1.ipynb

Execute `2_contrast_levels.ipynb` and convert output to HTML

In [None]:
# !papermill execute_notebooks/2_contrast_level.ipynb output_2.ipynb
# !jupyter nbconvert --to html output_2.ipynb

Execute `3_mouse_analysis.ipynb` and convert output to HTML

In [None]:
# !papermill execute_notebooks/3_mouse_analysis.ipynb output_3.ipynb
# !jupyter nbconvert --to html output_3.ipynb