# SUMMARY notebook

This notebook scans the directory in which it lives to find all jupyter notebooks (other than itself) in that directory. It then prints for every notebook it finds (1) a hyperlink to the notebook, and (2) the first cell (which is always markdown) of the notebook. This way you can read a nice, automatically generated summary of all the notebooks without having to open all of them. If you find a notebook that you want to explore further, you can simply click on its link to open it.

In [1]:
# Version: 2
import os
import json
from IPython.display import display, Markdown

# the name of this file
this_fname = 'SUMMARY.ipynb'
fname_to_md = {}
for fname in sorted([x for x in os.listdir('./')]):
    if fname[-6:] == '.ipynb'  and fname != this_fname:
        # print('------------', fname)
        with open(fname, 'r', encoding="utf-8") as f:
            fdata = json.load(f)
            fname_to_md[fname] = ''.join(fdata['cells'][0]['source'])
# print(fname_to_md)
pre_sep = '\n\n<hr style="height:10px; background-color: blue;">\n\n'
full_md = ''
k = 1
num_nb = len(fname_to_md)
project_name ="SentenceAx"
who ="rrtucci"
where = "jupyter_notebooks"
for fname, md in fname_to_md.items():
    sep = pre_sep
    local_link = f' [<a href="{fname}" target= "_blank">local link</a>] '
    github_link = f' [<a href="https://github.com/{who}/{project_name}/blob/master/{where}/' +\
        f'{fname}">github link</a>] '
    sep += fname + local_link + github_link + str(k) + '/' + str(num_nb) + '\n\n'
    full_md += sep + md
    k += 1
display(Markdown(full_md))



<hr style="height:10px; background-color: blue;">

cc-train_test(pid=5).ipynb [<a href="cc-train_test(pid=5).ipynb" target= "_blank">local link</a>]  [<a href="https://github.com/rrtucci/SentenceAx/blob/master/jupyter_notebooks/cc-train_test(pid=5).ipynb">github link</a>] 1/9

# cc-train_test(pid=5)

SentenceAx uses 2 NNs, one for task="ex" and another for task="cc". This notebook trains the fullly-fledged (not a warmup) NN for the task="cc". 

After running this notebook, append the suffix ".best" to the checkpoint file that it outputs.

<hr style="height:10px; background-color: blue;">

ex-train_test(pid=1).ipynb [<a href="ex-train_test(pid=1).ipynb" target= "_blank">local link</a>]  [<a href="https://github.com/rrtucci/SentenceAx/blob/master/jupyter_notebooks/ex-train_test(pid=1).ipynb">github link</a>] 2/9

# ex-train_test(pid=1)

SentenceAx uses 2 NNs, one for task="ex" and another for task="cc". This notebook trains the fullly-fledged (not a warmup) NN for the task="ex". 

After running this notebook, append the suffix ".best" to the checkpoint file that it outputs.

<hr style="height:10px; background-color: blue;">

global_variables.ipynb [<a href="global_variables.ipynb" target= "_blank">local link</a>]  [<a href="https://github.com/rrtucci/SentenceAx/blob/master/jupyter_notebooks/global_variables.ipynb">github link</a>] 3/9

# Global Variables

This notebook prints the global variables in the file `sax_globals.py`.

<hr style="height:10px; background-color: blue;">

tensorboard_tips.ipynb [<a href="tensorboard_tips.ipynb" target= "_blank">local link</a>]  [<a href="https://github.com/rrtucci/SentenceAx/blob/master/jupyter_notebooks/tensorboard_tips.ipynb">github link</a>] 4/9

# Tensorboard tips

make sure you have done 

    pip install tensorboard

To view Tensorboard in Jupyter cell after run is finished, do this


    %reload_ext tensorboard
    %tensorboard --logdir=logs/ex

To view Tensorboard in browser after run is finished, open terminal and do this

    tensorboard --logdir=logs/ex
    
or 

    tensorboard --logdir=logs/cc

I prefer viewing in a browser because there's a bug in Windows when try to display tensorboard logs in a jupyter cell. If a folder named ".tensorboard-info" exists, it must be deleted or it won't display.

<hr style="height:10px; background-color: blue;">

warmup-cc-train_test(pid=5).ipynb [<a href="warmup-cc-train_test(pid=5).ipynb" target= "_blank">local link</a>]  [<a href="https://github.com/rrtucci/SentenceAx/blob/master/jupyter_notebooks/warmup-cc-train_test(pid=5).ipynb">github link</a>] 5/9

# warmup cc-train_test(pid=5)

SentenceAx uses 2 NNs, one for task="ex" and another for task="cc". This warmup notebook trains the NN for the task="cc". 

The warmup NN has small sizes for everything so that it can be trained quickly but not accurately without GPU.

After running this notebook, append the suffix ".best" to the checkpoint file that it outputs.

<hr style="height:10px; background-color: blue;">

warmup-ex-extract(pid=3).ipynb [<a href="warmup-ex-extract(pid=3).ipynb" target= "_blank">local link</a>]  [<a href="https://github.com/rrtucci/SentenceAx/blob/master/jupyter_notebooks/warmup-ex-extract(pid=3).ipynb">github link</a>] 6/9

# warmup ex-extract(pid=3)

This warmup notebook performs action="extract". For this action, the computer does no cc splitting, only ex extraction. If you want the computer do both, split and extract, set the action to "splitextract".

The notebook reads the file:

`predicting/small_pred.txt`

with 6 sentences we want to extract from, and it writes the file

`predicting/small_pred_extract_ssents.txt`

with the predictions (i.e., ssents= simple sentences extracted from the original sentences.)

This notebook requires that you derive the ex and  cc  weights first by running the notebooks `warmup-ex-train_test(pid=1)` 
and `warmup-cc-train_test(pid=5)`.

<hr style="height:10px; background-color: blue;">

warmup-ex-splitextract(pid=6).ipynb [<a href="warmup-ex-splitextract(pid=6).ipynb" target= "_blank">local link</a>]  [<a href="https://github.com/rrtucci/SentenceAx/blob/master/jupyter_notebooks/warmup-ex-splitextract(pid=6).ipynb">github link</a>] 7/9

# warmup ex-splitextract(pid=6)

This warmup notebook performs action="splitextract". By setting split_only=False, we ask it to do both the cc splitting and the ex extraction.

The notebook reads the file:

`predicting/small_pred.txt`

with 6 sentences we want to splitextract, and it writes the file

`predicting/small_pred_splitextract_ssents.txt`

with the predictions (i.e., ssents= simple sentences extracted from the original sentences.)

This notebook requires that you derive the ex and  cc  weights first by running the notebooks `warmup-ex-train_test(pid=1)` 
and `warmup-cc-train_test(pid=5)`.

<hr style="height:10px; background-color: blue;">

warmup-ex-splitextract(pid=6, split_only=True).ipynb [<a href="warmup-ex-splitextract(pid=6, split_only=True).ipynb" target= "_blank">local link</a>]  [<a href="https://github.com/rrtucci/SentenceAx/blob/master/jupyter_notebooks/warmup-ex-splitextract(pid=6, split_only=True).ipynb">github link</a>] 8/9

# warmup ex-splitextract(pid=6, split_only=True)

This warmup notebook performs action="splitextract". By setting split_only=True, we ask it to do the cc splitting but not the ex extraction.

The notebook reads the file:

`predicting/small_pred.txt`

with 6 sentences we want to split, and it writes the file

`predicting/small_pred_split_ssents.txt`

with the predictions (i.e., ssents= simple sentences extracted from the original sentences.)

The warmup NN has small sizes for everything so that it can be trained quickly but not accurately without GPU.

This notebook requires that you derive the ex and  cc  weights first by running the notebooks `warmup-ex-train_test(pid=1)` 
and `warmup-cc-train_test(pid=5)`.

<hr style="height:10px; background-color: blue;">

warmup-ex-train_test(pid=1).ipynb [<a href="warmup-ex-train_test(pid=1).ipynb" target= "_blank">local link</a>]  [<a href="https://github.com/rrtucci/SentenceAx/blob/master/jupyter_notebooks/warmup-ex-train_test(pid=1).ipynb">github link</a>] 9/9

# warmup ex-train_test(pid=1)

SentenceAx uses 2 NNs, one for task="ex" and another for task="cc". This warmup notebook trains the NN for the task="ex".

The warmup NN has small sizes for everything so that it can be trained quickly but not accurately without GPU.

After running this notebook, append the suffix ".best" to the checkpoint file that it outputs.