Skip to content

SouthernMethodistUniversity/DHRI_python

Repository files navigation

This repo is no longer being updated. Please see the SMU@DHRI website for the most current information.

Text Mining With Python

The associated "Introduction to HPC and M2" slides can be found here.

Running the Notebook via JupyterLab on M2

  1. Go to hpc.smu.edu
  2. Sign in using your SMU ID and SMU password
  3. Select "ManeFrame II Shell Access" from the "Clusters" drop-down menu. (Note: If you are a new user, you might be prompted for a password. It will capture your keystrokes even if nothing you're typing appears.)
  4. In the "Shell" tab type git clone https://github.com/SouthernMethodistUniversity/DHRI_python.git to clone the GitHub repository.
  5. In the same "Shell" tab type cd DHRI_python to change to repository's directory to verify that it has been clone properly. (Typing ls should show files that have been downloaded.)
  6. In the "Dashboard" tab select "JupyterLab" from the "Interactive Apps" drop-down menu.
  7. Set the "Additional environment modules to load" field to environment.
  8. Set the "Custom module paths" field to module use $HOME/DHRI_python.
  9. Select other options required for your Jupyter instance. These options are the same as those requested via a standard Slurm script on M2. For this tutorial:
    • Partition: htc
    • Number of hours: 3
    • Number of nodes: 1
    • Cores per node: 1
    • GPUs per node: 0
    • Memory: 6
  10. Select "Launch"
  11. Wait for the job to start on M2. When the job starts a new button "Connect to JupyterLab" button will appear.
  12. Select "Connect to JupyterLab"
  13. The JupyterLab graphical interface will be presented and running on the M2 resource requested.
  14. Double click the appropriate notebook to open it.
  15. When finished using the JupyterLab instance, return to the "My Interactive Sessions" tab in your browser and select "Delete" and "Confirm", when prompted, to cancel the job on M2.

NOTE: To be sure you have the latest version of the environment and code. Everytime you log-in, you should go to "ManeFrame II Shell Access" from the "Clusters" drop-down menu and paste this command cd DHRI_python && git pull then hit "Enter".

Running the Notebook via JupyterLab via a Local Docker Installation

  1. Install Docker and run commands below in terminal shell of your choice.
  2. Clone repository, git clone https://github.com/SouthernMethodistUniversity/DHRI_python.git.
  3. Change to repository directory, cd DHRI_python.
  4. docker run --rm -p 8888:8888 -e JUPYTER_ENABLE_LAB=yes -v $HOME:/home/jovyan rkalescky/text_mining_with_python:latest
  5. Copy and paste the last URL given into your web browser and the JupyterLab graphical interface will be presented.