Assignment02 #6

drkostas · 2021-03-09T01:12:05Z

(25 points) Clone the kmeans repository into your own area at /lustre/haven/proj/UTK0150/$USER.
(25 points) Write a job script that will use a single node and a single process per node (so only one process total). Ensure the job runs on a compute node, and run the non-distributed kmeans (kmeans_vectorized.py). Make a note of the output directory and commit the job script to your cloned repo.
(25 points) Write another job script to run the distributed kmeans script on two compute nodes using 20 processes, using the same iris data we've been looking at and submit the job, noting the output directory. This job should finish in a very short amount of time so requesting a walltime of 5 minutes will help you get through the queue quicker. Don't forget that you must launch your processes with mpirun inside the script...
(25 points) Modify the script to use the TCGA data in the /lustre/haven/proj/UTK0150/data directory (see README for refresher on how to load the data). Run another job on ISAAC using 20 processes and time how long the script takes to run, using 10 clusters. Make a note of the time it takes. Also run with a single process and one node and verify that both jobs output identical cluster assignments and centroids by saving the outputs of each job, loading them once complete and verifying that they match. (hint: successs at this requires identical initialization).
Submit a message here with the following information:
- path to your code on ISAAC.
- paths and brief description of relevant output log directories for ISAAC jobs that succeeded (please don't make us sort through your main output directory ourselves and sort through failed job IDs).
- Timings for k-means on Iris and TCGA data, with single process vs twenty. Do you achieve a 20x speedup in each case?

Your assignment will not be graded unless you submit it here on Canvas; no exceptions.

…t2.py #6

This reverts commit 4dbd1f9

Changed back to 1 node-20 processes

drkostas created this issue from a note in DSE-512 (In Progress) Mar 9, 2021

drkostas added a commit that referenced this issue Mar 9, 2021

Update schedule script #6

2780b95

drkostas added a commit that referenced this issue Mar 9, 2021

PBS run logs #6

897edf0

drkostas added a commit that referenced this issue Mar 11, 2021

Added my vectorized version and integrated everythin in the assignmen…

04b0b02

…t2.py #6

drkostas added a commit that referenced this issue Mar 15, 2021

Probably finished the distributed version of K-Means #6

44a3f4b

drkostas added a commit that referenced this issue Mar 16, 2021

Adapted the code for the TCGA dataset #6

ec69778

drkostas added a commit that referenced this issue Mar 16, 2021

Compacted the code into the kmeans.py, cleaned it and improved it #6

1338b7d

drkostas added a commit that referenced this issue Mar 16, 2021

Saving results to external files #6

06a598b

drkostas added a commit that referenced this issue Mar 16, 2021

Added intermediate timings #6

1ff6651

drkostas added a commit that referenced this issue Mar 16, 2021

Created configuration file for runnning on isaac #6

f85679b

drkostas added a commit that referenced this issue Mar 16, 2021

path change #6

4dbd1f9

drkostas added a commit that referenced this issue Mar 16, 2021

Revert "path change #6"

94f086c

This reverts commit 4dbd1f9

drkostas added a commit that referenced this issue Mar 16, 2021

path change #6

9fd63cf

drkostas added a commit that referenced this issue Mar 16, 2021

MPI processes write to same file as root process now #6

1c45495

drkostas added a commit that referenced this issue Mar 16, 2021

MPI processes write to same file as root process now #6

4da0cef

drkostas added a commit that referenced this issue Mar 16, 2021

Update pbs script for iris-2ppn10 #6

7f19008

drkostas added a commit that referenced this issue Mar 16, 2021

Update pbs script for iris-2ppn10 #6

a1947d9

drkostas added a commit that referenced this issue Mar 16, 2021

Updated all pbs scripts #6

6161fb3

drkostas added a commit that referenced this issue Mar 16, 2021

Update run script #6

89f7888

drkostas added a commit that referenced this issue Mar 16, 2021

Update the command that finds the real dir in run script #6

30996ea

drkostas added a commit that referenced this issue Mar 16, 2021

Add outputs from isaac side #6

15cead5

drkostas added a commit that referenced this issue Mar 16, 2021

Fixed bug that was using num_clusters for the number of processes #6

27c9948

drkostas added a commit that referenced this issue Mar 17, 2021

Manually adding the env bin path for mpirun command #6

2c9d15f

drkostas added a commit that referenced this issue Mar 17, 2021

Specifying memory per node at the pbs scripts #6

523f5ee

drkostas added a commit that referenced this issue Mar 17, 2021

Fix mpirun path for local runs #6

750e932

drkostas added a commit that referenced this issue Mar 17, 2021

Massively improved the time of distances calculation #6

f9a9975

drkostas added a commit that referenced this issue Mar 17, 2021

Deleted explicit memory request from the pbs scripts #6

efdaf2c

drkostas added a commit that referenced this issue Mar 18, 2021

Fix bad name replacement #6

288176a

drkostas added a commit that referenced this issue Mar 18, 2021

Fix mpi installation error on isaac #6

34f88a8

drkostas added a commit that referenced this issue Mar 18, 2021

Forgot to change the pbs scripts' paths #6

7f32877

drkostas added a commit that referenced this issue Mar 18, 2021

Still getting error with 2nodes-10 processes #6

d4acbb3

Changed back to 1 node-20 processes

drkostas added the assignment label Mar 18, 2021

drkostas self-assigned this Mar 18, 2021

drkostas closed this as completed in fb47033 Mar 18, 2021

DSE-512 automation moved this from In Progress to Done Mar 18, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Assignment02 #6

Assignment02 #6

drkostas commented Mar 9, 2021 •

edited

Assignment02 #6

Assignment02 #6

Comments

drkostas commented Mar 9, 2021 • edited

drkostas commented Mar 9, 2021 •

edited