National Tsing Hua University - CS 542200 Course's Instructor: Jerry Chou
This repo includes 4(+1) homeworks, which all cover on parallel computing techniques, such as MPI, OpenMP, Pthread, and CUDA.
The specific details of each homeworks (Homework Spec, My Report), please refer to each homework directory and find the PDF file prefixed with PP_2020 & hw.
Homework List:
- HW1: [MPI] Odd-Even Sort
- If you wants to enhance the performance, please rewrite the logic by
C++and call theC++ native libraries(STL, ...), you will get greater performance thanC.
- HW2: [MPI+OpenMP+Vectorization] Mandelbrot Set
- My code is ranking top 3.
- Vectorization is Intel-based speedup method.
- HW3: [Pthread] All-Pairs Shortest Path
- There're two way get more efficient code than my code:
- Implement
Blocked Floyd-Warshall algorithm, and useTenary Expressionto implement logics. (You can check./hw4-1/seq.ccto see how to implementBlocked Floyd-Warshall algorithmon CPU.) - Implement
Floyd-Warshall algorithm, and useVectorizationto implement logics.
- Implement
- HW4: [Cuda] All-Pairs Shortest Path (One GPU or Multiple GPUs)
- My code is ranking top 3.
There are also 5 labs for your reference.