Matrix-vector parallel multiplication С++ implementation using std::thread. Matrix stored by rows. Matrix stored by columns.