Graceful BLAS is the reference generic matrix multiply routine for the experiments presented in the paper:
Throughput-distortion computation of generic matrix multiplication: Toward a computation channel for digital signal processing systems D. Anastasia and Y. Andreopoulos IEEE Transactions on Signal Processing, April 2012
http://ieeexplore.ieee.org/xpl/articleDetails.jsp?tp=&arnumber=6082463