I was wondering if it is possible to (easily) obtain timing information per thread for the multi-threaded BLAS calls in the MKL library?I just want to check the efficiency of using multiple threads when calling some Level 3 BLAS routines within my Fortran 90 code.Any suggestions greatly ...