A class of parallel tiled linear algebra algorithms for multicore architectures A Buttari, J Langou, J Kurzak, J Dongarra Parallel Computing 35 (1), 38-53, 2009 | 598 | 2009 |
Numerical linear algebra on emerging architectures: The PLASMA and MAGMA projects E Agullo, J Demmel, J Dongarra, B Hadri, J Kurzak, J Langou, H Ltaief, ... Journal of Physics: Conference Series 180 (1), 012037, 2009 | 489 | 2009 |
Communication-optimal parallel and sequential QR and LU factorizations J Demmel, L Grigori, M Hoemmen, J Langou SIAM Journal on Scientific Computing 34 (1), A206-A239, 2012 | 401 | 2012 |
Parallel tiled QR factorization for multicore architectures A Buttari, J Langou, J Kurzak, J Dongarra Concurrency and Computation: Practice and Experience 20 (13), 1573-1590, 2008 | 245 | 2008 |
Algorithm-based fault tolerance applied to high performance computing G Bosilca, R Delmas, J Dongarra, J Langou Journal of Parallel and Distributed Computing 69 (4), 410-416, 2009 | 227 | 2009 |
Algorithm 842: A set of GMRES routines for real and complex arithmetics on high performance computers V Frayssé, L Giraud, S Gratton, J Langou ACM Transactions on Mathematical Software (TOMS) 31 (2), 228-238, 2005 | 183 | 2005 |
Tiled QR factorization algorithms H Bouwmeester, M Jacquelin, J Langou, Y Robert SC'11: Proceedings of 2011 International Conference for High Performance …, 2011 | 181 | 2011 |
Accelerating scientific computations with mixed precision algorithms M Baboulin, A Buttari, J Dongarra, J Kurzak, J Langou, J Langou, ... Computer Physics Communications 180 (12), 2526-2533, 2009 | 179 | 2009 |
Flexible development of dense linear algebra algorithms on massively parallel architectures with DPLASMA G Bosilca, A Bouteiller, A Danalis, M Faverge, A Haidar, T Herault, ... 2011 IEEE International Symposium on Parallel and Distributed Processing …, 2011 | 174* | 2011 |
Exploiting the performance of 32 bit floating point arithmetic in obtaining 64 bit accuracy (revisiting iterative refinement for linear systems) J Langou, J Langou, P Luszczek, J Kurzak, A Buttari, J Dongarra SC'06: Proceedings of the 2006 ACM/IEEE conference on Supercomputing, 50-50, 2006 | 163 | 2006 |
The impact of multicore on math software A Buttari, J Dongarra, J Kurzak, J Langou, P Luszczek, S Tomov International Workshop on Applied Parallel Computing, 1-10, 2006 | 156 | 2006 |
The loss of orthogonality in the Gram-Schmidt orthogonalization process L Giraud, J Langou, M Rozloznik Computers & Mathematics with Applications 50 (7), 1069-1075, 2005 | 154 | 2005 |
Mixed precision iterative refinement techniques for the solution of dense linear systems A Buttari, J Dongarra, J Langou, J Langou, P Luszczek, J Kurzak The International Journal of High Performance Computing Applications 21 (4 …, 2007 | 138 | 2007 |
Rounding error analysis of the classical Gram-Schmidt orthogonalization process L Giraud, J Langou, M Rozložník, J van den Eshof Numerische Mathematik 101 (1), 87-100, 2005 | 135 | 2005 |
Fault tolerant high performance computing by a coding approach Z Chen, GE Fagg, E Gabriel, J Langou, T Angskun, G Bosilca, J Dongarra Proceedings of the tenth ACM SIGPLAN symposium on Principles and practice of …, 2005 | 130 | 2005 |
Handbook of parallel computing: models, algorithms and applications S Rajasekaran, J Reif CRC press, 2007 | 120* | 2007 |
LU factorization for accelerator-based systems E Agullo, C Augonnet, J Dongarra, M Faverge, J Langou, H Ltaief, ... 2011 9th IEEE/ACS International Conference on Computer Systems and …, 2011 | 80 | 2011 |
Recovery patterns for iterative methods in a parallel unstable environment J Langou, Z Chen, G Bosilca, J Dongarra SIAM Journal on Scientific Computing 30 (1), 102-116, 2008 | 62 | 2008 |
A set of GMRES routines for real and complex arithmetics V Frayssé, L Giraud, S Gratton, J Langou URL http://www. cerfacs. fr/algor/Softs/GMRES/index. html, 1997 | 60 | 1997 |
Plasma users guide E Agullo, J Dongarra, B Hadri, J Kurzak, J Langou, J Langou, H Ltaief, ... Technical report, ICL, UTK, 2009 | 57 | 2009 |