A class of parallel tiled linear algebra algorithms for multicore architectures
Buttari, Alfredo, Langou, Julien, Kurzak, Jakub, Dongarra, Jack
Published in Parallel computing (2009)
Published in Parallel computing (2009)
Get full text
Journal Article
Porting the PLASMA Numerical Library to the OpenMP Standard
YarKhan, Asim, Kurzak, Jakub, Luszczek, Piotr, Dongarra, Jack
Published in International journal of parallel programming (01.06.2017)
Published in International journal of parallel programming (01.06.2017)
Get full text
Journal Article
Parallel tiled QR factorization for multicore architectures
Buttari, Alfredo, Langou, Julien, Kurzak, Jakub, Dongarra, Jack
Published in Concurrency and computation (10.09.2008)
Published in Concurrency and computation (10.09.2008)
Get full text
Journal Article
Scheduling dense linear algebra operations on multicore processors
Kurzak, Jakub, Ltaief, Hatem, Dongarra, Jack, Badia, Rosa M.
Published in Concurrency and computation (01.01.2010)
Published in Concurrency and computation (01.01.2010)
Get full text
Journal Article
The PlayStation 3 for High-Performance Scientific Computing
Kurzak, Jakub, Buttari, Alfredo, Luszczek, Piotr, Dongarra, Jack
Published in Computing in science & engineering (01.05.2008)
Published in Computing in science & engineering (01.05.2008)
Get full text
Journal Article
Autotuning GEMM Kernels for the Fermi GPU
Kurzak, J., Tomov, S., Dongarra, J.
Published in IEEE transactions on parallel and distributed systems (01.11.2012)
Published in IEEE transactions on parallel and distributed systems (01.11.2012)
Get full text
Journal Article
Numerical linear algebra on emerging architectures: The PLASMA and MAGMA projects
Agullo, Emmanuel, Demmel, Jim, Dongarra, Jack, Hadri, Bilel, Kurzak, Jakub, Langou, Julien, Ltaief, Hatem, Luszczek, Piotr, Tomov, Stanimire
Published in Journal of physics. Conference series (01.07.2009)
Published in Journal of physics. Conference series (01.07.2009)
Get full text
Journal Article
Autotuning Numerical Dense Linear Algebra for Batched Computation With GPU Hardware Accelerators
Dongarra, Jack, Gates, Mark, Kurzak, Jakub, Luszczek, Piotr, Tsai, Yaohung M.
Published in Proceedings of the IEEE (01.11.2018)
Published in Proceedings of the IEEE (01.11.2018)
Get full text
Journal Article
Randomized algorithms to update partial singular value decomposition on a hybrid CPU/GPU cluster
Yamazaki, Ichitaro, Kurzak, Jakub, Luszczek, Piotr, Dongarra, Jack
Published in Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (15.11.2015)
Published in Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (15.11.2015)
Get full text
Conference Proceeding
Experiences in autotuning matrix multiplication for energy minimization on GPUs
Anzt, Hartwig, Haugen, Blake, Kurzak, Jakub, Luszczek, Piotr, Dongarra, Jack
Published in Concurrency and computation (10.12.2015)
Published in Concurrency and computation (10.12.2015)
Get full text
Journal Article
Performance of random sampling for computing low-rank approximations of a dense matrix on GPUs
Mary, Théo, Yamazaki, Ichitaro, Kurzak, Jakub, Luszczek, Piotr, Tomov, Stanimire, Dongarra, Jack
Published in Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (15.11.2015)
Published in Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (15.11.2015)
Get full text
Conference Proceeding
Accelerating scientific computations with mixed precision algorithms
Baboulin, Marc, Buttari, Alfredo, Dongarra, Jack, Kurzak, Jakub, Langou, Julie, Langou, Julien, Luszczek, Piotr, Tomov, Stanimire
Published in Computer physics communications (01.12.2009)
Published in Computer physics communications (01.12.2009)
Get full text
Journal Article
Looking back at dense linear algebra software
Luszczek, Piotr, Kurzak, Jakub, Dongarra, Jack
Published in Journal of parallel and distributed computing (01.07.2014)
Published in Journal of parallel and distributed computing (01.07.2014)
Get full text
Journal Article