Rajib Nath
Title
Cited by
Cited by
Year
Dense linear algebra solvers for multicore with GPU accelerators
S Tomov, R Nath, H Ltaief, J Dongarra
2010 IEEE International Symposium on Parallel & Distributed Processing …, 2010
2652010
An improved MAGMA GEMM for Fermi graphics processing units
R Nath, S Tomov, J Dongarra
The International Journal of High Performance Computing Applications 24 (4 …, 2010
2322010
Accelerating GPU kernels for dense linear algebra
R Nath, S Tomov, J Dongarra
High Performance Computing for Computational Science–VECPAR 2010, 83-92, 2011
772011
Accelerating the reduction to upper Hessenberg, tridiagonal, and bidiagonal forms through hybrid GPU-based computing
S Tomov, R Nath, J Dongarra
Parallel Computing 36 (12), 645-654, 2010
772010
A scalable high performant Cholesky factorization for multicore with GPU accelerators
H Ltaief, S Tomov, R Nath, P Du, J Dongarra
International Conference on High Performance Computing for Computational …, 2010
652010
Optimizing Symmetric Dense Matrix-Vector Multiplication on GPUs
R Nath, S Tomov, J Dongarra
Super Computing (SC), 2011
582011
JETC: Joint energy thermal and cooling management for memory and CPU subsystems in servers
R Ayoub, R Nath, T Rosing
IEEE International Symposium on High-Performance Comp Architecture, 1-12, 2012
412012
MAGMA version 0.2 User Guide
S Tomov, R Nath, P Du, J Dongarra
392009
MAGMA Users’ Guide
S Tomov, R Nath, P Du, J Dongarra
ICL, UTK (November 2009), 2011
382011
Hybrid multicore cholesky factorization with multiple gpu accelerators
H Ltaief, S Tomov, R Nath, J Dongarra
IEEE Transaction on Parallel and Distributed Systems 48, 2010
252010
An implementation of the tile QR factorization for a GPU and multiple CPUs
J Kurzak, R Nath, P Du, J Dongarra
PARA, 2010
202010
A fully empirical autotuned dense QR factorization for multicore architectures
E Agullo, J Dongarra, R Nath, S Tomov
Euro-Par 2011 Parallel Processing, 194-205, 2011
192011
The CRISP performance model for dynamic voltage and frequency scaling in a GPGPU
R Nath, D Tullsen
Proceedings of the 48th International Symposium on Microarchitecture, 281-293, 2015
182015
Auto-Tuning Stencil Computations on Multicore and Accelerators.
K Datta, S Williams, V Volkov, J Carter, L Oliker, J Shalf, KA Yelick
Scientific Computing with Multicore and Accelerators, 219-253, 2010
102010
BLAS for GPUs
R Nath, S Tomov, J Dongarra
92010
Cometc: Coordinated management of energy/thermal/cooling in servers
R Ayoub, R Nath, TS Rosing
ACM Transactions on Design Automation of Electronic Systems (TODAES) 19 (1 …, 2013
62013
Temperature aware thread block scheduling in GPGPUs
R Nath, R Ayoub, TS Rosing
Proceedings of the 50th Annual Design Automation Conference, 1-6, 2013
62013
Power Modeling and Thermal Management Techniques for Manycores
R Nath, D Carmean, T Rosing
62013
MAGMA: Matrix algebra on GPU and multicore architectures
A Tomov, R Nath, P Du, J Dongarra
62012
Fully empirical autotuned qr factorization for multicore architectures
E Agullo, J Dongarra, R Nath, S Tomov
arXiv preprint arXiv:1102.5328, 2011
62011
The system can't perform the operation now. Try again later.
Articles 1–20