Tareq Malas, Ph.D.
Tareq Malas, Ph.D.
Verified email at intel.com - Homepage
Title
Cited by
Cited by
Year
Multicore-optimized wavefront diamond blocking for optimizing stencil updates
T Malas, G Hager, H Ltaief, H Stengel, G Wellein, D Keyes
SIAM Journal on Scientific Computing 37 (4), C439-C464, 2015
702015
Applying the roofline performance model to the intel xeon phi knights landing processor
D Doerfler, J Deslippe, S Williams, L Oliker, B Cook, T Kurth, M Lobet, ...
International Conference on High Performance Computing, 339-353, 2016
652016
Deep learning at 15pf: supervised and semi-supervised classification for scientific data
T Kurth, J Zhang, N Satish, E Racah, I Mitliagkas, MMA Patwary, T Malas, ...
Proceedings of the International Conference for High Performance Computing …, 2017
612017
Evaluating and optimizing the nersc workload on knights landing
T Barnes, B Cook, J Deslippe, D Doerfler, B Friesen, Y He, T Kurth, ...
2016 7th International Workshop on Performance Modeling, Benchmarking and …, 2016
372016
Multidimensional intratile parallelization for memory-starved stencil computations
TM Malas, G Hager, H Ltaief, DE Keyes
ACM Transactions on Parallel Computing (TOPC) 4 (3), 1-32, 2017
282017
Feature selection for recognizing handwritten Arabic letters
GA Abandah, TM Malas
Dirasat Engineering Sciences Journal 37 (2), 2010
232010
Toward optimal Arabic keyboard layout using genetic algorithm
TM Malas, SS Taifour, GA Abandah
Proc. 9th Int’l Middle Eastern Multiconf. on Simulation and Modeling (MESM …, 2008
212008
Optimization of an electromagnetics code with multicore wavefront diamond blocking and multi-dimensional intra-tile parallelization
TM Malas, J Hornich, G Hager, H Ltaief, C Pflaum, DE Keyes
2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2016
152016
High-performance seismic modeling with finite-difference using spatial and temporal cache blocking
V Etienne, T Tonellot, T Malas, H Ltaief, S Kortas, P Thierry, D Keyes
Third EAGE Workshop on High Performance Computing for Upstream 2017 (1), 1-5, 2017
82017
Optimization of the sparse matrix-vector products of an IDR Krylov iterative solver in EMGeo for the Intel KNL manycore processor
T Malas, T Kurth, J Deslippe
International Conference on High Performance Computing, 378-389, 2016
82016
Towards energy efficiency and maximum computational intensity for stencil algorithms using wavefront diamond temporal blocking
T Malas, G Hager, H Ltaief, D Keyes
arXiv preprint arXiv:1410.5561, 2014
82014
Optimizing the performance of streaming numerical kernels on the IBM Blue Gene/P PowerPC 450 processor
T Malas, AJ Ahmadia, J Brown, JA Gunnels, DE Keyes
The International journal of high performance computing applications 27 (2 …, 2013
82013
Towards energy efficiency and maximum computational intensity for stencil algorithms using wavefront diamond temporal blocking. CoRR abs/1410.5561
TM Malas, G Hager, H Ltaief, DE Keyes
arXiv preprint arXiv:1410.5561, 2014
52014
Analyzing Performance of Selected NESAP Applications on the Cori HPC System
J Deslippe, D Doerfler, B Friesen, YH He, T Koskela, M Lobet, T Malas, ...
High Performance Computing: ISC High Performance 2017 International …, 2017
22017
Optimization of finite-difference kernels on multi-core architectures for seismic applications
V Etienne, T Tonellot, K Akbudak, H Ltaief, S Kortas, T Malas, P Thierry, ...
Intel EXtreme Performance Users Group, 2018
12018
Analyzing performance of selected NESAP applications on the Cori HPC system
T Kurth, W Arndt, T Barnes, B Cook, J Deslippe, D Doerfler, B Friesen, ...
International Conference on High Performance Computing, 334-347, 2017
12017
Tiling and asynchronous communication optimizations for stencil computations
TMY Malas
12015
КОНФЕРЕНЦИЯ
T Kurth, E Racah, W Bhimji, J Deslippe, ZJ Prabhat, I Mitliagkas, N Satish, ...
2017
Towards Fast Reverse Time Migration Kernels using Multi-threaded Wavefront Diamond Tiling
T Malas, G Hager, H Ltaief, D Keyes
Third EAGE Workshop on Iraq 2015 (1), 1-5, 2015
2015
Optimizing Stencil Computations: Multicore-optimized wavefront diamond blocking on Shared and Distributed Memory Systems
TMY Malas, H Ltaief, G Hager, G Wellein, DE Keyes
2014
The system can't perform the operation now. Try again later.
Articles 1–20