The READEX formalism for automatic tuning for energy efficiency J Schuchart, M Gerndt, PG Kjeldsberg, M Lysaght, D Horák, L Říha, ... Computing 99, 727-745, 2017 | 43 | 2017 |
Effective multi-GPU communication using multiple CUDA streams and threads M Sourouri, T Gillberg, SB Baden, X Cai 2014 20th IEEE International Conference on Parallel and Distributed Systems …, 2014 | 41 | 2014 |
Panda: A Compiler Framework for Concurrent CPU+GPU Execution of 3D Stencil Computations on GPU-accelerated Supercomputers M Sourouri, SB Baden, X Cai International Journal of Parallel Programming 45 (3), 711-729, 2017 | 31 | 2017 |
Scalable heterogeneous CPU-GPU computations for unstructured tetrahedral meshes J Langguth, M Sourouri, GT Lines, SB Baden, X Cai IEEE Micro 35 (4), 6-15, 2015 | 25 | 2015 |
Towards fine-grained dynamic tuning of HPC applications on modern multi-core architectures M Sourouri, EB Raknes, N Reissmann, J Langguth, D Hackenberg, ... Proceedings of the International Conference for High Performance Computing …, 2017 | 23 | 2017 |
CPU+GPU programming of stencil computations for resource-efficient use of GPU clusters M Sourouri, J Langguth, F Spiga, SB Baden, X Cai 2015 IEEE 18th International Conference on Computational Science and …, 2015 | 21 | 2015 |
Memory bandwidth contention: Communication vs computation tradeoffs in supercomputers with multicore architectures J Langguth, X Cai, M Sourouri 2018 IEEE 24th International Conference on Parallel and Distributed Systems …, 2018 | 19 | 2018 |
A new parallel 3D front propagation algorithm for fast simulation of geological folds T Gillberg, M Sourouri, X Cai Procedia Computer Science 9, 947-955, 2012 | 18 | 2012 |
Parallel solutions of static Hamilton-Jacobi equations for simulations of geological folds T Gillberg, AM Bruaset, Ø Hjelle, M Sourouri Journal of Mathematics in Industry 4, 1-31, 2014 | 13 | 2014 |
On the performance and energy efficiency of the pgas programming model on multicore architectures J Lagraviere, J Langguth, M Sourouri, PH Ha, X Cai 2016 International Conference on High Performance Computing & Simulation …, 2016 | 9 | 2016 |
Multi-gpu implementations of parallel 3d sweeping algorithms with application to geological folding E Krishnasamy, M Sourouri, X Cai Procedia Computer Science 51, 1494-1503, 2015 | 7 | 2015 |
Accelerating 3D Elastic Wave Equations on Knights Landing based Intel Xeon Phi processors M Sourouri, E Birger Raknes EGU General Assembly Conference Abstracts, 7790, 2017 | 1 | 2017 |
Scalable Heterogeneous Supercomputing: Programming Methodologies and Automated Code Generation M Sourouri | 1 | 2015 |
A parallel front propagation method: simulating geological folds on parallel architectures M Sourouri | 1 | 2012 |
Key exercise 1: Using Finite Difference Method to solve the 2D Wave Equation K Støverud, M Sourouri, I Drøsdal | | 2011 |
Document history Version Date Author/Editor Description A Gocht, USM TUD, M Lysaght, V Kannan, M Gerndt, A Chowdhury, ... | | |