Follow
Jens Domke
Jens Domke
RIKEN Center for Computational Science (R-CCS) / Tokyo Institute of Technology
Verified email at riken.jp - Homepage
Title
Cited by
Cited by
Year
Deadlock-free oblivious routing for arbitrary topologies
J Domke, T Hoefler, WE Nagel
2011 IEEE International Parallel & Distributed Processing Symposium, 616-627, 2011
802011
Fail-in-place network design: interaction between topology, routing algorithm and failures
J Domke, T Hoefler, S Matsuoka
SC'14: Proceedings of the International Conference for High Performance …, 2014
422014
Mitigating inter-job interference using adaptive flow-aware routing
SA Smith, CE Cromey, DK Lowenthal, J Domke, N Jain, JJ Thiagarajan, ...
SC18: International Conference for High Performance Computing, Networking …, 2018
352018
Matrix engines for high performance computing: A paragon of performance or grasping at straws?
J Domke, E Vatai, A Drozd, P ChenT, Y Oyama, L Zhang, S Salaria, ...
2021 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2021
332021
High-performance routing with multipathing and path diversity in ethernet and hpc networks
M Besta, J Domke, M Schneider, M Konieczny, S Di Girolamo, ...
IEEE Transactions on Parallel and Distributed Systems 32 (4), 943-959, 2020
332020
Scheduling-aware routing for supercomputers
J Domke, T Hoefler
SC'16: Proceedings of the International Conference for High Performance …, 2016
302016
Why globally re-shuffle? Revisiting data shuffling in large scale deep learning
TT Nguyen, F Trahay, J Domke, A Drozd, E Vatai, J Liao, M Wahib, ...
2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2022
282022
Routing on the dependency graph: A new approach to deadlock-free high-performance routing
J Domke, T Hoefler, S Matsuoka
proceedings of the 25th ACM international symposium on high-performance …, 2016
262016
MLPerf™ HPC: A holistic benchmark suite for scientific machine learning on HPC systems
S Farrell, M Emani, J Balma, L Drescher, A Drozd, A Fink, G Fox, D Kanter, ...
2021 IEEE/ACM Workshop on Machine Learning in High Performance Computing …, 2021
222021
Hyperx topology: First at-scale implementation and comparison to the fat-tree
J Domke, S Matsuoka, IR Ivanov, Y Tsushima, T Yuki, A Nomura, S Miura, ...
Proceedings of the International Conference for High Performance Computing …, 2019
222019
Preliminary performance analysis of multi-rail fat-tree networks
N Wolfe, M Mubarak, N Jain, J Domke, A Bhatele, CD Carothers, RB Ross
2017 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid …, 2017
212017
Scaling distributed deep learning workloads beyond the memory capacity with KARMA
M Wahib, H Zhang, TT Nguyen, A Drozd, J Domke, L Zhang, R Takano, ...
SC20: International Conference for High Performance Computing, Networking …, 2020
202020
Double-precision fpus in high-performance computing: an embarrassment of riches?
J Domke, K Matsumura, M Wahib, H Zhang, K Yashima, T Tsuchikawa, ...
2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2019
202019
Hardware-centric analysis of network performance for MPI applications
KA Brown, J Domke, S Matsuoka
2015 IEEE 21st International Conference on Parallel and Distributed Systems …, 2015
152015
Toward reliable validation of hpc network simulation models
M Mubarak, N Jain, J Domke, N Wolfe, C Ross, K Li, A Bhatele, ...
2017 Winter Simulation Conference (WSC), 659-674, 2017
112017
Tracing data movements within MPI collectives
KA Brown, J Domke, S Matsuoka
Proceedings of the 21st European MPI Users' Group Meeting, 117-118, 2014
112014
Runtime tracing of the community earth system model: feasibility study and benefits
J Domke, D Wang
Procedia Computer Science 9, 1950-1958, 2012
112012
High-performance gpu-to-cpu transpilation and optimization via high-level parallel constructs
WS Moses, IR Ivanov, J Domke, T Endo, J Doerfert, O Zinenko
Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and …, 2023
102023
A64FX–Your Compiler You Must Decide!
J Domke
2021 IEEE International Conference on Cluster Computing (CLUSTER), 736-740, 2021
92021
Optimizing asynchronous multi-level checkpoint/restart configurations with machine learning
T Dey, K Sato, B Nicolae, J Guo, J Domke, W Yu, F Cappello, K Mohror
2020 IEEE International Parallel and Distributed Processing Symposium …, 2020
92020
The system can't perform the operation now. Try again later.
Articles 1–20