Alexander Heinecke

Zitiert von

	Alle	Seit 2019
Zitate	4012	2758
h-index	30	24
i10-index	78	58

760

380

190

570

20102011201220132014201520162017201820192020202120222023202415 13 45 132 120 199 207 198 248 325 390 515 567 753 201

Öffentlicher Zugriff

Alle anzeigen

10 Artikel

3 Artikel

verfügbar

nicht verfügbar

Basierend auf Fördermandaten

Folgen

Alexander Heinecke

Senior Principal Engineer at Intel Labs

Bestätigte E-Mail-Adresse bei intel.com - Startseite

HPC and Parallel Computing


Titel Nach Zitationen sortieren Nach Jahr sortieren Nach Titel sortieren	Zitiert von Zitiert von	Jahr
The ELPA library: scalable parallel eigenvalue solutions for electronic structure theory and computational science A Marek, V Blum, R Johanni, V Havu, B Lang, T Auckenthaler, A Heinecke, ... Journal of Physics: Condensed Matter 26 (21), 213201, 2014	305	2014
A study of BFLOAT16 for deep learning training D Kalamkar, D Mudigere, N Mellempudi, D Das, K Banerjee, S Avancha, ... arXiv preprint arXiv:1905.12322, 2019	298	2019
Design and implementation of the linpack benchmark for single and multi-node systems based on intel® xeon phi coprocessor A Heinecke, K Vaidyanathan, M Smelyanskiy, A Kobotov, R Dubtsov, ... 2013 IEEE 27th International Symposium on Parallel and Distributed …, 2013	215	2013
LIBXSMM: accelerating small matrix multiplications by runtime code generation A Heinecke, G Henry, M Hutchinson, H Pabst SC'16: Proceedings of the International Conference for High Performance …, 2016	198	2016
Mixed precision training of convolutional neural networks using integer operations D Das, N Mellempudi, D Mudigere, D Kalamkar, S Avancha, K Banerjee, ... arXiv preprint arXiv:1802.00930, 2018	187	2018
Petascale high order dynamic rupture earthquake simulations on heterogeneous supercomputers A Heinecke, A Breuer, S Rettenberger, M Bader, AA Gabriel, C Pelties, ... SC'14: Proceedings of the International Conference for High Performance …, 2014	167	2014
ls1 mardyn: The Massively Parallel Molecular Dynamics Code for Large Systems C Niethammer, S Becker, M Bernreuther, M Buchholz, W Eckhardt, ... Journal of chemical theory and computation 10 (10), 4455-4464, 2014	158	2014
Anatomy of high-performance deep learning convolutions on simd architectures E Georganas, S Avancha, K Banerjee, D Kalamkar, G Henry, H Pabst, ... SC18: International Conference for High Performance Computing, Networking …, 2018	122	2018
591 TFLOPS multi-trillion particles simulation on SuperMUC W Eckhardt, A Heinecke, R Bader, M Brehm, N Hammer, H Huber, ... Supercomputing: 28th International Supercomputing Conference, ISC 2013 …, 2013	101	2013
Distgnn: Scalable distributed training for large-scale graph neural networks V Md, S Misra, G Ma, R Mohanty, E Georganas, A Heinecke, D Kalamkar, ... Proceedings of the International Conference for High Performance Computing …, 2021	99	2021
From gpgpu to many-core: Nvidia fermi and intel many integrated core architecture A Heinecke, M Klemm, HJ Bungartz Computing in Science & Engineering 14 (2), 78-83, 2012	88	2012
Sustained petascale performance of seismic simulations with SeisSol on SuperMUC A Breuer, A Heinecke, S Rettenberger, M Bader, AA Gabriel, C Pelties Supercomputing: 29th International Conference, ISC 2014, Leipzig, Germany …, 2014	87	2014
Fp8 formats for deep learning P Micikevicius, D Stosic, N Burgess, M Cornea, P Dubey, R Grisenthwaite, ... arXiv preprint arXiv:2209.05433, 2022	78	2022
Leveraging the bfloat16 artificial intelligence datatype for higher-precision computations G Henry, PTP Tang, A Heinecke 2019 IEEE 26th Symposium on Computer Arithmetic (ARITH), 69-76, 2019	68	2019
Efficient shared-memory implementation of high-performance conjugate gradient benchmark and its application to unstructured matrices J Park, M Smelyanskiy, K Vaidyanathan, A Heinecke, DD Kalamkar, X Liu, ... SC'14: Proceedings of the International Conference for High Performance …, 2014	67	2014
Performance optimizations for scalable implicit RANS calculations with SU2 TD Economon, D Mudigere, G Bansal, A Heinecke, F Palacios, J Park, ... Computers & Fluids 129, 146-158, 2016	55	2016
Methods and apparatus to detect anomalies of a monitored system M Agerstam, B Sadeghi, J Martin, J Ota, J Gottschlich, M Carranza, ... US Patent 10,802,942, 2020	50	2020
Petascale local time stepping for the ADER-DG finite element method A Breuer, A Heinecke, M Bader 2016 IEEE international parallel and distributed processing symposium (IPDPS …, 2016	49	2016
Optimized compute hardware for machine learning operations D Das, R Gramunt, M Smelyanskiy, J Corbal, D Mudigere, NK Mellempudi, ... US Patent 10,776,699, 2020	45	2020
Computer processor for higher precision computations using a mixed-precision decomposition of operations G Henry, A Heinecke US Patent 10,853,067, 2020	44	2020

Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.

Artikel 1–20

Zitate pro Jahr

Doppelte Zitate

Zusammengeführte Zitate

Koautor hinzufügenKoautoren

Folgen

Zitiert von