Follow
Saeed Maleki
Saeed Maleki
xAI
Verified email at x.ai
Title
Cited by
Cited by
Year
An evaluation of vectorizing compilers
S Maleki, Y Gao, MJ Garzar, T Wong, DA Padua
2011 International Conference on Parallel Architectures and Compilation …, 2011
3102011
CHET: an optimizing compiler for fully-homomorphic neural-network inferencing
R Dathathri, O Saarikivi, H Chen, K Laine, K Lauter, S Maleki, ...
Proceedings of the 40th ACM SIGPLAN conference on programming language …, 2019
2312019
Performance portability with the chapel language
A Sidelnik, S Maleki, BL Chamberlain, MJ Garzar'n, D Padua
2012 IEEE 26th international parallel and distributed processing symposium …, 2012
662012
DSMR: A parallel algorithm for single-source shortest path problem
S Maleki, D Nguyen, A Lenharth, M Garzarán, D Padua, K Pingali
Proceedings of the 2016 International Conference on Supercomputing, 1-14, 2016
512016
Parallelizing dynamic programming through rank convergence
S Maleki, M Musuvathi, T Mytkowicz
ACM SIGPLAN Notices 49 (8), 219-232, 2014
452014
An empirical study of the effect of source-level loop transformations on compiler stability
Z Gong, Z Chen, J Szaday, D Wong, Z Sura, N Watkinson, S Maleki, ...
Proceedings of the ACM on Programming Languages 2 (OOPSLA), 1-29, 2018
362018
Synthesizing optimal collective algorithms
Z Cai, Z Liu, S Maleki, M Musuvathi, T Mytkowicz, J Nelson, O Saarikivi
Proceedings of the 26th ACM SIGPLAN Symposium on Principles and Practice of …, 2021
282021
Inter-disciplinary research challenges in computer systems for the 2020s
A Cohen, X Shen, J Torrellas, J Tuck, Y Zhou, S Adve, I Akturk, S Bagchi, ...
National Science Foundation, 2018
282018
Parallel dynamic programming through rank convergence
TD Mytkowicz, M Musuvathi, S Maleki
US Patent 9,195,436, 2015
262015
Breaking the computation and communication abstraction barrier in distributed machine learning workloads
A Jangda, J Huang, G Liu, AHN Sabet, S Maleki, Y Miao, M Musuvathi, ...
Proceedings of the 27th ACM International Conference on Architectural …, 2022
252022
The magazine archive includes every article published in Communications of the ACM for over the past 50 years.
MY Vardi
Communications of the ACM 54 (5), 5, 2011
23*2011
Homomorphic evaluation of tensor programs
MS Musuvathi, K Laine, KE Lauter, H Chen, OI Saarikivi, S Maleki, ...
US Patent 11,177,935, 2021
182021
Implementing network security measures in response to a detected cyber attack
MS Musuvathi, TD Mytkowicz, S Maleki, Y Ding
US Patent 10,805,317, 2020
182020
CHET: compiler and runtime for homomorphic evaluation of tensor programs
R Dathathri, O Saarikivi, H Chen, K Laine, K Lauter, S Maleki, ...
arXiv preprint arXiv:1810.00845, 2018
182018
Lore: A loop repository for the evaluation of compilers
Z Chen, Z Gong, JJ Szaday, DC Wong, D Padua, A Nicolau, ...
2017 IEEE International Symposium on Workload Characterization (IISWC), 219-228, 2017
182017
Parallelizing wfst speech decoders
C Mendis, J Droppo, S Maleki, M Musuvathi, T Mytkowicz, G Zweig
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
182016
Efficient parallelization using rank convergence in dynamic programming algorithms
S Maleki, M Musuvathi, T Mytkowicz
Communications of the ACM 59 (10), 85-92, 2016
152016
{TACCL}: Guiding Collective Algorithm Synthesis using Communication Sketches
A Shah, V Chidambaram, M Cowan, S Maleki, M Musuvathi, T Mytkowicz, ...
20th USENIX Symposium on Networked Systems Design and Implementation (NSDI …, 2023
132023
Low-rank methods for parallelizing dynamic programming algorithms
S Maleki, M Musuvathi, T Mytkowicz
ACM Transactions on Parallel Computing (TOPC) 2 (4), 1-32, 2016
132016
Synthesizing collective communication algorithms for heterogeneous networks with taccl
A Shah, V Chidambaram, M Cowan, S Maleki, M Musuvathi, T Mytkowicz, ...
arXiv preprint arXiv:2111.04867, 2021
122021
The system can't perform the operation now. Try again later.
Articles 1–20