Beyond data and model parallelism for deep neural networks Z Jia, M Zaharia, A Aiken SysML 19, 2019 | 466 | 2019 |
TASO: Optimizing Deep Learning Computation with Automatic Generation of Graph Substitutions Z Jia, O Padon, J Thomas, T Warszawski, M Zaharia, A Aiken SOSP'19, 2019 | 242 | 2019 |
Improving the Accuracy, Scalability, and Performance of Graph Neural Networks with Roc Z Jia, S Lin, M Gao, M Zaharia, A Aiken MLSys'20, 2020 | 201 | 2020 |
Improving integer security for systems with {KINT} X Wang, H Chen, Z Jia, N Zeldovich, MF Kaashoek 10th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2012 | 159 | 2012 |
Undefined behavior: what happened to my code? X Wang, H Chen, A Cheung, Z Jia, N Zeldovich, MF Kaashoek Proceedings of the Asia-Pacific Workshop on Systems, 1-7, 2012 | 126 | 2012 |
Exploring hidden dimensions in parallelizing convolutional neural networks Z Jia, S Lin, CR Qi, A Aiken ICML 18, 2018 | 125 | 2018 |
Dorylus: Affordable, scalable, and accurate {GNN} training with distributed {CPU} servers and serverless threads J Thorpe, Y Qiao, J Eyolfson, S Teng, G Hu, Z Jia, J Wei, K Vora, ... 15th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2021 | 116 | 2021 |
Redundancy-Free Computation for Graph Neural Networks Z Jia, S Lin, R Ying, J You, J Leskovec, A Aiken KDD'20, 2019 | 83 | 2019 |
A distributed multi-gpu system for fast graph processing Z Jia, Y Kwon, G Shipman, P McCormick, M Erez, A Aiken Proceedings of the VLDB Endowment 11 (3), 297-310, 2017 | 83 | 2017 |
Optimizing DNN Computation With Relaxed Graph Substitutions Z Jia, J Thomas, T Warszawski, M Gao, M Zaharia, A Aiken SysML 2019, 2019 | 81 | 2019 |
Software-hardware co-design for fast and scalable training of deep learning recommendation models D Mudigere, Y Hao, J Huang, Z Jia, A Tulloch, S Sridharan, X Liu, ... Proceedings of the 49th Annual International Symposium on Computer …, 2022 | 72 | 2022 |
Unity: Accelerating {DNN} training through joint optimization of algebraic transformations and parallelization C Unger, Z Jia, W Wu, S Lin, M Baines, CEQ Narvaez, V Ramakrishnaiah, ... 16th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2022 | 56 | 2022 |
Ios: Inter-operator scheduler for cnn acceleration Y Ding, L Zhu, Z Jia, G Pekhimenko, S Han Proceedings of Machine Learning and Systems 3, 167-180, 2021 | 55 | 2021 |
{SLIK}: Scalable {Low-Latency} Indexes for a {Key-Value} Store A Kejriwal, A Gopalan, A Gupta, Z Jia, S Yang, J Ousterhout 2016 USENIX Annual Technical Conference (USENIX ATC 16), 57-70, 2016 | 55 | 2016 |
Bond: Benchmarking unsupervised outlier node detection on static attributed graphs K Liu, Y Dou, Y Zhao, X Ding, X Hu, R Zhang, K Ding, C Chen, H Peng, ... Advances in Neural Information Processing Systems 35, 27021-27035, 2022 | 50 | 2022 |
{PET}: Optimizing tensor programs with partially equivalent transformations and automated corrections H Wang, J Zhai, M Gao, Z Ma, S Tang, L Zheng, Y Li, K Rong, Y Chen, ... 15th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2021 | 50 | 2021 |
Specinfer: Accelerating generative llm serving with speculative inference and token tree verification X Miao, G Oliaro, Z Zhang, X Cheng, Z Wang, RYY Wong, Z Chen, ... arXiv preprint arXiv:2305.09781, 2023 | 44 | 2023 |
Exploring hidden dimensions in accelerating convolutional neural networks Z Jia, S Lin, CR Qi, A Aiken International Conference on Machine Learning, 2274-2283, 2018 | 44 | 2018 |
TopoOpt: Optimizing the Network Topology for Distributed DNN Training W Wang, M Khazraee, Z Zhong, Z Jia, D Mudigere, Y Zhang, A Kewitsch, ... arXiv preprint arXiv:2202.00433, 2022 | 43* | 2022 |
M. khorashadi, P D Mudigere, Y Hao, J Huang, Z Jia, A Tulloch, S Sridharan, X Liu, ... Bhattacharya, P. Lapukhov, M. Naumov, L. Qiao, M. Smelyanskiy, B. Jia, and V …, 2021 | 38 | 2021 |