Follow
Siddharth Singh
Siddharth Singh
PhD Student, Computer Science, University of Maryland
Verified email at umd.edu
Title
Cited by
Cited by
Year
Stance detection in web and social media: a comparative study
S Ghosh, P Singhania, S Singh, K Rudra, S Ghosh
Experimental IR Meets Multilinguality, Multimodality, and Interaction: 10th …, 2019
962019
Inducing Cooperation in Multi-Agent Games Through Status-Quo Loss
P Badjatiya, M Sarkar, A Sinha, S Singh, N Puri, B Krishnamurthy
arXiv preprint, 2020
9*2020
AxoNN: An asynchronous, message-driven parallel framework for extreme-scale deep learning
S Singh, A Bhatele
2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2022
82022
A hybrid tensor-expert-data parallelism approach to optimize mixture-of-experts training
S Singh, O Ruwase, AA Awan, S Rajbhandari, Y He, A Bhatele
Proceedings of the 37th International Conference on Supercomputing, 203-214, 2023
72023
A survey and empirical evaluation of parallel deep learning frameworks
D Nichols, S Singh, SH Lin, A Bhatele
arXiv preprint arXiv:2111.04949, 2021
7*2021
Exploiting sparsity in pruned neural networks to optimize large model training
S Singh, A Bhatele
2023 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2023
42023
PySchedCL: Leveraging Concurrency in Heterogeneous Data-Parallel Systems
A Ghose, S Singh, V Kulaharia, L Dokara, S Maity, S Dey
IEEE Transactions on Computers 71 (9), 2234-2247, 2021
22021
Loki: Low-Rank Keys for Efficient Sparse Attention
P Singhania, S Singh, S He, S Feizi, A Bhatele
arXiv preprint arXiv:2406.02542, 2024
12024
A 4D Hybrid Algorithm to Scale Parallel Training to Thousands of GPUs
arXiv preprint arXiv:2305.13525, 2023
1*2023
Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs
A Hans, Y Wen, N Jain, J Kirchenbauer, H Kazemi, P Singhania, S Singh, ...
arXiv preprint arXiv:2406.10209, 2024
2024
Jorge: Approximate Preconditioning for GPU-efficient Second-order Optimization
S Singh, Z Sating, A Bhatele
arXiv preprint arXiv:2310.12298, 2023
2023
A 4D Hybrid Algorithm to Scale Parallel Training to Thousands of GPUs
S Singh, P Singhania, AK Ranjan, Z Sating, A Bhatele
arXiv preprint arXiv:2305.13525, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–12