Jaewoong Sim

Cited by

	All	Since 2019
Citations	2976	2060
h-index	20	18
i10-index	23	23

480

240

120

360

201220132014201520162017201820192020202120222023202414 44 69 100 121 189 332 374 391 471 412 335 77

Public access

View all

4 articles

2 articles

available

not available

Based on funding mandates

Co-authors

Hyesoon KimGeorgia TechVerified email at cc.gatech.edu
Gabriel H. LohAMD Research and Advanced Development (RAD)Verified email at amd.com
Asit MishraNvidiaVerified email at nvidia.com
Srivatsan KrishnanHarvard UniversityVerified email at seas.harvard.edu
Mike O'ConnorNVIDIA ResearchVerified email at nvidia.com
Lifeng NaiGoogleVerified email at google.com
Chris WilkersonIntelVerified email at intel.com
Alaa R. AlameldeenSimon Fraser UniversityVerified email at cs.sfu.ca
Philip H.W. LeongProfessor of Computer Systems, The University of SydneyVerified email at sydney.edu.au
Zeshan ChishtiStaff Research Scientist, Intel CorporationVerified email at intel.com
Mithuna ThottethodiPurdue UniversityVerified email at purdue.edu
Vilas SridharanAMD, Inc.Verified email at amd.com
Richard VuducGeorgia Institute of TechnologyVerified email at cc.gatech.edu
Moinuddin QureshiProfessor, Georgia Institute of TechnologyVerified email at gatech.edu
Jaekyu LeeArm ResearchVerified email at arm.com

Jaewoong Sim

Seoul National University

Verified email at snu.ac.kr - Homepage

Computer Architecture Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Can FPGAs beat GPUs in accelerating next-generation deep neural networks? E Nurvitadhi, G Venkatesh, J Sim, D Marr, R Huang, J Ong Gee Hock, ... Proceedings of the 2017 ACM/SIGDA international symposium on field …, 2017	561	2017
Accelerating binarized neural networks: Comparison of FPGA, CPU, GPU, and ASIC E Nurvitadhi, D Sheffield, J Sim, A Mishra, G Venkatesh, D Marr 2016 International Conference on Field-Programmable Technology (FPT), 77-84, 2016	387	2016
Graphpim: Enabling instruction-level pim offloading in graph computing frameworks L Nai, R Hadidi, J Sim, H Kim, P Kumar, H Kim 2017 IEEE International symposium on high performance computer architecture …, 2017	327	2017
A performance analysis framework for identifying potential benefits in GPGPU applications J Sim, A Dasgupta, H Kim, R Vuduc Proceedings of the 17th ACM SIGPLAN Annual Symposium on Principles and …, 2012	266	2012
Accelerating recurrent neural networks in analytics servers: Comparison of FPGA, CPU, GPU, and ASIC E Nurvitadhi, J Sim, D Sheffield, A Mishra, S Krishnan, D Marr 2016 26th International Conference on Field Programmable Logic and …, 2016	233	2016
Transparent hardware management of stacked dram as part of memory J Sim, AR Alameldeen, Z Chishti, C Wilkerson, H Kim 2014 47th Annual IEEE/ACM International Symposium on Microarchitecture, 13-24, 2014	149	2014
A mostly-clean DRAM cache for effective hit speculation and self-balancing dispatch J Sim, GH Loh, H Kim, M OConnor, M Thottethodi 2012 45th Annual IEEE/ACM International Symposium on Microarchitecture, 247-257, 2012	126	2012
Dynamically configuring regions of a main memory in a write-back mode or a write-through mode J Sim, MS Thottethodi, GH Loh US Patent 9,552,294, 2017	109	2017
A customizable matrix multiplication framework for the intel harpv2 xeon+ fpga platform: A deep learning case study DJM Moss, S Krishnan, E Nurvitadhi, P Ratuszniak, C Johnson, J Sim, ... Proceedings of the 2018 ACM/SIGDA International Symposium on Field …, 2018	100	2018
High performance binary neural networks on the Xeon+ FPGA™ platform DJM Moss, E Nurvitadhi, J Sim, A Mishra, D Marr, S Subhaschandra, ... 2017 27Th International conference on field programmable logic and …, 2017	93	2017
Macsim: A cpu-gpu heterogeneous simulation framework user guide H Kim, J Lee, NB Lakshminarayana, J Sim, J Lim, T Pho Georgia Institute of Technology, 1-57, 2012	86	2012
BSSync: Processing near memory for machine learning workloads with bounded staleness consistency models JH Lee, J Sim, H Kim 2015 International Conference on Parallel Architecture and Compilation (PACT …, 2015	80	2015
Why compete when you can work together: FPGA-ASIC integration for persistent RNNs E Nurvitadhi, D Kwon, A Jafari, A Boutros, J Sim, P Tomson, H Sumbul, ... 2019 IEEE 27th Annual International Symposium on Field-Programmable Custom …, 2019	67	2019
Batch-aware unified memory management in GPUs for irregular workloads H Kim, J Sim, P Gera, R Hadidi, H Kim Proceedings of the Twenty-Fifth International Conference on Architectural …, 2020	65	2020
Resilient die-stacked DRAM caches J Sim, GH Loh, V Sridharan, M O'Connor ACM SIGARCH Computer Architecture News 41 (3), 416-427, 2013	65	2013
FLEXclusion: Balancing cache capacity and on-chip bandwidth via flexible exclusion J Sim, J Lee, MK Qureshi, H Kim ACM SIGARCH Computer Architecture News 40 (3), 321-332, 2012	54	2012
Partitioning caches for sub-entities in computing devices GH Loh, J Sim US Patent 9,098,417, 2015	35	2015
Method and apparatus for implementing a heterogeneous memory subsystem CB Wilkerson, AR Alameldeen, ZA Chishti, J Sim US Patent 9,472,248, 2016	26	2016
CoolPIM: Thermal-aware source throttling for efficient PIM instruction offloading L Nai, R Hadidi, H Xiao, H Kim, J Sim, H Kim 2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2018	23	2018
Specializing FGPU for persistent deep learning R Ma, JC Hsu, T Tan, E Nurvitadhi, D Sheffield, R Pelt, M Langhammer, ... ACM Transactions on Reconfigurable Technology and Systems (TRETS) 14 (2), 1-23, 2021	20	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors