CaffePresso: An optimized library for deep learning on embedded accelerator-based platforms G Hegde, Siddhartha, N Ramasamy, N Kapre Proceedings of the International Conference on Compilers, Architectures and …, 2016 | 63 | 2016 |
CaffePresso: Accelerating convolutional networks on embedded SoCs G Hegde, Siddhartha, N Kapre ACM Transactions on Embedded Computing Systems (TECS) 17 (1), 1-26, 2017 | 17 | 2017 |
LUXOR: An FPGA logic cell architecture for efficient compressor tree implementations SR Rasoulinezhad, Siddhartha, H Zhou, L Wang, D Boland, PHW Leong Proceedings of the 2020 ACM/SIGDA International Symposium on Field …, 2020 | 13 | 2020 |
A case for embedded fpga-based socs in energy-efficient acceleration of graph problems N Kapre, P Moorthy Supercomputing Frontiers and Innovations 2 (3), 76-86, 2015 | 5 | 2015 |
Evaluating embedded fpga accelerators for deep learning applications G Hegde, N Ramasamy, V Buddha, N Kapre 2016 IEEE 24th Annual International Symposium on Field-Programmable Custom …, 2016 | 4 | 2016 |
Heterogeneous dataflow architectures for fpga-based sparse lu factorization N Kapre 2014 24th International Conference on Field Programmable Logic and …, 2014 | 3 | 2014 |
Limits of Statically-Scheduled Token Dataflow Processing N Kapre 2014 Fourth Workshop on Data-Flow Execution Models for Extreme Scale …, 2014 | 3 | 2014 |
Breaking Sequential Dependencies in FPGA-based Sparse LU Factorization N Kapre 2014 IEEE 22nd Annual International Symposium on Field-Programmable Custom …, 2014 | 3 | 2014 |
Vector FPGA acceleration of 1-D DWT computations using sparse matrix skeletons S Maheshwari, G Modi, N Kapre 2016 26th International Conference on Field Programmable Logic and …, 2016 | | 2016 |
Communication Optimization for the 16-Core Epiphany Floating-Point Processor Array N Kapre 2016 IEEE 24th Annual International Symposium on Field-Programmable Custom …, 2016 | | 2016 |
FPGA Acceleration of Irregular Iterative Computations using Criticality-Aware Dataflow Optimizations Siddhartha, N Kapre Proceedings of the 2015 ACM/SIGDA International Symposium on Field …, 2015 | | 2015 |
Fanout decomposition dataflow optimizations for FPGA-based Sparse LU factorization N Kapre 2014 International Conference on Field-Programmable Technology (FPT), 252-255, 2014 | | 2014 |