Towards Neural Phrase-based Machine Translation PS Huang, C Wang, S Huang, D Zhou, L Deng Sixth International Conference on Learning Representations (ICLR), 2018 | 68 | 2018 |
Fpga/dnn co-design: An efficient design methodology for 1ot intelligence on the edge C Hao, X Zhang, Y Li, S Huang, J Xiong, K Rupnow, W Hwu, D Chen 2019 56th ACM/IEEE Design Automation Conference (DAC), 1-6, 2019 | 58 | 2019 |
Hardware acceleration of the pair-HMM algorithm for DNA variant calling S Huang, GJ Manikandan, A Ramachandran, K Rupnow, WW Hwu, ... Proceedings of the 2017 ACM/SIGDA International Symposium on Field …, 2017 | 50 | 2017 |
Accelerating subsequence similarity search based on dynamic time warping distance with FPGA Z Wang, S Huang, L Wang, H Li, Y Wang, H Yang Proceedings of the ACM/SIGDA international symposium on Field programmable …, 2013 | 37 | 2013 |
Automatic generation of warp-level primitives and atomic instructions for fast and portable parallel reduction on GPUs SG De Gonzalo, S Huang, J Gómez-Luna, S Hammond, O Mutlu, W Hwu 2019 IEEE/ACM International Symposium on Code Generation and Optimization …, 2019 | 17 | 2019 |
Collaborative computing for heterogeneous integrated systems LW Chang, J Gómez-Luna, I El Hajj, S Huang, D Chen, W Hwu Proceedings of the 8th ACM/SPEC on International Conference on Performance …, 2017 | 15 | 2017 |
Accelerating frequent item counting with fpga Y Sun, Z Wang, S Huang, L Wang, Y Wang, R Luo, H Yang Proceedings of the 2014 ACM/SIGDA international symposium on Field …, 2014 | 14 | 2014 |
Analysis and modeling of collaborative execution strategies for heterogeneous CPU-FPGA architectures S Huang, LW Chang, I El Hajj, S Garcia de Gonzalo, J Gómez-Luna, ... Proceedings of the 2019 ACM/SPEC International Conference on Performance …, 2019 | 11 | 2019 |
Hardware-software co-design for an analog-digital accelerator for machine learning J Ambrosi, A Ankit, R Antunes, SR Chalamalasetti, S Chatterjee, I El Hajj, ... 2018 IEEE International Conference on Rebooting Computing (ICRC), 1-13, 2018 | 8 | 2018 |
Triangle counting and truss decomposition using fpga S Huang, M El-Hadedy, C Hao, Q Li, VS Mailthody, K Date, J Xiong, ... 2018 IEEE High Performance extreme Computing Conference (HPEC), 1-7, 2018 | 7 | 2018 |
DTW-based subsequence similarity search on AMD heterogeneous computing platform S Huang, G Dai, Y Sun, Z Wang, Y Wang, H Yang 2013 IEEE 10th International Conference on High Performance Computing and …, 2013 | 7 | 2013 |
Accelerating sparse deep neural networks on fpgas S Huang, C Pearson, R Nagi, J Xiong, D Chen, W Hwu 2019 IEEE High Performance Extreme Computing Conference (HPEC), 1-7, 2019 | 6 | 2019 |
Thoughts on massively-parallel heterogeneous computing for solving large problems W Hwu, M Hidayetoglu, WC Chew, C Pearson, S Garcia, S Huang, ... 2017 Computing and Electromagnetics International Workshop (CEM), 67-68, 2017 | 3 | 2017 |
Acceleration of the Pair-HMM algorithm for DNA variant calling GJ Manikandan, S Huang, K Rupnow, WMW Hwu, D Chen 2016 IEEE 24th Annual International Symposium on Field-Programmable Custom …, 2016 | 3 | 2016 |
Analysis and optimization of I/O cache coherency strategies for SoC-FPGA device SW Min, S Huang, M El-Hadedy, J Xiong, D Chen, W Hwu 2019 29th International Conference on Field Programmable Logic and …, 2019 | 1 | 2019 |
Near-memory and in-storage FPGA acceleration for emerging cognitive computing workloads A Dhar, S Huang, J Xiong, D Jamsek, B Mesnet, J Huang, NS Kim, W Hwu, ... 2019 IEEE Computer Society Annual Symposium on VLSI (ISVLSI), 68-75, 2019 | 1 | 2019 |
Large Graph Convolutional Network Training with GPU-Oriented Data Communication Architecture SW Min, K Wu, S Huang, M Hidayetoğlu, J Xiong, E Ebrahimi, D Chen, ... arXiv preprint arXiv:2103.03330, 2021 | | 2021 |
Mind mappings: enabling efficient algorithm-accelerator mapping space search K Hegde, PA Tsai, S Huang, V Chandra, A Parashar, CW Fletcher arXiv preprint arXiv:2103.01489, 2021 | | 2021 |
PyLog: An Algorithm-Centric Python-Based FPGA Programming and Synthesis Flow S Huang, K Wu, H Jeong, C Wang, D Chen, W Hwu The 2021 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays …, 2021 | | 2021 |
PyTorch-Direct: Enabling GPU Centric Data Access for Very Large Graph Neural Network Training with Irregular Accesses SW Min, K Wu, S Huang, M Hidayetoğlu, J Xiong, E Ebrahimi, D Chen, ... arXiv preprint arXiv:2101.07956, 2021 | | 2021 |