Follow
Peng Chen
Title
Cited by
Cited by
Year
Matrix engines for high performance computing: A paragon of performance or grasping at straws?
J Domke, E Vatai, A Drozd, P Chen, Y Oyama, L Zhang, S Salaria, ...
2021 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2021
332021
A versatile software systolic execution model for GPU memory-bound kernels
Peng Chen , Wahib Mohamed, Takizawa Shinichiro,Takano Ryousei, Matsuoka Satoshi
Proceedings of the International Conference for High Performance Computing …, 2019
272019
Boosting the predictive performance with aqueous solubility dataset curation
J Meng, P Chen, M Wahib, M Yang, L Zheng, Y Wei, S Feng, W Liu
Scientific Data 9 (1), 71, 2022
122022
Automatic generation of high-performance convolution kernels on arm cpus for deep learning
J Meng, C Zhuang, P Chen, M Wahib, B Schmidt, X Wang, H Lan, D Wu, ...
IEEE Transactions on Parallel and Distributed Systems 33 (11), 2885-2899, 2022
92022
iFDK: a scalable framework for instant high-resolution image reconstruction
Peng Chen , Wahib Mohamed, Takizawa Shinichiro,Takano Ryousei, Matsuoka Satoshi
Proceedings of the International Conference for High Performance Computing …, 2019
9*2019
Efficient Algorithms for the Summed Area Tables Primitive on GPUs
Peng Chen , Wahib Mohamed, Takizawa Shinichiro,Takano Ryousei, Matsuoka Satoshi
IEEE International Conference on Cluster Computing (CLUSTER), 2018
82018
At the locus of performance: A case study in enhancing cpus with copious 3d-stacked cache
J Domke, E Vatai, B Gerofi, Y Kodama, M Wahib, A Podobas, S Mittal, ...
arXiv preprint arXiv:2204.02235, 2022
52022
Scalable FBP decomposition for cone-beam CT reconstruction
P Chen, M Wahib, X Wang, T Hirofuchi, H Ogawa, A Biguri, R Boardman, ...
Proceedings of the International Conference for High Performance Computing …, 2021
52021
Physics-Based Iterative Reconstruction for Dual Source and Flying Focal Spot Computed Tomography
X Wang, RD MacDougall, P Chen, CA Bouman, SK Warfield
arXiv e-prints, arXiv: 2001.09471, 2021
5*2021
Persistent Kernels for Iterative Memory-bound GPU Applications
L Zhang, M Wahib, P Chen, J Meng, X Wang, S Matsuoka
arXiv preprint arXiv:2204.02064, 2022
42022
PERKS: a Locality-Optimized Execution Model for Iterative Memory-bound GPU Applications
L Zhang, M Wahib, P Chen, J Meng, X Wang, T Endo, S Matsuoka
Proceedings of the 37th International Conference on Supercomputing, 167-179, 2023
32023
Performance portable back-projection algorithms on cpus: Agnostic data locality and vectorization optimizations
P Chen, M Wahib, X Wang, S Takizawa, T Hirofuchi, H Ogawa, ...
Proceedings of the ACM International Conference on Supercomputing, 316-328, 2021
32021
Evolutionary Architecture Search for Generative Adversarial Networks Based On Weight Sharing
Y Xue, W Tong, F Neri, P Chen, T Luo, L Zhen, X Wang
IEEE Transactions on Evolutionary Computation, 2023
22023
Simeuro: A Hybrid CPU-GPU Parallel Simulator for Neuromorphic Computing Chips
H Zhang, NM Ho, YP Dogukan, P Chen, M Wahib, TT Nguyen, J Meng, ...
IEEE Transactions on Parallel and Distributed Systems, 2023
22023
Revisiting Temporal Blocking Stencil Optimizations
L Zhang, M Wahib, P Chen, J Meng, X Wang, T Endo, S Matsuoka
Proceedings of the 37th International Conference on Supercomputing, 251-263, 2023
22023
Image gradient decomposition for parallel and memory-efficient ptychographic reconstruction
X Wang, A Tsaris, D Mukherjee, M Wahib, P Chen, M Oxley, ...
Proceedings of the International Conference for High Performance Computing …, 2022
22022
At the locus of performance: Quantifying the effects of copious 3D-stacked cache on HPC workloads
J Domke, E Vatai, B Gerofi, Y Kodama, M Wahib, A Podobas, S Mittal, ...
ACM Transactions on Architecture and Code Optimization 20 (4), 1-26, 2023
12023
Ultra-Long Sequence Distributed Transformer
X Wang, I Lyngaas, A Tsaris, P Chen, S Dash, MC Shekar, T Luo, ...
arXiv preprint arXiv:2311.02382, 2023
12023
Pushing the Limits for 2D Convolution Computation On CUDA-enabled GPUs
P Chen, M Wahib, S Takizawa, S Matsuoka
Technical Report, HPC-163, 2018
12018
Neural Architecture Search With Progressive Evaluation and Sub-Population Preservation
Y Xue, J Zha, D Pelusi, P Chen, T Luo, L Zhen, Y Wang, M Wahib
IEEE Transactions on Evolutionary Computation, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–20