Suivre
Xuhao Chen
Titre
Citée par
Citée par
Année
Adaptive Cache Management for Energy-efficient GPU Computing
X Chen, LW Chang, CI Rodrigues, J Lv, Z Wang, W Hwu
Proceedings of the 47th Annual IEEE/ACM International Symposium on …, 2014
1812014
Escoin: Efficient sparse convolutional neural network inference on gpus
X Chen
arXiv preprint arXiv:1802.10280, 2018
322018
Adaptive cache bypass and insertion for many-core accelerators
X Chen, S Wu, LW Chang, WS Huang, C Pearson, Z Wang, WMW Hwu
Proceedings of International Workshop on Manycore Embedded Systems, 1-8, 2014
302014
Pangolin: An efficient and flexible graph mining system on cpu and gpu
X Chen, R Dathathri, G Gill, K Pingali
Proceedings of the VLDB Endowment 13 (8), 1190-1205, 2020
252020
Architecting energy-efficient STT-RAM based register file on GPGPUs via delta compression
H Zhang, X Chen, N Xiao, F Liu
Proceedings of the 53rd Annual Design Automation Conference, 1-6, 2016
232016
DistTC: High performance distributed triangle counting
L Hoang, V Jatala, X Chen, U Agarwal, R Dathathri, G Gill, K Pingali
2019 IEEE High Performance Extreme Computing Conference (HPEC), 1-7, 2019
192019
Efficient and portable ALS matrix factorization for recommender systems
J Chen, J Fang, W Liu, T Tang, X Chen, C Yang
2017 IEEE International Parallel and Distributed Processing Symposium …, 2017
142017
Performance model for OpenMP parallelized loops
Z Zheng, X Chen, Z Wang, L Shen, J Li
Proceedings 2011 International Conference on Transportation, Mechanical, and …, 2011
132011
Efficient and high‐quality sparse graph coloring on GPUs
X Chen, P Li, J Fang, T Tang, Z Wang, C Yang
Concurrency and Computation: Practice and Experience 29 (10), e4064, 2017
122017
High Performance Parallel Graph Coloring on GPGPUs
P Li, X Chen, Z Quan, J Fang, H Su, T Tang, C Yang
Proceedings of the 30th IEEE International Parallel & Distributed Processing …, 2016
92016
Red-shield: Shielding read disturbance for STT-RAM based register files on GPUs
H Zhang, X Chen, N Xiao, F Liu, Z Chen
2016 International Great Lakes Symposium on VLSI (GLSVLSI), 389-392, 2016
82016
Sandslash: a two-level framework for efficient graph pattern mining
X Chen, R Dathathri, G Gill, L Hoang, K Pingali
Proceedings of the ACM International Conference on Supercomputing, 378-391, 2021
72021
GraphCage: Cache aware graph processing on GPUs
X Chen
arXiv preprint arXiv:1904.02241, 2019
72019
Orchestrating parallel detection of strongly connected components on GPUs
X Chen, C Chen, J Shen, J Fang, T Tang, C Yang, Z Wang
Parallel Computing 78, 101-114, 2018
72018
High performance detection of strongly connected components in sparse graphs on GPUs
P Li, X Chen, J Shen, J Fang, T Tang, C Yang
Proceedings of the 8th International Workshop on Programming Models and …, 2017
72017
Shielding STT-RAM based register files on GPUs against read disturbance
H Zhang, X Chen, N Xiao, L Wang, F Liu, W Chen, Z Chen
ACM Journal on Emerging Technologies in Computing Systems (JETC) 13 (2), 1-17, 2016
72016
Gardenia: A graph processing benchmark suite for next-generation accelerators
Z Xu, X Chen, J Shen, Y Zhang, C Chen, C Yang
ACM Journal on Emerging Technologies in Computing Systems (JETC) 15 (1), 1-13, 2019
62019
Evaluating multiple streams on heterogeneous platforms
J Fang, P Zhang, Z Li, T Tang, X Chen, C Chen, C Yang
Parallel Processing Letters 26 (04), 1640002, 2016
62016
Evaluating the performance impact of multiple streams on the mic-based heterogeneous platform
Z Li, J Fang, T Tang, X Chen, C Chen, C Yang
2016 IEEE International Parallel and Distributed Processing Symposium …, 2016
62016
Characterizing fine-grain parallelism on modern multicore platform
X Chen, W Chen, J Li, Z Zheng, L Shen, Z Wang
2011 IEEE 17th International Conference on Parallel and Distributed Systems …, 2011
62011
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–20