Follow
Yunquan Zhang
Title
Cited by
Cited by
Year
AUGEM: automatically generate high performance dense linear algebra kernels on x86 CPUs
Q Wang, X Zhang, Y Zhang, Q Yi
Proceedings of the international conference on high performance computing …, 2013
2672013
Model-driven level 3 BLAS performance optimization on Loongson 3A processor
Z Xianyi, W Qian, Z Yunquan
2012 IEEE 18th international conference on parallel and distributed systems …, 2012
2412012
yaSpMV: Yet another SpMV framework on GPUs
S Yan, C Li, Y Zhang, H Zhou
Acm Sigplan Notices 49 (8), 107-118, 2014
1742014
StreamScan: fast scan algorithms for GPUs without global barrier synchronization
S Yan, G Long, Y Zhang
Proceedings of the 18th ACM SIGPLAN symposium on Principles and practice of …, 2013
1152013
Parallel processing systems for big data: a survey
Y Zhang, T Cao, S Li, X Tian, L Yuan, H Jia, AV Vasilakos
Proceedings of the IEEE 104 (11), 2114-2136, 2016
1072016
Models of parallel computation: a survey and classification
Y Zhang, G Chen, G Sun, Q Miao
Frontiers of Computer Science in China 1, 156-165, 2007
482007
GPURoofline: a model for guiding performance optimizations on GPUs
H Jia, Y Zhang, G Long, J Xu, S Yan, Y Li
Euro-Par 2012 Parallel Processing: 18th International Conference, Euro-Par …, 2012
452012
MPFFT: An auto-tuning FFT library for OpenCL GPUs
Y Li, YQ Zhang, YQ Liu, GP Long, HP Jia
Journal of Computer Science and Technology 28 (1), 90-105, 2013
432013
A parallel shortest path algorithm based on graph-partitioning and iterative correcting
Y Tang, Y Zhang, H Chen
2008 10th IEEE International Conference on High Performance Computing and …, 2008
392008
Accelerating viola-jones facce detection algorithm on gpus
H Jia, Y Zhang, W Wang, J Xu
2012 IEEE 14th International Conference on High Performance Computing and …, 2012
382012
Study on parallel computing
GL Chen, GZ Sun, YQ Zhang, ZY Mo
Journal of Computer Science and Technology 21, 665-673, 2006
382006
Optimizing SpMV for diagonal sparse matrices on GPU
X Sun, Y Zhang, T Wang, X Zhang, L Yuan, L Rao
2011 International conference on parallel processing, 492-501, 2011
372011
Performance evaluation of allgather algorithms on terascale linux cluster with fast ethernet
J Chen, L Zhang, Y Zhang, W Yuan
Eighth International Conference on High-Performance Computing in Asia …, 2005
342005
Cache-oblivious MPI all-to-all communications based on Morton order
S Li, Y Zhang, T Hoefler
IEEE Transactions on Parallel and Distributed Systems 29 (3), 542-555, 2017
282017
Performance evaluation of multithreaded sparse matrix-vector multiplication using openmp
S Liu, Y Zhang, X Sun, RR Qiu
2009 11th IEEE International Conference on High Performance Computing and …, 2009
282009
Parallelization and performance optimization on face detection algorithm with OpenCL: A case study
W Wang, Y Zhang, S Yan, Y Zhang, H Jia
Tsinghua Science and Technology 17 (3), 287-295, 2012
232012
DRAM (h): a parallel computation model for high performance numerical computing
YQ Zhang
CHINESE JOURNAL OF COMPUTERS-CHINESE EDITION- 26 (12), 1660-1670, 2003
232003
pVOCL: Power-aware dynamic placement and migration in virtualized GPU environments
P Lama, Y Li, AM Aji, P Balaji, J Dinan, S Xiao, Y Zhang, W Feng, ...
2013 IEEE 33rd International Conference on Distributed Computing Systems …, 2013
222013
LogGPH: A parallel computational model with hierarchical communication awareness
L Yuan, Y Zhang, Y Tang, L Rao, X Sun
2010 13th IEEE International Conference on Computational Science and …, 2010
212010
Earth system model: CAS-ESM
Z Guangqing, Z Yunquan, J Jinrong, Z He, W Baodong, C Hang, W Tianyi, ...
Frontiers of Data and Domputing 2 (1), 38-54, 2020
192020
The system can't perform the operation now. Try again later.
Articles 1–20